ID A0A182WJ43_9DIPT Unreviewed; 1953 AA.
AC A0A182WJ43;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 24-JAN-2024, entry version 33.
DE RecName: Full=DNA-directed DNA polymerase {ECO:0008006|Google:ProtNLM};
OS Anopheles minimus.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=112268 {ECO:0000313|EnsemblMetazoa:AMIN010397-PA, ECO:0000313|Proteomes:UP000075920};
RN [1] {ECO:0000313|Proteomes:UP000075920}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MINIMUS1 {ECO:0000313|Proteomes:UP000075920};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Walton C., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles minimus MINIMUS1.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:AMIN010397-PA}
RP IDENTIFICATION.
RC STRAIN=MINIMUS1 {ECO:0000313|EnsemblMetazoa:AMIN010397-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 112268.A0A182WJ43; -.
DR EnsemblMetazoa; AMIN010397-RA; AMIN010397-PA; AMIN010397.
DR VEuPathDB; VectorBase:AMIN010397; -.
DR OrthoDB; 208079at2759; -.
DR Proteomes; UP000075920; Unassembled WGS sequence.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0003887; F:DNA-directed DNA polymerase activity; IEA:InterPro.
DR GO; GO:0006281; P:DNA repair; IEA:UniProt.
DR GO; GO:0006261; P:DNA-templated DNA replication; IEA:InterPro.
DR CDD; cd18026; DEXHc_POLQ-like; 1.
DR CDD; cd08638; DNA_pol_A_theta; 1.
DR CDD; cd18795; SF2_C_Ski2; 1.
DR Gene3D; 1.10.3380.20; -; 1.
DR Gene3D; 3.30.70.370; -; 1.
DR Gene3D; 1.10.150.20; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 2.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR011545; DEAD/DEAH_box_helicase_dom.
DR InterPro; IPR001098; DNA-dir_DNA_pol_A_palm_dom.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR002298; DNA_polymerase_A.
DR InterPro; IPR014001; Helicase_ATP-bd.
DR InterPro; IPR001650; Helicase_C.
DR InterPro; IPR046931; HTH_61.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR048960; POLQ-like_helical.
DR InterPro; IPR036397; RNaseH_sf.
DR PANTHER; PTHR10133; DNA POLYMERASE I; 1.
DR PANTHER; PTHR10133:SF62; DNA POLYMERASE THETA; 1.
DR Pfam; PF00270; DEAD; 1.
DR Pfam; PF00476; DNA_pol_A; 1.
DR Pfam; PF00271; Helicase_C; 1.
DR Pfam; PF20470; HTH_61; 1.
DR Pfam; PF21099; POLQ_helical; 1.
DR PRINTS; PR00868; DNAPOLI.
DR SMART; SM00487; DEXDc; 1.
DR SMART; SM00490; HELICc; 1.
DR SMART; SM00482; POLAc; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR SUPFAM; SSF158702; Sec63 N-terminal domain-like; 1.
DR PROSITE; PS51192; HELICASE_ATP_BIND_1; 1.
DR PROSITE; PS51194; HELICASE_CTER; 1.
PE 4: Predicted;
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741}.
FT DOMAIN 409..582
FT /note="Helicase ATP-binding"
FT /evidence="ECO:0000259|PROSITE:PS51192"
FT DOMAIN 629..834
FT /note="Helicase C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51194"
FT REGION 17..113
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 130..152
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 875..894
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 22..64
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 93..112
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1953 AA; 218672 MW; 83FBA3C403E740E0 CRC64;
MFSQSLQLGD STFDALEKQH TKACTPPTNE RTTRSQRNKV VKPQGSTEEV IEESPTNGKC
SAELSRQRSV RERLQIISTQ ARGRKAKKRT ARERQRTSGS DSREDKDTTG SSILRKDNFS
ELFTSDIQFN VDQSSTRRNG LDGPSPEKQP SVRISDEFEE LFQNSAFSMD TGRSQSMAMP
ANQNEPKEGS FHTLFDQSDF TLDKADEVAA PVDNPQHPAT MPESEELEIF SETLFDRAIS
SEPTMDDLHE NSDLLESLRV DSADDTIEDD VAIDIENVTF SQPLTSASIA LGQERSDESH
HQPQQTEIHD FIESEMANSF RNTKESLSDT VVSRHDESGL YPTSVTALPK KRKKAEPEGA
TVSCSTSLDQ TKDLRFLGNW GLSSSITSEY ARKGIVELFP WQVECLSRKE VVLEGKNLIY
SAPTSAGKTL VSEFLLAKTI TERKLKALLI LPFVAVAREK MLYLKDLLEP GGLRVEGFYG
GYHPPGGFES VDLAVCTIEK ANSIVNRLLE QNALASLGLV VVDEVHLISD PSRGYILELL
LTKVRFVSAR YEHRIQIVCM SATLPNIDLL ARWLEADLYR TNFRPIALVE MVKIGNTIFS
AAGEPIRVVN GSLLGYTLPK DTDHVALLCL ETILEGCAVI VFCPSKDWCE QLAISLASTL
HTLRKENHPH EELRNQLHKQ LDGVRQEEVL LQLRNCPAGL DSVLEKTVRY GVAFHHAGLT
TDERDILEGA FRDGALRIIV ATSTLSSGVN LPARRVIIRT PKFGGRPMSS LTYKQMIGRA
GRTGRDTLGE SILICSPAEE KIGRELTGAE LPPVRSCLDS ENYTHLKRAI LEIIASGSAT
TTQELETFVN ATLYSCERGH RFVVSDQLLQ RKSFKPKQTQ VREETDPDDS SGEMDPIVSC
ISFLLEYEFI RLLQQDCVED GNAAKRTVLN ATRLGLACLA ASLPPKDGFL LFSELQRARQ
CFVLESELHA IYLVTPYSVA YQWQQIDWMD FLDLWEKLSA AAKRVGELVG VKESFMVRAM
RGASNLDYRS LQIHKRFYTA LALLELVHEV PLCTVARKFK CCRGLLQSLQ QVSATFAGIV
TSFCASLNWT LLQLLVAQFR ERLFFGVAHE LLDLMRIPSL NGQRARLLHD GGITGLVQLA
NADRLTIETI LHNRTSFEAE RVREHENEYE AERRKRLRNL YLTGRAGVTV EEAARLLIQE
ARNYLQLEHG LANPCWSQTE KAESGERNKD KAKVELKTEE EQTSSLLLSH GESSSSTLMH
FHNSLLMRND HAISKNDSMH NASNARLTII DACKEHAVYK HFSELVACQS TITIAFGVEQ
EDSSSGVASI GGHLRRTDRT AKARERLRTY TFDDNLYLAG MAIAVQDASC TVYYVDFQDD
TVIDREEKVR FIRTLFGREE LTITLLDAKD QLKIIYRAGL VSAAESDELM AAALQDPRVA
CWLLQTDAET LTLHAMIERH CPALMQAELL WTGQRWVTPK WTGHGLNHHS PIDAKQRCAV
ECLLISQLMP CIGKRLAASG DSLPICFTSR EMPVQLALAR AEVIGFPVDR ERLGVLITRL
KACRDRIADE ARKLNGNRRL DFGSSRAVAK ALQLVGGEKK RCRTARQVLE RLESPLAALV
IAYRKIESNL TRTIEPLYRV IRPGTVRVHG RSFCFTSTGR ITMHEPNLQM VAKDFTVPSG
LGGFETDAER MFSCRSTFVC ADPDRTVLLS ADFCQLELCI LTHLSQDRQL MVALGASNGR
GTTVRNDVFR ALAARWNQLD SESAVSEELR NRTKAIVYGV IYGMGVRAMA AELQLDEDAA
RTLMEQFHAT YPEIRRYIER VVRLTRQLGY IETLTGRRRH LPAISSENAS ERAEAERQAV
CTTIQGSAAD ILKNAIVRMR RNLGKYRNVL ALGEIRLVLH MHDELIYEVP RKQLPKVAKI
LKSSMENCAK LSVPLRVKLK AGSSWGTLQD LSC
//