GenomeNet

Database: UniProt
Entry: A0A177EMH8_9MICR
LinkDB: A0A177EMH8_9MICR
Original site: A0A177EMH8_9MICR 
ID   A0A177EMH8_9MICR        Unreviewed;      1302 AA.
AC   A0A177EMH8;
DT   07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT   07-SEP-2016, sequence version 1.
DT   27-MAR-2024, entry version 26.
DE   RecName: Full=RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00012493};
DE            EC=2.7.7.49 {ECO:0000256|ARBA:ARBA00012493};
GN   ORFNames=NEIG_01785 {ECO:0000313|EMBL:OAG32906.1}, NEIG_02291
GN   {ECO:0000313|EMBL:OAG33108.1};
OS   Nematocida sp. ERTm5.
OC   Eukaryota; Fungi; Fungi incertae sedis; Microsporidia; Nematocida.
OX   NCBI_TaxID=1805481 {ECO:0000313|EMBL:OAG33108.1, ECO:0000313|Proteomes:UP000077089};
RN   [1] {ECO:0000313|EMBL:OAG33108.1, ECO:0000313|Proteomes:UP000077089}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=ERTm5 {ECO:0000313|EMBL:OAG33108.1,
RC   ECO:0000313|Proteomes:UP000077089};
RA   Reinke A.W., Balla K.M., Bennett E.J., Troemel E.R.;
RT   "Identification of microsporidia host-exposed proteins reveals a repertoire
RT   of large paralogous families and rapidly evolving proteins.";
RL   Submitted (FEB-2016) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:OAG33108.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; LTDK01000102; OAG32906.1; -; Genomic_DNA.
DR   EMBL; LTDK01000090; OAG33108.1; -; Genomic_DNA.
DR   VEuPathDB; MicrosporidiaDB:NEIG_01785; -.
DR   VEuPathDB; MicrosporidiaDB:NEIG_02291; -.
DR   OrthoDB; 1364977at2759; -.
DR   Proteomes; UP000077089; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProt.
DR   GO; GO:0004519; F:endonuclease activity; IEA:UniProtKB-KW.
DR   GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR   GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR   GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR   GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR   CDD; cd00303; retropepsin_like; 1.
DR   CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR   CDD; cd01647; RT_LTR; 1.
DR   Gene3D; 3.30.70.270; -; 2.
DR   Gene3D; 2.40.70.10; Acid Proteases; 1.
DR   Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR   Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR   InterPro; IPR043502; DNA/RNA_pol_sf.
DR   InterPro; IPR001584; Integrase_cat-core.
DR   InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR   InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR   InterPro; IPR012337; RNaseH-like_sf.
DR   InterPro; IPR036397; RNaseH_sf.
DR   InterPro; IPR000477; RT_dom.
DR   InterPro; IPR041373; RT_RNaseH.
DR   InterPro; IPR001878; Znf_CCHC.
DR   PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR   Pfam; PF17917; RT_RNaseH; 1.
DR   Pfam; PF00665; rve; 1.
DR   Pfam; PF00078; RVT_1; 1.
DR   SMART; SM00343; ZnF_C2HC; 1.
DR   SUPFAM; SSF50630; Acid proteases; 1.
DR   SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR   SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR   PROSITE; PS50994; INTEGRASE; 1.
DR   PROSITE; PS50878; RT_POL; 1.
DR   PROSITE; PS50158; ZF_CCHC; 1.
PE   4: Predicted;
KW   Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW   RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW   Transposable element {ECO:0000256|ARBA:ARBA00022464};
KW   Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW   Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT   DOMAIN          260..275
FT                   /note="CCHC-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50158"
FT   DOMAIN          535..715
FT                   /note="Reverse transcriptase"
FT                   /evidence="ECO:0000259|PROSITE:PS50878"
FT   DOMAIN          1049..1196
FT                   /note="Integrase catalytic"
FT                   /evidence="ECO:0000259|PROSITE:PS50994"
FT   REGION          228..256
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          277..326
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          930..951
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        228..250
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        277..291
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        292..326
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1302 AA;  150343 MW;  D3B96DE8EE9E14C2 CRC64;
     MTNKSSHLRG RGKNQLEADT NTASYSQIEH QEENGGGSNA GKILQGFTEL DFRNGREELE
     SQMQVTTYLF CQSWKNINPR YKIAILRNEL TRRSGSIEAT MVRELRSLIA TVSPSSPEEW
     TEELWNQIWE RILQIAEQAR PSPVADLPCF KEFLRMTEKG SYLDWCAMMI GQTMAPAIRP
     IADIALEFAI RKAIYPANFS EIWRMGGENR DEANRCLWEE CARLDQQRER NRSEMANRKE
     RKERRHVSTN HQGHGSSFSC HRCGLAGHRA RDCRRDISSD RRKDFKGKEQ NSRRNWNNPA
     PDQSKDGVKV VSTTGPHSGN PEGASSYTTA QIQGVHLPAL CDTGAEINLM SWEDFNLLRG
     KGVVLTPRPT QKVISTAGPQ DLICENYVLA DIVVGNSNQI PAKVYIGKDF HGTMILGNPT
     LEELRIFLVQ LPQSQDRAGV FYTATRMKPS ENVVRNIEEI LENENPEQIE GISRESLMTK
     EDYKSLEAAI ARIQPTQQVE VAEINTPPGK FASEYRFMPN HNLEKAKKLV LDFIKEGFME
     ETQASKWLIP IRLARKPNGT WRFCLDYRRL NELVEQDNYP LPDVQSIFDG LYGKRIFSVL
     DIANGFFRVP LREIDRQKTT CKIGNRFFRM KVLPMGYKNS PAIFQRIMDK MLKEELETGN
     VKVYIDDILI ATESYEEHVE ILRRVLGILK AHGMEVNWEK VRLAYKEIRF LGHTMWMNKI
     KPMDDRITEV MKIPVPQTPK QVRSFLGKVN YMGRHIYNLS ELKAPLNEFT SKGRKFVWND
     KHQEAFENLK VAVSKIIAAT MPDPRKKFTL ETDASSTGVG AVLRQEDSTI GFFSMRLTPT
     QQRYTITERE LYAIVWAMGR CKQYLLGVEF DVVTDHKAIE AFFTKKDQEF GNERIARWMQ
     ALESFQFTPK YRRGEDMVIP DGLSRMYEGT SRDLGNPSEV NGKKVLDPSD GNASRVAATL
     DVLPRHCNEG SEEMRRDALG RQGWQDAGGA WEEDILSWHE EFGHRKMIRD DLKEKGVLVS
     ARELVRILRK CRVCLERDNQ FMKHNEYIDT QDPGELVGID LMEYQRRYIF VAVDYYSRMA
     FTFELKNKEA KNIVACLESV YGTFPFQKLL CDNGKEFANA QVKLWLRKHG VEVGYRPPYF
     HEGTGRVERL IRTLRDSLNR TKGTVRQKLK RVTRAYNQKV HRAIGTSPEL ASKPENRGMV
     TQAIERYKRE FGKTTEPRLA EGTQVLIRKD IRQKDDKHFE RVGQIVGQAG PHSYEVMPEK
     GKILVRNRNQ LKVLKIREEE FITRNEIGIP LQEWQDHHEE RY
//
DBGET integrated database retrieval system