ID A0A0W0FF01_9AGAR Unreviewed; 1536 AA.
AC A0A0W0FF01;
DT 16-MAR-2016, integrated into UniProtKB/TrEMBL.
DT 16-MAR-2016, sequence version 1.
DT 27-MAR-2024, entry version 35.
DE RecName: Full=RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00012493};
DE EC=2.7.7.49 {ECO:0000256|ARBA:ARBA00012493};
GN ORFNames=WG66_12556 {ECO:0000313|EMBL:KTB34870.1};
OS Moniliophthora roreri.
OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; Agaricomycetes;
OC Agaricomycetidae; Agaricales; Marasmiineae; Marasmiaceae; Moniliophthora.
OX NCBI_TaxID=221103 {ECO:0000313|EMBL:KTB34870.1, ECO:0000313|Proteomes:UP000054988};
RN [1] {ECO:0000313|EMBL:KTB34870.1, ECO:0000313|Proteomes:UP000054988}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MCA 2952 {ECO:0000313|EMBL:KTB34870.1,
RC ECO:0000313|Proteomes:UP000054988};
RA Aime M.C., Diaz-Valderrama J.R., Kijpornyongpan T., Phillips-Mora W.;
RT "Draft genome sequence of Moniliophthora roreri, the causal agent of frosty
RT pod rot of cacao.";
RL Submitted (DEC-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KTB34870.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LATX01002025; KTB34870.1; -; Genomic_DNA.
DR eggNOG; KOG0017; Eukaryota.
DR Proteomes; UP000054988; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProt.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd18972; CD_POL_like; 1.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 2.40.50.40; -; 1.
DR Gene3D; 3.10.20.370; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR Gene3D; 4.10.60.10; Zinc finger, CCHC-type; 1.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR000953; Chromo/chromo_shadow_dom.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041373; RT_RNaseH.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR PANTHER; PTHR24559:SF437; RIBONUCLEASE H; 1.
DR PANTHER; PTHR24559; TRANSPOSON TY3-I GAG-POL POLYPROTEIN; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17917; RT_RNaseH; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF00098; zf-CCHC; 1.
DR SMART; SM00298; CHROMO; 1.
DR SMART; SM00343; ZnF_C2HC; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50013; CHROMO_2; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Reference proteome {ECO:0000313|Proteomes:UP000054988};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW Transposable element {ECO:0000256|ARBA:ARBA00022464};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 97..112
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 319..500
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 872..1032
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT DOMAIN 1171..1230
FT /note="Chromo"
FT /evidence="ECO:0000259|PROSITE:PS50013"
FT REGION 1403..1512
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1438..1471
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1472..1492
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1536 AA; 174126 MW; C25762A79C062E61 CRC64;
MADESGYDDQ ALIHIFRKGL PNSLSAKILN QLQGRPADLE GWYEAAIRYD EQYKYYEAVQ
KPKRFRLTDE KKKKVSINRM TNQLSDEEQK KYMADGRCFR CAKQGHMSRD CPTKQNGEVK
EEKKKLSTRE AYAKIRAIVF NVDGTPNKAA WITKLVTATY TVGTRQLTDT FLISGLGSEE
VILGLPWLQK YNLQVDWNTG RTAFPEKRYI KIPRVTGVLD YESPEGLIHR IDIRAKLSTS
QRLQHSADDG LQPAIVKIPE YLSQYQGQFE DREAERFPIS RSYDHAIELK PEFTLRDCKV
YSLTALEQAE LNNFLAENLR KGYIRKSKSP MASPFFFVGK KEKGKLRPTQ DYRRLNHGTV
KNAYPLPLVS DLIDKLNGAM IFSKLDLCNG YNNVRIKDGD QWKAAFKTNR GLFEPTVMFF
GLMNSPVTFQ AFMDDVLQDF MAEGWCLVYM DDILIYSQNP EQHRERTLRL LQRLKEQDLY
LKPHKCKFDI NEIDFLGLVI KPGQIAMDPT KLAGISDWPA PRTVTGVRSF TGFTNFYRKF
IGNYSAIAKP LYDLTKKGVP FEWTKECDHA FRTLKRRFQQ EPVLRLPDLT RAFIIETDAS
KWASGGVLRQ EGPDGELHPC GYISHAFDAT ERNYEIYDRE LFAIVWALQT WRHYLMEGPH
PVTVLCDHKN LTYFRTAQKL NRRQARWSLI LSMFNLRLVH VPGREMVQSD VLSRRDDHVQ
GMDDDNDDVI LLPERLFINV VDLDLQAKLQ DRLGSDDFHK MTLESLTTKG VPPIKSALSD
WEVNDSLIRY QGWIYVPDDV LLRREITKTI HEGQPFGHPG QFGTLDLVQR DYWWPGMAKF
VKNFVDGCAT CQQMKINTHP TKVGLQPIAG VPNATPFQII TMDLVTDLPE SHSFDTVMVV
VDHSSSKGVI FIPCTKTLDA PQAADLLLRN VYKRYGLPDK IISDRDPRFA AAVFQETMRL
LGVKHAMSTA YHPQSDGETE RVNQEMEIYL RMFCSKEQTE WSSYLHMAEF AHNNRTHSVT
RNSPFFMIMG YNPRPLPTAF EPTSVPSVEE RLNKLRKLRG DVAGMMEIAR KKMIEKAGKG
TDKFIEGQKV WLEGKNLDFG YPSKKLSPKR EGPFVIEKVM GPVTYKLKLP HQWKIHPVFH
AGLLKPYKET EAHGRNFLEP PPDIIEGHEE FEIEAIIGHK LLWKPRRFLV SWKGFDSSHN
EWKTKPQLEH AMDLYLDYIV RNKLNSSYSV RSLPDTPPGV TAFADDVTYT GPFHPSPFEL
LFLVNTGGHD DALRNASVGQ PHLQYAVDGL IRLRAQRVSL NAIIEETNVY AANLAFNAVA
VGPAPLILQG PMTVPSLPPR STSFEPQSST LALHLGPLSS DVCDADPNSP EFLRAAATYN
ARIRAAVGLA RDQYEFCPRY TGTEPVLPRQ ASPAPVVPKL ESSPDPDPVK SEPDSDSPSD
LVYPSDSSSR DLSPIYAQSI GVATASPSPT PRPSSAESPE SQSPPAPAPA SSDPFNPFDV
NDEPLAPSTV PFLASGQFNP AFVSIRDVAR ANPGSR
//