ID E2M0D8_MONPE Unreviewed; 628 AA.
AC E2M0D8;
DT 30-NOV-2010, integrated into UniProtKB/TrEMBL.
DT 30-NOV-2010, sequence version 1.
DT 27-MAR-2024, entry version 46.
DE RecName: Full=Reverse transcriptase domain-containing protein {ECO:0000259|PROSITE:PS50878};
GN ORFNames=MPER_13114 {ECO:0000313|EMBL:EEB88864.1};
OS Moniliophthora perniciosa (strain FA553 / isolate CP02) (Witches'-broom
OS disease fungus) (Marasmius perniciosus).
OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; Agaricomycetes;
OC Agaricomycetidae; Agaricales; Marasmiineae; Marasmiaceae; Moniliophthora.
OX NCBI_TaxID=554373 {ECO:0000313|EMBL:EEB88864.1, ECO:0000313|Proteomes:UP000000741};
RN [1] {ECO:0000313|EMBL:EEB88864.1, ECO:0000313|Proteomes:UP000000741}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=FA553 / isolate CP02 {ECO:0000313|Proteomes:UP000000741};
RX PubMed=19019209; DOI=10.1186/1471-2164-9-548;
RA Mondego J.M., Carazzolle M.F., Costa G.G., Formighieri E.F., Parizzi L.P.,
RA Rincones J., Cotomacci C., Carraro D.M., Cunha A.F., Carrer H., Vidal R.O.,
RA Estrela R.C., Garcia O., Thomazella D.P., de Oliveira B.V., Pires A.B.,
RA Rio M.C., Araujo M.R., de Moraes M.H., Castro L.A., Gramacho K.P.,
RA Goncalves M.S., Neto J.P., Neto A.G., Barbosa L.V., Guiltinan M.J.,
RA Bailey B.A., Meinhardt L.W., Cascardo J.C., Pereira G.A.;
RT "A genome survey of Moniliophthora perniciosa gives new insights into
RT Witches' broom disease of cacao.";
RL BMC Genomics 9:548-548(2008).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EEB88864.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ABRE01017908; EEB88864.1; -; Genomic_DNA.
DR AlphaFoldDB; E2M0D8; -.
DR STRING; 554373.E2M0D8; -.
DR KEGG; mpr:MPER_13114; -.
DR HOGENOM; CLU_000384_33_4_1; -.
DR InParanoid; E2M0D8; -.
DR OrthoDB; 1605248at2759; -.
DR Proteomes; UP000000741; Unassembled WGS sequence.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR InterPro; IPR001969; Aspartic_peptidase_AS.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR000477; RT_dom.
DR PANTHER; PTHR24559:SF437; RIBONUCLEASE H; 1.
DR PANTHER; PTHR24559; TRANSPOSON TY3-I GAG-POL POLYPROTEIN; 1.
DR Pfam; PF13650; Asp_protease_2; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR PROSITE; PS00141; ASP_PROTEASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Aspartyl protease {ECO:0000256|ARBA:ARBA00022750};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022750};
KW Protease {ECO:0000256|ARBA:ARBA00022750};
KW Reference proteome {ECO:0000313|Proteomes:UP000000741}.
FT DOMAIN 362..541
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT REGION 205..230
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 205..222
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 628 AA; 72731 MW; FAFC8F2FE727C06C CRC64;
MGPETQNETL QASYATTAYI NSKTEQAFII KTNLFENDDN VPALIDSGAS RLFLDREEAA
KYTRKQRRLE KPVKLTLFDG ESTSSGLITH ALDGTITFED GTVHKEELLI TKLHPEAKLV
LGLPWLRKYN PDIDWSELKL SFRNGVKLCA SIIKNLDFMQ QPQQKVEIEE EEELRPDYGV
PLGVGEPLLL HESEWEEYEW QRNKEKLGKQ KQEDESIWKD PRPPDQQDSP YISLIGAAPF
MTLIQQGCEI YTLRIMPETE EKANLQSICT KTMDIVDGKP RLGSNDDVMT PEERADFEKH
VPKAYHQFDK VFSDKEAQEM PPHRSYDMKI QTEEEQYPPP GKVYNMSGTE LKALKEYIDD
MLGKGFIRPS NSPIGAPVLF AKKKDGSLRL CVDFRALNKL TIKDRYTIPL IGNLIDQLKN
AKVYTKLDLR AGYYNVRIAE GEEWKTAFRT RYGSFEYLVM PMGLSNAPSV FQRFMNDIFH
DMVDICVIVY LDDILIYSDN EEEHERHVKE VFRRLEKNDL HLKLKKCEFH TKEVEYLGVI
VTPNGVRMDP SKVKVIRDWP VPTNLKELQA FLGFANFYRR FIDNYSGIVK PLTTLTSKNK
SWEWTKERQS VFELLKEAFG YCTYTETL
//