ID M5CEG3_THACB Unreviewed; 1058 AA.
AC M5CEG3;
DT 29-MAY-2013, integrated into UniProtKB/TrEMBL.
DT 29-MAY-2013, sequence version 1.
DT 27-MAR-2024, entry version 48.
DE SubName: Full=Retrotransposable element Tf2 155 kDa protein type 1 {ECO:0000313|EMBL:CCO37604.1};
GN ORFNames=BN14_11760 {ECO:0000313|EMBL:CCO37604.1};
OS Thanatephorus cucumeris (strain AG1-IB / isolate 7/3/14) (Lettuce bottom
OS rot fungus) (Rhizoctonia solani).
OC Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; Agaricomycetes;
OC Cantharellales; Ceratobasidiaceae; Rhizoctonia; Rhizoctonia solani AG-1.
OX NCBI_TaxID=1108050 {ECO:0000313|EMBL:CCO37604.1, ECO:0000313|Proteomes:UP000012065};
RN [1] {ECO:0000313|EMBL:CCO37604.1, ECO:0000313|Proteomes:UP000012065}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=AG1-IB / isolate 7/3/14 {ECO:0000313|Proteomes:UP000012065};
RX PubMed=23280342; DOI=10.1016/j.jbiotec.2012.12.010;
RA Wibberg D.W., Jelonek L.J., Rupp O.R., Hennig M.H., Eikmeyer F.E.,
RA Goesmann A.G., Hartmann A.H., Borriss R.B., Grosch R.G., Puehler A.P.,
RA Schlueter A.S.;
RT "Establishment and interpretation of the genome sequence of the
RT phytopathogenic fungus Rhizoctonia solani AG1-IB isolate 7/3/14.";
RL J. Biotechnol. 0:0-0(2013).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCO37604.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAOJ01017334; CCO37604.1; -; Genomic_DNA.
DR AlphaFoldDB; M5CEG3; -.
DR HOGENOM; CLU_000384_38_0_1; -.
DR Proteomes; UP000012065; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProt.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 2.40.50.40; -; 1.
DR Gene3D; 3.10.20.370; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR000953; Chromo/chromo_shadow_dom.
DR InterPro; IPR017984; Chromo_dom_subgr.
DR InterPro; IPR023780; Chromo_domain.
DR InterPro; IPR023779; Chromodomain_CS.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR24559:SF437; RIBONUCLEASE H; 1.
DR PANTHER; PTHR24559; TRANSPOSON TY3-I GAG-POL POLYPROTEIN; 1.
DR Pfam; PF00385; Chromo; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR PRINTS; PR00504; CHROMODOMAIN.
DR SMART; SM00298; CHROMO; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS00598; CHROMO_1; 1.
DR PROSITE; PS50013; CHROMO_2; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW Transposable element {ECO:0000256|ARBA:ARBA00022464}.
FT DOMAIN 148..327
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 699..858
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT DOMAIN 996..1045
FT /note="Chromo"
FT /evidence="ECO:0000259|PROSITE:PS50013"
SQ SEQUENCE 1058 AA; 121896 MW; DEC474A965BBE7C8 CRC64;
MLDGSTPQTG KIWKKVALQF TYDGRTMTHE FLVTPIGHHS AILGIKWLEQ EQPDIDWSSR
QLSFPIPHSE FAHIAQEEEA DDDPLEGIPT QYHAFAKVFG EEEFNKLPPH RSYDIEIELT
EDGPLNSPLY SMTDAESITL KQWLEDELKA GKIRPSKSPI SSPVMFVPKK DGSRRLVVDY
RKLNARSKKN VYPLPRPDDL MSKLRGAKLF TKLDLRWGYN NVRVKEGDEW KTAFRTKYGL
FETLVMPFGL SGAPGAFQAM MNEVFQDLLD VSVIIYLDDI LIFSSDPEEH ESHVKEVLKR
LMEMQLFCKG SKCEFHQTTV EYLGIIVSDK GFSLDKLKIQ AVQEWPTPTT VKQVQSFLGF
ANFVRRFVAN FSQIARPLHN LVKKEVKWQW TDKEEHAFRE LQKAIVNAPV IVHADPSKPY
FLETDASGAA LGSVLSQRQE DGRLHPIGYL SESFKGAEQN YDTHDKELLA IIRSFEHWRI
YLERTVLPIT VFTDHRNLEY WKESRTFNRR HARWHLLLAG FHFQIMYRPG KQSTKPDALS
RRADHLDIPP ADQSMLPESV FANVSLILPE KEIQSRIELS LDQDESLTEI LEHLRNGSTA
PASIKKAFKD YEMEAGLLFY QGRILVPDAG DLREELLRIY HDSPMAGHPG RQRTLELLSR
AYYWPGIRAD VYLHVDGCET CQRIRLPKTK LIPAQPLEIP SRPWQHVSYD MITDLPKDGP
YDCILVIVDS FTKFVVLVPV SKKLKAPELA EIFLNRVWKQ YGLPEKTVSD RGTVFNNKFL
RALYKRLGID PHFSSAYHPQ SDGQTERVNP TIEHFLRAYA SVNQSDWVKW LPMTEFAYNN
ATHSATGRSP FMALYGWQPT LTPSKVETNV PEANDLANAI EKQWEEVAAA LRQSKSRLVE
NQNEEVPISF EVGEEAWLDA KNVNLKTKSD KLTERRLGPF KVIEKISDRA YRLELPETMR
IHDVFYVGLL SKVKRNELQA WENRPPPITV DGEEEYEVEG IMDSRENKGK WEYLIKWKGY
GPEESTWEPK SNLKNAAKHL KKYEKFLRQK SLDAAKGL
//