GenomeNet

Database: UniProt
Entry: M5CCP2_THACB
LinkDB: M5CCP2_THACB
Original site: M5CCP2_THACB 
ID   M5CCP2_THACB            Unreviewed;       850 AA.
AC   M5CCP2;
DT   29-MAY-2013, integrated into UniProtKB/TrEMBL.
DT   29-MAY-2013, sequence version 1.
DT   24-JAN-2024, entry version 44.
DE   SubName: Full=Retrotransposable element Tf2 155 kDa protein type 1 {ECO:0000313|EMBL:CCO36910.1};
GN   ORFNames=BN14_11057 {ECO:0000313|EMBL:CCO36910.1};
OS   Thanatephorus cucumeris (strain AG1-IB / isolate 7/3/14) (Lettuce bottom
OS   rot fungus) (Rhizoctonia solani).
OC   Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina; Agaricomycetes;
OC   Cantharellales; Ceratobasidiaceae; Rhizoctonia; Rhizoctonia solani AG-1.
OX   NCBI_TaxID=1108050 {ECO:0000313|EMBL:CCO36910.1, ECO:0000313|Proteomes:UP000012065};
RN   [1] {ECO:0000313|EMBL:CCO36910.1, ECO:0000313|Proteomes:UP000012065}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=AG1-IB / isolate 7/3/14 {ECO:0000313|Proteomes:UP000012065};
RX   PubMed=23280342; DOI=10.1016/j.jbiotec.2012.12.010;
RA   Wibberg D.W., Jelonek L.J., Rupp O.R., Hennig M.H., Eikmeyer F.E.,
RA   Goesmann A.G., Hartmann A.H., Borriss R.B., Grosch R.G., Puehler A.P.,
RA   Schlueter A.S.;
RT   "Establishment and interpretation of the genome sequence of the
RT   phytopathogenic fungus Rhizoctonia solani AG1-IB isolate 7/3/14.";
RL   J. Biotechnol. 0:0-0(2013).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:CCO36910.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CAOJ01016549; CCO36910.1; -; Genomic_DNA.
DR   AlphaFoldDB; M5CCP2; -.
DR   HOGENOM; CLU_000384_38_1_1; -.
DR   Proteomes; UP000012065; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProt.
DR   GO; GO:1901363; F:heterocyclic compound binding; IEA:UniProt.
DR   GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR   GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR   CDD; cd00024; CD_CSD; 1.
DR   CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR   CDD; cd01647; RT_LTR; 1.
DR   Gene3D; 1.10.340.70; -; 1.
DR   Gene3D; 2.40.50.40; -; 1.
DR   Gene3D; 3.10.20.370; -; 1.
DR   Gene3D; 3.30.70.270; -; 2.
DR   Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR   Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR   InterPro; IPR016197; Chromo-like_dom_sf.
DR   InterPro; IPR000953; Chromo/chromo_shadow_dom.
DR   InterPro; IPR023780; Chromo_domain.
DR   InterPro; IPR023779; Chromodomain_CS.
DR   InterPro; IPR043502; DNA/RNA_pol_sf.
DR   InterPro; IPR001584; Integrase_cat-core.
DR   InterPro; IPR041588; Integrase_H2C2.
DR   InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR   InterPro; IPR012337; RNaseH-like_sf.
DR   InterPro; IPR036397; RNaseH_sf.
DR   InterPro; IPR000477; RT_dom.
DR   InterPro; IPR041373; RT_RNaseH.
DR   PANTHER; PTHR24559:SF425; RT_RNASEH DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR24559; TRANSPOSON TY3-I GAG-POL POLYPROTEIN; 1.
DR   Pfam; PF00385; Chromo; 1.
DR   Pfam; PF17921; Integrase_H2C2; 1.
DR   Pfam; PF17917; RT_RNaseH; 1.
DR   Pfam; PF00665; rve; 1.
DR   Pfam; PF00078; RVT_1; 1.
DR   SMART; SM00298; CHROMO; 1.
DR   SUPFAM; SSF54160; Chromo domain-like; 1.
DR   SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR   SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR   PROSITE; PS00598; CHROMO_1; 1.
DR   PROSITE; PS50013; CHROMO_2; 1.
DR   PROSITE; PS50994; INTEGRASE; 1.
DR   PROSITE; PS50878; RT_POL; 1.
PE   4: Predicted;
KW   RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW   Transposable element {ECO:0000256|ARBA:ARBA00022464}.
FT   DOMAIN          1..123
FT                   /note="Reverse transcriptase"
FT                   /evidence="ECO:0000259|PROSITE:PS50878"
FT   DOMAIN          488..647
FT                   /note="Integrase catalytic"
FT                   /evidence="ECO:0000259|PROSITE:PS50994"
FT   DOMAIN          789..850
FT                   /note="Chromo"
FT                   /evidence="ECO:0000259|PROSITE:PS50013"
SQ   SEQUENCE   850 AA;  97890 MW;  FEA35E73EEC0A043 CRC64;
     MGAKIFTKFD LKAGYNLVRI KEGDEWKTAF KTKYGLFEYL VMPFGLTNAP ASFQDMMNSI
     FIDLLDVCVI IYLDDILIFS KDESSHEEHV REVLRRLKDN DLFCNIEKCS FHVKKINYLG
     FIISEDGVEV DQSKVTAALQ WSSPKNVKNV QEFLGFVNFY RRFVPDFNKI AHPLYDLLKK
     DSIWNWSLAA QNSFDQLKEK LTTAPLLIQP DVTKQFFLEC DASDYATGAI LSQKNLEGKL
     HPVAYLSKSL APAERNYDIF DKELLAVIRA LKEWRHLLEG SELPIQILTD HKNLEYFSTS
     QSLNKRQIRW ANFLVDYNFQ IIYRPGSQNK KADILSRRYD LVPLEGGVEN QVLLKPDFFI
     ASITPDQEIN DLIGEAIYED PRSKEVLDTI KKGKKVQDWE LKEGLLWLKG KIFVPKDETI
     RNLILESRHD ALAAGHPGQA RTLELISRTY YWSSMKKFVN SYVNHCKTCI RAKPTNQLPV
     GLLKPLQTPE RPWEDIAYDM IVGLPSSEGF DAILTVIDRF SKMVHLIPTH STASAIDIAN
     LFVTYVWKLH GLPRSTVSDR GPTFNAKFIR HLYQRLDIKP TFSTAYHPQT DGQTERIQRE
     VEIFIRMFGN HRQSDWVSLL PLAEFACNNN LQTSTGKSPF QICYGINPKF SIGQHTNNDV
     PNAEEHADFL EKGYDEVKAA LTLAQERMKR FYDQRHRDIE PITVGDKVWL SHQNISTDRP
     SIKLSHKKLG PYLVLEKIGS HAFKLSLPHT MHIHPVFHVN LLQKFHPDPH GRDPHQPPPI
     ITEEGEEEYE VEEILDSKWK GRGKNKKIWY LIKWVGYDAG SNSWEPADNV GNAEEAIQEF
     HKKFPEAVGP
//
DBGET integrated database retrieval system