GenomeNet

Database: UniProt
Entry: A0A388KGG8_CHABU
LinkDB: A0A388KGG8_CHABU
Original site: A0A388KGG8_CHABU 
ID   A0A388KGG8_CHABU        Unreviewed;      4365 AA.
AC   A0A388KGG8;
DT   05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT   05-DEC-2018, sequence version 1.
DT   27-MAR-2024, entry version 19.
DE   RecName: Full=RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00012493};
DE            EC=2.7.7.49 {ECO:0000256|ARBA:ARBA00012493};
GN   ORFNames=CBR_g3864 {ECO:0000313|EMBL:GBG69164.1};
OS   Chara braunii (Braun's stonewort).
OC   Eukaryota; Viridiplantae; Streptophyta; Charophyceae; Charales; Characeae;
OC   Chara.
OX   NCBI_TaxID=69332 {ECO:0000313|EMBL:GBG69164.1, ECO:0000313|Proteomes:UP000265515};
RN   [1] {ECO:0000313|EMBL:GBG69164.1, ECO:0000313|Proteomes:UP000265515}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=S276 {ECO:0000313|EMBL:GBG69164.1,
RC   ECO:0000313|Proteomes:UP000265515};
RX   PubMed=30007417; DOI=10.1016/j.cell.2018.06.033;
RA   Nishiyama T., Sakayama H., Vries J.D., Buschmann H., Saint-Marcoux D.,
RA   Ullrich K.K., Haas F.B., Vanderstraeten L., Becker D., Lang D.,
RA   Vosolsobe S., Rombauts S., Wilhelmsson P.K.I., Janitza P., Kern R.,
RA   Heyl A., Rumpler F., Villalobos L.I.A.C., Clay J.M., Skokan R., Toyoda A.,
RA   Suzuki Y., Kagoshima H., Schijlen E., Tajeshwar N., Catarino B.,
RA   Hetherington A.J., Saltykova A., Bonnot C., Breuninger H., Symeonidi A.,
RA   Radhakrishnan G.V., Van Nieuwerburgh F., Deforce D., Chang C., Karol K.G.,
RA   Hedrich R., Ulvskov P., Glockner G., Delwiche C.F., Petrasek J.,
RA   Van de Peer Y., Friml J., Beilby M., Dolan L., Kohara Y., Sugano S.,
RA   Fujiyama A., Delaux P.-M., Quint M., TheiBen G., Hagemann M., Harholt J.,
RA   Dunand C., Zachgo S., Langdale J., Maumus F., Straeten D.V.D., Gould S.B.,
RA   Rensing S.A.;
RT   "The Chara Genome: Secondary Complexity and Implications for Plant
RT   Terrestrialization.";
RL   Cell 174:448-464(2018).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:GBG69164.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; BFEA01000110; GBG69164.1; -; Genomic_DNA.
DR   EnsemblPlants; GBG69164; GBG69164; CBR_g3864.
DR   Gramene; GBG69164; GBG69164; CBR_g3864.
DR   Proteomes; UP000265515; Unassembled WGS sequence.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR   CDD; cd00303; retropepsin_like; 1.
DR   CDD; cd09274; RNase_HI_RT_Ty3; 2.
DR   CDD; cd01647; RT_LTR; 2.
DR   Gene3D; 1.10.340.70; -; 2.
DR   Gene3D; 3.30.70.270; -; 4.
DR   Gene3D; 2.40.70.10; Acid Proteases; 1.
DR   Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR   Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR   Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 4.
DR   InterPro; IPR043502; DNA/RNA_pol_sf.
DR   InterPro; IPR001584; Integrase_cat-core.
DR   InterPro; IPR041588; Integrase_H2C2.
DR   InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR   InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR   InterPro; IPR012337; RNaseH-like_sf.
DR   InterPro; IPR036397; RNaseH_sf.
DR   InterPro; IPR000477; RT_dom.
DR   InterPro; IPR041373; RT_RNaseH.
DR   InterPro; IPR041577; RT_RNaseH_2.
DR   InterPro; IPR011990; TPR-like_helical_dom_sf.
DR   InterPro; IPR019734; TPR_repeat.
DR   PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR   Pfam; PF17921; Integrase_H2C2; 2.
DR   Pfam; PF17917; RT_RNaseH; 1.
DR   Pfam; PF17919; RT_RNaseH_2; 1.
DR   Pfam; PF00665; rve; 1.
DR   Pfam; PF00078; RVT_1; 2.
DR   Pfam; PF13181; TPR_8; 1.
DR   SMART; SM00028; TPR; 7.
DR   SUPFAM; SSF56672; DNA/RNA polymerases; 3.
DR   SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR   SUPFAM; SSF48452; TPR-like; 3.
DR   PROSITE; PS50994; INTEGRASE; 1.
DR   PROSITE; PS50878; RT_POL; 1.
DR   PROSITE; PS50005; TPR; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000265515};
KW   TPR repeat {ECO:0000256|PROSITE-ProRule:PRU00339}.
FT   DOMAIN          1013..1192
FT                   /note="Reverse transcriptase"
FT                   /evidence="ECO:0000259|PROSITE:PS50878"
FT   DOMAIN          1961..2130
FT                   /note="Integrase catalytic"
FT                   /evidence="ECO:0000259|PROSITE:PS50994"
FT   REPEAT          3319..3352
FT                   /note="TPR"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00339"
FT   REGION          417..453
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          668..694
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2532..2551
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          4318..4339
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   4365 AA;  489407 MW;  2EA848070EDA1D0E CRC64;
     MLFGCELIPF VSSSGSGAEL HPATGDKAKV RGGGEVILFS ANKRGGGGGR GGEGLNGYGL
     CYVYSELNLI ITLFLLQGLA ELHTATGDKA KVIDAWEHIR QLSSEAGDTD RVYDYTRRLA
     LAYGTEGRRE DELKTWRSLA NSNEGAVSSA VRIEALCRIA DAEMETMTQD IDAAAKRGIA
     KWKKANQQLA SAELGGPVTD SQDSWNEDKV RRLVADEWRD AHAGVVESTL LDVVEAAPAF
     GEQLHATSAV PRQSQCHVSM TGPAMGLPGQ LANEPIANYK KRFQAQLAFI EAEEQRQLAA
     AAARLQAEEA ATAEKLRLQA EADVDAQAPR KEAQDLLQRH EANSIERLEF WHFEPNGDDA
     TPEEQRKEFL SKLVTRLLCA CNYQRSELEK QNQELQQQYQ DLKTQHQELA NLRRMVQSHE
     DATRAPNSRV LDLEQAVPGP NAGASSSAPS SRQLEERVDH IVAMLGDIST FAAPTTISNQ
     LDTLKTEFQQ LQSTNTDGNN PKQYKMPIFQ LEKFDDYTHQ DPVLWWEAFT TQLRILPVAK
     HAYIGALFMN SKGGCEIWLT HLAATHSVDV ADLKDKIAWE ELTRLWKKRF IVDDAPALAI
     NRLFTMSQGN TATREATPPL TPPDLAALLA ASYTSGEDAH VASPRYTYED YAVHLVPPLD
     QPLHVQXSTX CTVSSPSTTD STASPSSTAG DSTSWSRLEE LDPLTFEDFQ WMPLPRSGRL
     PKPHCNVLKA QLRDYLHTAV QTLLMDARVE VVDLHAYIAK IDREFKTQRY DDIDAPLLYV
     RIQIGEATCS ALIDCRASRN YMSQDFMVRA GLGPRVRRKA HPTQVTLADG HTHKSIDRCI
     DDVPVYFAPH ASEAVSFDIL DTKFDMILGM SWLRSKDHPV NFYRRTVHVR DRNGVVVPCT
     VAPPHPSVSC HVVSAASMRA SIIRDDIEEM GVCFLHALPP QDASSMDSSS DPHITELLDA
     YSDVFEGPHG VVPDRPIRHE IILEDGAVPP RGCIYRMSEE ELSVLRAQLD DLQEKGRIRP
     SSSPYDAIVL FVRKKNKDLR LCIDYRKLNA QTIRNVGPLP RIDDLLERLG GAKFFSKLDL
     KSGYHQLEIR KEDRYKTAFK TRYGHFEWLV MPFGLTNAPT TFQAAMATEF RHILDRYVLI
     YLDDILVYSR SLEEHVEHLR TVLERLRQAK YKANRDKCEF ARQELEYLGH YVTPQGIRPL
     ADKIEALRVW PEPTNTTDVR AFMGLAGYYQ RFITGYSRIA APMTRLQSPK VPFVFDDDAC
     RSFQALKTAM LMAPVLNIYD PTLPTRVRTA ASLVLEQCVR MIKKQSAIST PWEVAMSFCE
     DDHANEICEP LRPIFDSTGM NLEQEWSQMR RNVWKGLLGR KLLHAFPLHY DGLVGLVCAL
     RTEISRSIRQ GAGVFEEALI QYQENCSSVS GWLMLTELKL LHQDFEGTLE CADKGRQCVI
     QAQDELGLGL SRAEMELIVL QAKALAELGD LDRAESLLHH VLQRAEAASL DDKLHIGMAA
     KKQAQIVTDG LLKLYSRKYK SVRELTTTVE RLIVVPGVEY NPQVLLTMFL RCLPTEIKNL
     LASDACLEYH TFETFSKKAL DLEAMLGGAQ TPATDERKKK TPQEWKKKGS RLMMVDSDGN
     QIEIDDVSEL VEVSKLDGKE SVEGSNLAAV VKTKAGGRGK GGQQRSQGQV ANPNKIAAWV
     RAGLDQEVWR DRWSRGACIN YDEYGHQQFK CKNPKSRRRS LLREGKKLRP VEHMSKKMPS
     KKLAKSTYER ELYALYKALV HWRHYPLGRC FYLRTDHHTL KWIKTQPVLS DALKRWTAFI
     DQYDFKLDYV KGEYNKVVDA LSRRENYLCA LISEFGLSED VTRSLGETYK EDPLTMDIIN
     KQQAKDKATS DEFVMVDGLE KARFKRLVVP SREILRSSFL GECHDATGHF GYKKTCANLV
     QRFWWPNMLD DAKKYVQTSQ VCPRDKPRTQ APLGLLKPLP IPAGPGQSIS MDFMDTLVTS
     KNGKRHIFVI VDRFTKYTRV IAMLETAGTE HVVKLFMDNW VRDFGLPKTI VSDRDVRFTS
     EMWKKAAEQM GSQLQMTSGN HPEANGQGEQ MNQVVQHLLR HYIKPSQDDW DEKLPLIASL
     YNNAVHSSTD VSLNQLHLGW KPRSALDVLL PENRTAATPG TIEFGVQYEK LLQQTVEHIK
     KSQEAMIASE NKRRRQSIFQ GLAGVCLRSG KDAEGRQLLE EIVEWDDRND WAVGELGWLA
     FTNGDNQAAI DLLSSAISIQ PKSYLHHLRL GKVYWNSREE LTDGSSKSHA RLLEAAKINP
     RCAEAFRYLG LVYRTALGDL QRASRCFQKA ISLDAQDRVS GISSATSAVP GHIAPSDIVP
     RQQHHVRHSA TPAVTSDTAP RQQCHVSNMT GLPGQLANET IAAYKQRCLA QIEAEEQRLL
     AVEAARIKAE EAAAAEKLRL QADADADSQA RRKEAQDLLQ RHEATSIDKL KFWLFKPNGD
     EATPEEQHKE FLSKLVTRLL YACNYQRSEL ERQHQDLTQQ HQELATLRRT VQCHEDATRA
     LNARLLNLEQ TVPGPAAGAS SSAPPSRQLE DRVDHVVAML GDISTFAAPT TTISSQLHTL
     KTEVQQLQTT NANGNPKMYK MPTFTLEKFD DYTQQDPVLW WEAFTTQLRI LPGGCQTWLT
     HLATSHGVDV PDLKDVITWE ELTRLWKKRF IIDDAPALAI NRLFTMSQGN TATRDWLTEW
     QKIAAAPNLN LPFEHLRREF YNRSSAALSQ ALGDREQYAT FVEIIDKARE IIKTNRTVSM
     GQVWLDRGGV QSSRPLRQLL LVQQHQAQDL PVPGSGQGGC ATPPQLGKLS FAEGYHQLEI
     RKEDRYKTAF KTRYGHFEWL VMPFGLTNAP AIFQAAMTTE FRHMLDRYVL IYLDDILVYS
     RSLEEHVEHP RTVLEWLRQA KYKANHDKCE FARQELEYLG HYVTPQGIRP LADKIEALRV
     WPEPTNTTDL RSFMGLAGYY QRFITGYSRI AAPMTRLQSP KVPFLFDDDA RQSFQALKTA
     MLMAPVLSIY DPTLPTRVTT DASGYGIGAV LEQHDGDDWH PVEYFSHKVP PINLLDDARK
     KELLAFVMAL KRRRHFLLGR RRFTWVTDNN PLTYYKTQDT VSSTIGRWMY FIDQFDFTPK
     HVPDLSNRAA DALSRRPDLC AMTHHAFAFD EELQRQFIRA YESDPDFGTL YAQLSSDHPP
     ASHYRIADQY LLLHSRGKDL LCVPRDRRLR TRLLGEYHDS RLAGHFGVNR TIARLRQRFR
     WPDLITDVTR YCDSCKVCRR SKPRNRNPYG GLHPMPIPRE PGLSIAMDVT GPFPRDRLGH
     DGILTVVDRL SLLRWHADEA LPLALAYAYQ QLGRFTAALK SYGRVLQLDS DRVFSLIQAS
     IINHQMGLHS KAVEGFQRAL CRSPKHVAAL CGLVSALKGQ AEVSVFSGAY GAAASFLKEA
     ESHIVLCREL HPTLQLARKL HGDIMILYAR VVPYEDDTKD EKGAEDDKAV EESKGAVGIL
     KKMEEEKKEA GLRAIRSYSH ALHKHPSNEG AWADLAFAYY QHAHLLSLQL VPKAPKEEIE
     RLSSFAERLA LGALRLNGGN AMLWNLLGVV ARRKFLRQHA FIRALQIDSK LPIVWANLGQ
     MPEARWGFGL VASRAGLLHK AEVYAALQQC ITQTPQEVSV FNVAALSAEA RGLIQEAIAL
     LKAAEEIACL EVGSPMASEE LRKDQVSSLQ KKIMISLNLA RVLMKAGRAA EALAIYQHVL
     HDEVASAASA SLLRGQNELS IIRGYAIALW SSGHDSSVAT STGIQSLTTA LQKASSSPPS
     PQYLATLRVL CQLYYHHLGP SGTAAILDCL TAVPAVCLED TQLQSAGLAS VVASRRFIDM
     SHMLSLCRTL FDHSSAHVGQ MIITDAYCIQ GDYAQAVRHL KRALHRYPHL VSLRGALGRA
     LLNLSIENVP KALHICLLSD EGFSDAENRE GKALCDAVCA WCAFACYACG MARGCHSTPF
     DHHTQAEQLQ TLCRKLQRLV HMNPSNGSAR LLMIATGLQR ARSLHYPSGL CTGLLRASMQ
     SLSLSVKEGL SPLERALLAI CASELALHVR KDGAISLGLR LAKKAVQVAN VPDSKPESPL
     HAVSPLVRRQ IGRCHAAAGE WHQAFLAHQE AMSDALERFD FPAALEVLEA MYLVGKKSEA
     EVELRRFEQA VQQMGTVGEK NVWLGALRAL QGQTKGDEGD GPAAVRFTSE AVTWTDEDNA
     VLHYLDGALR LESAEAEKDI EGVVLAKRSL GRALIMTQSE LGKPKVSPLS HAGVGAGIGG
     IYLLLARAEM LSNRGAPPSA SRLAKWTDLI KHEWSLWPRG TQPAELCFQM GVISGMSSSS
     KGGDSWEEEG SNRNKTPSLG TQGWFQRAVH LDPTCHRFWR RMILA
//
DBGET integrated database retrieval system