GenomeNet

Database: UniProt
Entry: A0A388KM24_CHABU
LinkDB: A0A388KM24_CHABU
Original site: A0A388KM24_CHABU 
ID   A0A388KM24_CHABU        Unreviewed;      4617 AA.
AC   A0A388KM24;
DT   05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT   05-DEC-2018, sequence version 1.
DT   27-MAR-2024, entry version 18.
DE   RecName: Full=Reverse transcriptase domain-containing protein {ECO:0008006|Google:ProtNLM};
GN   ORFNames=CBR_g8399 {ECO:0000313|EMBL:GBG71100.1};
OS   Chara braunii (Braun's stonewort).
OC   Eukaryota; Viridiplantae; Streptophyta; Charophyceae; Charales; Characeae;
OC   Chara.
OX   NCBI_TaxID=69332 {ECO:0000313|EMBL:GBG71100.1, ECO:0000313|Proteomes:UP000265515};
RN   [1] {ECO:0000313|EMBL:GBG71100.1, ECO:0000313|Proteomes:UP000265515}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=S276 {ECO:0000313|EMBL:GBG71100.1,
RC   ECO:0000313|Proteomes:UP000265515};
RX   PubMed=30007417; DOI=10.1016/j.cell.2018.06.033;
RA   Nishiyama T., Sakayama H., Vries J.D., Buschmann H., Saint-Marcoux D.,
RA   Ullrich K.K., Haas F.B., Vanderstraeten L., Becker D., Lang D.,
RA   Vosolsobe S., Rombauts S., Wilhelmsson P.K.I., Janitza P., Kern R.,
RA   Heyl A., Rumpler F., Villalobos L.I.A.C., Clay J.M., Skokan R., Toyoda A.,
RA   Suzuki Y., Kagoshima H., Schijlen E., Tajeshwar N., Catarino B.,
RA   Hetherington A.J., Saltykova A., Bonnot C., Breuninger H., Symeonidi A.,
RA   Radhakrishnan G.V., Van Nieuwerburgh F., Deforce D., Chang C., Karol K.G.,
RA   Hedrich R., Ulvskov P., Glockner G., Delwiche C.F., Petrasek J.,
RA   Van de Peer Y., Friml J., Beilby M., Dolan L., Kohara Y., Sugano S.,
RA   Fujiyama A., Delaux P.-M., Quint M., TheiBen G., Hagemann M., Harholt J.,
RA   Dunand C., Zachgo S., Langdale J., Maumus F., Straeten D.V.D., Gould S.B.,
RA   Rensing S.A.;
RT   "The Chara Genome: Secondary Complexity and Implications for Plant
RT   Terrestrialization.";
RL   Cell 174:448-464(2018).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:GBG71100.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; BFEA01000141; GBG71100.1; -; Genomic_DNA.
DR   EnsemblPlants; GBG71100; GBG71100; CBR_g8399.
DR   Gramene; GBG71100; GBG71100; CBR_g8399.
DR   Proteomes; UP000265515; Unassembled WGS sequence.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR   CDD; cd00303; retropepsin_like; 1.
DR   CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR   CDD; cd01647; RT_LTR; 3.
DR   Gene3D; 3.30.70.270; -; 6.
DR   Gene3D; 2.40.70.10; Acid Proteases; 1.
DR   Gene3D; 3.40.50.1820; alpha/beta hydrolase; 1.
DR   Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 3.
DR   Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR   InterPro; IPR029058; AB_hydrolase.
DR   InterPro; IPR049492; BD-FAE-like_dom.
DR   InterPro; IPR043502; DNA/RNA_pol_sf.
DR   InterPro; IPR007021; DUF659.
DR   InterPro; IPR001584; Integrase_cat-core.
DR   InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR   InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR   InterPro; IPR012337; RNaseH-like_sf.
DR   InterPro; IPR036397; RNaseH_sf.
DR   InterPro; IPR000477; RT_dom.
DR   InterPro; IPR041577; RT_RNaseH_2.
DR   PANTHER; PTHR24559:SF437; RIBONUCLEASE H; 1.
DR   PANTHER; PTHR24559; TRANSPOSON TY3-I GAG-POL POLYPROTEIN; 1.
DR   Pfam; PF20434; BD-FAE; 1.
DR   Pfam; PF04937; DUF659; 1.
DR   Pfam; PF17919; RT_RNaseH_2; 2.
DR   Pfam; PF08284; RVP_2; 1.
DR   Pfam; PF00078; RVT_1; 3.
DR   SUPFAM; SSF50630; Acid proteases; 1.
DR   SUPFAM; SSF53474; alpha/beta-Hydrolases; 1.
DR   SUPFAM; SSF56672; DNA/RNA polymerases; 4.
DR   SUPFAM; SSF53098; Ribonuclease H-like; 2.
DR   PROSITE; PS50994; INTEGRASE; 1.
DR   PROSITE; PS50878; RT_POL; 3.
PE   4: Predicted;
KW   Coiled coil {ECO:0000256|SAM:Coils}; Membrane {ECO:0000256|SAM:Phobius};
KW   Reference proteome {ECO:0000313|Proteomes:UP000265515};
KW   Transmembrane {ECO:0000256|SAM:Phobius};
KW   Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT   TRANSMEM        85..108
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   DOMAIN          1702..1881
FT                   /note="Reverse transcriptase"
FT                   /evidence="ECO:0000259|PROSITE:PS50878"
FT   DOMAIN          2025..2185
FT                   /note="Integrase catalytic"
FT                   /evidence="ECO:0000259|PROSITE:PS50994"
FT   DOMAIN          2628..2828
FT                   /note="Reverse transcriptase"
FT                   /evidence="ECO:0000259|PROSITE:PS50878"
FT   DOMAIN          4052..4243
FT                   /note="Reverse transcriptase"
FT                   /evidence="ECO:0000259|PROSITE:PS50878"
FT   REGION          122..224
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          874..906
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          963..988
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1001..1069
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1084..1166
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          3354..3398
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          3613..3649
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          3858..3880
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          3941..3985
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COILED          616..643
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COILED          3502..3529
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COMPBIAS        125..170
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        171..224
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1020..1037
FT                   /note="Basic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1038..1059
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1120..1140
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        3369..3398
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        3615..3640
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        3862..3876
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        3962..3982
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   4617 AA;  517296 MW;  B745C9209843EF86 CRC64;
     MMMHWSLKAA TTTTTMAIQT TKSGTYWRGK TGELVPSWTA ASEVSYMRQG SGKKKGGVRA
     MIGAAVRGAR IEVSKAGTPS ERLKIAAVLG MIIVKEIIGM ILLIPYGLHA LGIHRALATP
     VRTAEEGGEG EREGEREGER EGDGSKTSEK RERSAVGGQR RQKEEGTHLN VDQQMAESLV
     SNSNARQHVS STTIRRRASF PQSLQEQNEG LQLQGGNGAS CMSSMTSDVR RRRSGELRGS
     VRGRGGRELS AAGHVGTRLL TWLAPKERSV SLVRDVRYGV RERNLMTKNS SQAVKKHFRE
     VGISGQAYKG NKKRVCNYND KPMAGTGNRA REHFLKGLRC NVERLRGFRD AKFENTRTKR
     VEVTRGRVGK RVEELQQEWP TTGCMLQLDG WTDRRQRPHI NVMVSFPKGS IFWRSVCMSQ
     RNKDASAYYA ILKRAIEEIG AEAVVGAIMD NAAVCATAGR MIEADYLHIF SVPCIAHSLD
     LMFESVTKIG WVGAIMKMAS ELAKFFTNHS RVRDLLIHYS NGGVVSRPGA TRFATNFIML
     SSLHGLYLPR RACMTDGDWK PVIVHTSLRD LFVKATHSIL DDTFWADDEK VMQTSKNLLK
     LLKKVDGMGP TISKVYARME SAVEKLRESK HFVEAEKDEL EEIIMRRWNA MTSPLDCVAL
     FLDPEYRASR PETDAEVADG FWTWLYLWGP PKRHKRVSQK VKPAAVGRER SPLEGVSEEE
     IASEIKERKR KEPQRLTQDR IAGMDIGDEN LTPQERKRVI EILKTCDKAI AFSDAERGRV
     DPRYVKPARI YTVPHVPWND AGWKYAQKEK EEVIAFLKEK MVSHVAEPSD SAYANRWFFL
     RKPNGKIRWI QDLQKVNAVT IRDVGSVPHA DLLAEGAAEE GSPGGPVDTL EREARPPSPS
     SNLEHHPIVA AVGADAGVPV TNSGADATEQ LVVGTSATAR RDRATVLPTV AFYASGKSTG
     GMDEQGVRNV GRPSAPSMGK RSIGSVDGGR HTAMAEFEDR HGSALPTKTS DVHATRAAKA
     SLSRARKKAS ARKASRSSSH MRSRERGSGV VHLEDGEIAP DGDALDIGGR DAMTADIVGR
     EGTGNAVAGQ KRRGSVLIVH DDNTNVAPGE TTGTDDAGDS DYVPKPRAED GDDGGGRRVR
     PRTRLGPQGQ RAQGTPSAMI PGPIDRRAQA QRLLRKMEAT LQQTHGLDIY VPNESANGDL
     LPTVVFVHGG VWASGEKWQY SPIGVRLAKE GTVAVLVQYT LFPEVLADDM IKEVSLALTW
     VMDNICLYNG NPERIFFMGH SSDRELSHES QGPRSVGKVS TAESDATTLS TPSELVVTSL
     GSRTHAKVVS PPILHYFKDY AARLVPSLNS RAQGQDVCAV SSPFGSSGIE SSSSGPSRDS
     ARVFNIEDLD LLTPEDFAWL PLPSTGCLPE PQCATLSAHL HTYLAFYAPP TSPTEDEVAV
     GDILAYVSKV AREFRTQRYD DNNAPLLYVR IQVGQVSCSA LLDSGATRNF ISHSFMQRVG
     LGPQVRRKAH PMAIKLVDGR TQQLLDRYIE AVPVYFAPHA CEPVMFDVLD TDFDIVLGMP
     WLASADHTVN FHRRTLTVRD AFGAEVPCTI PLPHPSIRCQ VVTTKFFRVT CSYERADEIG
     LCFLRTVAAA ESQPTDLSSN PWVVRLLDEF ADIFESPTGM VPDRSISHEV ILEAGVVPPK
     GCIYRMSKEE LTVLRAQLDD LLDKGWIRPS SSPYGVPVLF VRKKNEDLRL CIDYRKLDAQ
     TVKNVGPLPH IDDLLERLGG ANFFPKLDLK SGYHQIWIRP QDRYKTAFKT RYGHFEWVVM
     PFGLTNAPTT FQVAMTNEFR AMLDRFVLVY LDDILVYSRT LEEHLEHLGR VLEMLRRAKY
     KANHDKCEFV RQELEYLGHF VTPQGISPLS DKIQAIQDWP EPRNITDVRS FLGLASYYQR
     FIKGYSKIAT HLYKLQCEDR PFDFGTDARE SFLALKAALL SAEVLRIYDP LLPTRVTTDA
     PGYGIDAVLE QHDGVDWHPV KYFSKKVSVV HSIDDARKKE LLAFVHALKR EAIAMDITGP
     FPKHKTGVDG ILTVVDRLTK FSMFLPCRYH AKAPELAEVL YAGWIRTKGY PKEIVCDXDT
     XFMSDFWLAL IKRWGSSLKP SSARHPQTDG QTERAHQTAQ VLIRTLICPD QKDWVERMPN
     IELAYNSSIH PAIEMSPFEF EHGLPVNSPL DTIIPQAVES DNHLLFIRRM QEPLVKACDQ
     MSKTQQRMRQ QANRQRLPCP FRVGELEEGV KEPGYGGWVK EVEEREGVKV VGYGGWVKGV
     EEEEGARVVG YGGWMEVEEE ENVKPVGYGG WVKEVEEEGV RVVGYGGWLK EVEEEEGVRV
     VGYGGWVREV EDEEGGGGGR RSEGGWLRWM GGWVEEVEEE EGVRVVGYGG WLKEVEEEGA
     RVVGYGGWVK EVEEVEEEEG AKVVGYGGWV KKVEEVEEEE GAKVVDYGGW VKKVEDEEDG
     KAVDYGGWVK EVEEEDGVRE VEEEEGVRVV GYGGWVKEVE EEEGVKVVGY GRWFKEVEEE
     EGLRVLGYGG WVKEVEEEGV RAVGYGGWVK EVEEEKGVRV AGYGGWVKEV EKEGLREVEE
     EEGVRVVGYG GWVEVEEEEG VRVVGYGGWL KEVEEEGVRV VGYGVWVEVE EEEGQKVVGY
     SGWVKEVDKG EGVKVVGYGG WVKEMEEEGV EVVGYGGWVK EVSFGGWVVE VEEEGVKVVP
     KLPVACGCKV FSKIDLKSGY HQIEVDPADQ HKAAFKTRDG LYMFTVMPFG LTNAPATFQS
     LMDKVLREQI GRFVVVYLDD VLIFSKSMEE HLKHLEEVLT ILKKMQLHLN LEKSEFGKDS
     VIYLGHRLSA AGLEPETTKV EVIRNWPQPV NVRELRSFLG LASYYRKFVP RFSIVAHPLS
     QLTSKNVPYS WDTTCTNAFQ ALKDALVSYS VLRIADPKLT FVVTTDASQY GIDVVLQQDD
     GDGLRPLEFY NKRMPNVKVA TSTSMRELYA LRMALSHWKH YLLGCHFKVF LDHDTLKWIK
     ERTTLSPTLI RWFHEIDIFD FELRHKKGCY NRVADALSSH PEYMTCLVKS YDLRKKLKEE
     LVEHTAKDPE LSPILERLHL GWQIRNPLFY LFPEQPAGLT PGKPGFRAKY DRLLKIAIAD
     MTNLLVYYGP WEVLDVIGED HFGPSYVVDV PAHLRTYPVF HASKLYLHRD ATTFDYRENM
     ITRAIKGGRE INGIKQHVGK GRNKPYQVHF MYHPLDDLYW ISKQELLHVV KEAWTGRGGT
     VDGAEKPIVL KGTGEEVGEG HRWLVVDSGC NVLLADPHEA GDEDVVADLV VGNVLAEGTD
     ILDEAVGGAV LAKPAKLVNV VVDCLLRAEG GGEEGGPLEQ GEGRRVCTSR MLMRSGTVKP
     SPTPAEQAEI DHKLADKKKE KEAKKKQKEE EARKKMKEKM ERELEEELKS IQEEEEEQQE
     EVKTLVRRRP IDIPESSTTQ KPPVEPLNWA NQYYYSSELL KESEEECDVF LAKLAMVTDT
     VERNLMMEEK RDELHSKLLA TRRQEIDEKK RLQAEGERLQ KALEAQKEDP SATEAQLALL
     REAVLNTRQD MNLMRQTLQR VETHRVEFET VWNNFLEKSA KDVDHHVQTY IQVLDDHVSK
     TFTLEVIEKI MKGAGGGGDD GDDDGDGDKK GKKKIGDPQD KPSQETGQGN KIKLKLPWTY
     NGKKEESMLH WAAAIETYVY GQRIPYWDRV LMATSCMGGD AISFAISLQK EAGCSSMVEY
     LQQTRIEDFL KLIREKFEDK NLARRTKMLI LSLPDRKWKS TSALKATMDE LLQCPDHGLT
     PAQILNSFAR ALPDPLRTQL YPRTKEEGTT YEKFGKIAID HAGFLANYCH YWKDLQAAPR
     PIGRPTLSFS KRSVDIIGTT AAPRPDPPPP ELSVPTPSPT ITVTSPRQFA HFIRQDDVTF
     FTVNVTDLLD YGPPCPDVEL ISLKPDPPSI SMAPISTSVP PPSVHYDPPC PNAELISLEP
     DPPSIPMAPI STSVPPPSVE STPPSLADAD VEELARYTAD LEPAVRDLIR EYHDVFPSSF
     SYVGIPPMRD VENSIQLMPA YRVHHQAPYK LSIPEATELK RQLEELLRLG FIKPSNALWG
     APVLFADGTL RLCIDYLGLN RYTIKNNYPM PRSDELFDRL AGNRFFTKID LLNSYHQIRV
     VAADQPKTAF RSRFGHYEFT VMPSALTNAP ATFQRAMNDI FRDILEHYVL VYLDDILVYS
     RTLEEHLRHL HDVLDRLRRH GFYAKLSKCR FAHHKVDFLG HYVSDQGLHM DDVKITAIAE
     WLIPTSAKQL RSFLGLTSYY RAHVSAMVLW QRAIHQLKHG ARSLEQEDGI ERTTRDLQGS
     DATEDSEFRI CERMPARGEV NEETYESQGA KDQNGHGKVA GAYSSQTGEQ IDSRQPYCFL
     GLSGVYDIGR HYAFEQRRGV AGISCMRPAI GGVDKFDAFS PSLLFALMGS MQISNGNGCT
     HCTPCTEPAK DHVKSTLRSL RGEKSAAHSK KNQMRLYSYM DDLPGARAAR KENPALDGRP
     IISTPRPSTE KAVNTNRHSD YQRFDSSRAT CVVPPCLLLC SPGDKTVPVD SSLALEKALL
     RIGCHAKAIL YDDLGHTDFS IWGTKTHCKT IDELGPHVSD ILDVVSERIP LDSMCWD
//
DBGET integrated database retrieval system