ID A0A388KM24_CHABU Unreviewed; 4617 AA.
AC A0A388KM24;
DT 05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 1.
DT 27-MAR-2024, entry version 18.
DE RecName: Full=Reverse transcriptase domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=CBR_g8399 {ECO:0000313|EMBL:GBG71100.1};
OS Chara braunii (Braun's stonewort).
OC Eukaryota; Viridiplantae; Streptophyta; Charophyceae; Charales; Characeae;
OC Chara.
OX NCBI_TaxID=69332 {ECO:0000313|EMBL:GBG71100.1, ECO:0000313|Proteomes:UP000265515};
RN [1] {ECO:0000313|EMBL:GBG71100.1, ECO:0000313|Proteomes:UP000265515}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=S276 {ECO:0000313|EMBL:GBG71100.1,
RC ECO:0000313|Proteomes:UP000265515};
RX PubMed=30007417; DOI=10.1016/j.cell.2018.06.033;
RA Nishiyama T., Sakayama H., Vries J.D., Buschmann H., Saint-Marcoux D.,
RA Ullrich K.K., Haas F.B., Vanderstraeten L., Becker D., Lang D.,
RA Vosolsobe S., Rombauts S., Wilhelmsson P.K.I., Janitza P., Kern R.,
RA Heyl A., Rumpler F., Villalobos L.I.A.C., Clay J.M., Skokan R., Toyoda A.,
RA Suzuki Y., Kagoshima H., Schijlen E., Tajeshwar N., Catarino B.,
RA Hetherington A.J., Saltykova A., Bonnot C., Breuninger H., Symeonidi A.,
RA Radhakrishnan G.V., Van Nieuwerburgh F., Deforce D., Chang C., Karol K.G.,
RA Hedrich R., Ulvskov P., Glockner G., Delwiche C.F., Petrasek J.,
RA Van de Peer Y., Friml J., Beilby M., Dolan L., Kohara Y., Sugano S.,
RA Fujiyama A., Delaux P.-M., Quint M., TheiBen G., Hagemann M., Harholt J.,
RA Dunand C., Zachgo S., Langdale J., Maumus F., Straeten D.V.D., Gould S.B.,
RA Rensing S.A.;
RT "The Chara Genome: Secondary Complexity and Implications for Plant
RT Terrestrialization.";
RL Cell 174:448-464(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GBG71100.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BFEA01000141; GBG71100.1; -; Genomic_DNA.
DR EnsemblPlants; GBG71100; GBG71100; CBR_g8399.
DR Gramene; GBG71100; GBG71100; CBR_g8399.
DR Proteomes; UP000265515; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 3.
DR Gene3D; 3.30.70.270; -; 6.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.40.50.1820; alpha/beta hydrolase; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 3.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR029058; AB_hydrolase.
DR InterPro; IPR049492; BD-FAE-like_dom.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR007021; DUF659.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR24559:SF437; RIBONUCLEASE H; 1.
DR PANTHER; PTHR24559; TRANSPOSON TY3-I GAG-POL POLYPROTEIN; 1.
DR Pfam; PF20434; BD-FAE; 1.
DR Pfam; PF04937; DUF659; 1.
DR Pfam; PF17919; RT_RNaseH_2; 2.
DR Pfam; PF08284; RVP_2; 1.
DR Pfam; PF00078; RVT_1; 3.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF53474; alpha/beta-Hydrolases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 4.
DR SUPFAM; SSF53098; Ribonuclease H-like; 2.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 3.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils}; Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000265515};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 85..108
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 1702..1881
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 2025..2185
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT DOMAIN 2628..2828
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 4052..4243
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT REGION 122..224
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 874..906
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 963..988
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1001..1069
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1084..1166
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3354..3398
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3613..3649
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3858..3880
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3941..3985
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 616..643
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 3502..3529
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 125..170
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 171..224
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1020..1037
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1038..1059
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1120..1140
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3369..3398
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3615..3640
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3862..3876
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3962..3982
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 4617 AA; 517296 MW; B745C9209843EF86 CRC64;
MMMHWSLKAA TTTTTMAIQT TKSGTYWRGK TGELVPSWTA ASEVSYMRQG SGKKKGGVRA
MIGAAVRGAR IEVSKAGTPS ERLKIAAVLG MIIVKEIIGM ILLIPYGLHA LGIHRALATP
VRTAEEGGEG EREGEREGER EGDGSKTSEK RERSAVGGQR RQKEEGTHLN VDQQMAESLV
SNSNARQHVS STTIRRRASF PQSLQEQNEG LQLQGGNGAS CMSSMTSDVR RRRSGELRGS
VRGRGGRELS AAGHVGTRLL TWLAPKERSV SLVRDVRYGV RERNLMTKNS SQAVKKHFRE
VGISGQAYKG NKKRVCNYND KPMAGTGNRA REHFLKGLRC NVERLRGFRD AKFENTRTKR
VEVTRGRVGK RVEELQQEWP TTGCMLQLDG WTDRRQRPHI NVMVSFPKGS IFWRSVCMSQ
RNKDASAYYA ILKRAIEEIG AEAVVGAIMD NAAVCATAGR MIEADYLHIF SVPCIAHSLD
LMFESVTKIG WVGAIMKMAS ELAKFFTNHS RVRDLLIHYS NGGVVSRPGA TRFATNFIML
SSLHGLYLPR RACMTDGDWK PVIVHTSLRD LFVKATHSIL DDTFWADDEK VMQTSKNLLK
LLKKVDGMGP TISKVYARME SAVEKLRESK HFVEAEKDEL EEIIMRRWNA MTSPLDCVAL
FLDPEYRASR PETDAEVADG FWTWLYLWGP PKRHKRVSQK VKPAAVGRER SPLEGVSEEE
IASEIKERKR KEPQRLTQDR IAGMDIGDEN LTPQERKRVI EILKTCDKAI AFSDAERGRV
DPRYVKPARI YTVPHVPWND AGWKYAQKEK EEVIAFLKEK MVSHVAEPSD SAYANRWFFL
RKPNGKIRWI QDLQKVNAVT IRDVGSVPHA DLLAEGAAEE GSPGGPVDTL EREARPPSPS
SNLEHHPIVA AVGADAGVPV TNSGADATEQ LVVGTSATAR RDRATVLPTV AFYASGKSTG
GMDEQGVRNV GRPSAPSMGK RSIGSVDGGR HTAMAEFEDR HGSALPTKTS DVHATRAAKA
SLSRARKKAS ARKASRSSSH MRSRERGSGV VHLEDGEIAP DGDALDIGGR DAMTADIVGR
EGTGNAVAGQ KRRGSVLIVH DDNTNVAPGE TTGTDDAGDS DYVPKPRAED GDDGGGRRVR
PRTRLGPQGQ RAQGTPSAMI PGPIDRRAQA QRLLRKMEAT LQQTHGLDIY VPNESANGDL
LPTVVFVHGG VWASGEKWQY SPIGVRLAKE GTVAVLVQYT LFPEVLADDM IKEVSLALTW
VMDNICLYNG NPERIFFMGH SSDRELSHES QGPRSVGKVS TAESDATTLS TPSELVVTSL
GSRTHAKVVS PPILHYFKDY AARLVPSLNS RAQGQDVCAV SSPFGSSGIE SSSSGPSRDS
ARVFNIEDLD LLTPEDFAWL PLPSTGCLPE PQCATLSAHL HTYLAFYAPP TSPTEDEVAV
GDILAYVSKV AREFRTQRYD DNNAPLLYVR IQVGQVSCSA LLDSGATRNF ISHSFMQRVG
LGPQVRRKAH PMAIKLVDGR TQQLLDRYIE AVPVYFAPHA CEPVMFDVLD TDFDIVLGMP
WLASADHTVN FHRRTLTVRD AFGAEVPCTI PLPHPSIRCQ VVTTKFFRVT CSYERADEIG
LCFLRTVAAA ESQPTDLSSN PWVVRLLDEF ADIFESPTGM VPDRSISHEV ILEAGVVPPK
GCIYRMSKEE LTVLRAQLDD LLDKGWIRPS SSPYGVPVLF VRKKNEDLRL CIDYRKLDAQ
TVKNVGPLPH IDDLLERLGG ANFFPKLDLK SGYHQIWIRP QDRYKTAFKT RYGHFEWVVM
PFGLTNAPTT FQVAMTNEFR AMLDRFVLVY LDDILVYSRT LEEHLEHLGR VLEMLRRAKY
KANHDKCEFV RQELEYLGHF VTPQGISPLS DKIQAIQDWP EPRNITDVRS FLGLASYYQR
FIKGYSKIAT HLYKLQCEDR PFDFGTDARE SFLALKAALL SAEVLRIYDP LLPTRVTTDA
PGYGIDAVLE QHDGVDWHPV KYFSKKVSVV HSIDDARKKE LLAFVHALKR EAIAMDITGP
FPKHKTGVDG ILTVVDRLTK FSMFLPCRYH AKAPELAEVL YAGWIRTKGY PKEIVCDXDT
XFMSDFWLAL IKRWGSSLKP SSARHPQTDG QTERAHQTAQ VLIRTLICPD QKDWVERMPN
IELAYNSSIH PAIEMSPFEF EHGLPVNSPL DTIIPQAVES DNHLLFIRRM QEPLVKACDQ
MSKTQQRMRQ QANRQRLPCP FRVGELEEGV KEPGYGGWVK EVEEREGVKV VGYGGWVKGV
EEEEGARVVG YGGWMEVEEE ENVKPVGYGG WVKEVEEEGV RVVGYGGWLK EVEEEEGVRV
VGYGGWVREV EDEEGGGGGR RSEGGWLRWM GGWVEEVEEE EGVRVVGYGG WLKEVEEEGA
RVVGYGGWVK EVEEVEEEEG AKVVGYGGWV KKVEEVEEEE GAKVVDYGGW VKKVEDEEDG
KAVDYGGWVK EVEEEDGVRE VEEEEGVRVV GYGGWVKEVE EEEGVKVVGY GRWFKEVEEE
EGLRVLGYGG WVKEVEEEGV RAVGYGGWVK EVEEEKGVRV AGYGGWVKEV EKEGLREVEE
EEGVRVVGYG GWVEVEEEEG VRVVGYGGWL KEVEEEGVRV VGYGVWVEVE EEEGQKVVGY
SGWVKEVDKG EGVKVVGYGG WVKEMEEEGV EVVGYGGWVK EVSFGGWVVE VEEEGVKVVP
KLPVACGCKV FSKIDLKSGY HQIEVDPADQ HKAAFKTRDG LYMFTVMPFG LTNAPATFQS
LMDKVLREQI GRFVVVYLDD VLIFSKSMEE HLKHLEEVLT ILKKMQLHLN LEKSEFGKDS
VIYLGHRLSA AGLEPETTKV EVIRNWPQPV NVRELRSFLG LASYYRKFVP RFSIVAHPLS
QLTSKNVPYS WDTTCTNAFQ ALKDALVSYS VLRIADPKLT FVVTTDASQY GIDVVLQQDD
GDGLRPLEFY NKRMPNVKVA TSTSMRELYA LRMALSHWKH YLLGCHFKVF LDHDTLKWIK
ERTTLSPTLI RWFHEIDIFD FELRHKKGCY NRVADALSSH PEYMTCLVKS YDLRKKLKEE
LVEHTAKDPE LSPILERLHL GWQIRNPLFY LFPEQPAGLT PGKPGFRAKY DRLLKIAIAD
MTNLLVYYGP WEVLDVIGED HFGPSYVVDV PAHLRTYPVF HASKLYLHRD ATTFDYRENM
ITRAIKGGRE INGIKQHVGK GRNKPYQVHF MYHPLDDLYW ISKQELLHVV KEAWTGRGGT
VDGAEKPIVL KGTGEEVGEG HRWLVVDSGC NVLLADPHEA GDEDVVADLV VGNVLAEGTD
ILDEAVGGAV LAKPAKLVNV VVDCLLRAEG GGEEGGPLEQ GEGRRVCTSR MLMRSGTVKP
SPTPAEQAEI DHKLADKKKE KEAKKKQKEE EARKKMKEKM ERELEEELKS IQEEEEEQQE
EVKTLVRRRP IDIPESSTTQ KPPVEPLNWA NQYYYSSELL KESEEECDVF LAKLAMVTDT
VERNLMMEEK RDELHSKLLA TRRQEIDEKK RLQAEGERLQ KALEAQKEDP SATEAQLALL
REAVLNTRQD MNLMRQTLQR VETHRVEFET VWNNFLEKSA KDVDHHVQTY IQVLDDHVSK
TFTLEVIEKI MKGAGGGGDD GDDDGDGDKK GKKKIGDPQD KPSQETGQGN KIKLKLPWTY
NGKKEESMLH WAAAIETYVY GQRIPYWDRV LMATSCMGGD AISFAISLQK EAGCSSMVEY
LQQTRIEDFL KLIREKFEDK NLARRTKMLI LSLPDRKWKS TSALKATMDE LLQCPDHGLT
PAQILNSFAR ALPDPLRTQL YPRTKEEGTT YEKFGKIAID HAGFLANYCH YWKDLQAAPR
PIGRPTLSFS KRSVDIIGTT AAPRPDPPPP ELSVPTPSPT ITVTSPRQFA HFIRQDDVTF
FTVNVTDLLD YGPPCPDVEL ISLKPDPPSI SMAPISTSVP PPSVHYDPPC PNAELISLEP
DPPSIPMAPI STSVPPPSVE STPPSLADAD VEELARYTAD LEPAVRDLIR EYHDVFPSSF
SYVGIPPMRD VENSIQLMPA YRVHHQAPYK LSIPEATELK RQLEELLRLG FIKPSNALWG
APVLFADGTL RLCIDYLGLN RYTIKNNYPM PRSDELFDRL AGNRFFTKID LLNSYHQIRV
VAADQPKTAF RSRFGHYEFT VMPSALTNAP ATFQRAMNDI FRDILEHYVL VYLDDILVYS
RTLEEHLRHL HDVLDRLRRH GFYAKLSKCR FAHHKVDFLG HYVSDQGLHM DDVKITAIAE
WLIPTSAKQL RSFLGLTSYY RAHVSAMVLW QRAIHQLKHG ARSLEQEDGI ERTTRDLQGS
DATEDSEFRI CERMPARGEV NEETYESQGA KDQNGHGKVA GAYSSQTGEQ IDSRQPYCFL
GLSGVYDIGR HYAFEQRRGV AGISCMRPAI GGVDKFDAFS PSLLFALMGS MQISNGNGCT
HCTPCTEPAK DHVKSTLRSL RGEKSAAHSK KNQMRLYSYM DDLPGARAAR KENPALDGRP
IISTPRPSTE KAVNTNRHSD YQRFDSSRAT CVVPPCLLLC SPGDKTVPVD SSLALEKALL
RIGCHAKAIL YDDLGHTDFS IWGTKTHCKT IDELGPHVSD ILDVVSERIP LDSMCWD
//