ID A0A388LVR8_CHABU Unreviewed; 1639 AA.
AC A0A388LVR8;
DT 05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE RecName: Full=Reverse transcriptase {ECO:0008006|Google:ProtNLM};
GN ORFNames=CBR_g41402 {ECO:0000313|EMBL:GBG86406.1};
OS Chara braunii (Braun's stonewort).
OC Eukaryota; Viridiplantae; Streptophyta; Charophyceae; Charales; Characeae;
OC Chara.
OX NCBI_TaxID=69332 {ECO:0000313|EMBL:GBG86406.1, ECO:0000313|Proteomes:UP000265515};
RN [1] {ECO:0000313|EMBL:GBG86406.1, ECO:0000313|Proteomes:UP000265515}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=S276 {ECO:0000313|EMBL:GBG86406.1,
RC ECO:0000313|Proteomes:UP000265515};
RX PubMed=30007417; DOI=10.1016/j.cell.2018.06.033;
RA Nishiyama T., Sakayama H., Vries J.D., Buschmann H., Saint-Marcoux D.,
RA Ullrich K.K., Haas F.B., Vanderstraeten L., Becker D., Lang D.,
RA Vosolsobe S., Rombauts S., Wilhelmsson P.K.I., Janitza P., Kern R.,
RA Heyl A., Rumpler F., Villalobos L.I.A.C., Clay J.M., Skokan R., Toyoda A.,
RA Suzuki Y., Kagoshima H., Schijlen E., Tajeshwar N., Catarino B.,
RA Hetherington A.J., Saltykova A., Bonnot C., Breuninger H., Symeonidi A.,
RA Radhakrishnan G.V., Van Nieuwerburgh F., Deforce D., Chang C., Karol K.G.,
RA Hedrich R., Ulvskov P., Glockner G., Delwiche C.F., Petrasek J.,
RA Van de Peer Y., Friml J., Beilby M., Dolan L., Kohara Y., Sugano S.,
RA Fujiyama A., Delaux P.-M., Quint M., TheiBen G., Hagemann M., Harholt J.,
RA Dunand C., Zachgo S., Langdale J., Maumus F., Straeten D.V.D., Gould S.B.,
RA Rensing S.A.;
RT "The Chara Genome: Secondary Complexity and Implications for Plant
RT Terrestrialization.";
RL Cell 174:448-464(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GBG86406.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BFEA01000563; GBG86406.1; -; Genomic_DNA.
DR EnsemblPlants; GBG86406; GBG86406; CBR_g41402.
DR Gramene; GBG86406; GBG86406; CBR_g41402.
DR Proteomes; UP000265515; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR InterPro; IPR001878; Znf_CCHC.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Reference proteome {ECO:0000313|Proteomes:UP000265515};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 582..597
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 796..975
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 1327..1442
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 1..24
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 80..135
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 283..303
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 466..491
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 7..24
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 80..124
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT UNSURE 422
FT /note="D or N"
FT /evidence="ECO:0000313|EMBL:GBG86406.1"
FT UNSURE 425
FT /note="D or N"
FT /evidence="ECO:0000313|EMBL:GBG86406.1"
SQ SEQUENCE 1639 AA; 185575 MW; D0379CCBCD4A63C5 CRC64;
MSGSDPGDDR ADSRRLTRSE ALRVPHVFRT LDEGEERRLR AERRARALAA GIAAANAAAR
EKATAARAAA MANSASSAAS TSASSSRTSG THLGMPLSST SSSGTSGSTR SRQSSSQMAG
SQFSPLTPRE RELREIQQVE RLCQQLEEDL KKATDREKEI KSRAARLDTL EADKAALEGL
DESSLSDTLK VLRDNMLSLH AHVDSRMDFM QSTLDQILDA LTNPGFRPPA QSSLPLTAMS
GPFVVQAGTQ PSGTSAATAQ TVASSSSGPA VIATLPQQPI RQSGQQQGQW YPKTPMKLPL
PFSGERKDEE LNTWLRTVPV WVKAKRTLPE DEVVTAASYL EGKAAKWLDG VVVKAGYGRR
MADWAKSLTL DQFMEMVEAR WHNPQEAQRT TDAINKLDQR KFRSVRELTT TVESLILXPG
IDYSDQFLLT TFVRCLXENM RNLLASEART EYHTFETLSR KALDLEATLG NAQPTSGNDS
KKKKSPQEWK KKGAKLMMVE SDGTQTEIDE LPDLMDYSEY DGEEVAEGRT LAAVVKTKAA
GRGKGQPRGQ GKAASNQTKP AEWVKADLDQ DVWRDRRMCG VCINCGEYGH MQWKCQNAKD
RVWHLSRPVA STLANKERMI VTDYIKNVVC TFSYGGGELN HKISFLVSDD LPFDMLLGMY
YLEVAEPQFD WDKKVLKYKL PDGRTVRLTK FKASSIVDIY GCLCASTFYN YYKQNQEEGM
YLVYVSEKRE AVKTPPEIER VVAKFPDLFE EPSGIVDREV VHAIEIIPGS KTPKGRVYRM
APAELDELRK QLKELTEKGW IRPSTSPYRS PVLFVPKKGG TLRMCNDYRG LNAITVKNAE
PLPRIDDLLD RVQGCKYYTK IDLKSGYHQI AIRSEDEHKT AFQNRYGLYE FVVMPFSLCN
APGTFQHAMN RIFHDHLDKF IVVYLDDILI FSKSVEEHAQ HVETVLSLLR QHKYKVNLEK
CEFGRTKILY LGHEVSAEGI RPEDAKVASI RDWPRPQTVI EVRSFLGMCD YYRNFIKNYS
TVASPLTNLT RLDTPWDWSD ECEGAFKRLK HALMNHEVLM VPDPQKPFIV TTDASQYGIG
AVLAQQDGKK LRPIEYISKK XPSKKLAKST XEREXYALYK ALVHWRHFLL GRFFYLRTDH
QTLKXIKTQP ALSDALKRWI EVIDQYDFKL EYLKGEYNKV ADALSRRADY LGALVSEFGV
SNEVTQSLVG AYQEDPVTMD IIRKLQAKDK ATESEFVMVD GLIYLDKAGV KRLVVPSSEQ
LRSLFLGECH DATGHFGYKK TSANLVQRFW WPRMLDDAKK YVETCQVCQR DKPRTQAPLG
LLKPLPIPDG PGLSVSMDFM DTLVTSKSGK RHIFVIIDRF TKYARLIPMP EAARTECVIK
LFKDNWVRDF GLPKTIISDR DVRFTTVPAL GPNQLHLGWK RKSALDFLLP ENRPAATPGT
LEYGVQYEKL LQEAVEHMKK AQQAMIASEN QHRRQSTFQV GERVWVKASE LGQEFGISRK
LMPQYFGPRE VLDVVGDEMD GPTYVIRVPG HLRTHPVFHA SKLAPFAETD QFPSRRSMLP
PTMDGQVDID DIVDHRDMPV PKPLGRGRPP KPKREYRIRF RHHTNPKEDR WFTREELMET
APQVVPDYER KLKGKALAK
//