ID A0A388KM79_CHABU Unreviewed; 1635 AA.
AC A0A388KM79;
DT 05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE RecName: Full=Reverse transcriptase {ECO:0008006|Google:ProtNLM};
GN ORFNames=CBR_g8448 {ECO:0000313|EMBL:GBG71145.1};
OS Chara braunii (Braun's stonewort).
OC Eukaryota; Viridiplantae; Streptophyta; Charophyceae; Charales; Characeae;
OC Chara.
OX NCBI_TaxID=69332 {ECO:0000313|EMBL:GBG71145.1, ECO:0000313|Proteomes:UP000265515};
RN [1] {ECO:0000313|EMBL:GBG71145.1, ECO:0000313|Proteomes:UP000265515}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=S276 {ECO:0000313|EMBL:GBG71145.1,
RC ECO:0000313|Proteomes:UP000265515};
RX PubMed=30007417; DOI=10.1016/j.cell.2018.06.033;
RA Nishiyama T., Sakayama H., Vries J.D., Buschmann H., Saint-Marcoux D.,
RA Ullrich K.K., Haas F.B., Vanderstraeten L., Becker D., Lang D.,
RA Vosolsobe S., Rombauts S., Wilhelmsson P.K.I., Janitza P., Kern R.,
RA Heyl A., Rumpler F., Villalobos L.I.A.C., Clay J.M., Skokan R., Toyoda A.,
RA Suzuki Y., Kagoshima H., Schijlen E., Tajeshwar N., Catarino B.,
RA Hetherington A.J., Saltykova A., Bonnot C., Breuninger H., Symeonidi A.,
RA Radhakrishnan G.V., Van Nieuwerburgh F., Deforce D., Chang C., Karol K.G.,
RA Hedrich R., Ulvskov P., Glockner G., Delwiche C.F., Petrasek J.,
RA Van de Peer Y., Friml J., Beilby M., Dolan L., Kohara Y., Sugano S.,
RA Fujiyama A., Delaux P.-M., Quint M., TheiBen G., Hagemann M., Harholt J.,
RA Dunand C., Zachgo S., Langdale J., Maumus F., Straeten D.V.D., Gould S.B.,
RA Rensing S.A.;
RT "The Chara Genome: Secondary Complexity and Implications for Plant
RT Terrestrialization.";
RL Cell 174:448-464(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GBG71145.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BFEA01000141; GBG71145.1; -; Genomic_DNA.
DR EnsemblPlants; GBG71145; GBG71145; CBR_g8448.
DR Gramene; GBG71145; GBG71145; CBR_g8448.
DR Proteomes; UP000265515; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 2.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 4.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF13650; Asp_protease_2; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17919; RT_RNaseH_2; 2.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 2.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 2.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000265515}.
FT DOMAIN 1..82
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 1256..1416
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 244..288
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 342..424
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1569..1596
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 250..266
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 267..288
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 352..410
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1635 AA; 185007 MW; D81A56B906177E26 CRC64;
MPFGLTNAPA TFQAAMNTEF RHMLDRFVLI YLDDILVYSR SLDKHVEHLR TVLERLRQAK
YKANRDKCKF AGQELEYLGH YVTPQGIRLL ADKIEAICIW PEPTNTTDVR SFMGLAGYYQ
RFITGYSRIV APMTRLQSPK VPFVFDDGAR RSFQALKTAM LMAPVLSIYD PTLPTRVTTD
APSYGIGAVL EQHDGNDWHP VEYFSHKVPP INSLDDARKK EAAHVKEEEK KAKKTKAAKA
AKKAKAKVKA KPKEAKEEEK AKVAKKAKPA KATKKAKVKK KKAKAAKAKV KAEKAKAKAK
AEKAKKKANG KATATAKKAT VTATTLDFMQ STLDQIMDAL TRPGFRPPAQ SSLPLSAMSG
PFPVQAGTQP SGTSAAVAQT VASSSSGPAV GATSPQQPVP QQGQPQGQWY PKTPNKPPLA
FSGERKDEEL NTWLRTVPVW VKAKRTLLED EVVTAASYLE GKAAKWLDGV VVKAGYGRRM
ADWPKSMTLD QFMEMVEARW HNPQEAQRAT DAINKMDQRY AKDTSKDGGI LFPGFKLGKR
VRPVEVLAAS TQDEVGLIPS PLHDKAKAND SLVTPQTHLD QTMLCTWDVS EALEDLEGEW
SNKDPRESWV ALKKGPRGEH FVVKVDVGGR KCGAFIDIGS TRNYISRDCL ERLHLQNRVR
HLSRPVASTL ANKERMIVTG YIKDVVCAFS YGGGELNHKI SFLVSDDLPF DMLLGMYYLE
VAKPQFDWDK KVLKRKLPDR QTVRLTKFKA SSIIDTYGCL CTSVFYNYYK QNEEEGMYLV
YVSEKGEAVK TPPEIERVVA KFPDLFEEPT GVVEREVVHA IEIIPGMMPF GLCNAPRTFQ
HAMNQIFHDH LDKLVVIYLN DILIFSKSAK EHAQHVEKVL SLLRQQKYKV NLEKCEFGRT
KILYLGHEVS AEGIRPEDAK VASIRDWPRP HTVTEVRSFL GMCGYYRNFV KNYSTVASPL
TDLTRLDMPW DWNDECEGAF KRMKHALMNH EVLMVPDPQK PFIVTTDASQ YGIGAVLAQQ
DGKELRPIEY MSKTMPSKKL AKSTYERELY ALYKALVHWR HFLLGRFFYL RTDHQTLKWI
KTQPALSDAL KRWIEVIDQY DFKLEYLKGE NNKVADALSR RADYLGALVS DEVTQSLVGA
YQEDPVTMDI IRKLQAKDKA TESEFVMVDG LLYLDKAGIK RLVVPSSEPL CSLFLGECHD
ATGHFGYKKT SANLVQRFWW PGMFNDAKKY VETCQVCQRD KPRTQAPLGL LKPLPIPAGP
GQSVSMDFMD TLVTSKSGKR HIFVIIDRFT KYARLAAMPE TARTDFVIKL FKDNWVRDFG
LPKSIVSDRD VRFTSELWKK TAEQMGSQLQ MTSGNHPEAN GQAGQMNRVV QHLLRHYIKP
SQDDWDEQLP LIASLYNNAI HSSTGVSPNQ LHLRWKPRSA LDFLLPENRP AATPGTLEYG
VQYEKLLQEA VEHMKKAQQA MIASENQHHR QSTFQVGERV WVKASELGQE FGFSRKLMPQ
YFGPWEILDI VGDEMDGPTY VIRVPGHLRT HPVFHASKLA PFPETDQFPS RRSMLPPTMD
GQVDIDGIVD HRDMPVPKPL GRGRPPKPKR EYRVRFRHHT DPKEDRWFTR EEVIETAPQV
AAEYERLLKG KAPAK
//