ID A0A388LCK9_CHABU Unreviewed; 1116 AA.
AC A0A388LCK9;
DT 05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 1.
DT 27-MAR-2024, entry version 15.
DE RecName: Full=Reverse transcriptase domain-containing protein {ECO:0000259|PROSITE:PS50878};
GN ORFNames=CBR_g30395 {ECO:0000313|EMBL:GBG80026.1};
OS Chara braunii (Braun's stonewort).
OC Eukaryota; Viridiplantae; Streptophyta; Charophyceae; Charales; Characeae;
OC Chara.
OX NCBI_TaxID=69332 {ECO:0000313|EMBL:GBG80026.1, ECO:0000313|Proteomes:UP000265515};
RN [1] {ECO:0000313|EMBL:GBG80026.1, ECO:0000313|Proteomes:UP000265515}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=S276 {ECO:0000313|EMBL:GBG80026.1,
RC ECO:0000313|Proteomes:UP000265515};
RX PubMed=30007417; DOI=10.1016/j.cell.2018.06.033;
RA Nishiyama T., Sakayama H., Vries J.D., Buschmann H., Saint-Marcoux D.,
RA Ullrich K.K., Haas F.B., Vanderstraeten L., Becker D., Lang D.,
RA Vosolsobe S., Rombauts S., Wilhelmsson P.K.I., Janitza P., Kern R.,
RA Heyl A., Rumpler F., Villalobos L.I.A.C., Clay J.M., Skokan R., Toyoda A.,
RA Suzuki Y., Kagoshima H., Schijlen E., Tajeshwar N., Catarino B.,
RA Hetherington A.J., Saltykova A., Bonnot C., Breuninger H., Symeonidi A.,
RA Radhakrishnan G.V., Van Nieuwerburgh F., Deforce D., Chang C., Karol K.G.,
RA Hedrich R., Ulvskov P., Glockner G., Delwiche C.F., Petrasek J.,
RA Van de Peer Y., Friml J., Beilby M., Dolan L., Kohara Y., Sugano S.,
RA Fujiyama A., Delaux P.-M., Quint M., TheiBen G., Hagemann M., Harholt J.,
RA Dunand C., Zachgo S., Langdale J., Maumus F., Straeten D.V.D., Gould S.B.,
RA Rensing S.A.;
RT "The Chara Genome: Secondary Complexity and Implications for Plant
RT Terrestrialization.";
RL Cell 174:448-464(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GBG80026.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BFEA01000335; GBG80026.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A388LCK9; -.
DR EnsemblPlants; GBG80026; GBG80026; CBR_g30395.
DR Gramene; GBG80026; GBG80026; CBR_g30395.
DR Proteomes; UP000265515; Unassembled WGS sequence.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 3.10.20.370; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR24559:SF437; RIBONUCLEASE H; 1.
DR PANTHER; PTHR24559; TRANSPOSON TY3-I GAG-POL POLYPROTEIN; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF08284; RVP_2; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000265515}.
FT DOMAIN 678..857
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT REGION 1..30
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 143..205
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 144..158
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 159..174
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1116 AA; 126316 MW; 676AF6073B662632 CRC64;
MKVEGQQPST STPPSPPLTA TQQEKKELQE QLRWQQDRLA ELERQEATEL EAATDNSRRD
YLLQQLDKVL TDDRCAQVTK HLAETIILEH KITSFYFTNW DDRFERLERR VDDLAAQQSK
ILDAIQNLTA QLSTAKLIAP QLPLLKPKPS PPSRPSTPPG SAHYSRSPSH SQKAAGKATY
AAVTAGDQRG PKIPPPNKFR GDDPKTDVGD WAAGTKAYLR GFVCAEQTKA GTVLGLLEGP
ALKWATSTSS SLQQSMEDWT FGLGVDRLLQ TLEDRFADKE RASKAADRIA RLGQQRYSGT
LQTLFTEFKQ LTSTPGLVMS RDDLLTNFCR AAPEKFVVAL YSAGHKDWRS FGRAALDMEA
KLHVQAPSSD RRKVSSVAPS STQVDESAAV SIETVGDNVC LGTARATSFC YEDPDPDQDP
GQDELQSLAT FFLHLKEQKK RKSDLIMLRP LINRIRITGF LDCGATRNFI SPSAVKKLHL
RMKVQQLQQP LLVRIDDSTV PSIREKVQGI PVTFDSAGEV RHSLNFYIFP ELPFDLVFSM
QWLKAVNPRI DWQIPKVELP NSQGVYQPCM IAADHHLKTS CYCLRAREFH DLSRRYHHER
LFIALVKQTH APPVSCPPEI QQVVDQYADL TKEPFGLPNW PTKHHIELLP GAVPPKGRIY
KMFPAELEEP RKQLETLTSK GWIRRNTSEF GAPVLFMPKG NGEFRMCIDY RGLNKITGKS
TEPLPRIDDL LDMVQGCTVF SKVDLKSGYH QIEMAEEDVY KIAFKTRYGI YEFLVMPFRL
CNAPGTFQTE MYRIFRPYLD KFMVVYLDDI LVFSKTAREH VEHLALVLQS LRDSLYKINR
EKSSFGVPSV IYLGHVISGD GPTPEAAKIA AIQEWPQPQT VRDVRSFMGL ASYYRKFVRN
FSAVAASLTN LTKKDTPFLW SLPCQLAFIR LKKALTRAPM LKLPDPTLPF ILTTDASQYG
IGAVLQQDDG NGLRPVEFMS KKIKTQKLQD STYEKELYAL VCALKHWKHF LLGRHFKIFS
NHSTLQWMKS QGKLNDKLAR YIQFIDMFDF ELKHKKGCYN KVADALSRRP DSFAPISSTH
SFGEETRQTI ARLLPQDETF GPIVRSLQAD PNSEPG
//