ID A0A388LKI5_CHABU Unreviewed; 1345 AA.
AC A0A388LKI5;
DT 05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 1.
DT 24-JAN-2024, entry version 19.
DE RecName: Full=Reverse transcriptase {ECO:0008006|Google:ProtNLM};
GN ORFNames=CBR_g36350 {ECO:0000313|EMBL:GBG82819.1};
OS Chara braunii (Braun's stonewort).
OC Eukaryota; Viridiplantae; Streptophyta; Charophyceae; Charales; Characeae;
OC Chara.
OX NCBI_TaxID=69332 {ECO:0000313|EMBL:GBG82819.1, ECO:0000313|Proteomes:UP000265515};
RN [1] {ECO:0000313|EMBL:GBG82819.1, ECO:0000313|Proteomes:UP000265515}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=S276 {ECO:0000313|EMBL:GBG82819.1,
RC ECO:0000313|Proteomes:UP000265515};
RX PubMed=30007417; DOI=10.1016/j.cell.2018.06.033;
RA Nishiyama T., Sakayama H., Vries J.D., Buschmann H., Saint-Marcoux D.,
RA Ullrich K.K., Haas F.B., Vanderstraeten L., Becker D., Lang D.,
RA Vosolsobe S., Rombauts S., Wilhelmsson P.K.I., Janitza P., Kern R.,
RA Heyl A., Rumpler F., Villalobos L.I.A.C., Clay J.M., Skokan R., Toyoda A.,
RA Suzuki Y., Kagoshima H., Schijlen E., Tajeshwar N., Catarino B.,
RA Hetherington A.J., Saltykova A., Bonnot C., Breuninger H., Symeonidi A.,
RA Radhakrishnan G.V., Van Nieuwerburgh F., Deforce D., Chang C., Karol K.G.,
RA Hedrich R., Ulvskov P., Glockner G., Delwiche C.F., Petrasek J.,
RA Van de Peer Y., Friml J., Beilby M., Dolan L., Kohara Y., Sugano S.,
RA Fujiyama A., Delaux P.-M., Quint M., TheiBen G., Hagemann M., Harholt J.,
RA Dunand C., Zachgo S., Langdale J., Maumus F., Straeten D.V.D., Gould S.B.,
RA Rensing S.A.;
RT "The Chara Genome: Secondary Complexity and Implications for Plant
RT Terrestrialization.";
RL Cell 174:448-464(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GBG82819.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BFEA01000419; GBG82819.1; -; Genomic_DNA.
DR EnsemblPlants; GBG82819; GBG82819; CBR_g36350.
DR Gramene; GBG82819; GBG82819; CBR_g36350.
DR Proteomes; UP000265515; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd00024; CD_CSD; 1.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 2.40.50.40; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR000953; Chromo/chromo_shadow_dom.
DR InterPro; IPR023780; Chromo_domain.
DR InterPro; IPR023779; Chromodomain_CS.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041373; RT_RNaseH.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR24559:SF425; RT_RNASEH DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24559; TRANSPOSON TY3-I GAG-POL POLYPROTEIN; 1.
DR Pfam; PF00385; Chromo; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17917; RT_RNaseH; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SMART; SM00298; CHROMO; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS00598; CHROMO_1; 1.
DR PROSITE; PS50013; CHROMO_2; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000265515}.
FT DOMAIN 321..500
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 911..1076
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT DOMAIN 1235..1297
FT /note="Chromo"
FT /evidence="ECO:0000259|PROSITE:PS50013"
FT REGION 1..20
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 128..157
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 199..241
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 616..643
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1286..1345
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..17
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 206..238
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 616..633
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1345 AA; 153227 MW; 99FAC9F84F0EE8DE CRC64;
MKGSSFEEIN AAPSTSQGDF VRPRVVEEAR LATSGSHSPI SVTLGDNKTQ RFFDQTVPDL
HFSLTLQPPD STQAPRRYHS SAYFDVLETG YDFILGTPWS RRFRSTEAEW AFETLILKTK
CGQTHLVPFI GTTGGTPPNL PPQDPRPSGS HPDISFTSPR QFAHFIRQED VTLYSVNVMD
LLRYDPLCPE VELISLEPDP PDPSSIFTAL ISTSTPQPAD TPSTSQVPTP STAESTHTSR
ADADVKELMR FTTDLEPVIC DLIREYSDVF PPYFSYSGIP PMRGVEHSIQ LVPDYRVHHQ
APYRLSIPEA TELKRQLEEL LRLGFIKPSN SPWGAPVLFA RKADETLRLC IDYHGLNRYT
VKNSYPMPRA DELFDRLTDN RFFTKIDLRS GYHQIRIAAE DQPKTAFRSR FGHYEFTVMP
FGLTNAPATF QTTMNDIFRD ILEEYVLVYL YDILVYSRTL EDHIRHLRDV LQCLRKHGFY
AKLSKCCFAQ RKVDFLGHHV SDQGLHMDDA KINAIAEWAV PTSAKQLRSF LGLTSYYSNF
IQGYARYSYV LTSTLLRKNP PWFWTPLCED AFRALKKAMT CAPVLRLPDF DHPFIVTTDA
SDFAVGTVLS QVFPSPPDSP YPHVPPSPPP LADTASRLTP ILPPLPSTDS PPITYSPTIA
EDGTVEARAG DCLIAFYSRQ LLPAEINYTA DEREVLAVVY ATRHWRHYLH GAPFTVRTDN
SVVQAFLTKP KLSPRQARWW RDLSEFSFTT QPIKGETNRV ADALSRRPYH NQEPIHLAVI
SITSVDQSVI DAYRTQYCHC PDYRVIHTTL RSGKTVPSYS LGENGLVYWH GRSGQLEPRI
CVPSTGQLRI QAVAEFHNQA AAGHMGFHKT LARVCRLYVW PKSKDFVKAY IQECPTCQEV
NSANHLPYGL LQPLPILEGR WQSISMDFIG PLRPPTQRGH DAXLVVVDRF TKXARFVPCR
YRISAREVAD IVFDRVVRDH GLPQSIISDR DPRFTNTFWR RLHEVYGSQL CFSSSYHPQT
DGQTEVTNKT LGNILRKFVR DDQQWDLHLA HAEIAYNHAV SPATGMSPFY CDLDYHPRVP
ADFLRPLRLR PDTRCPALDD WIAHMAAIMK XAHESLXXSQ TRMAARANRS RMDHPFKVGD
DVLVDARHLQ LEADMLRKFR RRFFGPCRIL QAVGSNTAAS PVSFRVKLPD YLXQARVHDV
YHVSLLRPYR RPSERFAGRP YERPPLIMVD GHEEFVLSDI VSRRVTDDTP PRIEYLVRWK
GYPDEEATWE PLEHLQHTRM LVRAYDRARR AGTSASTQPT DPLPPPPAEE IAEDEPAPSE
AVPQPQMVAS ERPQRTHRRP SCYDD
//