ID A0A388KT94_CHABU Unreviewed; 2642 AA.
AC A0A388KT94;
DT 05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE RecName: Full=Reverse transcriptase {ECO:0008006|Google:ProtNLM};
GN ORFNames=CBR_g13003 {ECO:0000313|EMBL:GBG73284.1};
OS Chara braunii (Braun's stonewort).
OC Eukaryota; Viridiplantae; Streptophyta; Charophyceae; Charales; Characeae;
OC Chara.
OX NCBI_TaxID=69332 {ECO:0000313|EMBL:GBG73284.1, ECO:0000313|Proteomes:UP000265515};
RN [1] {ECO:0000313|EMBL:GBG73284.1, ECO:0000313|Proteomes:UP000265515}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=S276 {ECO:0000313|EMBL:GBG73284.1,
RC ECO:0000313|Proteomes:UP000265515};
RX PubMed=30007417; DOI=10.1016/j.cell.2018.06.033;
RA Nishiyama T., Sakayama H., Vries J.D., Buschmann H., Saint-Marcoux D.,
RA Ullrich K.K., Haas F.B., Vanderstraeten L., Becker D., Lang D.,
RA Vosolsobe S., Rombauts S., Wilhelmsson P.K.I., Janitza P., Kern R.,
RA Heyl A., Rumpler F., Villalobos L.I.A.C., Clay J.M., Skokan R., Toyoda A.,
RA Suzuki Y., Kagoshima H., Schijlen E., Tajeshwar N., Catarino B.,
RA Hetherington A.J., Saltykova A., Bonnot C., Breuninger H., Symeonidi A.,
RA Radhakrishnan G.V., Van Nieuwerburgh F., Deforce D., Chang C., Karol K.G.,
RA Hedrich R., Ulvskov P., Glockner G., Delwiche C.F., Petrasek J.,
RA Van de Peer Y., Friml J., Beilby M., Dolan L., Kohara Y., Sugano S.,
RA Fujiyama A., Delaux P.-M., Quint M., TheiBen G., Hagemann M., Harholt J.,
RA Dunand C., Zachgo S., Langdale J., Maumus F., Straeten D.V.D., Gould S.B.,
RA Rensing S.A.;
RT "The Chara Genome: Secondary Complexity and Implications for Plant
RT Terrestrialization.";
RL Cell 174:448-464(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GBG73284.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BFEA01000180; GBG73284.1; -; Genomic_DNA.
DR EnsemblPlants; GBG73284; GBG73284; CBR_g13003.
DR Gramene; GBG73284; GBG73284; CBR_g13003.
DR Proteomes; UP000265515; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 2.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR InterPro; IPR001878; Znf_CCHC.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF00665; rve; 2.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 3.
DR PROSITE; PS50994; INTEGRASE; 2.
DR PROSITE; PS50878; RT_POL; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Reference proteome {ECO:0000313|Proteomes:UP000265515};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 271..458
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 809..931
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT DOMAIN 1160..1175
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 1386..1549
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 156..211
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1116..1137
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1735..1758
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2197..2245
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2286..2505
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2530..2642
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 174..189
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2197..2211
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2226..2245
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2286..2315
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2346..2364
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2386..2498
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2530..2547
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2620..2642
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2642 AA; 300845 MW; 79E795848B90C53D CRC64;
MRSYFEVMRT PQEDRSMIMG TNTEPAVRNH IELQAVAAGY ERIDLTDWLK VTPVRALEDL
LLDRYRDKHA ALKARLKLEA LKGQTWRSSM QALEQHLTGL FTTPDLGMTD VSCMDVXMGV
APKEYLSLLA LKDHATLREL MKDLVDLEAK DLARRKKAPA AGGKPQRKRF GSSNQLALHD
HREAEDQSYA DDLSLDDDLE PDSDTGCSTS ALECDRNDDE KLNAFRKTAS NKGPNCGSGV
RITFDKDRTV THELNFYFLD KCPFDAVIGL GWLKAHCLRT MWADNQFVVL DAKGNERTVL
LDETRESPVT LLSANKFCRK NAYSLPRIDD LLDAAGGCKI FSKIDLKSGY HQIEVDPSDQ
HKTAFKTRDG LYEFIVMPFG LTNAPTTFQC LMDKVLRHQL NRFVVVYLDD ILIFSKFMDE
HVKHLEEVLQ VLKEAQLHLN LEKSEFGRGS VIYLGHRLSA NDLEPEATKV EVIQNWPRLA
NVRELRSFLG LASYYRKFVP RFSIIAHPLS RLTSKNVAYA WCEKCEFAFQ ALKEALVNHP
VLRIADPNLT FVVTTDASQF GIGAVLQQDD GDGLRPLEYY SKRMPSHKVA TSTYMRELYA
LRKALAHWKH YLLGRHLKVY SDHQTLQWIQ TQSELSPTLT RWLHDIDVYS FEFKHKKGCY
NRVADALSRH PEYLTCLVGS YDLRRKLKED LIEHTAKDPD LSPILEQLKA HPNSQPDFHK
CEGLVFRRYG KFDRLCVPNH APLRTHFLDL AHGRSEHFGF EKTYGSLLQQ FDWPGMKGSA
QKFIAECQVC QRIKVHRHKP YGLLRPLPIP DGPGESISID FTDMGKVSEA GNSQVMVIVD
RFSKFLNLIP LPPHAPTKLV IEEFHQQYIL QSGVPKTIVS DRDTRFISKD WKDFTSQIYD
IKLNRTSGRH TEANGLAEEI NQTVIQLLRA MIVPDQNTWD KELHKVKGLM LMSHLAEVAL
HCVEAQRATD AINKLDQRKF KSVRELTTTV ESLILVPGIN YSDQFMLTTF VRCLPENIRH
LLASEALTEY HSFETLSRKA LDLEATLGNA QPTSQIDTKK KKSPQEWKKK GAKLMMVESD
GTRTEIDELT DLMDYTEFDG EEVAEGSTLA AVVQTKASGR GKGQPRSQGK AASNQTKQAE
WVKAGLDQDV WRDRRVRGAC INCGEYGHTQ YKCQNAKVSQ KDRVLHLSRP VASTLANKER
MIVTDYMKDV VCTFSYGGGE LNHKISFLVS DDLPFDMLLG MYYLEVAKPQ FDWDKKVLKH
KLPDGRTVRL TKFKASSIID TYGCLCASAF YNYYKQNQEE GMYLVYVSEK GEAVKTPPEI
ERVVAKFPDL FEEPTGVVER EVVHAIEIIP GRMFHDAKQY VETCQVCHRD KPRTQATLGL
LKPLPIPAGP GQSVSMDFMD TLVTSKSGKR HIFVIIDRFT KYARLVAMPE TARTDYVIKL
FKDNWVHDFG LPKSIVSDRD VRFTSELWKK TAEQMGSQLQ MTSGNHPEAN GQAEQMNRVV
QHLLRHYIKP NQDDWDEQLH LIASLYNNAI NSSTGYEKLL QEAVEHMKKA RQAMIASENQ
HRRQSTFQIG ERVWVKASTS RKKEAAGSDS TATRVKEAFV PQPEARQLQL QATKVAFKWV
TQGQMVPPLS KGKYRVKCTL CGADWVASYT RVWPHFTRKT LPFPGRFPEM LHILAATGHK
IDYKKMQRLI QLYRMEHNIP FDGLTPAIPE EAQGDTLDKF IVPPAGRRSR TVPLSTVDEE
GPQGIDEEGG GMARTSAQGS AAGQKSGKLT QVSIKRWTTN DSQRRLDIAW GMHLCRHAHC
LSLLMKDICE LDWVKEVVQR TKMMVKFIRR HHHTASLYTK CSELSWELTL ILPTEVWFAS
SYMMMSRFWG RRRVLEDMME EGWRLLRWSA RKDRDKSDTT YMTVTENEWW VKLRTVLDVL
EPIDELLRKM DRNGTAPPSL WHFDEGLGRR LNALTGLTDV QWQAIMKAVR KRTKMMRQPV
HAANFLLDPR RRDMKWLLDM QTPLVQNTLK FFLSQCKEEV TWGCREQLDL WADLQAFHRE
PTGDVVKDPV TGKEVEESLW TEFVKFDSSL TQMTASEWWN AHGASHKKLR DIAVRVTALL
DIKKKSMGSL AGYLDMWAAF FDDVEAPPPN DPAALPKAAT VADLTGDELV HQANLTKTPW
ARVLKYRAVD ESSSSDNSDD GEDLIWRGKG KKKSVVEVLD DGKGKAQMSA DVEEDDAEES
EEDDENFTLR SPRASDMSSD DHEVDEALTR NVERGHLDSD LKVLRPRGMD FNARIGADTD
LDDDAERARA QSLAQRDRAL VEQRVREETA KRTVVPPSGR KDMTPGGRGS CLVADDVQQQ
QHKEGVPQQQ EEEEMHHQRE DGLQQQEQAE GLQQKTHEQE DGLQHQPQQE DGLQQQQHQH
QQHQQHQEPA QDGLQQQQED RLQQHQQQHQ EPGQDGLLQQ QETGQQQLQQ QHHEGLQQQQ
PDEDGLRQHQ QQQHDGLQQQ QQEKESAQPI TRVYSRRPQG STAAAILDAV QTLPFPPEIE
DMQHDNLDVS LAGRKRKVQP DTGPKVARKR GRPRKYPLAA SAGAAAGVDG AEGGEVQEHV
IAGRKVQGGP TLAVDGVETS LPKRPKRKAA KKARVVEEDP TDSDAEESSS ESDEEGRESD
WE
//