ID A0A388KRA3_CHABU Unreviewed; 2266 AA.
AC A0A388KRA3;
DT 05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 1.
DT 27-MAR-2024, entry version 18.
DE RecName: Full=Reverse transcriptase domain-containing protein {ECO:0000259|PROSITE:PS50878};
GN ORFNames=CBR_g12166 {ECO:0000313|EMBL:GBG72594.1};
OS Chara braunii (Braun's stonewort).
OC Eukaryota; Viridiplantae; Streptophyta; Charophyceae; Charales; Characeae;
OC Chara.
OX NCBI_TaxID=69332 {ECO:0000313|EMBL:GBG72594.1, ECO:0000313|Proteomes:UP000265515};
RN [1] {ECO:0000313|EMBL:GBG72594.1, ECO:0000313|Proteomes:UP000265515}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=S276 {ECO:0000313|EMBL:GBG72594.1,
RC ECO:0000313|Proteomes:UP000265515};
RX PubMed=30007417; DOI=10.1016/j.cell.2018.06.033;
RA Nishiyama T., Sakayama H., Vries J.D., Buschmann H., Saint-Marcoux D.,
RA Ullrich K.K., Haas F.B., Vanderstraeten L., Becker D., Lang D.,
RA Vosolsobe S., Rombauts S., Wilhelmsson P.K.I., Janitza P., Kern R.,
RA Heyl A., Rumpler F., Villalobos L.I.A.C., Clay J.M., Skokan R., Toyoda A.,
RA Suzuki Y., Kagoshima H., Schijlen E., Tajeshwar N., Catarino B.,
RA Hetherington A.J., Saltykova A., Bonnot C., Breuninger H., Symeonidi A.,
RA Radhakrishnan G.V., Van Nieuwerburgh F., Deforce D., Chang C., Karol K.G.,
RA Hedrich R., Ulvskov P., Glockner G., Delwiche C.F., Petrasek J.,
RA Van de Peer Y., Friml J., Beilby M., Dolan L., Kohara Y., Sugano S.,
RA Fujiyama A., Delaux P.-M., Quint M., TheiBen G., Hagemann M., Harholt J.,
RA Dunand C., Zachgo S., Langdale J., Maumus F., Straeten D.V.D., Gould S.B.,
RA Rensing S.A.;
RT "The Chara Genome: Secondary Complexity and Implications for Plant
RT Terrestrialization.";
RL Cell 174:448-464(2018).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the TFIIA subunit 1 family.
CC {ECO:0000256|ARBA:ARBA00010059}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GBG72594.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BFEA01000168; GBG72594.1; -; Genomic_DNA.
DR EnsemblPlants; GBG72594; GBG72594; CBR_g12166.
DR Gramene; GBG72594; GBG72594; CBR_g12166.
DR Proteomes; UP000265515; Unassembled WGS sequence.
DR GO; GO:0005672; C:transcription factor TFIIA complex; IEA:InterPro.
DR GO; GO:0006367; P:transcription initiation at RNA polymerase II promoter; IEA:InterPro.
DR CDD; cd09274; RNase_HI_RT_Ty3; 2.
DR CDD; cd01647; RT_LTR; 2.
DR CDD; cd07976; TFIIA_alpha_beta_like; 1.
DR Gene3D; 3.30.70.270; -; 3.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 2.30.18.10; Transcription factor IIA (TFIIA), beta-barrel domain; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041373; RT_RNaseH.
DR InterPro; IPR041577; RT_RNaseH_2.
DR InterPro; IPR004855; TFIIA_asu/bsu.
DR InterPro; IPR009088; TFIIA_b-brl.
DR PANTHER; PTHR33064; POL PROTEIN; 1.
DR PANTHER; PTHR33064:SF36; RT_RNASEH_2 DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17917; RT_RNaseH; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF00078; RVT_1; 2.
DR Pfam; PF03153; TFIIA; 1.
DR SMART; SM01371; TFIIA; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 2.
DR SUPFAM; SSF50784; Transcription factor IIA (TFIIA), beta-barrel domain; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000265515};
KW Signal {ECO:0000256|SAM:SignalP};
KW Transcription {ECO:0000256|ARBA:ARBA00023163}.
FT SIGNAL 1..17
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 18..2266
FT /note="Reverse transcriptase domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5017419059"
FT DOMAIN 676..877
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT REGION 43..112
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 183..215
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 257..281
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 310..356
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 416..437
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1031..1053
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1080..1124
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1263..1301
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1367..1408
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 1151..1189
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 63..77
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 190..204
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1038..1053
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1080..1107
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1272..1295
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1375..1405
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2266 AA; 256524 MW; 9A814B2C21D54A6F CRC64;
MAVRCLAPRS LLPAALAALL RHLRACAENS AALLLRSYDE EQDDLYSEPP RPSSPVTQDF
FAAAAERKRK RESPSPPRPR YQYDVNEIPQ QDGAPDSPED MAERLDILGR TGAADQHSSD
LRICAPAVAR EAQRRAIRDL TADEEARVDA DAALLRLAMG ECQVSTSGSF KEARPGGVRE
RCIPQVDGGD GEYEDYDAPD DDYNDQGNGS DWVPGPELAA ARVGISAELD MVLRARNARP
SPMEQEKLAN AALCVESGDK EDEEGELGRG GGGGGAKWQR SMRQSERAGV WRLGRDFEYA
LLGAKLARKK GGQERKGGES AGQSGMKREG RESRDGGRES IRRQEKEWGG HRGREVRKVK
QNGVEIGIER GEKRRGGRGV GRVGAGKWEL IVALGRERKM SDHGAQQQPQ LEVVMKVEGQ
QPSTSTPSPP LTATQQEKEL QEQLRRQQDG LVELERQEAA ELEVATDNSR RDYLLQQLDK
VLTDDRCAQV TKHLAETIIL EHKITSSYFT NWDDRFVRLE RRVNDLADQQ SKILDAIQNL
TAQLGLPQPQ SMEDWAFGLG VDKLLQALED RFADKERARK AVDRIACLGQ QRYSGTLHAL
FAEFEQLTST PGLVMSTDDL LTNFCRAAPE KFVVALYSVG HKDWRSFGRA ALDMEAKLHV
QAPSSDRRKR AFPRGGRKGK AAFTHVGFAS AVLFVPKGNG EFRMCIDYRG LNKITRKSTK
PVPRIDDLLD MVQGCTVFNK VDLKSSYHQI EMAEEDAYKT AFQTRYGTYE FLVMPFGLCN
APGTLQTEMH RIFRPYLDKF MVVYLDDISV FSKTVREHAE HLALVLQSLC DSQYKINREK
SSFGVPSVIY LHVISGDVLQ QDDGNGRRPV EFMSKKIKTR KLKDSTYEKE LYALVFALKH
WKHFLLGRHF KIFSDHSTLQ WMKSQGELND KLARYIQFID MFDFELKHKK GCYRVHTDAF
NGDNMAKVFD ARSGKRXFAE LGVKFLLAED RKNLAEVVKV GLEGGAEDKD VIKVDDDTDF
EEVAEDVVHG RLEGSGGIGE SEGHHEKLEV PEPRAERGLV GVLLADTDLV EATAKVDLGK
DDQGARKSEG KSPECGEAGR PRGTAEESRG AVHIQAEPEA PPHLLGAQVW STAFGAGGWE
QRWGAAIASS KQRLAREGEG QRRRKKKKKE EEEEEGRRRR RRKKKKKEEE EGGREKEPRY
LPAMVAAISY SLEQQMTQWF VVVALMTMVP VFVIWNDSNV SPVELCHADG YIVCLAGVSD
VGMKGPKQER LDEEEEDDEP PLNENDDDEP DDVNDPEEGP TIHNLVLAQF DKVTRSKNKW
KCTLKEGVMH LNGRDILFQK CTTPLYPGSR GAIRRHFKKE KRGVPQAGEV KGNCRGAGGE
EKEVGGGDAE IIEGGGREKE SCRRSSGRGV SIGRTQKAYR REEHSCGTKE EDMWMEKKIF
EWVASLSLGE DKEAMLYVPW EEKEAVVREM EAMEDPLDCQ TLEDEKRLES ILRLGREKKR
RREEANRMAK EVERIQSCKQ EVQAQQDIPA KLDKILSAIE GAKVVATAEA EARPRKEPVK
LKFLDSYSGK KDDNFDNWEA SINTYVYLQH IAPEKQVLVA FHALKDEAAS FARSLARAAD
CEHNMVAYSN LSPLPTFLKL LRERFTDVAR GARASDKLQT IHSRQWKRAR ALKAAMDDHV
AIPDHGVIET LLVNLFYRAM PEPLRGHFFD KTQQTNIAYD ALSREGVLFE AKSMPISTLW
HKDFDKGMKW KGCTISGQVR AKDHLILTFD EGGADEVPYS QIEWGLEEED SGHAMNRIFH
DCLDKFVIVY LDDILIFTKT VEEHVAHLDK IPSLMRQHKF KINGEKCEFG RTRVLYLGHE
MFAEGLKPDA AKVASIRDWP RPQSVTEMRS FLGMTGYYRN FVKNYSIVGA SLTDLTRLDT
PWEWTERCEA AFRHLKHALT HYEVLKLPDP DKPFIVTTDA SQYGIGAILA QQEGKKLRPA
EYMSKKMPSQ KLAKSTYEKE LYAVYKALTH WRHYLLGRSF ILRTDHQTLR WMRTQPVLSD
ALKRWIEVIE QYDFDPQYLK GEYNKVVDAL SRRPDFSANL LQRFWWPTLM RDAKLYMETC
QVCQRDKPRT QAPLGLLKPL PIPERPGESL SMDFMDTLVT SKSVMRYIYA IVDRFSNFAR
ANAARHKGRY VALQQIVKLY HLPAHKWKYQ HLTHSSVPNM PCVTPGMPTQ HMCHVSVKSQ
HGLLEGCYVS NAWNANSSNI GMPCQIEGTL LECGEPRQQA NGEFEW
//