ID A0A388K617_CHABU Unreviewed; 2440 AA.
AC A0A388K617;
DT 05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 1.
DT 27-MAR-2024, entry version 17.
DE RecName: Full=Reverse transcriptase domain-containing protein {ECO:0000259|PROSITE:PS50878};
GN ORFNames=CBR_g51093 {ECO:0000313|EMBL:GBG65498.1};
OS Chara braunii (Braun's stonewort).
OC Eukaryota; Viridiplantae; Streptophyta; Charophyceae; Charales; Characeae;
OC Chara.
OX NCBI_TaxID=69332 {ECO:0000313|EMBL:GBG65498.1, ECO:0000313|Proteomes:UP000265515};
RN [1] {ECO:0000313|EMBL:GBG65498.1, ECO:0000313|Proteomes:UP000265515}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=S276 {ECO:0000313|EMBL:GBG65498.1,
RC ECO:0000313|Proteomes:UP000265515};
RX PubMed=30007417; DOI=10.1016/j.cell.2018.06.033;
RA Nishiyama T., Sakayama H., Vries J.D., Buschmann H., Saint-Marcoux D.,
RA Ullrich K.K., Haas F.B., Vanderstraeten L., Becker D., Lang D.,
RA Vosolsobe S., Rombauts S., Wilhelmsson P.K.I., Janitza P., Kern R.,
RA Heyl A., Rumpler F., Villalobos L.I.A.C., Clay J.M., Skokan R., Toyoda A.,
RA Suzuki Y., Kagoshima H., Schijlen E., Tajeshwar N., Catarino B.,
RA Hetherington A.J., Saltykova A., Bonnot C., Breuninger H., Symeonidi A.,
RA Radhakrishnan G.V., Van Nieuwerburgh F., Deforce D., Chang C., Karol K.G.,
RA Hedrich R., Ulvskov P., Glockner G., Delwiche C.F., Petrasek J.,
RA Van de Peer Y., Friml J., Beilby M., Dolan L., Kohara Y., Sugano S.,
RA Fujiyama A., Delaux P.-M., Quint M., TheiBen G., Hagemann M., Harholt J.,
RA Dunand C., Zachgo S., Langdale J., Maumus F., Straeten D.V.D., Gould S.B.,
RA Rensing S.A.;
RT "The Chara Genome: Secondary Complexity and Implications for Plant
RT Terrestrialization.";
RL Cell 174:448-464(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GBG65498.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BFEA01000062; GBG65498.1; -; Genomic_DNA.
DR EnsemblPlants; GBG65498; GBG65498; CBR_g51093.
DR Gramene; GBG65498; GBG65498; CBR_g51093.
DR Proteomes; UP000265515; Unassembled WGS sequence.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR CDD; cd01647; RT_LTR; 2.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 2.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR027806; HARBI1_dom.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR24559:SF434; RNA-DIRECTED DNA POLYMERASE HOMOLOG; 1.
DR PANTHER; PTHR24559; TRANSPOSON TY3-I GAG-POL POLYPROTEIN; 1.
DR Pfam; PF13359; DDE_Tnp_4; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 2.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000265515}.
FT DOMAIN 105..285
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT REGION 467..515
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 899..963
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1126..1175
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1197..1219
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1732..1777
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1799..1824
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1839..1927
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 782..817
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 1263..1290
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 916..940
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1150..1172
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1732..1764
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1887..1904
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT UNSURE 812
FT /note="D or N"
FT /evidence="ECO:0000313|EMBL:GBG65498.1"
SQ SEQUENCE 2440 AA; 275436 MW; 043FCF7B545654AD CRC64;
MVHDPCRLLL MKENCLAGDG CIWKHDRTIL RQGWCRTGEL VKMLPEIQGV VAKYPDLFEE
LTGVVEKEVV HAIEIIPGSG IPKGRIYRMS PGELDELRRQ LKELVEKGWI RPIVPPYGSP
VLFVPKKKEG TLRMCIDYMG LNAIIVKNRE PLPRIDDLLD RVQGCRYFSK IDLKSGYHQI
AIQPEDQHKT AFQTRYGLYE FVVMPFGLCN APDTFQHAMN RIFHDYMDKF EIVYLDDILI
FSKTVEEHVA HLDKVPSLLR QHKFKINGEK CEFGRTRVLY LGHEISAEGL KPDDAKVASI
RDWPRPQSVT EMRSFLGITG YYRNFVKNYS IVVAPLTDLT HLDTPWEWTD RCEAAFRHLK
HALTHHEVLK LPDPNKPFIV TMDASQFGIG AVLAQQEGKK LRPVEYTSKK MPSQKLAKST
YEKELYAIYK AHPLEALPPW EIVSNTTAIE TMTRLVVIEE VARGEERKEP KEKFMKDAIK
STRVTPEKPR SSKKEHKKPA EKKVEPTKEP AKEVEKLKRK EKVKIKLPFT YNGKRGEHLL
LWIAKIQTYC GTAPVEPESQ VAFTTACLCE TAKEWVLSEA NAAGFEDIGE WAKTLTLREF
LQKIEERFLD KTATNKAFDE LTTIGQKRWT SAGTLSHEVD RLLQVPGLNL QDNQVLYIFS
RALHEPIRGH LVAEAKSGKY NYRQRHDLAL QREQMTTHVK GTYASVVKFG TVGGYGKRVL
WRQKRQDHML VVFDDDTVEK LPHDESEGGE QQPQVEVVMK VEGEQPSAST PPSPPLTATQ
RLKELEEQLR QQQARLAEVE QQEAAELEAA TDHSRREYLL QQLERSLADD RCSQVTKHMA
AMILLEHKIT NSQFTNWDDR FVRLERRVDE LSAQQTKILK SIQGLTAQLA AAKLTTPQTP
LLQPKSSPHS SPPSSPHPSH AGSVHSSRSS TPSQKASAKA TYAVVTAGDQ RGPKITAPNK
FRGDPKTDVG DWAAGTRAYL RGFACAEQTN VATVLGLLEG PALKWATSTS SSLQQSMEDW
AFGLGVDRLL QALEERFADK ERARKAAGRI ARLGQQRYSG TLQALFLEFE QLTSTPELVM
SPDDLLTNFF RAAPEKFVVA LYNAGHKDWR SFGRAALEME AKLHVQAPSS DRRKGAFPRG
GRKGKATFTH AGSASGSDSD SQADTPSAGT RSAAETDVAA AVTQAVLGAL QLQKSFLRPQ
PQPVPRGRKR DTNAPSVSYP PEIQQVVDQY ADLMQEPFGL PNWPTKHHIE LLPGAVPPKG
RIYRMSSVEL EELRKQLEEL RKQLETLTSK DWIRPSTSEF GALVLFVPKG NGEFKMCIDY
MGLNKITRKS TEPLPRIDHL VDMVQGCTVF SKVNLKSGYH QIEMAEGDVY KTAFKTGHFK
IFSDHSTLQW MKSQGELNNK LVQYIQFIDM FDFELKHKKG CDNKVADALS RRPDSFALIS
STHSFGEETR QTIAHLLPQD ETFGPIVRNL QANPNSEPGY VLSSDLLYTC SRGEERLCIP
QDQRLKTLLM SECHDARGHF GFLKSYAALS QRFFWKEMRS EMLRYVDTCE LCQRNKVHRR
PHLGLLKPLP IPDGPAESLS IDFTDLGKTT PRGMRQVMVC VEVLLHEVLQ ESMPPDLRVL
LFQGRNLANF LRDFQDFCLI KKWNRKAILY MFPLFVCEGL SEEVYALKQK AKTWDELESS
LRLKFPEYGI EGQCGECSVK PVVGAPSQAE LSGLQRQVGA LEERLARLEE AKRGKRKVSE
DSPSELDDQR ERGVEEDDQE SPLSIGAPKR RVGAQAKNKG EGFVEIKEEQ DANMALPVIA
PDPKRRLTNR ETSSAAGQER QKNGWWPEHW VKGVFDCWGN TGRTPQGSEE KGVTGEKGEL
EPSKLTKAQQ GTSLRAVKVP KKGRARGRWQ LPRESWKDDL DGGDDGDSVN AVGLGTRGGA
SGSRGTAWSR VHGESQLWRQ VSPLHRRDSH RSLLSFLSMD PPSVRHSDDG WEDSLVVMTL
LVVQILQERN VAAMMVLQAA AALPLSIPGV GDNIALIAGG LLHNHIMQCA ALKTLHRTAE
RRRALWVLER NGGVWSDLQK EGEQYDRVSR RLCRLPPAMF LEVLDRIGPH IQRQDTNWRR
SLPAALKFAC ALGRWAMGTY YRQYGHSLGV GLASAQRSNI DVAEALIKEY GHVIAWPEGR
RLQETLDAFE RKGFPGCVGV IDCTHRYIEK PKGARVECFY DRTGGHSIVA QVCCDHEGRI
LNVFVGCPGA VHDARVLRLS PMYNNVQEGR VIFHSGACTL RDGGEVGYYL LGDAGYPLLP
WIMTLVGGSE RTTLQRQYDD FHTAARSIIE RCFGRLKGVW RNFIKRHICN LKTLTKEFMA
VCILHNLMID AHVVIYRSLL TAESDSEEEN DGVGRPRRQW RRRRINRRDE VHNREEGWGW
DDASLAHSTD ASKAIRDRLI AHVHHHARVR GAPPSNPWGV
//