ID A0A388K4T8_CHABU Unreviewed; 2385 AA.
AC A0A388K4T8;
DT 05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 1.
DT 27-MAR-2024, entry version 18.
DE RecName: Full=Reverse transcriptase {ECO:0008006|Google:ProtNLM};
GN ORFNames=CBR_g49142 {ECO:0000313|EMBL:GBG65070.1};
OS Chara braunii (Braun's stonewort).
OC Eukaryota; Viridiplantae; Streptophyta; Charophyceae; Charales; Characeae;
OC Chara.
OX NCBI_TaxID=69332 {ECO:0000313|EMBL:GBG65070.1, ECO:0000313|Proteomes:UP000265515};
RN [1] {ECO:0000313|EMBL:GBG65070.1, ECO:0000313|Proteomes:UP000265515}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=S276 {ECO:0000313|EMBL:GBG65070.1,
RC ECO:0000313|Proteomes:UP000265515};
RX PubMed=30007417; DOI=10.1016/j.cell.2018.06.033;
RA Nishiyama T., Sakayama H., Vries J.D., Buschmann H., Saint-Marcoux D.,
RA Ullrich K.K., Haas F.B., Vanderstraeten L., Becker D., Lang D.,
RA Vosolsobe S., Rombauts S., Wilhelmsson P.K.I., Janitza P., Kern R.,
RA Heyl A., Rumpler F., Villalobos L.I.A.C., Clay J.M., Skokan R., Toyoda A.,
RA Suzuki Y., Kagoshima H., Schijlen E., Tajeshwar N., Catarino B.,
RA Hetherington A.J., Saltykova A., Bonnot C., Breuninger H., Symeonidi A.,
RA Radhakrishnan G.V., Van Nieuwerburgh F., Deforce D., Chang C., Karol K.G.,
RA Hedrich R., Ulvskov P., Glockner G., Delwiche C.F., Petrasek J.,
RA Van de Peer Y., Friml J., Beilby M., Dolan L., Kohara Y., Sugano S.,
RA Fujiyama A., Delaux P.-M., Quint M., TheiBen G., Hagemann M., Harholt J.,
RA Dunand C., Zachgo S., Langdale J., Maumus F., Straeten D.V.D., Gould S.B.,
RA Rensing S.A.;
RT "The Chara Genome: Secondary Complexity and Implications for Plant
RT Terrestrialization.";
RL Cell 174:448-464(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GBG65070.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BFEA01000057; GBG65070.1; -; Genomic_DNA.
DR EnsemblPlants; GBG65070; GBG65070; CBR_g49142.
DR Gramene; GBG65070; GBG65070; CBR_g49142.
DR Proteomes; UP000265515; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 2.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 4.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 2.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17919; RT_RNaseH_2; 2.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 2.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 2.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 2.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000265515}.
FT DOMAIN 795..975
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 1272..1432
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT DOMAIN 1909..2088
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT REGION 1..363
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 408..445
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 691..719
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 1456..1483
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 35..101
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 111..133
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 147..208
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 257..311
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 325..356
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 408..427
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT UNSURE 1621
FT /note="E or Q"
FT /evidence="ECO:0000313|EMBL:GBG65070.1"
SQ SEQUENCE 2385 AA; 267595 MW; 1BD04F248EE3A6C0 CRC64;
MADTMKPAGG GADEARRGGR EAQRSGGGTG GAAEEVGRRS REAAEDGGRR SGEAAEEEGW
SLGGAEKLRR RAGGGAEEQR SGRGRGEEGW RSEEDVVEGG RTGGAEKLQR MRGGGGEERT
RGTAEKLRRR RGGGAENGGG AGEEGWRSGE AAEEQRRSSC RGGPEEERRN SGGAAEEQRR
SSGGAAEKDG QRSGGGMEEE HGRRGGAEGA HRSGGAVGKV QRGSPGGENM QRTTGRGGPE
ERRSSGGAAE EAQRISGGGG AEERRGVDVE WGSSGGGVED ERGRRGRGEE DERRRRNRGG
EAQRRRGTEK LQRMSGGGGA EARRMSEGDG VEEQRTYTGG AEDEPGRGKG DEEGGDDPGG
VSPYVAPSLV AIVSLGMPLS GGVSAGTTLH VPVQTVFTQS PSQVAVTTQP AVSQPLQPQG
TQPQQLAPQP VSPGPQPGVM QGPGQTRWVP KTAIAAPKPF TGDKRGEDLD TWLRAVPVYV
RCKLTLPHEE VLVAASYLEG SAARWLSGLV QLQGYGHDFR AWAASQKLED FLKMVEETWH
DPQEARRATD AILTLHTRQF KSVREATDAV KRLICVPGVR YDPQVLLTSY LRRFSQPLRN
QLAKEANINM HNFPSFSKVA LDLEANIGHG QAPTTDGRKK TLPPNWKAKG RLMFVDNDGS
TIELDGNFQE GVGSKAGSVD ASEGGVVAAV SQKGKATGRR CGGSRSRSQV DPNAPPWEKA
GLTKDVWRDR YSRQASIRCG QYGHIQFKCH NKKVTEKIPP TMGQVLGSSQ PVGSNVANTS
GSELDELRRQ LKELVKKGWI RPSVSPYGSP VLFVPKKKEG TFRMCIDYRG LNAITVKNRE
PLPRIDNLLD RVQGCRYFSK IDLKSGYHQI AIGPEDQHKT AFQTRYGLYE FVVMPFGLCN
APGTFQHAMN WIFHDYLDKF VIVYLDDILI FSKTFEEHVA HLDKVLSLLR QHNFKINCEK
CEFGRTRVLY LGHEISAEGL KPNDAKVVSI RDWPRPQSVT EMRSFLGMTG YYRTFVKNYS
IVATPLTDLT RLNTPWEWTD ECEAAFRHVK HALTHYEVLK LPDPDKPFIV TTDASQYGIG
VVLAQQEGPK LRPVEYMSKK MPSQKLAKST YEKELYAVYK ALTHWRHYLL GRPDFSGALI
IEFDLTNNVT QSLVEAYRED QFMSEIVRRL EAKDKKTSAE FELVNGLLFL EKARNKRLCV
PNSESLRSLF LGEFHDATGH FGYKKTAANL VQRFWWPTMM RDAQLYMETC QVCQRDKPRT
QAPLGLLKPL SVLERPGESL SMDFMDTLIT SKSGMCHIFV IVDRFSKYAR LVAMPETAKT
EYVIRMFKEN WVRDFGLPKS IVSDRDVRLT SELWKAAAAE QGTQLQMTSG NHPEANDQTE
QLNRAVQHLL RHYIKPNQVD WDEKLALIAS LYNNVVHSAT GVSPNSLLLT FKPRLPLDFL
LPENQSTAAL GTLEFAYCYE QKMQQAVEQM QKAQAAMIES ENRHRWPSTF QVGDRVWVKS
SQLGQEYGIS RKLMPQYFGP WEVLDIVGTD PDGPSYVIRI PGHLCTYLVF HASKLAPFKE
TIQFPSRRSM LPPTMDEEWT SMILWITESC PYQDPQGTVR PRSIEADLCP PVQKYRSSLS
EAIDLPAFYR YGVFNGPDTI SSTVATIKTD ITKLQTKPDA TTKNYKMPHF DISKFDDHNK
TDALAWWQRF LTEASYRTVP DDYLMKALYL QLIGGAQAWM NHLAATHTCT IAELHTHITW
KDFEKLWFTR FMVHNVMKAA MNEVYTCSQG SMPTRDWTTK WQKIVTTPGF DLSFPNRRSE
FFSRSCAGLR SALGNEYDYT SFQAILDRAN LVIQTDDKAA TRNTPSHIMW PSRAINYRDV
FEAPTGTVLD RPIRHGITLE AGTVPQRGCI YRMSEEELEV LRAQLDDLLD KGWIRPSCSP
YGPPVLFVRK KNKDLRLCID YRKLNAQTVK NAGPLPRIDD LLERLGGATY FSKLDLKSGY
HQIEIQPQDR YKTAFKTRYG HFEWVVMPFG LTNAPATFQA VMTTEFCDFL DRSVLIYLDD
ILVYSRTLDE HIIHLRVVLD RLRLAKYKAN LDKCEFAKQK LEYLGHFVTP KGIRPLVDKI
QAIVDWPEPR CTTDVRSFMG LAGYYQRFVE SYSKVAAPLS RLQSSKVPFE FDDAARGAFT
TLKAAMQAAP ALRIYDPTLP TPVTTDASGY GIGAVLEQCH EDGWHPVEYF SQKVPLINTL
DDARKKELLA FVAVLKRWRH FLLSRRRFKW NTDNNPLTFY KTQDTVTSTI GRWMYYINQF
DFDPCHIPGP ANRAADALSR RPDFCAIVTM AFDLDDDLQP HFVKGYKSDP TYSTIYAELS
SDHPPASHYR ISDRFLLLHT RGKDLLVVPQ ERILRTRLLG EVHDA
//