GenomeNet

Database: UniProt
Entry: A0A388K4T8_CHABU
LinkDB: A0A388K4T8_CHABU
Original site: A0A388K4T8_CHABU 
ID   A0A388K4T8_CHABU        Unreviewed;      2385 AA.
AC   A0A388K4T8;
DT   05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT   05-DEC-2018, sequence version 1.
DT   27-MAR-2024, entry version 18.
DE   RecName: Full=Reverse transcriptase {ECO:0008006|Google:ProtNLM};
GN   ORFNames=CBR_g49142 {ECO:0000313|EMBL:GBG65070.1};
OS   Chara braunii (Braun's stonewort).
OC   Eukaryota; Viridiplantae; Streptophyta; Charophyceae; Charales; Characeae;
OC   Chara.
OX   NCBI_TaxID=69332 {ECO:0000313|EMBL:GBG65070.1, ECO:0000313|Proteomes:UP000265515};
RN   [1] {ECO:0000313|EMBL:GBG65070.1, ECO:0000313|Proteomes:UP000265515}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=S276 {ECO:0000313|EMBL:GBG65070.1,
RC   ECO:0000313|Proteomes:UP000265515};
RX   PubMed=30007417; DOI=10.1016/j.cell.2018.06.033;
RA   Nishiyama T., Sakayama H., Vries J.D., Buschmann H., Saint-Marcoux D.,
RA   Ullrich K.K., Haas F.B., Vanderstraeten L., Becker D., Lang D.,
RA   Vosolsobe S., Rombauts S., Wilhelmsson P.K.I., Janitza P., Kern R.,
RA   Heyl A., Rumpler F., Villalobos L.I.A.C., Clay J.M., Skokan R., Toyoda A.,
RA   Suzuki Y., Kagoshima H., Schijlen E., Tajeshwar N., Catarino B.,
RA   Hetherington A.J., Saltykova A., Bonnot C., Breuninger H., Symeonidi A.,
RA   Radhakrishnan G.V., Van Nieuwerburgh F., Deforce D., Chang C., Karol K.G.,
RA   Hedrich R., Ulvskov P., Glockner G., Delwiche C.F., Petrasek J.,
RA   Van de Peer Y., Friml J., Beilby M., Dolan L., Kohara Y., Sugano S.,
RA   Fujiyama A., Delaux P.-M., Quint M., TheiBen G., Hagemann M., Harholt J.,
RA   Dunand C., Zachgo S., Langdale J., Maumus F., Straeten D.V.D., Gould S.B.,
RA   Rensing S.A.;
RT   "The Chara Genome: Secondary Complexity and Implications for Plant
RT   Terrestrialization.";
RL   Cell 174:448-464(2018).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:GBG65070.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; BFEA01000057; GBG65070.1; -; Genomic_DNA.
DR   EnsemblPlants; GBG65070; GBG65070; CBR_g49142.
DR   Gramene; GBG65070; GBG65070; CBR_g49142.
DR   Proteomes; UP000265515; Unassembled WGS sequence.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR   CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR   CDD; cd01647; RT_LTR; 2.
DR   Gene3D; 1.10.340.70; -; 1.
DR   Gene3D; 3.30.70.270; -; 4.
DR   Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 2.
DR   Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR   InterPro; IPR043502; DNA/RNA_pol_sf.
DR   InterPro; IPR001584; Integrase_cat-core.
DR   InterPro; IPR041588; Integrase_H2C2.
DR   InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR   InterPro; IPR012337; RNaseH-like_sf.
DR   InterPro; IPR036397; RNaseH_sf.
DR   InterPro; IPR000477; RT_dom.
DR   InterPro; IPR041577; RT_RNaseH_2.
DR   PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR   Pfam; PF17921; Integrase_H2C2; 1.
DR   Pfam; PF17919; RT_RNaseH_2; 2.
DR   Pfam; PF00665; rve; 1.
DR   Pfam; PF00078; RVT_1; 2.
DR   SUPFAM; SSF56672; DNA/RNA polymerases; 2.
DR   SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR   PROSITE; PS50994; INTEGRASE; 1.
DR   PROSITE; PS50878; RT_POL; 2.
PE   4: Predicted;
KW   Coiled coil {ECO:0000256|SAM:Coils};
KW   Reference proteome {ECO:0000313|Proteomes:UP000265515}.
FT   DOMAIN          795..975
FT                   /note="Reverse transcriptase"
FT                   /evidence="ECO:0000259|PROSITE:PS50878"
FT   DOMAIN          1272..1432
FT                   /note="Integrase catalytic"
FT                   /evidence="ECO:0000259|PROSITE:PS50994"
FT   DOMAIN          1909..2088
FT                   /note="Reverse transcriptase"
FT                   /evidence="ECO:0000259|PROSITE:PS50878"
FT   REGION          1..363
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          408..445
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          691..719
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COILED          1456..1483
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COMPBIAS        35..101
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        111..133
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        147..208
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        257..311
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        325..356
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        408..427
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   UNSURE          1621
FT                   /note="E or Q"
FT                   /evidence="ECO:0000313|EMBL:GBG65070.1"
SQ   SEQUENCE   2385 AA;  267595 MW;  1BD04F248EE3A6C0 CRC64;
     MADTMKPAGG GADEARRGGR EAQRSGGGTG GAAEEVGRRS REAAEDGGRR SGEAAEEEGW
     SLGGAEKLRR RAGGGAEEQR SGRGRGEEGW RSEEDVVEGG RTGGAEKLQR MRGGGGEERT
     RGTAEKLRRR RGGGAENGGG AGEEGWRSGE AAEEQRRSSC RGGPEEERRN SGGAAEEQRR
     SSGGAAEKDG QRSGGGMEEE HGRRGGAEGA HRSGGAVGKV QRGSPGGENM QRTTGRGGPE
     ERRSSGGAAE EAQRISGGGG AEERRGVDVE WGSSGGGVED ERGRRGRGEE DERRRRNRGG
     EAQRRRGTEK LQRMSGGGGA EARRMSEGDG VEEQRTYTGG AEDEPGRGKG DEEGGDDPGG
     VSPYVAPSLV AIVSLGMPLS GGVSAGTTLH VPVQTVFTQS PSQVAVTTQP AVSQPLQPQG
     TQPQQLAPQP VSPGPQPGVM QGPGQTRWVP KTAIAAPKPF TGDKRGEDLD TWLRAVPVYV
     RCKLTLPHEE VLVAASYLEG SAARWLSGLV QLQGYGHDFR AWAASQKLED FLKMVEETWH
     DPQEARRATD AILTLHTRQF KSVREATDAV KRLICVPGVR YDPQVLLTSY LRRFSQPLRN
     QLAKEANINM HNFPSFSKVA LDLEANIGHG QAPTTDGRKK TLPPNWKAKG RLMFVDNDGS
     TIELDGNFQE GVGSKAGSVD ASEGGVVAAV SQKGKATGRR CGGSRSRSQV DPNAPPWEKA
     GLTKDVWRDR YSRQASIRCG QYGHIQFKCH NKKVTEKIPP TMGQVLGSSQ PVGSNVANTS
     GSELDELRRQ LKELVKKGWI RPSVSPYGSP VLFVPKKKEG TFRMCIDYRG LNAITVKNRE
     PLPRIDNLLD RVQGCRYFSK IDLKSGYHQI AIGPEDQHKT AFQTRYGLYE FVVMPFGLCN
     APGTFQHAMN WIFHDYLDKF VIVYLDDILI FSKTFEEHVA HLDKVLSLLR QHNFKINCEK
     CEFGRTRVLY LGHEISAEGL KPNDAKVVSI RDWPRPQSVT EMRSFLGMTG YYRTFVKNYS
     IVATPLTDLT RLNTPWEWTD ECEAAFRHVK HALTHYEVLK LPDPDKPFIV TTDASQYGIG
     VVLAQQEGPK LRPVEYMSKK MPSQKLAKST YEKELYAVYK ALTHWRHYLL GRPDFSGALI
     IEFDLTNNVT QSLVEAYRED QFMSEIVRRL EAKDKKTSAE FELVNGLLFL EKARNKRLCV
     PNSESLRSLF LGEFHDATGH FGYKKTAANL VQRFWWPTMM RDAQLYMETC QVCQRDKPRT
     QAPLGLLKPL SVLERPGESL SMDFMDTLIT SKSGMCHIFV IVDRFSKYAR LVAMPETAKT
     EYVIRMFKEN WVRDFGLPKS IVSDRDVRLT SELWKAAAAE QGTQLQMTSG NHPEANDQTE
     QLNRAVQHLL RHYIKPNQVD WDEKLALIAS LYNNVVHSAT GVSPNSLLLT FKPRLPLDFL
     LPENQSTAAL GTLEFAYCYE QKMQQAVEQM QKAQAAMIES ENRHRWPSTF QVGDRVWVKS
     SQLGQEYGIS RKLMPQYFGP WEVLDIVGTD PDGPSYVIRI PGHLCTYLVF HASKLAPFKE
     TIQFPSRRSM LPPTMDEEWT SMILWITESC PYQDPQGTVR PRSIEADLCP PVQKYRSSLS
     EAIDLPAFYR YGVFNGPDTI SSTVATIKTD ITKLQTKPDA TTKNYKMPHF DISKFDDHNK
     TDALAWWQRF LTEASYRTVP DDYLMKALYL QLIGGAQAWM NHLAATHTCT IAELHTHITW
     KDFEKLWFTR FMVHNVMKAA MNEVYTCSQG SMPTRDWTTK WQKIVTTPGF DLSFPNRRSE
     FFSRSCAGLR SALGNEYDYT SFQAILDRAN LVIQTDDKAA TRNTPSHIMW PSRAINYRDV
     FEAPTGTVLD RPIRHGITLE AGTVPQRGCI YRMSEEELEV LRAQLDDLLD KGWIRPSCSP
     YGPPVLFVRK KNKDLRLCID YRKLNAQTVK NAGPLPRIDD LLERLGGATY FSKLDLKSGY
     HQIEIQPQDR YKTAFKTRYG HFEWVVMPFG LTNAPATFQA VMTTEFCDFL DRSVLIYLDD
     ILVYSRTLDE HIIHLRVVLD RLRLAKYKAN LDKCEFAKQK LEYLGHFVTP KGIRPLVDKI
     QAIVDWPEPR CTTDVRSFMG LAGYYQRFVE SYSKVAAPLS RLQSSKVPFE FDDAARGAFT
     TLKAAMQAAP ALRIYDPTLP TPVTTDASGY GIGAVLEQCH EDGWHPVEYF SQKVPLINTL
     DDARKKELLA FVAVLKRWRH FLLSRRRFKW NTDNNPLTFY KTQDTVTSTI GRWMYYINQF
     DFDPCHIPGP ANRAADALSR RPDFCAIVTM AFDLDDDLQP HFVKGYKSDP TYSTIYAELS
     SDHPPASHYR ISDRFLLLHT RGKDLLVVPQ ERILRTRLLG EVHDA
//
DBGET integrated database retrieval system