GenomeNet

Database: UniProt
Entry: A0A388JZ42_CHABU
LinkDB: A0A388JZ42_CHABU
Original site: A0A388JZ42_CHABU 
ID   A0A388JZ42_CHABU        Unreviewed;      3883 AA.
AC   A0A388JZ42;
DT   05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT   05-DEC-2018, sequence version 1.
DT   27-MAR-2024, entry version 17.
DE   RecName: Full=Reverse transcriptase domain-containing protein {ECO:0000259|PROSITE:PS50878};
GN   ORFNames=CBR_g36536 {ECO:0000313|EMBL:GBG63052.1};
OS   Chara braunii (Braun's stonewort).
OC   Eukaryota; Viridiplantae; Streptophyta; Charophyceae; Charales; Characeae;
OC   Chara.
OX   NCBI_TaxID=69332 {ECO:0000313|EMBL:GBG63052.1, ECO:0000313|Proteomes:UP000265515};
RN   [1] {ECO:0000313|EMBL:GBG63052.1, ECO:0000313|Proteomes:UP000265515}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=S276 {ECO:0000313|EMBL:GBG63052.1,
RC   ECO:0000313|Proteomes:UP000265515};
RX   PubMed=30007417; DOI=10.1016/j.cell.2018.06.033;
RA   Nishiyama T., Sakayama H., Vries J.D., Buschmann H., Saint-Marcoux D.,
RA   Ullrich K.K., Haas F.B., Vanderstraeten L., Becker D., Lang D.,
RA   Vosolsobe S., Rombauts S., Wilhelmsson P.K.I., Janitza P., Kern R.,
RA   Heyl A., Rumpler F., Villalobos L.I.A.C., Clay J.M., Skokan R., Toyoda A.,
RA   Suzuki Y., Kagoshima H., Schijlen E., Tajeshwar N., Catarino B.,
RA   Hetherington A.J., Saltykova A., Bonnot C., Breuninger H., Symeonidi A.,
RA   Radhakrishnan G.V., Van Nieuwerburgh F., Deforce D., Chang C., Karol K.G.,
RA   Hedrich R., Ulvskov P., Glockner G., Delwiche C.F., Petrasek J.,
RA   Van de Peer Y., Friml J., Beilby M., Dolan L., Kohara Y., Sugano S.,
RA   Fujiyama A., Delaux P.-M., Quint M., TheiBen G., Hagemann M., Harholt J.,
RA   Dunand C., Zachgo S., Langdale J., Maumus F., Straeten D.V.D., Gould S.B.,
RA   Rensing S.A.;
RT   "The Chara Genome: Secondary Complexity and Implications for Plant
RT   Terrestrialization.";
RL   Cell 174:448-464(2018).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:GBG63052.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; BFEA01000035; GBG63052.1; -; Genomic_DNA.
DR   EnsemblPlants; GBG63052; GBG63052; CBR_g36536.
DR   Gramene; GBG63052; GBG63052; CBR_g36536.
DR   Proteomes; UP000265515; Unassembled WGS sequence.
DR   CDD; cd00303; retropepsin_like; 1.
DR   CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR   CDD; cd01647; RT_LTR; 2.
DR   Gene3D; 3.30.70.270; -; 6.
DR   Gene3D; 2.40.70.10; Acid Proteases; 2.
DR   Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 3.
DR   InterPro; IPR043502; DNA/RNA_pol_sf.
DR   InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR   InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR   InterPro; IPR000477; RT_dom.
DR   InterPro; IPR041577; RT_RNaseH_2.
DR   PANTHER; PTHR33064; POL PROTEIN; 1.
DR   PANTHER; PTHR33064:SF40; REVERSE TRANSCRIPTASE_RETROTRANSPOSON-DERIVED PROTEIN RNASE H-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF17919; RT_RNaseH_2; 3.
DR   Pfam; PF00078; RVT_1; 2.
DR   SUPFAM; SSF56672; DNA/RNA polymerases; 3.
DR   PROSITE; PS50878; RT_POL; 2.
PE   4: Predicted;
KW   Coiled coil {ECO:0000256|SAM:Coils};
KW   Reference proteome {ECO:0000313|Proteomes:UP000265515}.
FT   DOMAIN          1644..1823
FT                   /note="Reverse transcriptase"
FT                   /evidence="ECO:0000259|PROSITE:PS50878"
FT   DOMAIN          2693..2872
FT                   /note="Reverse transcriptase"
FT                   /evidence="ECO:0000259|PROSITE:PS50878"
FT   REGION          88..140
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          246..314
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          341..487
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          856..884
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1006..1031
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1086..1105
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1236..1400
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          3221..3264
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COILED          2069..2099
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COMPBIAS        88..132
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        246..266
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        341..372
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1236..1263
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1276..1307
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1308..1332
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1361..1376
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        3249..3264
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   3883 AA;  432215 MW;  23183CE22E0F3427 CRC64;
     MLAKGYWSGL NTGLAARQFF PVTSSSISEH GVETGAASEV GKKQTSEVAS SCPAQTSHEE
     NVFSMLFRLH IASIGALFLE DYQPTLGSGL GHDTQEGNGN QQQPNPPLPF LTSFPHQSKS
     DSWTSGLSGN QRFLSPGRDD DGRDGAFVDS FWIEMFAGDG YRAHASLGWE GFRMGMNVEG
     EDGEAALDRQ QVQKSSEREG KGKEAARRAK KTAKIECSIG QCCLNLCGPQ GCGQVGDIEN
     IVKRRMMSEA TERRDGDKDV HGEEDAEGIP TAGSQVNNDH RDEQGEVDGQ AGPAGDEGDY
     VRPEFGVPVP ESFAAQESVD AGSTFTDFGS AADRFSVWSS RYGRSSTFSE TGELGTRLSA
     EGQIQDVSKG QGGSRAGGSD GEAPSEARSG QKLGMLSRSR SLGSMRDLEK ENLEHNVGES
     GGGIDLRRPT DFPATPKAAP VDQRKHTGSK STHSQNSGRR SLSRLKGLYV DGSSGEKNKG
     DTWEDALSPR SVGEMSFMTA EAESSDGELM TEMGLLNEMS TSGIMWGKQW AVASLQIGSL
     VLRRGDYDGV ADLRAVAEQV MKGSDEIDIT LLVAENLTKV RMKSEGGLVY IDTPVLEGVI
     TRAWRYYTSL LQSVSKMEIL KMALLSSRPV SAETVTESGD VSDSHHNASS SSGMSITWQG
     EGSMVISDLS VVLARPPLDA NVTGEGLLLE LDSSLNVVMD MQEILLSTQI SGFAARGLLT
     SRKPLTVRRA DIIGGNGYHK EVDESVVRDP ELAVQFPTET FYEAERETRQ AGSEEVLFEG
     EPRVILDCME SACDLRRAIH QDGSADDLGP SKEGVWEGRV SIGGLHAAVT TYEVQLLTGL
     LTPLLRVSSS MTSAENKEDD AGIFPGAMTS SSVPALPDSE TESPEIPYLP HGSVIAIRDV
     YEESFVAVEE HSGTLGTFSL TSVRHYGLAG ERALFKVKRL RNKKHKEQPW FCLLSLYARH
     STTGRALRLH YQPASALPEI ATTNNGAWEQ WQLLPAKPPD YLDGVIAHDV DEGNKDGDDK
     EQRKDKTSAQ RTVYLENKKA GRCLSISDGS VVLGKYSHPL KIKIVALPKE GETDVEFQME
     EAEERAKRKK RRKGESSAPE PVQELVRSPT EQVAIALANI PSVNVVIKDG LSITLLYEAA
     GGAHLLPLVR ARGEKMEATV QLGTEKTRFI ASCSIGLDYF EVERGKWEKI FSRISLEMLS
     RGRVLASSLA RKEWKAPSRT FITFQKGDME RRRRMWTRRT TRIERTTRSQ DGGGEEMDGG
     KRGRAGGGGR VGGGGDMDGE EGEDTRKGRR RGGMEDEEVR GGGGDKEDEE GQEEEDEQEE
     EEEEEEEEEE EIKRRREEET WSMRKGRRSM SELANLQRAV RNHKTQHEDA TRALDARVLD
     LEQAVPGPDA GASSSASSSR QLEERVDHVV AMLGDISAFT ELATISQRFE LLDTKIRLHG
     ARRPWSTCPE EVPAYATLAD GHTHKSIDRC IDVVPVYFAP HASEAVSFDI LDTKFDMILG
     MSWLRSEDHP VNFFHRTVHI RDRNGVLVPC TVPLPHPSIS CHVVSAASMR ASIIRDDIEE
     MGGCFLHALP PRDASSTDSS SDPRITELLD AYSDLFEGPH GVVPDRPIRH EIILEDGVVP
     PCGCIYRMSE EELSVLRAQL DDLLEKGWIR PSSSPYGAPV LFVRKKNKDL RLCIDYRKLN
     AQTIRNVGPL PRIDDLLERL GGAQFFSKLD LKSGYHQLEI RKDDRYKTAF KTRYGHFEWL
     VMPFGLTNAP ATFQAAMTTE FQHMLDRFVL IYLDDILVYS RSLDEHVEHL RTVLERLRQA
     KYKANRDXCE FAXQEXEYLG HYVTPQGIRP LADKIEALRV WPEPTNTTDV RSVMGLAGYY
     QRFITGYSRI AAPMTRLQSP KVPFVFDDAA RWSFQALKTA MLMAPVLSIY DPTLPTRVTT
     DASGCGIGAV LEQHDGDDWH PVEYFSHKVP PINSLDDARK KELLAFVMAL KRWRHFLLGR
     RRFTWVTDNN PLTYYKMQDT DTGALPRRSA RLAVRARPVV PPRPKQRLSK RKTPAASSTA
     MTVPVSTGVL PACPVQGAGE PLAAYLQRVQ AFTDAVATAK AQEEAAEAER QRLANEAAAQ
     AQHTAEADAA ARDQRNASST ESLIHSENQW TMFLQGMIFL PSDAQADPTP AEAEKTHLAN
     LMLGMMRGIM WNNTLLQAHL RTEQQQRQKY QQDIAVLTAA IRAEASQQQQ QHQLLNSALA
     RINNIEANAT AALGCTMDAT KQLNERIDHV VTIIGDIGDF TIPATISSTV AAVKTDITKL
     QTRPDAATKT YKMPHFDISK FDDYNKSDAL AWWQRFLTEA SCRTVPADDM MKALYLQLIG
     GGHAWMNHLA ATKKCTIAEP HTHIPWKEFE KLWLTRFMVR NVMNAAMNEV YTCSQGSMPT
     RDWTTKWQKI VTTPGFDLSF TNQRSEFFSR SCAGLRSALG NEYDYDSFQA ILDRANLVIQ
     TDDKAANERQ SQPHYVAKQG YQRPTHNNVV ISEETVDLHA AAASSSDGRI VAALPPKRPK
     RVRKNKATQE TASMSVSFDI LDTKFDMILG MSWLRSADHP MNFQDRTIHI RDRNGVLVPC
     TVATAHTSIA CHVVSVARIR AAIARNDVEE MGLALLHALP SPDGPAASPP DPRITHLLDK
     YGDVFEAPTG KVSDRPIRQE ITLEAGAVRP RGCIYRMSEE GLEVLRAQLD DLLNKGWIRR
     SCSPYGAPVL FAWKKNKDLP LCIDYRKLNS QTVNNADPLP RIDDLLERLG GATYFSKLDL
     KSGYHQIEIQ PQDRYRTAFK TLYGHFEWVV MPFGLTNAPA TFQAAMTTEF RDLLDRSVLI
     YLDEILVYSS TLDEHITHLR AVLNRLRLAK YKANRDKCEF AKQELEYLGQ YVTPKGIRPL
     ADKIQAIMDW PEPRCRTDVH SFLDLAGYYQ RFVESYSKVA APLSRLQSPK VPFEFDDAVR
     GAFTTLKAAM QAAPALRIYD PTLPTQVTTD ASGYGIGAVL EQCHEDGWHP VEYFSQKVQN
     HTGVPLNCHL GTWHKIQPQT AEDFLIRRMA SRRDIRAALM SSSIVIGLGD PTTGVTISAL
     MSVPLTDPGV QIVNFQLRGT AEGDQKSHVP LVVSISKQSQ EGLFVSISPL LTVGNLSGMP
     LFVRYWKLGA SDENGKAEDG VMRLDKNEAV DDLMHSFNVR TEGGERSHLF GGLDCRHPEG
     EERGVPEAGE VKGYCRGAGG EEEAVRRRDV EVSEEKIMML QREEEEKRRA AEEEAAAEEE
     EEEEGEPLER RCGEERGEVS GTKEEDKWRE KKISEWVANL SLGEDEETQL YVSQEEREAF
     ARALELIEDP LECQATEDEK KLEWKLKMMR EKKRRRKEAS RIAVEVERVR TGRQELQAQT
     EVFAKLDKMM GFLEVLSEAW LEAHQARKGQ EVTLQAMRSG FREFASDVVG HVGSEIRRLR
     DGVDKFCAGA IETAKIVATT EATARPHKEP VNYISCKALK KLRLGLKVQK LEDPIVTVLA
     DNRTMRVEDY VEGVQAYFRL EKDGKVEKVI HSPTLLVQDD LPFDIVLGLD WGEAAGATLH
     LREHECRLPS PSGEVKTARL FHVSRVDNSL AHCCLSAPAF ARLVRKEQLE DQVFVVYVRP
     VTEPKEEDRS TDPAIAKLLE EFEDLAEPPT GVVPWPIQQR IEIEPGSRTP KGAVYRMSPR
     ELKELRKQLD EPLQKGWIRP SSSPFGAPVL GKLREANFKI NAKKCEGAKT QVLYLGHVID
     GDGIKPEDSK IAAIRDKPTP RTLTELRSFL GLANYYRKFV MNFSTIAAPL RRLLKKEAIW
     QWDKDCTSAL KKLKRALIEY PVLKVADPSL PFVVTTDASQ YGIGVVLQED DDNGYRPVEF
     MSARMPSEKV ATSTYERELK LSGKLRAQEA LPAWEALQGV FRP
//
DBGET integrated database retrieval system