ID A0A1S4CHP7_TOBAC Unreviewed; 1643 AA.
AC A0A1S4CHP7;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE SubName: Full=UDP-glucose:glycoprotein glucosyltransferase-like isoform X1 {ECO:0000313|RefSeq:XP_016500516.1};
GN Name=LOC107818951 {ECO:0000313|RefSeq:XP_016500516.1};
OS Nicotiana tabacum (Common tobacco).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; lamiids; Solanales; Solanaceae; Nicotianoideae; Nicotianeae;
OC Nicotiana.
OX NCBI_TaxID=4097 {ECO:0000313|Proteomes:UP000084051, ECO:0000313|RefSeq:XP_016500516.1};
RN [1] {ECO:0000313|Proteomes:UP000084051}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. TN90 {ECO:0000313|Proteomes:UP000084051};
RX PubMed=24807620; DOI=10.1038/ncomms4833;
RA Sierro N., Battey J.N., Ouadi S., Bakaher N., Bovet L., Willig A.,
RA Goepfert S., Peitsch M.C., Ivanov N.V.;
RT "The tobacco genome sequence and its comparison with those of tomato and
RT potato.";
RL Nat. Commun. 5:3833-3833(2014).
RN [2] {ECO:0000313|RefSeq:XP_016500516.1}
RP IDENTIFICATION.
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- COFACTOR:
CC Name=Ca(2+); Xref=ChEBI:CHEBI:29108;
CC Evidence={ECO:0000256|ARBA:ARBA00001913};
CC -!- PATHWAY: Protein modification; protein glycosylation.
CC {ECO:0000256|ARBA:ARBA00004922}.
CC -!- SUBCELLULAR LOCATION: Endoplasmic reticulum lumen
CC {ECO:0000256|ARBA:ARBA00004319}.
CC -!- SIMILARITY: Belongs to the glycosyltransferase 8 family.
CC {ECO:0000256|ARBA:ARBA00006351}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_016500516.1; XM_016645030.1.
DR SMR; A0A1S4CHP7; -.
DR STRING; 4097.A0A1S4CHP7; -.
DR PaxDb; 4097-A0A1S4CHP7; -.
DR GeneID; 107818951; -.
DR KEGG; nta:107818951; -.
DR OrthoDB; 1734at2759; -.
DR UniPathway; UPA00378; -.
DR Proteomes; UP000084051; Unplaced.
DR GO; GO:0005783; C:endoplasmic reticulum; IBA:GO_Central.
DR GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProtKB-SubCell.
DR GO; GO:0003980; F:UDP-glucose:glycoprotein glucosyltransferase activity; IBA:GO_Central.
DR GO; GO:0051082; F:unfolded protein binding; IBA:GO_Central.
DR GO; GO:0071712; P:ER-associated misfolded protein catabolic process; IBA:GO_Central.
DR GO; GO:0018279; P:protein N-linked glycosylation via asparagine; IBA:GO_Central.
DR CDD; cd06432; GT8_HUGT1_C_like; 1.
DR InterPro; IPR040497; Glyco_transf_24.
DR InterPro; IPR029044; Nucleotide-diphossugar_trans.
DR InterPro; IPR009448; UDP-g_GGtrans.
DR InterPro; IPR040693; UGGT_TRXL_1.
DR InterPro; IPR040694; UGGT_TRXL_2.
DR InterPro; IPR040692; UGGT_TRXL_3.
DR InterPro; IPR040525; UGGT_TRXL_4.
DR PANTHER; PTHR11226; UDP-GLUCOSE GLYCOPROTEIN:GLUCOSYLTRANSFERASE; 1.
DR PANTHER; PTHR11226:SF0; UDP-GLUCOSE:GLYCOPROTEIN GLUCOSYLTRANSFERASE; 1.
DR Pfam; PF18404; Glyco_transf_24; 1.
DR Pfam; PF18400; Thioredoxin_12; 1.
DR Pfam; PF18401; Thioredoxin_13; 1.
DR Pfam; PF18402; Thioredoxin_14; 1.
DR Pfam; PF18403; Thioredoxin_15; 1.
DR Pfam; PF06427; UDP-g_GGTase; 1.
DR SUPFAM; SSF53448; Nucleotide-diphospho-sugar transferases; 1.
PE 3: Inferred from homology;
KW Endoplasmic reticulum {ECO:0000256|ARBA:ARBA00022824};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Reference proteome {ECO:0000313|Proteomes:UP000084051};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP};
KW Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT SIGNAL 1..28
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 29..1643
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5010223049"
FT DOMAIN 47..272
FT /note="UGGT thioredoxin-like"
FT /evidence="ECO:0000259|Pfam:PF18400"
FT DOMAIN 360..481
FT /note="UGGT thioredoxin-like"
FT /evidence="ECO:0000259|Pfam:PF18401"
FT DOMAIN 488..754
FT /note="UGGT thioredoxin-like"
FT /evidence="ECO:0000259|Pfam:PF18402"
FT DOMAIN 774..1015
FT /note="UDP-glucose:glycoprotein glucosyltransferase
FT thioredoxin-like"
FT /evidence="ECO:0000259|Pfam:PF18403"
FT DOMAIN 1337..1603
FT /note="Glucosyltransferase 24 catalytic"
FT /evidence="ECO:0000259|Pfam:PF18404"
FT REGION 135..155
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1613..1643
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1643 AA; 185743 MW; 3D7FEC3C21AA6127 CRC64;
METRFRFGFW VVIAVAFCIC LSGHSVSAVT SKPKNVQVAL RAKWSGTPVL LEAGELLSKE
SKDLYWDFTE FWLQSADENS DCRTAKDCLK RIVKYGRSQL SESLASIFEF SLTLRSASPR
IVLYRQLAEE SLSSFPLTDD NSSSSPEGGV FQQNDNAKNK KVNPLLVGEN PRSPEGNCCW
IDTGGRLFFV VAELLVWLQN AKEVSLDTFH PELFEFDHVH PDSNVGSPVA ILYGALGTYC
FEQFHRTLAN AAREGKIYYV VRPVLPSGCE SKSGPCGALG TRDSLNLGGY GVELALKNME
YKAMDDSTVK KGVTLEDPHT EDLRQEVRGF IFSRILERKP ELTSEIMAFR DYLLSSAVSD
TLDVWELKDL GHQTAQRIVH AADPLQLMQD INQNFPSVVS SLSRMKLNES IKEEIVENQR
MIPPGKSLMA LNGALVNIED IDLYLLVDMV HKELSLADQY SKMKIPISTV RKLLSALPPS
ESSNFRVDFR SDHVHYLNNL EVDVMYKRWR SNLNEILMPV FPGQLRYIRK NLFHAVYVLD
PASICGLEAI DTIVSLFENH IPMRFGVILY SAKLIEEIES SGGELPLSYR EKDSPSQEDF
SSLIIRLFIY IKENQGIATA FQFLSNINKL RIESAADDPL EVHHVEAAFV ETLLPQAKTP
PQDTLLKLEK EHSFKELSEE SSLFVFKLGL AKRRCCLLFN GLVHDPTEDA LMNAMNDELP
RIQEQVYFGH INSHTDILEK FLSESGVQRY NPQIIAEGKV KPRFISLSAI ILAEDSFLND
VSYLHSTETI DDLKPVTHLL AVNMASKKGM RLLREGIHYL MAGTTTGRLG VLFNSVLDPH
SPSSLFMKVF QITASSYSHK KGVLEFLDQI CSFYEHDYIH ASSAGTESSE AFLDKVFELA
NSNGLSSKAL KSALSGLSDE KLRMHLNKVG TFLFGQVGLE YGANAVITNG RVIGLVDDTT
FLSHDLQLLE SLEFKQRIKH VVEIIEEVKW EEIDPDMLTS KFISDIVMSV SSSISMRDRS
SEGARFELLS AKYSAVVLEN ESSSIHIDAV IDPLSSSGQK LSSLLRLLSK SIRPSMRLVL
NPMSSLVDLP LKNYYRYVIP TLDDFSSADY TIYGPKAFFA NMPPSKTLTM NLDVPEPWLV
EPVVAIHDLD NILLENLGET RTLQAVYELE ALVLTGHCSE KDHEPPRGLQ LILGTKSTPH
LVDTLVMANL GYWQMKVFPG VWYLQLAPGR SSELYALKED GDGGQETTLS KRITIDDLRG
KLVHMEVMKK KGKEHEKLLV SADDNSYSQE KKKGNQDSWN SNILKWASGF IGGGDQSKKS
KSTPVKQVTS GRHGKTINIF SVASGHLYER FLKIMILSVL KNTQRPVKFW FIKNYLSPQF
KDVIPHMARE YGFDYELITY KWPTWLHKQK EKQRIIWAYK ILFLDVIFPL ALEKVIFVDA
DQIIRTDMGE LYDMDLKGRP LAYTPFCDNN REMDGYRFWK QGFWKEHLRG RPYHISALYV
VDLLKFRETA AGDNLRVFYE TLSKDPNSLS NLDQDLPNYA QHSVPIFSLP QEWLWCESWC
GNATKPKAKT IDLCNNPMTK EPKLQGAKRI VAEWPELDYE ARHFTAKILG EDFDPLEQAA
PSAETQQTIS DTPLEDEESK SEL
//