ID A0A1S3YS97_TOBAC Unreviewed; 475 AA.
AC A0A1S3YS97;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE SubName: Full=Legumin A-like {ECO:0000313|RefSeq:XP_016455081.1};
GN Name=LOC107779191 {ECO:0000313|RefSeq:XP_016455081.1};
OS Nicotiana tabacum (Common tobacco).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; lamiids; Solanales; Solanaceae; Nicotianoideae; Nicotianeae;
OC Nicotiana.
OX NCBI_TaxID=4097 {ECO:0000313|Proteomes:UP000084051, ECO:0000313|RefSeq:XP_016455081.1};
RN [1] {ECO:0000313|Proteomes:UP000084051}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. TN90 {ECO:0000313|Proteomes:UP000084051};
RX PubMed=24807620; DOI=10.1038/ncomms4833;
RA Sierro N., Battey J.N., Ouadi S., Bakaher N., Bovet L., Willig A.,
RA Goepfert S., Peitsch M.C., Ivanov N.V.;
RT "The tobacco genome sequence and its comparison with those of tomato and
RT potato.";
RL Nat. Commun. 5:3833-3833(2014).
RN [2] {ECO:0000313|RefSeq:XP_016455081.1}
RP IDENTIFICATION.
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- FUNCTION: Seed storage protein. {ECO:0000256|RuleBase:RU003681}.
CC -!- SUBUNIT: Hexamer; each subunit is composed of an acidic and a basic
CC chain derived from a single precursor and linked by a disulfide bond.
CC {ECO:0000256|RuleBase:RU003681}.
CC -!- SIMILARITY: Belongs to the 11S seed storage protein (globulins) family.
CC {ECO:0000256|ARBA:ARBA00007178, ECO:0000256|RuleBase:RU003681}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_016455081.1; XM_016599595.1.
DR AlphaFoldDB; A0A1S3YS97; -.
DR SMR; A0A1S3YS97; -.
DR STRING; 4097.A0A1S3YS97; -.
DR PaxDb; 4097-A0A1S3YS97; -.
DR GeneID; 107779191; -.
DR KEGG; nta:107779191; -.
DR OMA; QAKHYQG; -.
DR OrthoDB; 1219266at2759; -.
DR Proteomes; UP000084051; Unplaced.
DR GO; GO:0045735; F:nutrient reservoir activity; IEA:UniProtKB-KW.
DR CDD; cd02243; cupin_11S_legumin_C; 1.
DR CDD; cd02242; cupin_11S_legumin_N; 1.
DR Gene3D; 2.60.120.10; Jelly Rolls; 2.
DR InterPro; IPR022379; 11S_seedstore_CS.
DR InterPro; IPR006044; 11S_seedstore_pln.
DR InterPro; IPR006045; Cupin_1.
DR InterPro; IPR014710; RmlC-like_jellyroll.
DR InterPro; IPR011051; RmlC_Cupin_sf.
DR PANTHER; PTHR31189:SF64; LEGUMIN A-LIKE; 1.
DR PANTHER; PTHR31189; OS03G0336100 PROTEIN-RELATED; 1.
DR Pfam; PF00190; Cupin_1; 2.
DR PRINTS; PR00439; 11SGLOBULIN.
DR SMART; SM00835; Cupin_1; 2.
DR SUPFAM; SSF51182; RmlC-like cupins; 1.
DR PROSITE; PS00305; 11S_SEED_STORAGE; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157,
KW ECO:0000256|RuleBase:RU003681};
KW Reference proteome {ECO:0000313|Proteomes:UP000084051};
KW Seed storage protein {ECO:0000256|ARBA:ARBA00023129,
KW ECO:0000256|RuleBase:RU003681}; Signal {ECO:0000256|RuleBase:RU003681};
KW Storage protein {ECO:0000256|ARBA:ARBA00022761,
KW ECO:0000256|RuleBase:RU003681}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|RuleBase:RU003681"
FT CHAIN 22..475
FT /evidence="ECO:0000256|RuleBase:RU003681"
FT /id="PRO_5010002920"
FT DOMAIN 35..239
FT /note="Cupin type-1"
FT /evidence="ECO:0000259|SMART:SM00835"
FT DOMAIN 299..448
FT /note="Cupin type-1"
FT /evidence="ECO:0000259|SMART:SM00835"
FT REGION 262..281
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 265..279
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 475 AA; 53664 MW; 56847ED45ABB6513 CRC64;
MASNWLSFSL SFLLVLHGTF AQQRYQQQQG ECQLNRLSPQ EPTVRIQAEA GVTELWDPNN
QQFQCAGVSL IRHVIQSRGM LLPSYVNTPL LAYVERGRGF YGIMQSGCPE TFQSSQQMQQ
GERGAGSRFQ DRHQRIGQFR QGDIIAFPAG AAHWVYNEGN EELVLVVLED SSNNANQLGR
TSRRFFIAGN PQQGQQQQQQ GQYGGRSLRR EQFQSGNVFN GFDVQVLAEA FGVDQETARR
LQGQEDQRGH IVNIQQGLRV VRPPFSQEQE EREERQEQGQ YGPRMNGIEE TICSAKVRQN
IDNPSRADIY NPHAGRFTTV NSLTLPILSF LRLSAARGVL YRDSIMAPHW VTNAHKVIYI
TKGESRIQIV DHRGQAVLDD RVRQGQVVVV PQNYAVVKHA ETEGCEWVEF NTNDNAMINT
LSGRTSAIRG LPVDVIANSY QISRDEARRL KFNREETLIF RSSGRARSSE RVAAA
//