ID Q9FEC5_SOYBN Unreviewed; 536 AA.
AC Q9FEC5;
DT 01-MAR-2001, integrated into UniProtKB/TrEMBL.
DT 01-MAR-2001, sequence version 1.
DT 27-MAR-2024, entry version 122.
DE SubName: Full=Glycinin subunit G7 {ECO:0000313|EMBL:AAG42489.1};
GN Name=Gy7 {ECO:0000313|EMBL:AAG42489.1};
GN Synonyms=547611 {ECO:0000313|EnsemblPlants:KRG95666};
GN ORFNames=GLYMA_19G164800 {ECO:0000313|EMBL:KRG95666.1};
OS Glycine max (Soybean) (Glycine hispida).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine;
OC Glycine subgen. Soja.
OX NCBI_TaxID=3847 {ECO:0000313|EMBL:AAG42489.1};
RN [1] {ECO:0000313|EMBL:AAG42489.1}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=12582623; DOI=10.1007/s00122-002-0884-6;
RA Beilinson V., Chen Z., Shoemaker C., Fischer L., Goldberg B., Nielsen C.;
RT "Genomic organization of glycinin genes in soybean.";
RL Theor. Appl. Genet. 104:1132-1140(2002).
RN [2] {ECO:0000313|EMBL:KRG95666.1, ECO:0000313|EnsemblPlants:KRG95666}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Williams 82 {ECO:0000313|EnsemblPlants:KRG95666};
RC TISSUE=Callus {ECO:0000313|EMBL:KRG95666.1};
RX PubMed=20075913; DOI=10.1038/nature08670;
RA Schmutz J., Cannon S.B., Schlueter J., Ma J., Mitros T., Nelson W.,
RA Hyten D.L., Song Q., Thelen J.J., Cheng J., Xu D., Hellsten U., May G.D.,
RA Yu Y., Sakurai T., Umezawa T., Bhattacharyya M.K., Sandhu D.,
RA Valliyodan B., Lindquist E., Peto M., Grant D., Shu S., Goodstein D.,
RA Barry K., Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N.,
RA Joshi T., Libault M., Sethuraman A., Zhang X.-C., Shinozaki K.,
RA Nguyen H.T., Wing R.A., Cregan P., Specht J., Grimwood J., Rokhsar D.,
RA Stacey G., Shoemaker R.C., Jackson S.A.;
RT "Genome sequence of the palaeopolyploid soybean.";
RL Nature 463:178-183(2010).
RN [3] {ECO:0000313|EnsemblPlants:KRG95666}
RP IDENTIFICATION.
RC STRAIN=Williams 82 {ECO:0000313|EnsemblPlants:KRG95666};
RG EnsemblPlants;
RL Submitted (FEB-2018) to UniProtKB.
RN [4] {ECO:0000313|EMBL:KRG95666.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Callus {ECO:0000313|EMBL:KRG95666.1};
RA Schmutz J., Cannon S., Schlueter J., Ma J., Mitros T., Nelson W., Hyten D.,
RA Song Q., Thelen J., Cheng J., Xu D., Hellsten U., May G., Yu Y.,
RA Sakurai T., Umezawa T., Bhattacharyya M., Sandhu D., Valliyodan B.,
RA Lindquist E., Peto M., Grant D., Shu S., Goodstein D., Barry K.,
RA Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N., Joshi T.,
RA Libault M., Sethuraman A., Zhang X., Shinozaki K., Nguyen H., Wing R.,
RA Cregan P., Specht J., Grimwood J., Rokhsar D., Stacey G., Shoemaker R.,
RA Jackson S.;
RT "WGS assembly of Glycine max.";
RL Submitted (JUL-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Seed storage protein. {ECO:0000256|ARBA:ARBA00003839}.
CC -!- SUBCELLULAR LOCATION: Endoplasmic reticulum
CC {ECO:0000256|ARBA:ARBA00004240}. Protein storage vacuole
CC {ECO:0000256|ARBA:ARBA00004558}. Vacuole
CC {ECO:0000256|ARBA:ARBA00004116}.
CC -!- SIMILARITY: Belongs to the 11S seed storage protein (globulins) family.
CC {ECO:0000256|ARBA:ARBA00007178}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AF319776; AAG42488.1; -; Genomic_DNA.
DR EMBL; AF319777; AAG42489.1; -; mRNA.
DR EMBL; CM000852; KRG95666.1; -; Genomic_DNA.
DR RefSeq; NP_001235354.1; NM_001248425.1.
DR AlphaFoldDB; Q9FEC5; -.
DR SMR; Q9FEC5; -.
DR PaxDb; 3847-GLYMA19G34770-2; -.
DR EnsemblPlants; KRG95666; KRG95666; GLYMA_19G164800.
DR GeneID; 547611; -.
DR Gramene; KRG95666; KRG95666; GLYMA_19G164800.
DR KEGG; gmx:547611; -.
DR eggNOG; ENOG502QU1J; Eukaryota.
DR HOGENOM; CLU_026341_2_0_1; -.
DR InParanoid; Q9FEC5; -.
DR OMA; MHQKLEN; -.
DR OrthoDB; 1219266at2759; -.
DR Proteomes; UP000008827; Chromosome 19.
DR ExpressionAtlas; Q9FEC5; baseline and differential.
DR GO; GO:0005783; C:endoplasmic reticulum; IEA:UniProtKB-SubCell.
DR GO; GO:0000326; C:protein storage vacuole; IEA:UniProtKB-SubCell.
DR GO; GO:0045735; F:nutrient reservoir activity; IEA:UniProtKB-KW.
DR GO; GO:0048316; P:seed development; IEA:UniProt.
DR CDD; cd02243; cupin_11S_legumin_C; 1.
DR CDD; cd02242; cupin_11S_legumin_N; 1.
DR Gene3D; 2.60.120.10; Jelly Rolls; 2.
DR InterPro; IPR006044; 11S_seedstore_pln.
DR InterPro; IPR006045; Cupin_1.
DR InterPro; IPR014710; RmlC-like_jellyroll.
DR InterPro; IPR011051; RmlC_Cupin_sf.
DR PANTHER; PTHR31189:SF35; 12S SEED STORAGE PROTEIN CRC; 1.
DR PANTHER; PTHR31189; OS03G0336100 PROTEIN-RELATED; 1.
DR Pfam; PF00190; Cupin_1; 2.
DR PRINTS; PR00439; 11SGLOBULIN.
DR SMART; SM00835; Cupin_1; 2.
DR SUPFAM; SSF51182; RmlC-like cupins; 1.
PE 2: Evidence at transcript level;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Endoplasmic reticulum {ECO:0000256|ARBA:ARBA00022824};
KW Reference proteome {ECO:0000313|Proteomes:UP000008827};
KW Seed storage protein {ECO:0000256|ARBA:ARBA00023129};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP};
KW Storage protein {ECO:0000256|ARBA:ARBA00022761};
KW Vacuole {ECO:0000256|ARBA:ARBA00022554}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..536
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5014589386"
FT DOMAIN 36..220
FT /note="Cupin type-1"
FT /evidence="ECO:0000259|SMART:SM00835"
FT DOMAIN 360..509
FT /note="Cupin type-1"
FT /evidence="ECO:0000259|SMART:SM00835"
FT REGION 250..339
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 517..536
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 536 AA; 60486 MW; 40F452FAA067FBC7 CRC64;
MFNHSALHYY FLLFFTCTCL ARQQCQFKQE CQLDTIHALK PDNLIESQGG VTETWNASHP
ELCCAGVAFI KRTINPNGLH LPSYVNYPEL HFVLQGEGVL GIVIPGCDET FEEPQREREH
DRHQKVRYLK QGDIFAVPPG IPYWTYNYAN VSLVVITLLD TANFENQLDR VPRRFYLAGN
PKEEHPCGRK QEEGNNINMF GGFDPRFLAE ASNVKVGITK KLQSHIGDQI IKVEKGLSII
RPPLEHEVRE AEVEEKPKTR EHCECQKERK HKEGEGEEEV VQEKEIRKRK HHIGEHEGCG
ECEDKEEEEQ SRSRERGEWH EHKGQQHGKE KGRERYKEGG EGRVRSNVLE EILCTLKLHE
NIADPSHADI FNPRAGRVRT INSLTLPVLK LLRLSAQWVK LYKSGIYVPH WSMNANSVAY
VTSGGGWVQV VNSQGKSVFS GAVGRGRVVV VPQNFAVAIQ AGRDGMEYIV FRTNDRAMMG
TLVGPTSAIT AIPGEVLANA FGLSPEEVSE LKNNRKEAVL SSPASHHSPN PLIVTM
//