GenomeNet

Database: UniProt
Entry: K7K8G9_SOYBN
LinkDB: K7K8G9_SOYBN
Original site: K7K8G9_SOYBN 
ID   K7K8G9_SOYBN            Unreviewed;       455 AA.
AC   K7K8G9;
DT   09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT   09-JAN-2013, sequence version 1.
DT   27-MAR-2024, entry version 67.
DE   RecName: Full=MSP domain-containing protein {ECO:0000259|PROSITE:PS50202};
GN   Name=100804997 {ECO:0000313|EnsemblPlants:KRH71504};
GN   ORFNames=GLYMA_02G151200 {ECO:0000313|EMBL:KRH71504.1};
OS   Glycine max (Soybean) (Glycine hispida).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC   NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine;
OC   Glycine subgen. Soja.
OX   NCBI_TaxID=3847 {ECO:0000313|EMBL:KRH71504.1};
RN   [1] {ECO:0000313|EMBL:KRH71504.1, ECO:0000313|EnsemblPlants:KRH71504}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Williams 82 {ECO:0000313|EnsemblPlants:KRH71504};
RC   TISSUE=Callus {ECO:0000313|EMBL:KRH71504.1};
RX   PubMed=20075913; DOI=10.1038/nature08670;
RA   Schmutz J., Cannon S.B., Schlueter J., Ma J., Mitros T., Nelson W.,
RA   Hyten D.L., Song Q., Thelen J.J., Cheng J., Xu D., Hellsten U., May G.D.,
RA   Yu Y., Sakurai T., Umezawa T., Bhattacharyya M.K., Sandhu D.,
RA   Valliyodan B., Lindquist E., Peto M., Grant D., Shu S., Goodstein D.,
RA   Barry K., Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N.,
RA   Joshi T., Libault M., Sethuraman A., Zhang X.-C., Shinozaki K.,
RA   Nguyen H.T., Wing R.A., Cregan P., Specht J., Grimwood J., Rokhsar D.,
RA   Stacey G., Shoemaker R.C., Jackson S.A.;
RT   "Genome sequence of the palaeopolyploid soybean.";
RL   Nature 463:178-183(2010).
RN   [2] {ECO:0000313|EnsemblPlants:KRH71504}
RP   IDENTIFICATION.
RC   STRAIN=Williams 82 {ECO:0000313|EnsemblPlants:KRH71504};
RG   EnsemblPlants;
RL   Submitted (FEB-2018) to UniProtKB.
RN   [3] {ECO:0000313|EMBL:KRH71504.1}
RP   NUCLEOTIDE SEQUENCE.
RC   TISSUE=Callus {ECO:0000313|EMBL:KRH71504.1};
RA   Schmutz J., Cannon S., Schlueter J., Ma J., Mitros T., Nelson W., Hyten D.,
RA   Song Q., Thelen J., Cheng J., Xu D., Hellsten U., May G., Yu Y.,
RA   Sakurai T., Umezawa T., Bhattacharyya M., Sandhu D., Valliyodan B.,
RA   Lindquist E., Peto M., Grant D., Shu S., Goodstein D., Barry K.,
RA   Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N., Joshi T.,
RA   Libault M., Sethuraman A., Zhang X., Shinozaki K., Nguyen H., Wing R.,
RA   Cregan P., Specht J., Grimwood J., Rokhsar D., Stacey G., Shoemaker R.,
RA   Jackson S.;
RT   "WGS assembly of Glycine max.";
RL   Submitted (JUL-2018) to the EMBL/GenBank/DDBJ databases.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM000835; KRH71504.1; -; Genomic_DNA.
DR   RefSeq; XP_006575098.1; XM_006575035.2.
DR   AlphaFoldDB; K7K8G9; -.
DR   SMR; K7K8G9; -.
DR   STRING; 3847.K7K8G9; -.
DR   PaxDb; 3847-GLYMA02G17021-1; -.
DR   EnsemblPlants; KRH71504; KRH71504; GLYMA_02G151200.
DR   GeneID; 100804997; -.
DR   Gramene; KRH71504; KRH71504; GLYMA_02G151200.
DR   KEGG; gmx:100804997; -.
DR   eggNOG; KOG4177; Eukaryota.
DR   HOGENOM; CLU_000134_53_1_1; -.
DR   InParanoid; K7K8G9; -.
DR   OMA; PGPHVFR; -.
DR   OrthoDB; 2385921at2759; -.
DR   Proteomes; UP000008827; Chromosome 2.
DR   Gene3D; 1.25.40.20; Ankyrin repeat-containing domain; 2.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR   InterPro; IPR002110; Ankyrin_rpt.
DR   InterPro; IPR036770; Ankyrin_rpt-contain_sf.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR000535; MSP_dom.
DR   InterPro; IPR008962; PapD-like_sf.
DR   PANTHER; PTHR24189; MYOTROPHIN; 1.
DR   PANTHER; PTHR24189:SF50; MYOTROPHIN-RELATED; 1.
DR   Pfam; PF12796; Ank_2; 2.
DR   PRINTS; PR01415; ANKYRIN.
DR   SMART; SM00248; ANK; 5.
DR   SUPFAM; SSF48403; Ankyrin repeat; 1.
DR   SUPFAM; SSF49354; PapD-like; 1.
DR   PROSITE; PS50297; ANK_REP_REGION; 4.
DR   PROSITE; PS50088; ANK_REPEAT; 4.
DR   PROSITE; PS50202; MSP; 1.
PE   4: Predicted;
KW   ANK repeat {ECO:0000256|ARBA:ARBA00023043, ECO:0000256|PROSITE-
KW   ProRule:PRU00023}; Reference proteome {ECO:0000313|Proteomes:UP000008827};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT   DOMAIN          4..125
FT                   /note="MSP"
FT                   /evidence="ECO:0000259|PROSITE:PS50202"
FT   REPEAT          248..280
FT                   /note="ANK"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT   REPEAT          281..303
FT                   /note="ANK"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT   REPEAT          374..406
FT                   /note="ANK"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT   REPEAT          407..433
FT                   /note="ANK"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
SQ   SEQUENCE   455 AA;  49130 MW;  D73FBA24FE359081 CRC64;
     MDRLVKADTK EVEMIFLKGQ KCSSSFKLTN LMHTMSVAVS LTTTNSSLFS INKPFSTIPP
     LSTASYKLQL SQPSDQPPLS DPPDAITVRA TMLPTGKATA ADLRRFFSKP GPHVFRDAVL
     TVSLVGPHVA EFLISQTPQS RNLFAKSISA CTKPQLMRLL KPAVECGSTD AVADLLNAGA
     DATATTESLM PLAIRVGNLH AVKLLEASGC KIDGSSLHEA AAMDRIDAME FLLARYDGEL
     DVDAVDSEGR TAIHVAAREG HARVIQFCVA MGGNPNRVDS KGWTPLHYAA WKGHVKAAEC
     LLECSNVKCA RDREGRTAFS VAAESEHEQS HARTRLVDLL GWGDALLRAV RVDDVHGVKK
     CLGEGVSVNG RDQNGWTPLH WAAFKGRIKS LKVLLEHGAE VETVDDAGYT PLHCAAQAGH
     LQVALYLIAH GASQPNLKSF PHLAHPFQNH SFTLI
//
DBGET integrated database retrieval system