GenomeNet

Database: UniProt
Entry: K7MXW3_SOYBN
LinkDB: K7MXW3_SOYBN
Original site: K7MXW3_SOYBN 
ID   K7MXW3_SOYBN            Unreviewed;       527 AA.
AC   K7MXW3;
DT   09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT   09-JAN-2013, sequence version 1.
DT   27-MAR-2024, entry version 72.
DE   RecName: Full=MSP domain-containing protein {ECO:0000259|PROSITE:PS50202};
GN   Name=100782232 {ECO:0000313|EnsemblPlants:KRG94872};
GN   ORFNames=GLYMA_19G114600 {ECO:0000313|EMBL:KRG94872.1};
OS   Glycine max (Soybean) (Glycine hispida).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC   NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine;
OC   Glycine subgen. Soja.
OX   NCBI_TaxID=3847 {ECO:0000313|EMBL:KRG94872.1};
RN   [1] {ECO:0000313|EMBL:KRG94872.1, ECO:0000313|EnsemblPlants:KRG94872}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Williams 82 {ECO:0000313|EnsemblPlants:KRG94872};
RC   TISSUE=Callus {ECO:0000313|EMBL:KRG94872.1};
RX   PubMed=20075913; DOI=10.1038/nature08670;
RA   Schmutz J., Cannon S.B., Schlueter J., Ma J., Mitros T., Nelson W.,
RA   Hyten D.L., Song Q., Thelen J.J., Cheng J., Xu D., Hellsten U., May G.D.,
RA   Yu Y., Sakurai T., Umezawa T., Bhattacharyya M.K., Sandhu D.,
RA   Valliyodan B., Lindquist E., Peto M., Grant D., Shu S., Goodstein D.,
RA   Barry K., Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N.,
RA   Joshi T., Libault M., Sethuraman A., Zhang X.-C., Shinozaki K.,
RA   Nguyen H.T., Wing R.A., Cregan P., Specht J., Grimwood J., Rokhsar D.,
RA   Stacey G., Shoemaker R.C., Jackson S.A.;
RT   "Genome sequence of the palaeopolyploid soybean.";
RL   Nature 463:178-183(2010).
RN   [2] {ECO:0000313|EnsemblPlants:KRG94872}
RP   IDENTIFICATION.
RC   STRAIN=Williams 82 {ECO:0000313|EnsemblPlants:KRG94872};
RG   EnsemblPlants;
RL   Submitted (FEB-2018) to UniProtKB.
RN   [3] {ECO:0000313|EMBL:KRG94872.1}
RP   NUCLEOTIDE SEQUENCE.
RC   TISSUE=Callus {ECO:0000313|EMBL:KRG94872.1};
RA   Schmutz J., Cannon S., Schlueter J., Ma J., Mitros T., Nelson W., Hyten D.,
RA   Song Q., Thelen J., Cheng J., Xu D., Hellsten U., May G., Yu Y.,
RA   Sakurai T., Umezawa T., Bhattacharyya M., Sandhu D., Valliyodan B.,
RA   Lindquist E., Peto M., Grant D., Shu S., Goodstein D., Barry K.,
RA   Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N., Joshi T.,
RA   Libault M., Sethuraman A., Zhang X., Shinozaki K., Nguyen H., Wing R.,
RA   Cregan P., Specht J., Grimwood J., Rokhsar D., Stacey G., Shoemaker R.,
RA   Jackson S.;
RT   "WGS assembly of Glycine max.";
RL   Submitted (JUL-2018) to the EMBL/GenBank/DDBJ databases.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM000852; KRG94872.1; -; Genomic_DNA.
DR   RefSeq; XP_003553341.1; XM_003553293.3.
DR   AlphaFoldDB; K7MXW3; -.
DR   SMR; K7MXW3; -.
DR   STRING; 3847.K7MXW3; -.
DR   PaxDb; 3847-GLYMA19G29190-2; -.
DR   EnsemblPlants; KRG94872; KRG94872; GLYMA_19G114600.
DR   GeneID; 100782232; -.
DR   Gramene; KRG94872; KRG94872; GLYMA_19G114600.
DR   KEGG; gmx:100782232; -.
DR   eggNOG; KOG4177; Eukaryota.
DR   HOGENOM; CLU_000134_53_1_1; -.
DR   InParanoid; K7MXW3; -.
DR   OMA; WAVFGGW; -.
DR   OrthoDB; 2385921at2759; -.
DR   Proteomes; UP000008827; Chromosome 19.
DR   GO; GO:0030941; F:chloroplast targeting sequence binding; IBA:GO_Central.
DR   GO; GO:0045036; P:protein targeting to chloroplast; IBA:GO_Central.
DR   Gene3D; 1.25.40.20; Ankyrin repeat-containing domain; 5.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR   InterPro; IPR002110; Ankyrin_rpt.
DR   InterPro; IPR036770; Ankyrin_rpt-contain_sf.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR000535; MSP_dom.
DR   InterPro; IPR008962; PapD-like_sf.
DR   PANTHER; PTHR23206:SF7; ANKYRIN REPEAT AND KH DOMAIN-CONTAINING PROTEIN 1 ISOFORM X1; 1.
DR   PANTHER; PTHR23206; MASK PROTEIN; 1.
DR   Pfam; PF00023; Ank; 1.
DR   Pfam; PF12796; Ank_2; 2.
DR   Pfam; PF13637; Ank_4; 1.
DR   Pfam; PF00635; Motile_Sperm; 1.
DR   PRINTS; PR01415; ANKYRIN.
DR   SMART; SM00248; ANK; 8.
DR   SUPFAM; SSF48403; Ankyrin repeat; 1.
DR   SUPFAM; SSF49354; PapD-like; 1.
DR   PROSITE; PS50297; ANK_REP_REGION; 7.
DR   PROSITE; PS50088; ANK_REPEAT; 7.
DR   PROSITE; PS50202; MSP; 1.
PE   4: Predicted;
KW   ANK repeat {ECO:0000256|ARBA:ARBA00023043, ECO:0000256|PROSITE-
KW   ProRule:PRU00023}; Reference proteome {ECO:0000313|Proteomes:UP000008827};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT   DOMAIN          4..135
FT                   /note="MSP"
FT                   /evidence="ECO:0000259|PROSITE:PS50202"
FT   REPEAT          172..204
FT                   /note="ANK"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT   REPEAT          205..237
FT                   /note="ANK"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT   REPEAT          238..270
FT                   /note="ANK"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT   REPEAT          271..303
FT                   /note="ANK"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT   REPEAT          305..337
FT                   /note="ANK"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT   REPEAT          392..424
FT                   /note="ANK"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT   REPEAT          425..457
FT                   /note="ANK"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT   REGION          499..527
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   527 AA;  57525 MW;  20433BFDC7D4B643 CRC64;
     MDRLIKVDPT NTVPIRIEPG QKCHGQITLR NVMYTMPVAF RLQSLIKTRY TLKPQSGIIS
     PLSTVTIEIL YNLPLGSTLP HSFPHSEDSF LLHSVVVPGA TVKEPSSMFE SVPSDWFTAK
     KKQVFIDSGI KIIFVGSLIL AQLVHNGSID EIREVLEHSE HTWKAVDSVD QNGDTLLHVA
     ISKSRPDIVQ LLLEFNADVE SKNRTGETPL ESACASGEEL IVELLLAHKA NTERTESSSL
     GAIHLSAREG RREVLRLLLL KGASVDSLTK DGYTALHLAV REGSRDCARL LLANNARTDI
     RDSRDGDTCL HVAAGVGDES MVKLLLNKGA NKDVRNFNGK TAYDVAAEKG HARVFDALRL
     GDGLCVAARK GEVRSIQRLI EGGAVVDGRD QHGWTALHRA CFKGRVEAVR ALLERGIDVE
     ARDEDGYTAL HCAVEAGHAD VAEVLVKRGV DVEARTNKGV TALQIAEALG YGGIARLLGA
     AAGHVAEGEQ QSVLGEMKKE KKNKLGRRRR EREIRGSFDR SMPLPVL
//
DBGET integrated database retrieval system