GenomeNet

Database: UniProt
Entry: K7MUA4_SOYBN
LinkDB: K7MUA4_SOYBN
Original site: K7MUA4_SOYBN 
ID   K7MUA4_SOYBN            Unreviewed;       929 AA.
AC   K7MUA4;
DT   09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT   09-JAN-2013, sequence version 1.
DT   27-MAR-2024, entry version 50.
DE   RecName: Full=DUF4378 domain-containing protein {ECO:0008006|Google:ProtNLM};
GN   Name=100782204 {ECO:0000313|EnsemblPlants:KRH00757};
GN   ORFNames=GLYMA_18G233100 {ECO:0000313|EMBL:KRH00757.1};
OS   Glycine max (Soybean) (Glycine hispida).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC   NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine;
OC   Glycine subgen. Soja.
OX   NCBI_TaxID=3847 {ECO:0000313|EMBL:KRH00757.1};
RN   [1] {ECO:0000313|EMBL:KRH00757.1, ECO:0000313|EnsemblPlants:KRH00757}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Williams 82 {ECO:0000313|EnsemblPlants:KRH00757};
RC   TISSUE=Callus {ECO:0000313|EMBL:KRH00757.1};
RX   PubMed=20075913; DOI=10.1038/nature08670;
RA   Schmutz J., Cannon S.B., Schlueter J., Ma J., Mitros T., Nelson W.,
RA   Hyten D.L., Song Q., Thelen J.J., Cheng J., Xu D., Hellsten U., May G.D.,
RA   Yu Y., Sakurai T., Umezawa T., Bhattacharyya M.K., Sandhu D.,
RA   Valliyodan B., Lindquist E., Peto M., Grant D., Shu S., Goodstein D.,
RA   Barry K., Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N.,
RA   Joshi T., Libault M., Sethuraman A., Zhang X.-C., Shinozaki K.,
RA   Nguyen H.T., Wing R.A., Cregan P., Specht J., Grimwood J., Rokhsar D.,
RA   Stacey G., Shoemaker R.C., Jackson S.A.;
RT   "Genome sequence of the palaeopolyploid soybean.";
RL   Nature 463:178-183(2010).
RN   [2] {ECO:0000313|EnsemblPlants:KRH00757}
RP   IDENTIFICATION.
RC   STRAIN=Williams 82 {ECO:0000313|EnsemblPlants:KRH00757};
RG   EnsemblPlants;
RL   Submitted (FEB-2018) to UniProtKB.
RN   [3] {ECO:0000313|EMBL:KRH00757.1}
RP   NUCLEOTIDE SEQUENCE.
RC   TISSUE=Callus {ECO:0000313|EMBL:KRH00757.1};
RA   Schmutz J., Cannon S., Schlueter J., Ma J., Mitros T., Nelson W., Hyten D.,
RA   Song Q., Thelen J., Cheng J., Xu D., Hellsten U., May G., Yu Y.,
RA   Sakurai T., Umezawa T., Bhattacharyya M., Sandhu D., Valliyodan B.,
RA   Lindquist E., Peto M., Grant D., Shu S., Goodstein D., Barry K.,
RA   Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N., Joshi T.,
RA   Libault M., Sethuraman A., Zhang X., Shinozaki K., Nguyen H., Wing R.,
RA   Cregan P., Specht J., Grimwood J., Rokhsar D., Stacey G., Shoemaker R.,
RA   Jackson S.;
RT   "WGS assembly of Glycine max.";
RL   Submitted (JUL-2018) to the EMBL/GenBank/DDBJ databases.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM000851; KRH00757.1; -; Genomic_DNA.
DR   EMBL; CM000851; KRH00758.1; -; Genomic_DNA.
DR   RefSeq; XP_003551662.1; XM_003551614.3.
DR   AlphaFoldDB; K7MUA4; -.
DR   EnsemblPlants; KRH00757; KRH00757; GLYMA_18G233100.
DR   EnsemblPlants; KRH00758; KRH00758; GLYMA_18G233100.
DR   GeneID; 100782204; -.
DR   Gramene; KRH00757; KRH00757; GLYMA_18G233100.
DR   Gramene; KRH00758; KRH00758; GLYMA_18G233100.
DR   KEGG; gmx:100782204; -.
DR   HOGENOM; CLU_014707_0_0_1; -.
DR   OMA; ESSEIWY; -.
DR   OrthoDB; 543602at2759; -.
DR   Proteomes; UP000008827; Chromosome 18.
DR   ExpressionAtlas; K7MUA4; baseline and differential.
DR   InterPro; IPR022212; DUF3741.
DR   InterPro; IPR025486; DUF4378.
DR   PANTHER; PTHR47212; ADHESIN-LIKE PROTEIN, PUTATIVE (DUF3741)-RELATED; 1.
DR   PANTHER; PTHR47212:SF4; ADHESIN-LIKE PROTEIN, PUTATIVE (DUF3741)-RELATED; 1.
DR   Pfam; PF12552; DUF3741; 1.
DR   Pfam; PF14309; DUF4378; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000008827}.
FT   DOMAIN          223..267
FT                   /note="DUF3741"
FT                   /evidence="ECO:0000259|Pfam:PF12552"
FT   DOMAIN          763..910
FT                   /note="DUF4378"
FT                   /evidence="ECO:0000259|Pfam:PF14309"
FT   REGION          107..174
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          526..567
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          691..723
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        107..132
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        142..174
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        528..544
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   929 AA;  105073 MW;  7A3935432FFCBCD3 CRC64;
     MAKRCQRFPV NYEKDQSGCM WGFISIFDFR HARFTRKLIA DRRHGSKHAV GAALTKNKFE
     VLSNLDEEYE GNFDRGESKR LTLTNDADKL SVKKLIEEEM IIDQDEIKDQ GNAEVESKQS
     RLGHEGPPKT DSKRKKKSRK KSRDMDSHDL NSDATLKSEF SHKPHSRQQS KDNLDLNKIM
     DDFCHVEAAC SMMNDDHGKI DEQSNQKHVI SENLANAIHE FANQMRLNGK DLPEDGQLLS
     SHELMEALQV ISSDKQLFLR LLQDPNSHLL KYIQELENAQ GRGGKECSSV TSSNCSEHEL
     VKLKQTRETA NRKHRNFFRK RVKSQPKDST NENEKTEFSN RIVILKPALT GMQISESGNN
     LASTLNSHDI AQYKNPSVRV GSHFSLTEIK RKLKCAMGKE RHGNPELIPR KLPVERQNKL
     PRGKCKDNAG MRSPNKDHFF IEKITRPMFN VVKGNKTGTM KDSELNVEHE SGIPNQSVSN
     IYIEARKHLC EMLDNADENT NISSRQMPKT LGRILSLPEY NFSSPGRDLE HHSVTAQATF
     SSSDKTREVS EDKLSPKPAT CIGLPDQEIN NSEKQSSICD ERSDNKVQEI KLVSNLSHDV
     NHVNTSEACY PVRDEIVTEG NVESTKEKND LESSLDPNGF IIGKDQNIDI SEIPDGAGCS
     ECLNQDIPEE NQSSSLLSSP QSSITKKIEE LENGTDVSGR PSPVSVLDTS FSDDDFGPGH
     SRYQPVKLPV QPLQIKFEEH DSSPAEQFDR RKYCFEESEL IYDYIKAVLH ASGLTTDQLL
     MKCLSSDKIL DPSLFDQVEL FSNLLCNNQK LLFDSINEVL MEICQHYFGA SPWVSFVNPS
     TRLTPSMKRV TLKVWEGVCW HMLPLPPPRT LEQIVRKDMA RRGTWMDLGL DTETIGFEMG
     EAILAELMED TILSLVIESP ESKCFSASI
//
DBGET integrated database retrieval system