GenomeNet

Database: UniProt
Entry: I1MQL8_SOYBN
LinkDB: I1MQL8_SOYBN
Original site: I1MQL8_SOYBN 
ID   I1MQL8_SOYBN            Unreviewed;       718 AA.
AC   I1MQL8;
DT   13-JUN-2012, integrated into UniProtKB/TrEMBL.
DT   13-JUN-2012, sequence version 1.
DT   24-JAN-2024, entry version 74.
DE   RecName: Full=Homeodomain/HOMEOBOX transcription factor {ECO:0008006|Google:ProtNLM};
GN   Name=100785023 {ECO:0000313|EnsemblPlants:KRH09477};
GN   ORFNames=GLYMA_16G217800 {ECO:0000313|EMBL:KRH09477.1};
OS   Glycine max (Soybean) (Glycine hispida).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC   NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine;
OC   Glycine subgen. Soja.
OX   NCBI_TaxID=3847 {ECO:0000313|EMBL:KRH09477.1};
RN   [1] {ECO:0000313|EMBL:KRH09477.1, ECO:0000313|EnsemblPlants:KRH09477}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Williams 82 {ECO:0000313|EnsemblPlants:KRH09477};
RC   TISSUE=Callus {ECO:0000313|EMBL:KRH09477.1};
RX   PubMed=20075913; DOI=10.1038/nature08670;
RA   Schmutz J., Cannon S.B., Schlueter J., Ma J., Mitros T., Nelson W.,
RA   Hyten D.L., Song Q., Thelen J.J., Cheng J., Xu D., Hellsten U., May G.D.,
RA   Yu Y., Sakurai T., Umezawa T., Bhattacharyya M.K., Sandhu D.,
RA   Valliyodan B., Lindquist E., Peto M., Grant D., Shu S., Goodstein D.,
RA   Barry K., Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N.,
RA   Joshi T., Libault M., Sethuraman A., Zhang X.-C., Shinozaki K.,
RA   Nguyen H.T., Wing R.A., Cregan P., Specht J., Grimwood J., Rokhsar D.,
RA   Stacey G., Shoemaker R.C., Jackson S.A.;
RT   "Genome sequence of the palaeopolyploid soybean.";
RL   Nature 463:178-183(2010).
RN   [2] {ECO:0000313|EnsemblPlants:KRH09477}
RP   IDENTIFICATION.
RC   STRAIN=Williams 82 {ECO:0000313|EnsemblPlants:KRH09477};
RG   EnsemblPlants;
RL   Submitted (FEB-2018) to UniProtKB.
RN   [3] {ECO:0000313|EMBL:KRH09477.1}
RP   NUCLEOTIDE SEQUENCE.
RC   TISSUE=Callus {ECO:0000313|EMBL:KRH09477.1};
RA   Schmutz J., Cannon S., Schlueter J., Ma J., Mitros T., Nelson W., Hyten D.,
RA   Song Q., Thelen J., Cheng J., Xu D., Hellsten U., May G., Yu Y.,
RA   Sakurai T., Umezawa T., Bhattacharyya M., Sandhu D., Valliyodan B.,
RA   Lindquist E., Peto M., Grant D., Shu S., Goodstein D., Barry K.,
RA   Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N., Joshi T.,
RA   Libault M., Sethuraman A., Zhang X., Shinozaki K., Nguyen H., Wing R.,
RA   Cregan P., Specht J., Grimwood J., Rokhsar D., Stacey G., Shoemaker R.,
RA   Jackson S.;
RT   "WGS assembly of Glycine max.";
RL   Submitted (JUL-2018) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC       ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC   -!- SIMILARITY: Belongs to the HD-ZIP homeobox family. Class IV subfamily.
CC       {ECO:0000256|ARBA:ARBA00006789}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM000849; KRH09477.1; -; Genomic_DNA.
DR   EMBL; CM000849; KRH09478.1; -; Genomic_DNA.
DR   RefSeq; XP_003549189.1; XM_003549141.3.
DR   RefSeq; XP_014624636.1; XM_014769150.1.
DR   AlphaFoldDB; I1MQL8; -.
DR   SMR; I1MQL8; -.
DR   STRING; 3847.I1MQL8; -.
DR   PaxDb; 3847-GLYMA16G34350-1; -.
DR   EnsemblPlants; KRH09477; KRH09477; GLYMA_16G217800.
DR   EnsemblPlants; KRH09478; KRH09478; GLYMA_16G217800.
DR   GeneID; 100785023; -.
DR   Gramene; KRH09477; KRH09477; GLYMA_16G217800.
DR   Gramene; KRH09478; KRH09478; GLYMA_16G217800.
DR   KEGG; gmx:100785023; -.
DR   eggNOG; ENOG502QQXM; Eukaryota.
DR   HOGENOM; CLU_015002_2_1_1; -.
DR   InParanoid; I1MQL8; -.
DR   OMA; GSSMHSH; -.
DR   OrthoDB; 3036383at2759; -.
DR   Proteomes; UP000008827; Chromosome 16.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR   GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR   GO; GO:0008289; F:lipid binding; IEA:InterPro.
DR   CDD; cd00086; homeodomain; 1.
DR   CDD; cd08875; START_ArGLABRA2_like; 1.
DR   Gene3D; 3.30.530.20; -; 1.
DR   Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR   InterPro; IPR042160; GLABRA2/ANL2/PDF2/ATML1-like.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR017970; Homeobox_CS.
DR   InterPro; IPR001356; Homeobox_dom.
DR   InterPro; IPR023393; START-like_dom_sf.
DR   InterPro; IPR002913; START_lipid-bd_dom.
DR   PANTHER; PTHR45654:SF1; HOMEOBOX-LEUCINE ZIPPER PROTEIN HDG12; 1.
DR   PANTHER; PTHR45654; HOMEOBOX-LEUCINE ZIPPER PROTEIN MERISTEM L1; 1.
DR   Pfam; PF00046; Homeodomain; 1.
DR   Pfam; PF01852; START; 1.
DR   SMART; SM00389; HOX; 1.
DR   SMART; SM00234; START; 1.
DR   SUPFAM; SSF55961; Bet v1-like; 2.
DR   SUPFAM; SSF46689; Homeodomain-like; 1.
DR   PROSITE; PS00027; HOMEOBOX_1; 1.
DR   PROSITE; PS50071; HOMEOBOX_2; 1.
DR   PROSITE; PS50848; START; 1.
PE   3: Inferred from homology;
KW   Coiled coil {ECO:0000256|ARBA:ARBA00023054};
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW   ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000008827}.
FT   DOMAIN          22..82
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000259|PROSITE:PS50071"
FT   DOMAIN          215..451
FT                   /note="START"
FT                   /evidence="ECO:0000259|PROSITE:PS50848"
FT   DNA_BIND        24..83
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT   REGION          1..32
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          643..666
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1..23
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        649..666
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   718 AA;  79149 MW;  9FF3F52CBF67DC5D CRC64;
     MEFGSGSPGD RHHHHDGSSD SQRRKKRYHR HTANQIQRLE SMFKECPHPD EKQRLQLSRE
     LGLAPRQIKF WFQNRRTQMK AQHERADNCA LRAENDKIRC ENIAIREALK NVICPSCGGP
     PMNDDCYFDE QKLRLENAQL KEELDRVSSI AAKYIGRPIS QLPPVQPIHI SSLDLSMGTF
     ASQGLGGPSL DLDLLPGSSS SSMPNVPPFQ PPCLSDMDKS LMSDIASNAM EEMIRLLQTN
     EPLWMKGADG RDVLDLDSYE RMFPKANSHL KNPNVHVEAS RDSGVVIMNG LTLVDMFMDP
     NKWMELFSTI VTMARTIEVI SSGMMGGHGG SLQLMYEELQ VLSPLVSTRE FYFLRYCQQI
     EQGLWAIVDV SYDFTQDNQF APQFRSHRLP SGVFIQDMPN GYSKVTWIEH VEIEDKTPVH
     RLYRNIIYSG IAFGAQRWLT TLQRMCERIA CLLVTGNSTR DLGGVIPSPE GKRSMMKLAQ
     RMVTNFCASI SSSAGHRWTT LSGSGMNEVG VRVTVHKSSD PGQPNGVVLS AATTIWLPIP
     PQTVFNFFKD EKKRPQWDVL SNGNAVQEVA HIANGSHPGN CISVLRAFNS SQNNMLILQE
     SCVDSSGSLV VYCPVDLPAI NIAMSGEDPS YIPLLPSGFT ISPDGQADQD GGGASTSTSS
     RVMGGGSGSG GSLITVAFQI LVSSLPSAKL NMESVTTVNS LIGNTVQHIK AALNCPSS
//
DBGET integrated database retrieval system