ID I1MQL8_SOYBN Unreviewed; 718 AA.
AC I1MQL8;
DT 13-JUN-2012, integrated into UniProtKB/TrEMBL.
DT 13-JUN-2012, sequence version 1.
DT 24-JAN-2024, entry version 74.
DE RecName: Full=Homeodomain/HOMEOBOX transcription factor {ECO:0008006|Google:ProtNLM};
GN Name=100785023 {ECO:0000313|EnsemblPlants:KRH09477};
GN ORFNames=GLYMA_16G217800 {ECO:0000313|EMBL:KRH09477.1};
OS Glycine max (Soybean) (Glycine hispida).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine;
OC Glycine subgen. Soja.
OX NCBI_TaxID=3847 {ECO:0000313|EMBL:KRH09477.1};
RN [1] {ECO:0000313|EMBL:KRH09477.1, ECO:0000313|EnsemblPlants:KRH09477}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Williams 82 {ECO:0000313|EnsemblPlants:KRH09477};
RC TISSUE=Callus {ECO:0000313|EMBL:KRH09477.1};
RX PubMed=20075913; DOI=10.1038/nature08670;
RA Schmutz J., Cannon S.B., Schlueter J., Ma J., Mitros T., Nelson W.,
RA Hyten D.L., Song Q., Thelen J.J., Cheng J., Xu D., Hellsten U., May G.D.,
RA Yu Y., Sakurai T., Umezawa T., Bhattacharyya M.K., Sandhu D.,
RA Valliyodan B., Lindquist E., Peto M., Grant D., Shu S., Goodstein D.,
RA Barry K., Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N.,
RA Joshi T., Libault M., Sethuraman A., Zhang X.-C., Shinozaki K.,
RA Nguyen H.T., Wing R.A., Cregan P., Specht J., Grimwood J., Rokhsar D.,
RA Stacey G., Shoemaker R.C., Jackson S.A.;
RT "Genome sequence of the palaeopolyploid soybean.";
RL Nature 463:178-183(2010).
RN [2] {ECO:0000313|EnsemblPlants:KRH09477}
RP IDENTIFICATION.
RC STRAIN=Williams 82 {ECO:0000313|EnsemblPlants:KRH09477};
RG EnsemblPlants;
RL Submitted (FEB-2018) to UniProtKB.
RN [3] {ECO:0000313|EMBL:KRH09477.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Callus {ECO:0000313|EMBL:KRH09477.1};
RA Schmutz J., Cannon S., Schlueter J., Ma J., Mitros T., Nelson W., Hyten D.,
RA Song Q., Thelen J., Cheng J., Xu D., Hellsten U., May G., Yu Y.,
RA Sakurai T., Umezawa T., Bhattacharyya M., Sandhu D., Valliyodan B.,
RA Lindquist E., Peto M., Grant D., Shu S., Goodstein D., Barry K.,
RA Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N., Joshi T.,
RA Libault M., Sethuraman A., Zhang X., Shinozaki K., Nguyen H., Wing R.,
RA Cregan P., Specht J., Grimwood J., Rokhsar D., Stacey G., Shoemaker R.,
RA Jackson S.;
RT "WGS assembly of Glycine max.";
RL Submitted (JUL-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the HD-ZIP homeobox family. Class IV subfamily.
CC {ECO:0000256|ARBA:ARBA00006789}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM000849; KRH09477.1; -; Genomic_DNA.
DR EMBL; CM000849; KRH09478.1; -; Genomic_DNA.
DR RefSeq; XP_003549189.1; XM_003549141.3.
DR RefSeq; XP_014624636.1; XM_014769150.1.
DR AlphaFoldDB; I1MQL8; -.
DR SMR; I1MQL8; -.
DR STRING; 3847.I1MQL8; -.
DR PaxDb; 3847-GLYMA16G34350-1; -.
DR EnsemblPlants; KRH09477; KRH09477; GLYMA_16G217800.
DR EnsemblPlants; KRH09478; KRH09478; GLYMA_16G217800.
DR GeneID; 100785023; -.
DR Gramene; KRH09477; KRH09477; GLYMA_16G217800.
DR Gramene; KRH09478; KRH09478; GLYMA_16G217800.
DR KEGG; gmx:100785023; -.
DR eggNOG; ENOG502QQXM; Eukaryota.
DR HOGENOM; CLU_015002_2_1_1; -.
DR InParanoid; I1MQL8; -.
DR OMA; GSSMHSH; -.
DR OrthoDB; 3036383at2759; -.
DR Proteomes; UP000008827; Chromosome 16.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:0008289; F:lipid binding; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR CDD; cd08875; START_ArGLABRA2_like; 1.
DR Gene3D; 3.30.530.20; -; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR042160; GLABRA2/ANL2/PDF2/ATML1-like.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR023393; START-like_dom_sf.
DR InterPro; IPR002913; START_lipid-bd_dom.
DR PANTHER; PTHR45654:SF1; HOMEOBOX-LEUCINE ZIPPER PROTEIN HDG12; 1.
DR PANTHER; PTHR45654; HOMEOBOX-LEUCINE ZIPPER PROTEIN MERISTEM L1; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF01852; START; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00234; START; 1.
DR SUPFAM; SSF55961; Bet v1-like; 2.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS50848; START; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000008827}.
FT DOMAIN 22..82
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 215..451
FT /note="START"
FT /evidence="ECO:0000259|PROSITE:PS50848"
FT DNA_BIND 24..83
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..32
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 643..666
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..23
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 649..666
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 718 AA; 79149 MW; 9FF3F52CBF67DC5D CRC64;
MEFGSGSPGD RHHHHDGSSD SQRRKKRYHR HTANQIQRLE SMFKECPHPD EKQRLQLSRE
LGLAPRQIKF WFQNRRTQMK AQHERADNCA LRAENDKIRC ENIAIREALK NVICPSCGGP
PMNDDCYFDE QKLRLENAQL KEELDRVSSI AAKYIGRPIS QLPPVQPIHI SSLDLSMGTF
ASQGLGGPSL DLDLLPGSSS SSMPNVPPFQ PPCLSDMDKS LMSDIASNAM EEMIRLLQTN
EPLWMKGADG RDVLDLDSYE RMFPKANSHL KNPNVHVEAS RDSGVVIMNG LTLVDMFMDP
NKWMELFSTI VTMARTIEVI SSGMMGGHGG SLQLMYEELQ VLSPLVSTRE FYFLRYCQQI
EQGLWAIVDV SYDFTQDNQF APQFRSHRLP SGVFIQDMPN GYSKVTWIEH VEIEDKTPVH
RLYRNIIYSG IAFGAQRWLT TLQRMCERIA CLLVTGNSTR DLGGVIPSPE GKRSMMKLAQ
RMVTNFCASI SSSAGHRWTT LSGSGMNEVG VRVTVHKSSD PGQPNGVVLS AATTIWLPIP
PQTVFNFFKD EKKRPQWDVL SNGNAVQEVA HIANGSHPGN CISVLRAFNS SQNNMLILQE
SCVDSSGSLV VYCPVDLPAI NIAMSGEDPS YIPLLPSGFT ISPDGQADQD GGGASTSTSS
RVMGGGSGSG GSLITVAFQI LVSSLPSAKL NMESVTTVNS LIGNTVQHIK AALNCPSS
//