ID I1MYP2_SOYBN Unreviewed; 322 AA.
AC I1MYP2;
DT 13-JUN-2012, integrated into UniProtKB/TrEMBL.
DT 13-JUN-2012, sequence version 1.
DT 27-MAR-2024, entry version 79.
DE RecName: Full=Homeobox-leucine zipper protein {ECO:0000256|RuleBase:RU369038};
DE AltName: Full=HD-ZIP protein {ECO:0000256|RuleBase:RU369038};
DE AltName: Full=Homeodomain transcription factor {ECO:0000256|RuleBase:RU369038};
GN Name=100777924 {ECO:0000313|EnsemblPlants:KRG97544};
GN ORFNames=GLYMA_18G014900 {ECO:0000313|EMBL:KRG97544.1};
OS Glycine max (Soybean) (Glycine hispida).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine;
OC Glycine subgen. Soja.
OX NCBI_TaxID=3847 {ECO:0000313|EnsemblPlants:KRG97544};
RN [1] {ECO:0000313|EMBL:KRG97544.1, ECO:0000313|EnsemblPlants:KRG97544}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Williams 82 {ECO:0000313|EnsemblPlants:KRG97544};
RC TISSUE=Callus {ECO:0000313|EMBL:KRG97544.1};
RX PubMed=20075913; DOI=10.1038/nature08670;
RA Schmutz J., Cannon S.B., Schlueter J., Ma J., Mitros T., Nelson W.,
RA Hyten D.L., Song Q., Thelen J.J., Cheng J., Xu D., Hellsten U., May G.D.,
RA Yu Y., Sakurai T., Umezawa T., Bhattacharyya M.K., Sandhu D.,
RA Valliyodan B., Lindquist E., Peto M., Grant D., Shu S., Goodstein D.,
RA Barry K., Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N.,
RA Joshi T., Libault M., Sethuraman A., Zhang X.-C., Shinozaki K.,
RA Nguyen H.T., Wing R.A., Cregan P., Specht J., Grimwood J., Rokhsar D.,
RA Stacey G., Shoemaker R.C., Jackson S.A.;
RT "Genome sequence of the palaeopolyploid soybean.";
RL Nature 463:178-183(2010).
RN [2] {ECO:0000313|EnsemblPlants:KRG97544}
RP IDENTIFICATION.
RC STRAIN=Williams 82 {ECO:0000313|EnsemblPlants:KRG97544};
RG EnsemblPlants;
RL Submitted (FEB-2018) to UniProtKB.
RN [3] {ECO:0000313|EMBL:KRG97544.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Callus {ECO:0000313|EMBL:KRG97544.1};
RA Schmutz J., Cannon S., Schlueter J., Ma J., Mitros T., Nelson W., Hyten D.,
RA Song Q., Thelen J., Cheng J., Xu D., Hellsten U., May G., Yu Y.,
RA Sakurai T., Umezawa T., Bhattacharyya M., Sandhu D., Valliyodan B.,
RA Lindquist E., Peto M., Grant D., Shu S., Goodstein D., Barry K.,
RA Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N., Joshi T.,
RA Libault M., Sethuraman A., Zhang X., Shinozaki K., Nguyen H., Wing R.,
RA Cregan P., Specht J., Grimwood J., Rokhsar D., Stacey G., Shoemaker R.,
RA Jackson S.;
RT "WGS assembly of Glycine max.";
RL Submitted (JUL-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Transcription factor. {ECO:0000256|RuleBase:RU369038}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the HD-ZIP homeobox family. Class I subfamily.
CC {ECO:0000256|ARBA:ARBA00025748, ECO:0000256|RuleBase:RU369038}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM000851; KRG97544.1; -; Genomic_DNA.
DR RefSeq; XP_003552015.1; XM_003551967.3.
DR AlphaFoldDB; I1MYP2; -.
DR SMR; I1MYP2; -.
DR STRING; 3847.I1MYP2; -.
DR PaxDb; 3847-GLYMA18G01830-1; -.
DR EnsemblPlants; KRG97544; KRG97544; GLYMA_18G014900.
DR GeneID; 100777924; -.
DR Gramene; KRG97544; KRG97544; GLYMA_18G014900.
DR KEGG; gmx:100777924; -.
DR eggNOG; KOG0483; Eukaryota.
DR HOGENOM; CLU_060842_1_0_1; -.
DR InParanoid; I1MYP2; -.
DR OMA; MLNGYEE; -.
DR OrthoDB; 450969at2759; -.
DR Proteomes; UP000008827; Chromosome 18.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:UniProtKB-UniRule.
DR GO; GO:0043565; F:sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0045893; P:positive regulation of DNA-templated transcription; IBA:GO_Central.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR045224; HDZip_class_I_plant.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR000047; HTH_motif.
DR InterPro; IPR003106; Leu_zip_homeo.
DR PANTHER; PTHR24326; HOMEOBOX-LEUCINE ZIPPER PROTEIN; 1.
DR PANTHER; PTHR24326:SF547; HOMEOBOX-LEUCINE ZIPPER PROTEIN ATHB-6; 1.
DR Pfam; PF02183; HALZ; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR PRINTS; PR00031; HTHREPRESSR.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000008827};
KW Transcription {ECO:0000256|RuleBase:RU369038};
KW Transcription regulation {ECO:0000256|RuleBase:RU369038}.
FT DOMAIN 53..113
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 55..114
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..29
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 154..206
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 172..187
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 188..203
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 322 AA; 36683 MW; BCA4DC1CB05EA0E3 CRC64;
MKRLSSSDSS SALMTICPST EEHSPRNSQH MYGREFQSML DGLDEEGCVE EPGYQSEKKR
RLSVDQVKAL EKNFEVENKL EPERKVKLAQ ELGLQPRQVA VWFQNRRARW KTKQLERDYG
VLKANYDALK LNFDTLDQDN EALRKQVKEL KSRLLQEENT GGSGVSVKEE IITRPADSED
KTMEQSKSDP SSETSNINPS SESEEDHLNY ECFNNNDDCV GGTAASLLQV DFKDGSSDSD
GSSAILNEDN MYSPLKFNNC SISTSPSSSS MMNCFQFQKP YHHHAQYVKM EEHNFLSADE
ACNFFSDEQA PTLQWYCPEQ WS
//