ID A0A445LUP1_GLYSO Unreviewed; 316 AA.
AC A0A445LUP1;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 27-MAR-2024, entry version 16.
DE SubName: Full=Homeobox protein SBH1 isoform A {ECO:0000313|EMBL:RZC26928.1};
GN ORFNames=D0Y65_005212 {ECO:0000313|EMBL:RZC26928.1};
OS Glycine soja (Wild soybean).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine;
OC Glycine subgen. Soja.
OX NCBI_TaxID=3848 {ECO:0000313|EMBL:RZC26928.1, ECO:0000313|Proteomes:UP000289340};
RN [1] {ECO:0000313|EMBL:RZC26928.1, ECO:0000313|Proteomes:UP000289340}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. W05 {ECO:0000313|Proteomes:UP000289340};
RC TISSUE=Hypocotyl of etiolated seedlings {ECO:0000313|EMBL:RZC26928.1};
RA Xie M., Chung C.Y.L., Li M.-W., Wong F.-L., Chan T.-F., Lam H.-M.;
RT "A high-quality reference genome of wild soybean provides a powerful tool
RT to mine soybean genomes.";
RL Submitted (SEP-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108}.
CC -!- SIMILARITY: Belongs to the TALE/KNOX homeobox family.
CC {ECO:0000256|PROSITE-ProRule:PRU00559}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RZC26928.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QZWG01000002; RZC26928.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A445LUP1; -.
DR OrthoDB; 3180467at2759; -.
DR Proteomes; UP000289340; Chromosome 2.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR005539; ELK_dom.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR008422; Homeobox_KN_domain.
DR InterPro; IPR005540; KNOX1.
DR InterPro; IPR005541; KNOX2.
DR PANTHER; PTHR11850:SF409; HOMEOBOX KNOTTED-LIKE PROTEIN; 1.
DR PANTHER; PTHR11850; HOMEOBOX PROTEIN TRANSCRIPTION FACTORS; 1.
DR Pfam; PF03789; ELK; 1.
DR Pfam; PF05920; Homeobox_KN; 1.
DR Pfam; PF03790; KNOX1; 1.
DR Pfam; PF03791; KNOX2; 1.
DR SMART; SM01188; ELK; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM01255; KNOX1; 1.
DR SMART; SM01256; KNOX2; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS51213; ELK; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000289340}.
FT DOMAIN 200..220
FT /note="ELK"
FT /evidence="ECO:0000259|PROSITE:PS51213"
FT DOMAIN 220..283
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 221..284
FT /note="Homeobox; TALE-type"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
SQ SEQUENCE 316 AA; 36312 MW; 1E46B967F5699D99 CRC64;
MIGFGGNSCS ELSEMDMTNM ETNRKFLSLP LSNNNNSGDN RRVPVTSNSI AQEHHYSHHH
NPTDTCSVRD KIMAHPLFPR LLSSYLNCLK VGAPPEVVAS LEESYAKYES FNASSGRIGG
GSIGEDPALD QFMEAYCEML IKYEQELTKP FKEAMLFFSR IECQLKALAV SSDFGQSETS
SKNEVDVHEN NLDSQAEDRE LKVQLLRKYS GYLGSLKKEF LKKKKNGKLP KEARQQLLDW
WNRHYKWPYP SESQKQALAE STGLDMKQIN NWFINQRKRH WKPSEDMQFA VMDATNYYME
NVMCKPFPMD SMPMLL
//