ID A0A445G7C0_GLYSO Unreviewed; 312 AA.
AC A0A445G7C0;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 27-MAR-2024, entry version 14.
DE SubName: Full=Homeobox-leucine zipper protein HAT22 {ECO:0000313|EMBL:RZB57078.1};
GN ORFNames=D0Y65_045957 {ECO:0000313|EMBL:RZB57078.1};
OS Glycine soja (Wild soybean).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine;
OC Glycine subgen. Soja.
OX NCBI_TaxID=3848 {ECO:0000313|EMBL:RZB57078.1, ECO:0000313|Proteomes:UP000289340};
RN [1] {ECO:0000313|EMBL:RZB57078.1, ECO:0000313|Proteomes:UP000289340}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. W05 {ECO:0000313|Proteomes:UP000289340};
RC TISSUE=Hypocotyl of etiolated seedlings {ECO:0000313|EMBL:RZB57078.1};
RA Xie M., Chung C.Y.L., Li M.-W., Wong F.-L., Chan T.-F., Lam H.-M.;
RT "A high-quality reference genome of wild soybean provides a powerful tool
RT to mine soybean genomes.";
RL Submitted (SEP-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the HD-ZIP homeobox family. Class II subfamily.
CC {ECO:0000256|ARBA:ARBA00006074}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RZB57078.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QZWG01000017; RZB57078.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A445G7C0; -.
DR SMR; A0A445G7C0; -.
DR OrthoDB; 461623at2759; -.
DR Proteomes; UP000289340; Chromosome 17.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR003106; Leu_zip_homeo.
DR PANTHER; PTHR45714; HOMEOBOX-LEUCINE ZIPPER PROTEIN HAT14; 1.
DR PANTHER; PTHR45714:SF34; HOMEOBOX-LEUCINE ZIPPER PROTEIN HAT9; 1.
DR Pfam; PF02183; HALZ; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR SMART; SM00340; HALZ; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000289340}.
FT DOMAIN 160..220
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 162..221
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 88..122
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 226..256
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 88..120
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 312 AA; 34423 MW; 2738CAEE8F5A5450 CRC64;
MMGLDQDASS NSGLHLILGL ALTATTTTTP SLPSISNKLD HVDHHHHLIT LRPTTKSPYN
SSEAEPSLTL GLSRESYLKV PKNIIGQNSN NKVSSCDDPL DHLSTQTNSP HHSAVSSFSS
GRVKRERDLS CEEVVDATEI DQRDHSCEGI VRATDEDEDG TAARKKLRLS KEQSALLEES
FKQHSTLNPK QKQALAKQLN LRPRQVEVWF QNRRARTKLK QTEVDCEFLK KCCETLTDEN
RRLQKELQEL KALKLAQPLY MPMPAATLTM CPSCERLGGG INGGGGGSPK TPFSMAPKPH
FFNPFANPSA AC
//