ID A0A1U8MGE6_GOSHI Unreviewed; 311 AA.
AC A0A1U8MGE6;
DT 10-MAY-2017, integrated into UniProtKB/TrEMBL.
DT 10-MAY-2017, sequence version 1.
DT 27-MAR-2024, entry version 36.
DE SubName: Full=Homeobox protein HD1-like {ECO:0000313|RefSeq:XP_016724639.1};
GN Name=LOC107936422 {ECO:0000313|RefSeq:XP_016724639.1};
OS Gossypium hirsutum (Upland cotton) (Gossypium mexicanum).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium.
OX NCBI_TaxID=3635 {ECO:0000313|Proteomes:UP000189702, ECO:0000313|RefSeq:XP_016724639.1};
RN [1] {ECO:0000313|Proteomes:UP000189702}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. TM-1 {ECO:0000313|Proteomes:UP000189702};
RX PubMed=25893780; DOI=10.1038/nbt.3208;
RA Li F., Fan G., Lu C., Xiao G., Zou C., Kohel R.J., Ma Z., Shang H., Ma X.,
RA Wu J., Liang X., Huang G., Percy R.G., Liu K., Yang W., Chen W., Du X.,
RA Shi C., Yuan Y., Ye W., Liu X., Zhang X., Liu W., Wei H., Wei S., Huang G.,
RA Zhang X., Zhu S., Zhang H., Sun F., Wang X., Liang J., Wang J., He Q.,
RA Huang L., Wang J., Cui J., Song G., Wang K., Xu X., Yu J.Z., Zhu Y., Yu S.;
RT "Genome sequence of cultivated Upland cotton (Gossypium hirsutum TM-1)
RT provides insights into genome evolution.";
RL Nat. Biotechnol. 33:524-530(2015).
RN [2] {ECO:0000313|RefSeq:XP_016724639.1}
RP IDENTIFICATION.
RC TISSUE=Leaf {ECO:0000313|RefSeq:XP_016724639.1};
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108}.
CC -!- SIMILARITY: Belongs to the TALE/KNOX homeobox family.
CC {ECO:0000256|PROSITE-ProRule:PRU00559}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_016724639.1; XM_016869150.1.
DR AlphaFoldDB; A0A1U8MGE6; -.
DR PaxDb; 3635-A0A1U8MGE6; -.
DR GeneID; 107936422; -.
DR KEGG; ghi:107936422; -.
DR OrthoDB; 3180467at2759; -.
DR Proteomes; UP000189702; Unplaced.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR005539; ELK_dom.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR008422; Homeobox_KN_domain.
DR InterPro; IPR005540; KNOX1.
DR InterPro; IPR005541; KNOX2.
DR PANTHER; PTHR11850:SF74; HOMEOBOX PROTEIN KNOTTED-1-LIKE 7; 1.
DR PANTHER; PTHR11850; HOMEOBOX PROTEIN TRANSCRIPTION FACTORS; 1.
DR Pfam; PF05920; Homeobox_KN; 1.
DR Pfam; PF03790; KNOX1; 1.
DR Pfam; PF03791; KNOX2; 1.
DR SMART; SM01188; ELK; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM01255; KNOX1; 1.
DR SMART; SM01256; KNOX2; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS51213; ELK; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000313|RefSeq:XP_016724639.1};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000189702}.
FT DOMAIN 214..234
FT /note="ELK"
FT /evidence="ECO:0000259|PROSITE:PS51213"
FT DOMAIN 234..297
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 235..298
FT /note="Homeobox; TALE-type"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
SQ SEQUENCE 311 AA; 35744 MW; 49F092EA437A23E6 CRC64;
MKEHKFREMQ EPELGMMANF GGTTTAAIGG LSSRELSVPL DQNHRQLKAE IATHPLYDQL
LTAHVSCLRV ATTIEQLPLI DAQLAQCHNV LRSYASQHPQ HGHSLSPHHR QDLDNFLAQY
LIMLCRFKEE LQQHVRVDAV EAVMACREIE NNLHALTGVT LGESTGATMS DDEDELQMDF
PLDNSGVEAN DLMGFGPLLP TESERTLMER VRKELKIELK QGYKSKIEDV REEILRKRRA
GKLPGDTTSV LKNWWQQHSK WPYPTEDDKA KLVEETGLQL KQINNWFINQ RKRNWHTNYQ
STTSLKSKRK R
//