ID A0A1U8JN57_GOSHI Unreviewed; 295 AA.
AC A0A1U8JN57;
DT 10-MAY-2017, integrated into UniProtKB/TrEMBL.
DT 10-MAY-2017, sequence version 1.
DT 27-MAR-2024, entry version 37.
DE SubName: Full=Homeobox protein knotted-1-like 3 {ECO:0000313|RefSeq:XP_016690138.1};
GN Name=LOC107907313 {ECO:0000313|RefSeq:XP_016690138.1};
OS Gossypium hirsutum (Upland cotton) (Gossypium mexicanum).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium.
OX NCBI_TaxID=3635 {ECO:0000313|Proteomes:UP000189702, ECO:0000313|RefSeq:XP_016690138.1};
RN [1] {ECO:0000313|Proteomes:UP000189702}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. TM-1 {ECO:0000313|Proteomes:UP000189702};
RX PubMed=25893780; DOI=10.1038/nbt.3208;
RA Li F., Fan G., Lu C., Xiao G., Zou C., Kohel R.J., Ma Z., Shang H., Ma X.,
RA Wu J., Liang X., Huang G., Percy R.G., Liu K., Yang W., Chen W., Du X.,
RA Shi C., Yuan Y., Ye W., Liu X., Zhang X., Liu W., Wei H., Wei S., Huang G.,
RA Zhang X., Zhu S., Zhang H., Sun F., Wang X., Liang J., Wang J., He Q.,
RA Huang L., Wang J., Cui J., Song G., Wang K., Xu X., Yu J.Z., Zhu Y., Yu S.;
RT "Genome sequence of cultivated Upland cotton (Gossypium hirsutum TM-1)
RT provides insights into genome evolution.";
RL Nat. Biotechnol. 33:524-530(2015).
RN [2] {ECO:0000313|RefSeq:XP_016690138.1}
RP IDENTIFICATION.
RC TISSUE=Leaf {ECO:0000313|RefSeq:XP_016690138.1};
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108}.
CC -!- SIMILARITY: Belongs to the TALE/KNOX homeobox family.
CC {ECO:0000256|PROSITE-ProRule:PRU00559}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_016690138.1; XM_016834649.1.
DR AlphaFoldDB; A0A1U8JN57; -.
DR SMR; A0A1U8JN57; -.
DR PaxDb; 3635-A0A1U8JN57; -.
DR GeneID; 107907313; -.
DR KEGG; ghi:107907313; -.
DR OMA; MNFAKKM; -.
DR OrthoDB; 3180467at2759; -.
DR Proteomes; UP000189702; Chromosome 19.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR005539; ELK_dom.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR008422; Homeobox_KN_domain.
DR InterPro; IPR005540; KNOX1.
DR InterPro; IPR005541; KNOX2.
DR PANTHER; PTHR11850:SF358; HOMEOBOX PROTEIN KNOTTED-1-LIKE 13; 1.
DR PANTHER; PTHR11850; HOMEOBOX PROTEIN TRANSCRIPTION FACTORS; 1.
DR Pfam; PF03789; ELK; 1.
DR Pfam; PF05920; Homeobox_KN; 1.
DR Pfam; PF03790; KNOX1; 1.
DR Pfam; PF03791; KNOX2; 1.
DR SMART; SM01188; ELK; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM01255; KNOX1; 1.
DR SMART; SM01256; KNOX2; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS51213; ELK; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000189702}.
FT DOMAIN 200..220
FT /note="ELK"
FT /evidence="ECO:0000259|PROSITE:PS51213"
FT DOMAIN 220..283
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 221..284
FT /note="Homeobox; TALE-type"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
SQ SEQUENCE 295 AA; 33294 MW; 9217CC3047191C9E CRC64;
MAFDGFVSDN TKDMQTLALN EPPPSAADID HWERAKCKAE IMGHPMYDQL LEAHVACLRV
ATPVDQLAQI DAQLARSQDV LAKYSSAAAA AGSAEEELDH FMANYVLLLG FFKDQLQQHV
RVHAMEAVMA CWDLEQSLQS LTGVSPGEGT GATMSDDEDE VVDSDTSLFD GSFDGIDSMG
FGPLVPSETE RSLMERVRQE LKHELKQGYK EKIVDIREEI LRKRRAGKLP GDTTSFLKAW
WQSHSKWPYP TEEDKAKLVQ ETGLQLKQIN NWFINQRKRN WHSNPSTSLK SKRKR
//