GenomeNet

Database: UniProt
Entry: A0A1U8K8W0_GOSHI
LinkDB: A0A1U8K8W0_GOSHI
Original site: A0A1U8K8W0_GOSHI 
ID   A0A1U8K8W0_GOSHI        Unreviewed;       281 AA.
AC   A0A1U8K8W0;
DT   10-MAY-2017, integrated into UniProtKB/TrEMBL.
DT   10-MAY-2017, sequence version 1.
DT   31-JUL-2019, entry version 13.
DE   SubName: Full=homeobox-leucine zipper protein HAT22-like {ECO:0000313|RefSeq:XP_016697398.1};
GN   Name=LOC107913352 {ECO:0000313|RefSeq:XP_016697398.1};
OS   Gossypium hirsutum (Upland cotton) (Gossypium mexicanum).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
OC   Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae;
OC   Gossypium.
OX   NCBI_TaxID=3635 {ECO:0000313|Proteomes:UP000189702, ECO:0000313|RefSeq:XP_016697398.1};
RN   [1] {ECO:0000313|Proteomes:UP000189702}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. TM-1 {ECO:0000313|Proteomes:UP000189702};
RX   PubMed=25893780; DOI=10.1038/nbt.3208;
RA   Li F., Fan G., Lu C., Xiao G., Zou C., Kohel R.J., Ma Z., Shang H.,
RA   Ma X., Wu J., Liang X., Huang G., Percy R.G., Liu K., Yang W.,
RA   Chen W., Du X., Shi C., Yuan Y., Ye W., Liu X., Zhang X., Liu W.,
RA   Wei H., Wei S., Huang G., Zhang X., Zhu S., Zhang H., Sun F., Wang X.,
RA   Liang J., Wang J., He Q., Huang L., Wang J., Cui J., Song G., Wang K.,
RA   Xu X., Yu J.Z., Zhu Y., Yu S.;
RT   "Genome sequence of cultivated Upland cotton (Gossypium hirsutum TM-1)
RT   provides insights into genome evolution.";
RL   Nat. Biotechnol. 33:524-530(2015).
RN   [2] {ECO:0000313|RefSeq:XP_016697398.1}
RP   IDENTIFICATION.
RC   TISSUE=Leaf {ECO:0000313|RefSeq:XP_016697398.1};
RG   RefSeq;
RL   Submitted (APR-2017) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-
CC       ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   -----------------------------------------------------------------------
DR   RefSeq; XP_016697398.1; XM_016841909.1.
DR   GeneID; 107913352; -.
DR   KEGG; ghi:107913352; -.
DR   KO; K09338; -.
DR   Proteomes; UP000189702; Genome assembly.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR   GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR   CDD; cd00086; homeodomain; 1.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR017970; Homeobox_CS.
DR   InterPro; IPR001356; Homeobox_dom.
DR   InterPro; IPR003106; Leu_zip_homeo.
DR   Pfam; PF02183; HALZ; 1.
DR   Pfam; PF00046; Homeodomain; 1.
DR   SMART; SM00340; HALZ; 1.
DR   SMART; SM00389; HOX; 1.
DR   SUPFAM; SSF46689; SSF46689; 1.
DR   PROSITE; PS00027; HOMEOBOX_1; 1.
DR   PROSITE; PS50071; HOMEOBOX_2; 1.
PE   4: Predicted;
KW   Coiled coil {ECO:0000256|SAM:Coils};
KW   Complete proteome {ECO:0000313|Proteomes:UP000189702};
KW   DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108,
KW   ECO:0000256|RuleBase:RU000682, ECO:0000313|RefSeq:XP_016697398.1};
KW   Homeobox {ECO:0000256|PROSITE-ProRule:PRU00108,
KW   ECO:0000256|RuleBase:RU000682, ECO:0000313|RefSeq:XP_016697398.1};
KW   Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
KW   ECO:0000256|RuleBase:RU000682};
KW   Reference proteome {ECO:0000313|Proteomes:UP000189702}.
FT   DOMAIN      132    192       Homeobox. {ECO:0000259|PROSITE:PS50071}.
FT   DNA_BIND    134    193       Homeobox. {ECO:0000256|PROSITE-ProRule:
FT                                PRU00108}.
FT   REGION       83    103       Disordered. {ECO:0000256|SAM:MobiDB-
FT                                lite}.
FT   COILED      198    228       {ECO:0000256|SAM:Coils}.
SQ   SEQUENCE   281 AA;  31231 MW;  C00E0A22C74ED13A CRC64;
     MGIDDACNTG LVLGLGFSST LGTPSKANNN QTPKKSSMSM AAASFEPSLT LALSGEIYLV
     NDNSKKIDVN KGVGYLHNHE EPGSGDLYRQ ASPHSAVSSF SSGRVKRERD LSCEEVEVEK
     NSSRVSEEDE DGVNARKKLR LTKDQSALLE ESFKQHSTLN PKQKQALAKQ LNLRPRQVEV
     WFQNRRARTK LKQTEVDCEF LKKCCETLTD ENRRLQKELQ ELKALKLAQP FYMHMPAATL
     TMCPSCERIG GVSDGSSKNP FSVLPSKPHF YNRFTNPSAA C
//
DBGET integrated database retrieval system