GenomeNet

Database: UniProt
Entry: A0A1U8K8U1_GOSHI
LinkDB: A0A1U8K8U1_GOSHI
Original site: A0A1U8K8U1_GOSHI 
ID   A0A1U8K8U1_GOSHI        Unreviewed;       801 AA.
AC   A0A1U8K8U1;
DT   10-MAY-2017, integrated into UniProtKB/TrEMBL.
DT   10-MAY-2017, sequence version 1.
DT   24-JAN-2024, entry version 24.
DE   SubName: Full=BEL1-like homeodomain protein 4 {ECO:0000313|RefSeq:XP_016697058.1, ECO:0000313|RefSeq:XP_016697059.1};
GN   Name=LOC107913110 {ECO:0000313|RefSeq:XP_016697058.1,
GN   ECO:0000313|RefSeq:XP_016697059.1};
OS   Gossypium hirsutum (Upland cotton) (Gossypium mexicanum).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium.
OX   NCBI_TaxID=3635 {ECO:0000313|Proteomes:UP000189702, ECO:0000313|RefSeq:XP_016697059.1};
RN   [1] {ECO:0000313|Proteomes:UP000189702}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. TM-1 {ECO:0000313|Proteomes:UP000189702};
RX   PubMed=25893780; DOI=10.1038/nbt.3208;
RA   Li F., Fan G., Lu C., Xiao G., Zou C., Kohel R.J., Ma Z., Shang H., Ma X.,
RA   Wu J., Liang X., Huang G., Percy R.G., Liu K., Yang W., Chen W., Du X.,
RA   Shi C., Yuan Y., Ye W., Liu X., Zhang X., Liu W., Wei H., Wei S., Huang G.,
RA   Zhang X., Zhu S., Zhang H., Sun F., Wang X., Liang J., Wang J., He Q.,
RA   Huang L., Wang J., Cui J., Song G., Wang K., Xu X., Yu J.Z., Zhu Y., Yu S.;
RT   "Genome sequence of cultivated Upland cotton (Gossypium hirsutum TM-1)
RT   provides insights into genome evolution.";
RL   Nat. Biotechnol. 33:524-530(2015).
RN   [2] {ECO:0000313|RefSeq:XP_016697058.1, ECO:0000313|RefSeq:XP_016697059.1}
RP   IDENTIFICATION.
RC   TISSUE=Leaf {ECO:0000313|RefSeq:XP_016697058.1,
RC   ECO:0000313|RefSeq:XP_016697059.1};
RG   RefSeq;
RL   Submitted (SEP-2023) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC       ECO:0000256|PROSITE-ProRule:PRU00108}.
CC   -!- SIMILARITY: Belongs to the TALE/BELL homeobox family.
CC       {ECO:0000256|ARBA:ARBA00006454}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   RefSeq; XP_016697058.1; XM_016841569.1.
DR   RefSeq; XP_016697059.1; XM_016841570.1.
DR   STRING; 3635.A0A1U8K8U1; -.
DR   PaxDb; 3635-A0A1U8K8U1; -.
DR   GeneID; 107913110; -.
DR   KEGG; ghi:107913110; -.
DR   OrthoDB; 3180467at2759; -.
DR   Proteomes; UP000189702; Chromosome 21.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR   GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR   GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR   CDD; cd00086; homeodomain; 1.
DR   Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR001356; Homeobox_dom.
DR   InterPro; IPR008422; Homeobox_KN_domain.
DR   InterPro; IPR006563; POX_dom.
DR   PANTHER; PTHR11850:SF361; BEL1-LIKE HOMEODOMAIN PROTEIN 2; 1.
DR   PANTHER; PTHR11850; HOMEOBOX PROTEIN TRANSCRIPTION FACTORS; 1.
DR   Pfam; PF05920; Homeobox_KN; 1.
DR   Pfam; PF07526; POX; 1.
DR   SMART; SM00389; HOX; 1.
DR   SMART; SM00574; POX; 1.
DR   SUPFAM; SSF46689; Homeodomain-like; 1.
DR   PROSITE; PS50071; HOMEOBOX_2; 1.
PE   3: Inferred from homology;
KW   DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108,
KW   ECO:0000313|RefSeq:XP_016697058.1};
KW   Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW   ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000189702}.
FT   DOMAIN          561..624
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000259|PROSITE:PS50071"
FT   DNA_BIND        563..625
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT   REGION          211..230
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          405..444
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          632..692
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          705..731
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        405..437
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        632..647
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        648..692
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        705..730
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   801 AA;  86742 MW;  19F1A4E1B6BD30DC CRC64;
     MGIATPPLVA SILSQESQPS KKIHHQIPIQ HKSNNSTNSM SQDYHQAAGI FSFSNGFERS
     AVSHQEHQQQ QQQHLEQQIR RDKLRVQGFE PPPPPLVGIE EEESTSLPVY ETAGMLSEMF
     NFPSGAAAAA AAATASTELL DQHVQPNYRA HRPAPGNTNE WYSNRQGAVG GLGQLGESKS
     HITRDSLAQQ QQQLPSINAD SAAAMHLFLM NPQQRSPSPP PPPPATSSNT LHMLLPNPST
     SLQGFNVSGP GGGFGTSTVL SPPQFTWVPG SAHEGGNNTD SQLSSLNEIG SVVEGQGLSL
     SLSSSLQHLE AAKAEELGMG DGGLLYYNQG GGSAAQFHQY RNLGSHHHQT MHLQGGVGQN
     QQVHHVGFGS SLGMVNVLRN SKYVKAAQEL LEEFCSVGRG QFKKNKFGRN NTNPSSDPGS
     SGGAGGGGSS SSTKDLPPLS AADRIEHQRR KVKLLSMLDE VERRYNHYCE QMQMVVNSFD
     LVMGFGAAVP YTALAQKAMS RHFRCLKDAI SAQLKHSCEM LGEKDGAGSS GITKGETPRL
     RMLEQSLRQQ RAFNQMGMME QEAWRPQRGL PERSVNILRA WLFEHFLHPY PSDADKHLLA
     RQTGLSRNQV SNWFINARVR LWKPMVEEMY QQETKEEGDH NNNNNNNHTE RERNPNNNNN
     AQTSTPSTTA APTTTTTTSV GKGSQINATE NDPSLIAINT PQCFSENQAN PNATEVAPPI
     SQPFTTSIPH DSDIHHQRIA GTTVAAADYG TTAGGNTDIG SSLIRLGTTT AGDVSLTLGL
     RHAGNMPENT SSFSVRDFGG C
//
DBGET integrated database retrieval system