ID A0A1U8JS51_GOSHI Unreviewed; 288 AA.
AC A0A1U8JS51;
DT 10-MAY-2017, integrated into UniProtKB/TrEMBL.
DT 10-MAY-2017, sequence version 1.
DT 27-MAR-2024, entry version 31.
DE RecName: Full=MYB transcription factor {ECO:0000256|ARBA:ARBA00032813};
GN Name=LOC107908833 {ECO:0000313|RefSeq:XP_016691598.1,
GN ECO:0000313|RefSeq:XP_016691599.1};
OS Gossypium hirsutum (Upland cotton) (Gossypium mexicanum).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium.
OX NCBI_TaxID=3635 {ECO:0000313|Proteomes:UP000189702, ECO:0000313|RefSeq:XP_016691598.1};
RN [1] {ECO:0000313|Proteomes:UP000189702}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. TM-1 {ECO:0000313|Proteomes:UP000189702};
RX PubMed=25893780; DOI=10.1038/nbt.3208;
RA Li F., Fan G., Lu C., Xiao G., Zou C., Kohel R.J., Ma Z., Shang H., Ma X.,
RA Wu J., Liang X., Huang G., Percy R.G., Liu K., Yang W., Chen W., Du X.,
RA Shi C., Yuan Y., Ye W., Liu X., Zhang X., Liu W., Wei H., Wei S., Huang G.,
RA Zhang X., Zhu S., Zhang H., Sun F., Wang X., Liang J., Wang J., He Q.,
RA Huang L., Wang J., Cui J., Song G., Wang K., Xu X., Yu J.Z., Zhu Y., Yu S.;
RT "Genome sequence of cultivated Upland cotton (Gossypium hirsutum TM-1)
RT provides insights into genome evolution.";
RL Nat. Biotechnol. 33:524-530(2015).
RN [2] {ECO:0000313|RefSeq:XP_016691598.1, ECO:0000313|RefSeq:XP_016691599.1}
RP IDENTIFICATION.
RC TISSUE=Leaf {ECO:0000313|RefSeq:XP_016691598.1,
RC ECO:0000313|RefSeq:XP_016691599.1};
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC {ECO:0000256|ARBA:ARBA00004604}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_016691598.1; XM_016836109.1.
DR RefSeq; XP_016691599.1; XM_016836110.1.
DR STRING; 3635.A0A1U8JS51; -.
DR PaxDb; 3635-A0A1U8JS51; -.
DR GeneID; 107908833; -.
DR KEGG; ghi:107908833; -.
DR OrthoDB; 325267at2759; -.
DR Proteomes; UP000189702; Chromosome 20.
DR GO; GO:0005730; C:nucleolus; IEA:UniProtKB-SubCell.
DR GO; GO:0000786; C:nucleosome; IEA:InterPro.
DR GO; GO:0003691; F:double-stranded telomeric DNA binding; IEA:InterPro.
DR GO; GO:0006334; P:nucleosome assembly; IEA:InterPro.
DR CDD; cd11660; SANT_TRF; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 1.
DR InterPro; IPR005818; Histone_H1/H5_H15.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017930; Myb_dom.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR044597; SMH1-6.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR InterPro; IPR036390; WH_DNA-bd_sf.
DR PANTHER; PTHR46267; SINGLE MYB HISTONE 4; 1.
DR PANTHER; PTHR46267:SF8; TELOMERE REPEAT-BINDING FACTOR 1; 1.
DR Pfam; PF00538; Linker_histone; 1.
DR Pfam; PF00249; Myb_DNA-binding; 1.
DR SMART; SM00526; H15; 1.
DR SMART; SM00717; SANT; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR SUPFAM; SSF46785; Winged helix' DNA-binding domain; 1.
DR PROSITE; PS51504; H15; 1.
DR PROSITE; PS51294; HTH_MYB; 1.
DR PROSITE; PS50090; MYB_LIKE; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054, ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000189702}.
FT DOMAIN 1..33
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT DOMAIN 5..57
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 120..188
FT /note="H15"
FT /evidence="ECO:0000259|PROSITE:PS51504"
FT COILED 241..268
FT /evidence="ECO:0000256|SAM:Coils"
SQ SEQUENCE 288 AA; 32049 MW; 9896E4CA4F711B50 CRC64;
MGAPKQKWTP EEAAALKAGV IKHGAGKWRT ILKDPEFSGV LYLRSNVDLK AKWRNMSVMA
NGWGSRDKAR LAVKRTSSFP KQEESAVDLA VAPSDEEIVD VKSVPVSSAT LQIPSAAKRS
IVRLDNLIME AITTLKEPGG SNKTNIAAYI EEQYWAPPDF KRLLSAKLKY LTACGRLIKV
KRRYRIAPAL SFSDRRRNHP MLFSEGRQRV SPRFDRDDLK IITKSQIDLE LARMRKMTPQ
EAAAIAEAEE AAREAEVAEA DAEAAQAFAE AAMKTLKGRN NQKVMVRA
//