ID G3P5E8_GASAC Unreviewed; 737 AA.
AC G3P5E8;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 16-NOV-2011, sequence version 1.
DT 27-MAR-2024, entry version 54.
DE SubName: Full=SIX homeobox 5 {ECO:0000313|Ensembl:ENSGACP00000012821.1};
OS Gasterosteus aculeatus (Three-spined stickleback).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Perciformes; Cottioidei; Gasterosteales; Gasterosteidae;
OC Gasterosteus.
OX NCBI_TaxID=69293 {ECO:0000313|Ensembl:ENSGACP00000012821.1, ECO:0000313|Proteomes:UP000007635};
RN [1] {ECO:0000313|Ensembl:ENSGACP00000012821.1, ECO:0000313|Proteomes:UP000007635}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Lindblad-Toh K., Mauceli E., Grabherr M., Chang J.L., Lander E.S.;
RL Submitted (JAN-2006) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSGACP00000012821.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- FUNCTION: Sequence-specific transcription factor which is part of a
CC developmental regulatory system that provides cells with specific
CC positional identities on the anterior-posterior axis.
CC {ECO:0000256|ARBA:ARBA00003263}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
CC ECO:0000256|RuleBase:RU000682}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3P5E8; -.
DR STRING; 69293.ENSGACP00000012821; -.
DR Ensembl; ENSGACT00000012845.1; ENSGACP00000012821.1; ENSGACG00000009721.1.
DR eggNOG; KOG0775; Eukaryota.
DR GeneTree; ENSGT00940000166238; -.
DR InParanoid; G3P5E8; -.
DR OMA; EQTTNCT; -.
DR TreeFam; TF315545; -.
DR Proteomes; UP000007635; Unassembled WGS sequence.
DR Bgee; ENSGACG00000009721; Expressed in zone of skin and 2 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR031701; SIX1_SD.
DR PANTHER; PTHR10390; HOMEOBOX PROTEIN SIX; 1.
DR PANTHER; PTHR10390:SF65; HOMEOBOX PROTEIN SIX4.3; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF16878; SIX1_SD; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 4: Predicted;
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000007635};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 171..222
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 173..223
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..32
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 211..257
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 394..414
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..18
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 213..252
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 395..414
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 737 AA; 77566 MW; AE361027FDD55513 CRC64;
MASLSLESTE QSESGPDEPS ATESKEEEDP VQISERLLQS FQKSALSFST DQVSCLCEAL
LQAGNVDRLW RFLSTIPPPS ELLRGNETML KAQALVAFHR DEFKELYAIL ESHDFHPSNH
GFLQDLYLKA RYKEAERSRG RSLGAVDKYR LRKKFPLPKT IWDGEETVYC FKEKSRNALK
DCYKSNRYPT PDEKKNLANV SGLSLTQVSN WFKNRRQRDR TPSGTHSKSE SDGNHSTEDE
ASPMDDGPDK PEEAAGSTAS IISLSALPCG AGGQLILNGS GGFLTAPQSL LLNGNSLLSG
TGAGVIINGL SLGDCQTVTL SPVATNSPLM LNGAPQALSV APQELSAVEA KSSSLPAVVL
GGNTASVSNP PSVISLTQPT RTDGGDATDY ISVPEGRTVS PSSSLSPSSP TLSSPTMLSS
LVLTQNNQCQ ESLTLPASMS SAGMLLSGTS VPLSGPHGEY VVFATGGSHL NPSSSVVPSS
NSTPQVFSLP QVVPSIQGVP VSQLVQHSSG AQVSQCPQLV PLSPLASSAP HFQNQTANTS
TRLVQQLQDS PTTLPEGATT IFSISQLNNH QLLQRMVHPP GDQSTNSTPS KMQVPQVISI
SSPTQVVSVP QNKGNTAAQL VPLSMPQLVP VSSIQTSSSI SFPQVVPASP SLSMSSAGVP
LQILTSSPAG VSQAPLRINQ LRPIQSVGPQ TIGAPGMQLL NSGIIQLPSP PVGNLLLGGS
PYLSVQQGKL ILTIPSG
//