ID G3NQ77_GASAC Unreviewed; 473 AA.
AC G3NQ77;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 16-NOV-2011, sequence version 1.
DT 27-MAR-2024, entry version 71.
DE SubName: Full=SRY-box transcription factor 9a {ECO:0000313|Ensembl:ENSGACP00000007493.1};
OS Gasterosteus aculeatus (Three-spined stickleback).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Perciformes; Cottioidei; Gasterosteales; Gasterosteidae;
OC Gasterosteus.
OX NCBI_TaxID=69293 {ECO:0000313|Ensembl:ENSGACP00000007493.1, ECO:0000313|Proteomes:UP000007635};
RN [1] {ECO:0000313|Ensembl:ENSGACP00000007493.1, ECO:0000313|Proteomes:UP000007635}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Lindblad-Toh K., Mauceli E., Grabherr M., Chang J.L., Lander E.S.;
RL Submitted (JAN-2006) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSGACP00000007493.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3NQ77; -.
DR STRING; 69293.ENSGACP00000007493; -.
DR Ensembl; ENSGACT00000007512.1; ENSGACP00000007493.1; ENSGACG00000005675.1.
DR eggNOG; KOG0527; Eukaryota.
DR GeneTree; ENSGT00940000158269; -.
DR InParanoid; G3NQ77; -.
DR OMA; AWMSKSQ; -.
DR Proteomes; UP000007635; Unassembled WGS sequence.
DR Bgee; ENSGACG00000005675; Expressed in embryo and 2 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd22031; HMG-box_SoxE; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR022151; Sox_N.
DR PANTHER; PTHR45803; SOX100B; 1.
DR PANTHER; PTHR45803:SF1; TRANSCRIPTION FACTOR SOX-9; 1.
DR Pfam; PF00505; HMG_box; 1.
DR Pfam; PF12444; Sox_N; 1.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Isopeptide bond {ECO:0000256|ARBA:ARBA00022499};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00267}; Reference proteome {ECO:0000313|Proteomes:UP000007635};
KW Ubl conjugation {ECO:0000256|ARBA:ARBA00022843}.
FT DOMAIN 104..172
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 104..172
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 1..60
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 156..196
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 208..263
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 325..377
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 440..473
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 18..50
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 156..186
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 325..340
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 351..377
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 473 AA; 51390 MW; EB416A4F536DB4BB CRC64;
MNLLDPYLKM TEEQDKCLSD APSPSMSEDS AGSPCPSGSG SDTENTRPSE NGLLGLDGEF
KKDEDDKFPA CIREAVSQVL KGYDWTLVPM PVRVNGSSKN KPHVKRPMNA FMVWAQAARR
KLADQYPHLH NAELSKTLGK LWRLLNEGEK RPFVEEAERL RVQHKKDHPD YKYQPRRRKS
VKNGQSESED GSEQTHISPN AIFKALQQAD SPASSMGEVH SPGEHSGSQG PPTPPTTPKT
DVSSGKMDLK REGGIRSLSD GTGGRQLNID FRDVDIGELS SDVISHIETF DVNEFDQYLP
PNGHPGLAGA AGGSAPAGAA AAWLAKSQNQ QGQQQHTLTP LGGGGAEHRT QIKTEQLSPS
HYTEQQGSPQ HVAYNSPFNL QHYSPPSSAY PAAISRAQQY DYSDHQGGGA AGYYSHAGAG
QGSGLYSTFS YMSSPSQRPM YTPIADTTGV PSIPQSSPQH WEQAPVYTQL TRP
//