ID G3Q9R4_GASAC Unreviewed; 757 AA.
AC G3Q9R4;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 16-NOV-2011, sequence version 1.
DT 27-MAR-2024, entry version 79.
DE RecName: Full=SIM bHLH transcription factor 2 {ECO:0008006|Google:ProtNLM};
OS Gasterosteus aculeatus (Three-spined stickleback).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Perciformes; Cottioidei; Gasterosteales; Gasterosteidae;
OC Gasterosteus.
OX NCBI_TaxID=69293 {ECO:0000313|Ensembl:ENSGACP00000026630.1, ECO:0000313|Proteomes:UP000007635};
RN [1] {ECO:0000313|Ensembl:ENSGACP00000026630.1, ECO:0000313|Proteomes:UP000007635}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Lindblad-Toh K., Mauceli E., Grabherr M., Chang J.L., Lander E.S.;
RL Submitted (JAN-2006) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSGACP00000026630.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3Q9R4; -.
DR STRING; 69293.ENSGACP00000026630; -.
DR Ensembl; ENSGACT00000026682.1; ENSGACP00000026630.1; ENSGACG00000020158.1.
DR eggNOG; KOG3559; Eukaryota.
DR GeneTree; ENSGT00940000159985; -.
DR InParanoid; G3Q9R4; -.
DR OMA; ETPYSHI; -.
DR TreeFam; TF317772; -.
DR Proteomes; UP000007635; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR GO; GO:0030154; P:cell differentiation; IEA:UniProtKB-KW.
DR GO; GO:0007399; P:nervous system development; IEA:UniProtKB-KW.
DR CDD; cd00130; PAS; 2.
DR Gene3D; 4.10.280.10; Helix-loop-helix DNA-binding domain; 1.
DR Gene3D; 3.30.450.20; PAS domain; 2.
DR InterPro; IPR011598; bHLH_dom.
DR InterPro; IPR036638; HLH_DNA-bd_sf.
DR InterPro; IPR001610; PAC.
DR InterPro; IPR000014; PAS.
DR InterPro; IPR035965; PAS-like_dom_sf.
DR InterPro; IPR013767; PAS_fold.
DR InterPro; IPR013655; PAS_fold_3.
DR InterPro; IPR010578; SIM_C.
DR NCBIfam; TIGR00229; sensory_box; 1.
DR PANTHER; PTHR23043; HYPOXIA-INDUCIBLE FACTOR 1 ALPHA; 1.
DR PANTHER; PTHR23043:SF36; SIM BHLH TRANSCRIPTION FACTOR 1B-RELATED; 1.
DR Pfam; PF00989; PAS; 1.
DR Pfam; PF08447; PAS_3; 1.
DR Pfam; PF06621; SIM_C; 1.
DR SMART; SM00353; HLH; 1.
DR SMART; SM00086; PAC; 1.
DR SMART; SM00091; PAS; 2.
DR SUPFAM; SSF47459; HLH, helix-loop-helix DNA-binding domain; 1.
DR SUPFAM; SSF55785; PYP-like sensor domain (PAS domain); 2.
DR PROSITE; PS50888; BHLH; 1.
DR PROSITE; PS50112; PAS; 2.
PE 4: Predicted;
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW Differentiation {ECO:0000256|ARBA:ARBA00022782};
KW Neurogenesis {ECO:0000256|ARBA:ARBA00022902};
KW Reference proteome {ECO:0000313|Proteomes:UP000007635};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 1..53
FT /note="BHLH"
FT /evidence="ECO:0000259|PROSITE:PS50888"
FT DOMAIN 78..141
FT /note="PAS"
FT /evidence="ECO:0000259|PROSITE:PS50112"
FT DOMAIN 234..289
FT /note="PAS"
FT /evidence="ECO:0000259|PROSITE:PS50112"
FT REGION 712..757
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 721..737
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 757 AA; 84353 MW; B6926B4B5A6055B9 CRC64;
MKEKCKNAAK TRREKENGEF YELAKMLPLP AAITSQLDKA SIIRLTSSYL KMRTVFPDVG
LGDGWGRSTR LSPLDSMAKE LGSHLLQTLD GFVFVVASDG KIMYISETAS VHLGLSQVEL
TGNSIFEYIH PSDHDEMTAV LGLCQPPHHH FSLEYEIERS FFLRMKCVLA KRNAGLTCGG
YKVIHCSGYL KVRQYMMDMA LYESSYQTVG LVAVGHSLPP SGITEIKLHS NMFMFRASLD
LKLIFLDTRV AELTGYEPQD LIEKTLYHHV HTCDVFHLRY AHHLLLVKGQ VTTKYYRMLS
KHGGWVWVQS YATIVHNSRS SRPHCIVSVN YVLTDVECKE LQLSDDQSRA TKPGLSLASV
QDHRKQLKTK AVKMKTKLRA VHYPETVVPN LKDRLNCSPQ RGLWKERSEY PLTSCSERSS
SLSPETSRLS CSPTYNLTLH YPYNHRHLDA QNHLPGSGLS PQPAPQIIGS LHSRGSVGWN
ISCPAAPNKY TGVQFAPPAA HPLSRQSPEL AADRRHPDEI YNDCSSPTNP HSTGSLKEEP
YEHYTFFHGE LPEVCPHPRP QSAKDIKRLG QQVRHNTRGE GPQGDRGLAC PLIRDLVPGK
AEQGGSLHGA TGQPLPVQMA LEQRRRLCMM EAPYSHPASA YPQPANSLQQ HGATAELQPR
CAEQDRRMLI GMGVGVGLPT QNPYVSLKFH KVLGKHGSIK APAFTTLSHV TDGHSYHGKE
KASYSCNSQS PSSDSSAESH REIPHYIGTS VIITNER
//