ID G1RUD0_NOMLE Unreviewed; 766 AA.
AC G1RUD0;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 19-OCT-2011, sequence version 1.
DT 24-JAN-2024, entry version 85.
DE SubName: Full=SIM bHLH transcription factor 1 {ECO:0000313|Ensembl:ENSNLEP00000016855.1};
GN Name=SIM1 {ECO:0000313|Ensembl:ENSNLEP00000016855.1};
OS Nomascus leucogenys (Northern white-cheeked gibbon) (Hylobates leucogenys).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hylobatidae;
OC Nomascus.
OX NCBI_TaxID=61853 {ECO:0000313|Ensembl:ENSNLEP00000016855.1, ECO:0000313|Proteomes:UP000001073};
RN [1] {ECO:0000313|Ensembl:ENSNLEP00000016855.1, ECO:0000313|Proteomes:UP000001073}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Gibbon Genome Sequencing Consortium;
RL Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSNLEP00000016855.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (JUL-2023) to UniProtKB.
CC -!- FUNCTION: Transcriptional factor that may have pleiotropic effects
CC during embryogenesis and in the adult. {ECO:0000256|ARBA:ARBA00037499}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADFV01180964; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADFV01180965; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADFV01180966; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_003258422.1; XM_003258374.3.
DR AlphaFoldDB; G1RUD0; -.
DR Ensembl; ENSNLET00000017708.3; ENSNLEP00000016855.1; ENSNLEG00000013870.3.
DR GeneID; 100607894; -.
DR KEGG; nle:100607894; -.
DR CTD; 6492; -.
DR eggNOG; KOG3559; Eukaryota.
DR GeneTree; ENSGT00940000156143; -.
DR HOGENOM; CLU_010044_4_0_1; -.
DR OrthoDB; 5396877at2759; -.
DR TreeFam; TF317772; -.
DR Proteomes; UP000001073; Chromosome 3.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR GO; GO:0030154; P:cell differentiation; IEA:UniProtKB-KW.
DR GO; GO:0007399; P:nervous system development; IEA:UniProtKB-KW.
DR CDD; cd19738; bHLH-PAS_SIM1; 1.
DR CDD; cd00130; PAS; 2.
DR Gene3D; 4.10.280.10; Helix-loop-helix DNA-binding domain; 1.
DR Gene3D; 3.30.450.20; PAS domain; 2.
DR InterPro; IPR011598; bHLH_dom.
DR InterPro; IPR036638; HLH_DNA-bd_sf.
DR InterPro; IPR001610; PAC.
DR InterPro; IPR000014; PAS.
DR InterPro; IPR035965; PAS-like_dom_sf.
DR InterPro; IPR013767; PAS_fold.
DR InterPro; IPR013655; PAS_fold_3.
DR InterPro; IPR010578; SIM_C.
DR PANTHER; PTHR23043; HYPOXIA-INDUCIBLE FACTOR 1 ALPHA; 1.
DR PANTHER; PTHR23043:SF22; SINGLE-MINDED HOMOLOG 1; 1.
DR Pfam; PF00010; HLH; 1.
DR Pfam; PF00989; PAS; 1.
DR Pfam; PF08447; PAS_3; 1.
DR Pfam; PF06621; SIM_C; 1.
DR SMART; SM00353; HLH; 1.
DR SMART; SM00086; PAC; 1.
DR SMART; SM00091; PAS; 2.
DR SUPFAM; SSF47459; HLH, helix-loop-helix DNA-binding domain; 1.
DR SUPFAM; SSF55785; PYP-like sensor domain (PAS domain); 2.
DR PROSITE; PS50888; BHLH; 1.
DR PROSITE; PS50112; PAS; 2.
DR PROSITE; PS51302; SIM_C; 1.
PE 4: Predicted;
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW Differentiation {ECO:0000256|ARBA:ARBA00022782};
KW Neurogenesis {ECO:0000256|ARBA:ARBA00022902};
KW Reference proteome {ECO:0000313|Proteomes:UP000001073};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 1..53
FT /note="BHLH"
FT /evidence="ECO:0000259|PROSITE:PS50888"
FT DOMAIN 77..140
FT /note="PAS"
FT /evidence="ECO:0000259|PROSITE:PS50112"
FT DOMAIN 233..270
FT /note="PAS"
FT /evidence="ECO:0000259|PROSITE:PS50112"
FT DOMAIN 336..766
FT /note="Single-minded C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51302"
FT REGION 353..431
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 528..563
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 642..662
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 353..394
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 536..563
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 646..662
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 766 AA; 85482 MW; 11C88D429ED44C4F CRC64;
MKEKSKNAAR TRREKENSEF YELAKLLPLP SAITSQLDKA SIIRLTTSYL KMRVVFPEGL
GEAWGHSSRT SPLDNVGREL GSHLLQTLDG FIFVVAPDGK IMYISETASV HLGLSQVELT
GNSIYEYIHP ADHDEMTAVL TAHQPYHSHF VQEYEIERSF FLRMKCVLAK RNAGLTCGGY
KVIHCSGYLK IRQYSLDMSP FDGCYQNVGL VAVGHSLPPS AVTEIKLHSN MFMFRASLDM
KLIFLDSRVA ELTGYEPQDL IEKTLYHHVH GCDTFHLRCA HHLLLVKGQV TTKYYRFLAK
HGGWVWVQSY ATIVHNSRSS RPHCIVSVNY VLTDTEYKGL QLSLDQISAS KPAFSYTSSS
TPTMTDNRKG AKSRLSSSKS KSRTSPYPQY SGFHTERSES DHDSQWGGSP LTDTASPQLL
DPADRPGSQH DASCAYRQFS DRSSLCYGFA LDHSRLVEER HFHTQACEGG RCEAGRYFLG
TPQAGREPWW GSRAALPLTK ASPESREAYE NSMPHIASVH RIHGRGHWDE DSVVSSPDPG
SASESGDRYR TEQYQSSPHE PSKIETLIRA TQQMIKEEEN RLQLRKAPSD QLASINGAGK
KHSLCFANYQ QPPPTGEVCH GSALANTSPC DHIQQREGKV LSPHENDYDN SPTALSRISS
PNSDRISKSS LILAKDYLHS DISPHQTAGD HPTVSPNCFG SHRQYFDKHA YTLTGYALEH
LYDSETIRNY SLGCNGSHFD VTSHLRMQPD PAQGHKGTSV IITNGS
//