GenomeNet

Database: UniProt
Entry: G1RUD0_NOMLE
LinkDB: G1RUD0_NOMLE
Original site: G1RUD0_NOMLE 
ID   G1RUD0_NOMLE            Unreviewed;       766 AA.
AC   G1RUD0;
DT   19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT   19-OCT-2011, sequence version 1.
DT   24-JAN-2024, entry version 85.
DE   SubName: Full=SIM bHLH transcription factor 1 {ECO:0000313|Ensembl:ENSNLEP00000016855.1};
GN   Name=SIM1 {ECO:0000313|Ensembl:ENSNLEP00000016855.1};
OS   Nomascus leucogenys (Northern white-cheeked gibbon) (Hylobates leucogenys).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hylobatidae;
OC   Nomascus.
OX   NCBI_TaxID=61853 {ECO:0000313|Ensembl:ENSNLEP00000016855.1, ECO:0000313|Proteomes:UP000001073};
RN   [1] {ECO:0000313|Ensembl:ENSNLEP00000016855.1, ECO:0000313|Proteomes:UP000001073}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG   Gibbon Genome Sequencing Consortium;
RL   Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSNLEP00000016855.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (JUL-2023) to UniProtKB.
CC   -!- FUNCTION: Transcriptional factor that may have pleiotropic effects
CC       during embryogenesis and in the adult. {ECO:0000256|ARBA:ARBA00037499}.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ADFV01180964; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; ADFV01180965; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; ADFV01180966; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   RefSeq; XP_003258422.1; XM_003258374.3.
DR   AlphaFoldDB; G1RUD0; -.
DR   Ensembl; ENSNLET00000017708.3; ENSNLEP00000016855.1; ENSNLEG00000013870.3.
DR   GeneID; 100607894; -.
DR   KEGG; nle:100607894; -.
DR   CTD; 6492; -.
DR   eggNOG; KOG3559; Eukaryota.
DR   GeneTree; ENSGT00940000156143; -.
DR   HOGENOM; CLU_010044_4_0_1; -.
DR   OrthoDB; 5396877at2759; -.
DR   TreeFam; TF317772; -.
DR   Proteomes; UP000001073; Chromosome 3.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR   GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR   GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR   GO; GO:0030154; P:cell differentiation; IEA:UniProtKB-KW.
DR   GO; GO:0007399; P:nervous system development; IEA:UniProtKB-KW.
DR   CDD; cd19738; bHLH-PAS_SIM1; 1.
DR   CDD; cd00130; PAS; 2.
DR   Gene3D; 4.10.280.10; Helix-loop-helix DNA-binding domain; 1.
DR   Gene3D; 3.30.450.20; PAS domain; 2.
DR   InterPro; IPR011598; bHLH_dom.
DR   InterPro; IPR036638; HLH_DNA-bd_sf.
DR   InterPro; IPR001610; PAC.
DR   InterPro; IPR000014; PAS.
DR   InterPro; IPR035965; PAS-like_dom_sf.
DR   InterPro; IPR013767; PAS_fold.
DR   InterPro; IPR013655; PAS_fold_3.
DR   InterPro; IPR010578; SIM_C.
DR   PANTHER; PTHR23043; HYPOXIA-INDUCIBLE FACTOR 1 ALPHA; 1.
DR   PANTHER; PTHR23043:SF22; SINGLE-MINDED HOMOLOG 1; 1.
DR   Pfam; PF00010; HLH; 1.
DR   Pfam; PF00989; PAS; 1.
DR   Pfam; PF08447; PAS_3; 1.
DR   Pfam; PF06621; SIM_C; 1.
DR   SMART; SM00353; HLH; 1.
DR   SMART; SM00086; PAC; 1.
DR   SMART; SM00091; PAS; 2.
DR   SUPFAM; SSF47459; HLH, helix-loop-helix DNA-binding domain; 1.
DR   SUPFAM; SSF55785; PYP-like sensor domain (PAS domain); 2.
DR   PROSITE; PS50888; BHLH; 1.
DR   PROSITE; PS50112; PAS; 2.
DR   PROSITE; PS51302; SIM_C; 1.
PE   4: Predicted;
KW   Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW   Differentiation {ECO:0000256|ARBA:ARBA00022782};
KW   Neurogenesis {ECO:0000256|ARBA:ARBA00022902};
KW   Reference proteome {ECO:0000313|Proteomes:UP000001073};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT   DOMAIN          1..53
FT                   /note="BHLH"
FT                   /evidence="ECO:0000259|PROSITE:PS50888"
FT   DOMAIN          77..140
FT                   /note="PAS"
FT                   /evidence="ECO:0000259|PROSITE:PS50112"
FT   DOMAIN          233..270
FT                   /note="PAS"
FT                   /evidence="ECO:0000259|PROSITE:PS50112"
FT   DOMAIN          336..766
FT                   /note="Single-minded C-terminal"
FT                   /evidence="ECO:0000259|PROSITE:PS51302"
FT   REGION          353..431
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          528..563
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          642..662
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        353..394
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        536..563
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        646..662
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   766 AA;  85482 MW;  11C88D429ED44C4F CRC64;
     MKEKSKNAAR TRREKENSEF YELAKLLPLP SAITSQLDKA SIIRLTTSYL KMRVVFPEGL
     GEAWGHSSRT SPLDNVGREL GSHLLQTLDG FIFVVAPDGK IMYISETASV HLGLSQVELT
     GNSIYEYIHP ADHDEMTAVL TAHQPYHSHF VQEYEIERSF FLRMKCVLAK RNAGLTCGGY
     KVIHCSGYLK IRQYSLDMSP FDGCYQNVGL VAVGHSLPPS AVTEIKLHSN MFMFRASLDM
     KLIFLDSRVA ELTGYEPQDL IEKTLYHHVH GCDTFHLRCA HHLLLVKGQV TTKYYRFLAK
     HGGWVWVQSY ATIVHNSRSS RPHCIVSVNY VLTDTEYKGL QLSLDQISAS KPAFSYTSSS
     TPTMTDNRKG AKSRLSSSKS KSRTSPYPQY SGFHTERSES DHDSQWGGSP LTDTASPQLL
     DPADRPGSQH DASCAYRQFS DRSSLCYGFA LDHSRLVEER HFHTQACEGG RCEAGRYFLG
     TPQAGREPWW GSRAALPLTK ASPESREAYE NSMPHIASVH RIHGRGHWDE DSVVSSPDPG
     SASESGDRYR TEQYQSSPHE PSKIETLIRA TQQMIKEEEN RLQLRKAPSD QLASINGAGK
     KHSLCFANYQ QPPPTGEVCH GSALANTSPC DHIQQREGKV LSPHENDYDN SPTALSRISS
     PNSDRISKSS LILAKDYLHS DISPHQTAGD HPTVSPNCFG SHRQYFDKHA YTLTGYALEH
     LYDSETIRNY SLGCNGSHFD VTSHLRMQPD PAQGHKGTSV IITNGS
//
DBGET integrated database retrieval system