ID A0A096NJ48_PAPAN Unreviewed; 667 AA.
AC A0A096NJ48;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 28-FEB-2018, sequence version 2.
DT 24-JAN-2024, entry version 58.
DE SubName: Full=SIM bHLH transcription factor 2 {ECO:0000313|Ensembl:ENSPANP00000012997.2};
GN Name=SIM2 {ECO:0000313|Ensembl:ENSPANP00000012997.2};
OS Papio anubis (Olive baboon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Papio.
OX NCBI_TaxID=9555 {ECO:0000313|Ensembl:ENSPANP00000012997.2, ECO:0000313|Proteomes:UP000028761};
RN [1] {ECO:0000313|Ensembl:ENSPANP00000012997.2, ECO:0000313|Proteomes:UP000028761}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Liu Y.L., Abraham K.A., Akbar H.A., Ali S.A., Anosike U.A., Aqrawi P.A.,
RA Arias F.A., Attaway T.A., Awwad R.A., Babu C.B., Bandaranaike D.B.,
RA Battles P.B., Bell A.B., Beltran B.B., Berhane-Mersha D.B., Bess C.B.,
RA Bickham C.B., Bolden T.B., Carter K.C., Chau D.C., Chavez A.C.,
RA Clerc-Blankenburg K.C., Coyle M.C., Dao M.D., Davila M.L.D.,
RA Davy-Carroll L.D., Denson S.D., Dinh H.D., Fernandez S.F., Fernando P.F.,
RA Forbes L.F., Francis C.F., Francisco L.F., Fu Q.F., Garcia-Iii R.G.,
RA Garrett T.G., Gross S.G., Gubbala S.G., Hirani K.H., Hogues M.H.,
RA Hollins B.H., Jackson L.J., Javaid M.J., Jhangiani S.J., Johnson A.J.,
RA Johnson B.J., Jones J.J., Joshi V.J., Kalu J.K., Khan N.K., Korchina V.K.,
RA Kovar C.K., Lago L.L., Lara F.L., Le T.-K.L., Lee S.L., Legall-Iii F.L.,
RA Lemon S.L., Liu J.L., Liu Y.-S.L., Liyanage D.L., Lopez J.L.,
RA Lorensuhewa L.L., Mata R.M., Mathew T.M., Mercado C.M., Mercado I.M.,
RA Morales K.M., Morgan M.M., Munidasa M.M., Ngo D.N., Nguyen L.N.,
RA Nguyen T.N., Nguyen N.N., Obregon M.O., Okwuonu G.O., Ongeri F.O.,
RA Onwere C.O., Osifeso I.O., Parra A.P., Patil S.P., Perez A.P., Perez Y.P.,
RA Pham C.P., Pu L.-L.P., Puazo M.P., Quiroz J.Q., Rouhana J.R., Ruiz M.R.,
RA Ruiz S.-J.R., Saada N.S., Santibanez J.S., Scheel M.S., Schneider B.S.,
RA Simmons D.S., Sisson I.S., Tang L.-Y.T., Thornton R.T., Tisius J.T.,
RA Toledanes G.T., Trejos Z.T., Usmani K.U., Varghese R.V., Vattathil S.V.,
RA Vee V.V., Walker D.W., Weissenberger G.W., White C.W., Williams A.W.,
RA Woodworth J.W., Wright R.W., Zhu Y.Z., Han Y.H., Newsham I.N.,
RA Nazareth L.N., Worley K.W., Muzny D.M., Rogers J.R., Gibbs R.G.;
RT "Whole Genome Assembly of Papio anubis.";
RL Submitted (MAR-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSPANP00000012997.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_003895560.1; XM_003895511.3.
DR RefSeq; XP_009200528.1; XM_009202264.2.
DR AlphaFoldDB; A0A096NJ48; -.
DR STRING; 9555.ENSPANP00000012997; -.
DR Ensembl; ENSPANT00000029081.3; ENSPANP00000012997.2; ENSPANG00000020491.3.
DR GeneID; 101013404; -.
DR KEGG; panu:101013404; -.
DR CTD; 6493; -.
DR eggNOG; KOG3559; Eukaryota.
DR GeneTree; ENSGT00940000159985; -.
DR HOGENOM; CLU_010044_4_1_1; -.
DR OMA; SECQWHY; -.
DR OrthoDB; 5396877at2759; -.
DR Proteomes; UP000028761; Chromosome 4.
DR Bgee; ENSPANG00000020491; Expressed in esophagus and 6 other cell types or tissues.
DR GO; GO:0016604; C:nuclear body; IEA:Ensembl.
DR GO; GO:0003677; F:DNA binding; IEA:Ensembl.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0046982; F:protein heterodimerization activity; IEA:Ensembl.
DR GO; GO:0030154; P:cell differentiation; IEA:UniProtKB-KW.
DR GO; GO:0009880; P:embryonic pattern specification; IEA:Ensembl.
DR GO; GO:0030324; P:lung development; IEA:Ensembl.
DR GO; GO:0000122; P:negative regulation of transcription by RNA polymerase II; IEA:Ensembl.
DR GO; GO:0007399; P:nervous system development; IEA:UniProtKB-KW.
DR CDD; cd00130; PAS; 2.
DR Gene3D; 4.10.280.10; Helix-loop-helix DNA-binding domain; 1.
DR Gene3D; 3.30.450.20; PAS domain; 2.
DR InterPro; IPR011598; bHLH_dom.
DR InterPro; IPR036638; HLH_DNA-bd_sf.
DR InterPro; IPR001610; PAC.
DR InterPro; IPR000014; PAS.
DR InterPro; IPR035965; PAS-like_dom_sf.
DR InterPro; IPR013767; PAS_fold.
DR InterPro; IPR013655; PAS_fold_3.
DR InterPro; IPR010578; SIM_C.
DR PANTHER; PTHR23043; HYPOXIA-INDUCIBLE FACTOR 1 ALPHA; 1.
DR PANTHER; PTHR23043:SF19; SINGLE-MINDED HOMOLOG 2; 1.
DR Pfam; PF00010; HLH; 1.
DR Pfam; PF00989; PAS; 1.
DR Pfam; PF08447; PAS_3; 1.
DR Pfam; PF06621; SIM_C; 1.
DR SMART; SM00353; HLH; 1.
DR SMART; SM00086; PAC; 1.
DR SMART; SM00091; PAS; 2.
DR SUPFAM; SSF47459; HLH, helix-loop-helix DNA-binding domain; 1.
DR SUPFAM; SSF55785; PYP-like sensor domain (PAS domain); 2.
DR PROSITE; PS50888; BHLH; 1.
DR PROSITE; PS50112; PAS; 2.
DR PROSITE; PS51302; SIM_C; 1.
PE 4: Predicted;
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW Differentiation {ECO:0000256|ARBA:ARBA00022782};
KW Neurogenesis {ECO:0000256|ARBA:ARBA00022902};
KW Reference proteome {ECO:0000313|Proteomes:UP000028761};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 1..53
FT /note="BHLH"
FT /evidence="ECO:0000259|PROSITE:PS50888"
FT DOMAIN 77..149
FT /note="PAS"
FT /evidence="ECO:0000259|PROSITE:PS50112"
FT DOMAIN 233..288
FT /note="PAS"
FT /evidence="ECO:0000259|PROSITE:PS50112"
FT DOMAIN 336..667
FT /note="Single-minded C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51302"
FT REGION 500..520
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 535..563
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 616..651
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 667 AA; 73424 MW; F7C9CC7E3E8FFD4B CRC64;
MKEKSKNAAK TRREKENGEF YELAKLLPLP SAITSQLDKA SIIRLTTSYL KMRAVFPEGL
GDAWGQPSRA GPLDSVAKEL GSHLLQTLDG FVFVVASDGK IMYISETASV HLGLSQVELT
GNSIYEYIHP SDHDEMTAVL TAHQPLHHHL LQEYEIERSF FLRMKCVLAK RNAGLTCSGY
KVIHCSGYLK IRQYMLDMSL YDSCYQIVGL VAVGQSLPPS AITEIKLYSN MFMFRASLDL
KLIFLDSRVT EVTGYEPQDL IEKTLYHHVH GCDVFHLRYA HHLLLVKGQV TTKYYRLLSK
RGGWVWVQSY ATVVHNSRSS RPHCIVSVNY VLTEIEYKEL QLSLEQVSTA KSQDSWRTAL
STSQETRKLV KPKNTKMKTK LRTNPYPPQQ YSSFQMDKLE CGQLGNWRAS PPASAPVPPE
PQLHSESSDL LYTPSYSLPF SYHYGHFPLD SHVFSSKKPM LPAKFGQPQG SPCEVARFFL
STLPASGECQ WHYANPLVPS SSSTAKHPPE PPANTARHGL VPSYEAPAAA VRRFGEDTAP
PSFPSCGHYR EEPALGPPKA ARQAARDGVR LALARATPEC CAPPAPEPPG APPQLPFVLL
NYHRVLTRRG PLGGAAPAAS GLACAPGGPE AATSALRPRH PSPAAASPAG TPLPHYLGAS
VIITNGR
//