ID G3W683_SARHA Unreviewed; 656 AA.
AC G3W683;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 07-APR-2021, sequence version 2.
DT 27-MAR-2024, entry version 71.
DE SubName: Full=SIM bHLH transcription factor 2 {ECO:0000313|Ensembl:ENSSHAP00000010938.2};
GN Name=SIM2 {ECO:0000313|Ensembl:ENSSHAP00000010938.2};
OS Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000010938.2, ECO:0000313|Proteomes:UP000007648};
RN [1] {ECO:0000313|Ensembl:ENSSHAP00000010938.2, ECO:0000313|Proteomes:UP000007648}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA Jones M.E., Schuster S.C.;
RT "Genetic diversity and population structure of the endangered marsupial
RT Sarcophilus harrisii (Tasmanian devil).";
RL Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN [2] {ECO:0000313|Ensembl:ENSSHAP00000010938.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; G3W683; -.
DR Ensembl; ENSSHAT00000011032.2; ENSSHAP00000010938.2; ENSSHAG00000009425.2.
DR GeneTree; ENSGT00940000159985; -.
DR HOGENOM; CLU_010044_4_0_1; -.
DR Proteomes; UP000007648; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR GO; GO:0030154; P:cell differentiation; IEA:UniProtKB-KW.
DR GO; GO:0007399; P:nervous system development; IEA:UniProtKB-KW.
DR CDD; cd19739; bHLH-PAS_SIM2; 1.
DR CDD; cd00130; PAS; 2.
DR Gene3D; 4.10.280.10; Helix-loop-helix DNA-binding domain; 1.
DR Gene3D; 3.30.450.20; PAS domain; 2.
DR InterPro; IPR011598; bHLH_dom.
DR InterPro; IPR036638; HLH_DNA-bd_sf.
DR InterPro; IPR001610; PAC.
DR InterPro; IPR000014; PAS.
DR InterPro; IPR035965; PAS-like_dom_sf.
DR InterPro; IPR013767; PAS_fold.
DR InterPro; IPR013655; PAS_fold_3.
DR InterPro; IPR010578; SIM_C.
DR PANTHER; PTHR23043; HYPOXIA-INDUCIBLE FACTOR 1 ALPHA; 1.
DR PANTHER; PTHR23043:SF19; SINGLE-MINDED HOMOLOG 2; 1.
DR Pfam; PF00010; HLH; 1.
DR Pfam; PF00989; PAS; 1.
DR Pfam; PF08447; PAS_3; 1.
DR Pfam; PF06621; SIM_C; 1.
DR SMART; SM00353; HLH; 1.
DR SMART; SM00086; PAC; 1.
DR SMART; SM00091; PAS; 2.
DR SUPFAM; SSF47459; HLH, helix-loop-helix DNA-binding domain; 1.
DR SUPFAM; SSF55785; PYP-like sensor domain (PAS domain); 2.
DR PROSITE; PS50888; BHLH; 1.
DR PROSITE; PS50112; PAS; 2.
DR PROSITE; PS51302; SIM_C; 1.
PE 4: Predicted;
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW Differentiation {ECO:0000256|ARBA:ARBA00022782};
KW Neurogenesis {ECO:0000256|ARBA:ARBA00022902};
KW Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 1..53
FT /note="BHLH"
FT /evidence="ECO:0000259|PROSITE:PS50888"
FT DOMAIN 77..149
FT /note="PAS"
FT /evidence="ECO:0000259|PROSITE:PS50112"
FT DOMAIN 233..288
FT /note="PAS"
FT /evidence="ECO:0000259|PROSITE:PS50112"
FT DOMAIN 336..527
FT /note="Single-minded C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51302"
SQ SEQUENCE 656 AA; 74235 MW; FE790B627015A507 CRC64;
MKEKSKNAAK TRREKENGEF YELAKLLPLP SAITSQLDKA SIIRLTTSYL KMRAVFPEGL
GDAWGQTSQI GPLDNVAKEL GSHLLQTLDG FVFVVASDGK IMYISETASV HLGLSQVELT
GNSIYEYIHP SDHDEMTAVL AAHQPLHHHL LQEYEIERSF FLRMKCVLAK RNAGLTCSGY
KVIHCSGYLK IRQYMLDMSL YDSCYQIVGL VAVGQSLPPS AITEIKLHSN MFMFRASLDL
KLIFLDSRVT ELTGYEPQDL IEKTLYHHVH GCDMFYLRYA HHLLLVKGQV TTKYYRLLSK
HGGWVWVQSY ATIVHNSRSS RPHCIVSVNY VLTDIEYKEI QLSLDQVTSS KSQYSCRTSM
STSQEPRKIA KYKSNKMKTK VRTNPYLPQQ YDSFQMDKSE CGQVGNWRAS SATNPTNTQE
QNFYSENSEL LYAPSYSLPF SYHYGPFPID SHVFRSKRQM LSSKFGQSQG SPCEVAHFFL
STLQTSGECQ WHYTNPLVPN SQSPAKNHPD QPMNITRHNL APNYEVPISA QRYNEDIISD
NFASCAAAIP NSQQKDVYDS SILKSNKSEH TMEMQPTGQV PFVLLNYHHL LSKHGTFQPS
SCAATRHVAE NYGSSSEEVN TFISKNPSLS SISPPETHRE TELCHYIGTS VIINSR
//