GenomeNet

Database: UniProt
Entry: G3W683_SARHA
LinkDB: G3W683_SARHA
Original site: G3W683_SARHA 
ID   G3W683_SARHA            Unreviewed;       656 AA.
AC   G3W683;
DT   16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT   07-APR-2021, sequence version 2.
DT   27-MAR-2024, entry version 71.
DE   SubName: Full=SIM bHLH transcription factor 2 {ECO:0000313|Ensembl:ENSSHAP00000010938.2};
GN   Name=SIM2 {ECO:0000313|Ensembl:ENSSHAP00000010938.2};
OS   Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX   NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000010938.2, ECO:0000313|Proteomes:UP000007648};
RN   [1] {ECO:0000313|Ensembl:ENSSHAP00000010938.2, ECO:0000313|Proteomes:UP000007648}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA   Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA   Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA   Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA   Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA   Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA   Jones M.E., Schuster S.C.;
RT   "Genetic diversity and population structure of the endangered marsupial
RT   Sarcophilus harrisii (Tasmanian devil).";
RL   Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN   [2] {ECO:0000313|Ensembl:ENSSHAP00000010938.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; G3W683; -.
DR   Ensembl; ENSSHAT00000011032.2; ENSSHAP00000010938.2; ENSSHAG00000009425.2.
DR   GeneTree; ENSGT00940000159985; -.
DR   HOGENOM; CLU_010044_4_0_1; -.
DR   Proteomes; UP000007648; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR   GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR   GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR   GO; GO:0030154; P:cell differentiation; IEA:UniProtKB-KW.
DR   GO; GO:0007399; P:nervous system development; IEA:UniProtKB-KW.
DR   CDD; cd19739; bHLH-PAS_SIM2; 1.
DR   CDD; cd00130; PAS; 2.
DR   Gene3D; 4.10.280.10; Helix-loop-helix DNA-binding domain; 1.
DR   Gene3D; 3.30.450.20; PAS domain; 2.
DR   InterPro; IPR011598; bHLH_dom.
DR   InterPro; IPR036638; HLH_DNA-bd_sf.
DR   InterPro; IPR001610; PAC.
DR   InterPro; IPR000014; PAS.
DR   InterPro; IPR035965; PAS-like_dom_sf.
DR   InterPro; IPR013767; PAS_fold.
DR   InterPro; IPR013655; PAS_fold_3.
DR   InterPro; IPR010578; SIM_C.
DR   PANTHER; PTHR23043; HYPOXIA-INDUCIBLE FACTOR 1 ALPHA; 1.
DR   PANTHER; PTHR23043:SF19; SINGLE-MINDED HOMOLOG 2; 1.
DR   Pfam; PF00010; HLH; 1.
DR   Pfam; PF00989; PAS; 1.
DR   Pfam; PF08447; PAS_3; 1.
DR   Pfam; PF06621; SIM_C; 1.
DR   SMART; SM00353; HLH; 1.
DR   SMART; SM00086; PAC; 1.
DR   SMART; SM00091; PAS; 2.
DR   SUPFAM; SSF47459; HLH, helix-loop-helix DNA-binding domain; 1.
DR   SUPFAM; SSF55785; PYP-like sensor domain (PAS domain); 2.
DR   PROSITE; PS50888; BHLH; 1.
DR   PROSITE; PS50112; PAS; 2.
DR   PROSITE; PS51302; SIM_C; 1.
PE   4: Predicted;
KW   Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW   Differentiation {ECO:0000256|ARBA:ARBA00022782};
KW   Neurogenesis {ECO:0000256|ARBA:ARBA00022902};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT   DOMAIN          1..53
FT                   /note="BHLH"
FT                   /evidence="ECO:0000259|PROSITE:PS50888"
FT   DOMAIN          77..149
FT                   /note="PAS"
FT                   /evidence="ECO:0000259|PROSITE:PS50112"
FT   DOMAIN          233..288
FT                   /note="PAS"
FT                   /evidence="ECO:0000259|PROSITE:PS50112"
FT   DOMAIN          336..527
FT                   /note="Single-minded C-terminal"
FT                   /evidence="ECO:0000259|PROSITE:PS51302"
SQ   SEQUENCE   656 AA;  74235 MW;  FE790B627015A507 CRC64;
     MKEKSKNAAK TRREKENGEF YELAKLLPLP SAITSQLDKA SIIRLTTSYL KMRAVFPEGL
     GDAWGQTSQI GPLDNVAKEL GSHLLQTLDG FVFVVASDGK IMYISETASV HLGLSQVELT
     GNSIYEYIHP SDHDEMTAVL AAHQPLHHHL LQEYEIERSF FLRMKCVLAK RNAGLTCSGY
     KVIHCSGYLK IRQYMLDMSL YDSCYQIVGL VAVGQSLPPS AITEIKLHSN MFMFRASLDL
     KLIFLDSRVT ELTGYEPQDL IEKTLYHHVH GCDMFYLRYA HHLLLVKGQV TTKYYRLLSK
     HGGWVWVQSY ATIVHNSRSS RPHCIVSVNY VLTDIEYKEI QLSLDQVTSS KSQYSCRTSM
     STSQEPRKIA KYKSNKMKTK VRTNPYLPQQ YDSFQMDKSE CGQVGNWRAS SATNPTNTQE
     QNFYSENSEL LYAPSYSLPF SYHYGPFPID SHVFRSKRQM LSSKFGQSQG SPCEVAHFFL
     STLQTSGECQ WHYTNPLVPN SQSPAKNHPD QPMNITRHNL APNYEVPISA QRYNEDIISD
     NFASCAAAIP NSQQKDVYDS SILKSNKSEH TMEMQPTGQV PFVLLNYHHL LSKHGTFQPS
     SCAATRHVAE NYGSSSEEVN TFISKNPSLS SISPPETHRE TELCHYIGTS VIINSR
//
DBGET integrated database retrieval system