ID A0A3P8NZ33_ASTCA Unreviewed; 581 AA.
AC A0A3P8NZ33;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE SubName: Full=Single-minded homolog 1-A-like {ECO:0000313|Ensembl:ENSACLP00000010032.1};
OS Astatotilapia calliptera (Eastern happy) (Chromis callipterus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC Pseudocrenilabrinae; Haplochromini; Astatotilapia.
OX NCBI_TaxID=8154 {ECO:0000313|Ensembl:ENSACLP00000010032.1, ECO:0000313|Proteomes:UP000265100};
RN [1] {ECO:0000313|Ensembl:ENSACLP00000010032.1, ECO:0000313|Proteomes:UP000265100}
RP NUCLEOTIDE SEQUENCE.
RA Datahose.;
RL Submitted (MAY-2018) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSACLP00000010032.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A3P8NZ33; -.
DR Ensembl; ENSACLT00000010270.1; ENSACLP00000010032.1; ENSACLG00000006784.1.
DR GeneTree; ENSGT00940000156143; -.
DR Proteomes; UP000265100; Chromosome 11.
DR Bgee; ENSACLG00000006784; Expressed in ovary.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR GO; GO:0030154; P:cell differentiation; IEA:UniProtKB-KW.
DR GO; GO:0007399; P:nervous system development; IEA:UniProtKB-KW.
DR CDD; cd19738; bHLH-PAS_SIM1; 1.
DR CDD; cd00130; PAS; 2.
DR Gene3D; 4.10.280.10; Helix-loop-helix DNA-binding domain; 1.
DR Gene3D; 3.30.450.20; PAS domain; 2.
DR InterPro; IPR011598; bHLH_dom.
DR InterPro; IPR036638; HLH_DNA-bd_sf.
DR InterPro; IPR001610; PAC.
DR InterPro; IPR000014; PAS.
DR InterPro; IPR035965; PAS-like_dom_sf.
DR InterPro; IPR013767; PAS_fold.
DR InterPro; IPR013655; PAS_fold_3.
DR InterPro; IPR010578; SIM_C.
DR PANTHER; PTHR23043; HYPOXIA-INDUCIBLE FACTOR 1 ALPHA; 1.
DR PANTHER; PTHR23043:SF36; SIM BHLH TRANSCRIPTION FACTOR 1B-RELATED; 1.
DR Pfam; PF00010; HLH; 1.
DR Pfam; PF00989; PAS; 1.
DR Pfam; PF08447; PAS_3; 1.
DR Pfam; PF06621; SIM_C; 2.
DR SMART; SM00353; HLH; 1.
DR SMART; SM00086; PAC; 1.
DR SMART; SM00091; PAS; 2.
DR SUPFAM; SSF47459; HLH, helix-loop-helix DNA-binding domain; 1.
DR SUPFAM; SSF55785; PYP-like sensor domain (PAS domain); 2.
DR PROSITE; PS50888; BHLH; 1.
DR PROSITE; PS50112; PAS; 2.
DR PROSITE; PS51302; SIM_C; 1.
PE 4: Predicted;
KW Developmental protein {ECO:0000256|ARBA:ARBA00022473};
KW Differentiation {ECO:0000256|ARBA:ARBA00022782};
KW Neurogenesis {ECO:0000256|ARBA:ARBA00022902};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 1..53
FT /note="BHLH"
FT /evidence="ECO:0000259|PROSITE:PS50888"
FT DOMAIN 80..143
FT /note="PAS"
FT /evidence="ECO:0000259|PROSITE:PS50112"
FT DOMAIN 236..276
FT /note="PAS"
FT /evidence="ECO:0000259|PROSITE:PS50112"
FT DOMAIN 339..419
FT /note="Single-minded C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51302"
FT REGION 395..433
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 395..418
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 581 AA; 64914 MW; 3E00D495329DE6F7 CRC64;
MKEKSKNAAR TRREKENSEF YELAKLLPLP SAITSQLDKA SIIRLTTSYL KMRVVFPEGL
GESWGHVSRS SSLDGLSQEL GSHLLQVLTT LDGFIFVVAP DGKIMYISET ASVHLGLSQV
ELTGNSIYEY IHPADHDEMI AVLTAHQPYH SHFVQEYEME RSFFLRMKCV LAKRNAGLTC
GGYKVIHCSG YLKIRQYSLD MSPFDGCYQN IGLVAVGHSL PPSAITEIKL HSNMFMFRAS
LDMKLIFLDS RVAELTGYEP QDLIEKTLYH HVHSCDCFHL RCAHHLLLVK GQVTTKYYRF
LAKHGGWVWV QSYATIVHNS RSSRPHCIVS VNYVLTETEY KGLQLSLDQA TSKASFPCTS
SLTDNCRTAK SRASRLWIIP FFLLSPSQYS SFQIERSESG QDSPWGSSPL TDSASPRLQF
SDPRAPPPGR ETWWDTARSI IPLSKSSLES YEDYDGSVPA PQSKLNSPGS ECLHKPKDYL
QTELPPLSLQ LHPFGRAGAC SGSPTPAPAL YPSHTHPRPY LDKHPAYSLA GYTLEHLYDP
ESLRGYCTST STGSTPYDHF RIPAEQTTGR KGTSVIITNG S
//