ID A8WHS5_CAEEL Unreviewed; 580 AA.
AC A8WHS5;
DT 15-JAN-2008, integrated into UniProtKB/TrEMBL.
DT 15-JAN-2008, sequence version 1.
DT 27-MAR-2024, entry version 113.
DE SubName: Full=Suppressor of activated egl-4 protein 1 {ECO:0000313|EMBL:CAP19336.1};
GN Name=saeg-1 {ECO:0000313|EMBL:CAP19336.1,
GN ECO:0000313|WormBase:F53H10.2c};
GN ORFNames=CELE_F53H10.2 {ECO:0000313|EMBL:CAP19336.1}, F53H10.2
GN {ECO:0000313|WormBase:F53H10.2c};
OS Caenorhabditis elegans.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
OC Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
OC Caenorhabditis.
OX NCBI_TaxID=6239 {ECO:0000313|EMBL:CAP19336.1, ECO:0000313|Proteomes:UP000001940};
RN [1] {ECO:0000313|EMBL:CAP19336.1, ECO:0000313|Proteomes:UP000001940}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Bristol N2 {ECO:0000313|EMBL:CAP19336.1,
RC ECO:0000313|Proteomes:UP000001940};
RX PubMed=9851916; DOI=10.1126/science.282.5396.2012;
RG The C. elegans sequencing consortium;
RA Sulson J.E., Waterston R.;
RT "Genome sequence of the nematode C. elegans: a platform for investigating
RT biology.";
RL Science 282:2012-2018(1998).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BX284605; CAP19336.1; -; Genomic_DNA.
DR RefSeq; NP_001122953.1; NM_001129481.2.
DR AlphaFoldDB; A8WHS5; -.
DR EPD; A8WHS5; -.
DR EnsemblMetazoa; F53H10.2c.1; F53H10.2c.1; WBGene00010012.
DR GeneID; 179505; -.
DR UCSC; F53H10.2a; c. elegans.
DR AGR; WB:WBGene00010012; -.
DR WormBase; F53H10.2c; CE41815; WBGene00010012; saeg-1.
DR HOGENOM; CLU_013904_0_0_1; -.
DR OrthoDB; 2906194at2759; -.
DR Proteomes; UP000001940; Chromosome V.
DR Bgee; WBGene00010012; Expressed in larva and 4 other cell types or tissues.
DR ExpressionAtlas; A8WHS5; baseline and differential.
DR GO; GO:0005634; C:nucleus; IDA:WormBase.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR000949; ELM2_dom.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR017884; SANT_dom.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR16089; REST COREPRESSOR COREST PROTEIN-RELATED; 1.
DR PANTHER; PTHR16089:SF40; SUPPRESSOR OF ACTIVATED EGL-4 PROTEIN 1; 1.
DR Pfam; PF00249; Myb_DNA-binding; 1.
DR SMART; SM01189; ELM2; 1.
DR SMART; SM00717; SANT; 1.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS51156; ELM2; 1.
DR PROSITE; PS51293; SANT; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 1.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 1.
PE 1: Evidence at protein level;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Proteomics identification {ECO:0007829|EPD:A8WHS5,
KW ECO:0007829|PeptideAtlas:A8WHS5};
KW Reference proteome {ECO:0000313|Proteomes:UP000001940};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00042}.
FT DOMAIN 131..224
FT /note="ELM2"
FT /evidence="ECO:0000259|PROSITE:PS51156"
FT DOMAIN 245..291
FT /note="SANT"
FT /evidence="ECO:0000259|PROSITE:PS51293"
FT DOMAIN 416..443
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT REGION 17..61
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 86..105
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 390..409
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 17..32
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 41..56
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 390..407
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 580 AA; 64640 MW; E1BAA562FD8D2AC6 CRC64;
MRNGSGLFCQ IVKSANSSLP VAEQSPDAPS CSTNGVDGDM KHLMNGKKRS EDGDGPSRKN
GFFYMAQQMN QTNFANELEA LRKESWASTS SADEKMQTER KESLESIRKA SCMSDSYYEI
EEGPKISDPN PHINLGKNYQ ARVKKWCDRQ VSTSERDAIE DRDEIVFSSE ILQDIDPEQI
TAFELLACSQ ACPRAGRNKE LALHLLMENK GNIEAAVEDL LRSDTLDWEH YSSVFGYMYN
DSVLWTPDEI YQFQDAIYQS EKDFDKVAVE LPGKSVKECV QFYYTWKKDC PDDYRKLRNL
RRKRQLLDIN LQKNQSEEPV VPAKKISIIE SGDSDNESNA TDSSFIGNGH MEFRDRAFTS
PMMSSPREEP IIGLSPSSKD LFGIQKNYQP TAPRAHHTPS ASASKKGAQP SADGFFHCRL
CDKCFEKVKS LNAHMKSHAM KARAEQEAKA HDAQVAAAAA AQLTSAVGNV VGNPVATSPL
NSFANGHLGI SIPSTIGNLT PQQLTPQQLN LNQQLQTQLN SLSNQMSLNS PLTPQQQLQQ
FTQQHLMARA MQQNLFQPVT STPLVQPTHP LIQAGLHSIN
//