ID A0A0D9R0E6_CHLSB Unreviewed; 1708 AA.
AC A0A0D9R0E6;
DT 27-MAY-2015, integrated into UniProtKB/TrEMBL.
DT 27-MAY-2015, sequence version 1.
DT 24-JAN-2024, entry version 55.
DE SubName: Full=SET domain containing 1A, histone lysine methyltransferase {ECO:0000313|Ensembl:ENSCSAP00000002085.1};
GN Name=SETD1A {ECO:0000313|Ensembl:ENSCSAP00000002085.1};
OS Chlorocebus sabaeus (Green monkey) (Cercopithecus sabaeus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Chlorocebus.
OX NCBI_TaxID=60711 {ECO:0000313|Ensembl:ENSCSAP00000002085.1, ECO:0000313|Proteomes:UP000029965};
RN [1] {ECO:0000313|Ensembl:ENSCSAP00000002085.1, ECO:0000313|Proteomes:UP000029965}
RP NUCLEOTIDE SEQUENCE.
RA Warren W., Wilson R.K.;
RL Submitted (MAR-2014) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSCSAP00000002085.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (JUL-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AQIB01118760; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AQIB01118761; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_007988595.1; XM_007990404.1.
DR RefSeq; XP_007988601.1; XM_007990410.1.
DR RefSeq; XP_007988612.1; XM_007990421.1.
DR RefSeq; XP_007988623.1; XM_007990432.1.
DR RefSeq; XP_007988634.1; XM_007990443.1.
DR RefSeq; XP_007988642.1; XM_007990451.1.
DR STRING; 60711.ENSCSAP00000002085; -.
DR Ensembl; ENSCSAT00000003814.1; ENSCSAP00000002085.1; ENSCSAG00000005791.1.
DR GeneID; 103231168; -.
DR KEGG; csab:103231168; -.
DR CTD; 9739; -.
DR eggNOG; KOG1080; Eukaryota.
DR GeneTree; ENSGT00940000154575; -.
DR OMA; KVSRYPD; -.
DR OrthoDB; 950362at2759; -.
DR BioGRID-ORCS; 103231168; 0 hits in 9 CRISPR screens.
DR Proteomes; UP000029965; Chromosome 5.
DR Bgee; ENSCSAG00000005791; Expressed in adrenal cortex and 7 other cell types or tissues.
DR GO; GO:0000785; C:chromatin; IEA:Ensembl.
DR GO; GO:0016607; C:nuclear speck; IEA:Ensembl.
DR GO; GO:0048188; C:Set1C/COMPASS complex; IEA:Ensembl.
DR GO; GO:0008013; F:beta-catenin binding; IEA:Ensembl.
DR GO; GO:0042800; F:histone H3K4 methyltransferase activity; IEA:Ensembl.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0061629; F:RNA polymerase II-specific DNA-binding transcription factor binding; IEA:Ensembl.
DR GO; GO:0007420; P:brain development; IEA:Ensembl.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR GO; GO:1902275; P:regulation of chromatin organization; IEA:Ensembl.
DR GO; GO:1902036; P:regulation of hematopoietic stem cell differentiation; IEA:Ensembl.
DR CDD; cd12548; RRM_Set1A; 1.
DR CDD; cd19169; SET_SETD1; 1.
DR Gene3D; 3.30.70.330; -; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR024657; COMPASS_Set1_N-SET.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR000504; RRM_dom.
DR InterPro; IPR044570; Set1-like.
DR InterPro; IPR034467; Set1A_RRM.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR037841; SET_SETD1A/B.
DR PANTHER; PTHR45814; HISTONE-LYSINE N-METHYLTRANSFERASE SETD1; 1.
DR PANTHER; PTHR45814:SF3; HISTONE-LYSINE N-METHYLTRANSFERASE SETD1A; 1.
DR Pfam; PF11764; N-SET; 1.
DR Pfam; PF00076; RRM_1; 1.
DR Pfam; PF00856; SET; 1.
DR SMART; SM01291; N-SET; 1.
DR SMART; SM00508; PostSET; 1.
DR SMART; SM00360; RRM; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF54928; RNA-binding domain, RBD; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50102; RRM; 1.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW Chromatin regulator {ECO:0000256|ARBA:ARBA00022853};
KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000029965};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884, ECO:0000256|PROSITE-
KW ProRule:PRU00176}; S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT DOMAIN 84..172
FT /note="RRM"
FT /evidence="ECO:0000259|PROSITE:PS50102"
FT DOMAIN 1569..1686
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 1692..1708
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
FT REGION 194..308
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 335..486
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 505..655
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 834..854
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 891..1251
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1264..1293
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1311..1419
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1473..1500
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 206..308
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 335..381
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 427..444
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 463..486
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 565..580
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 593..655
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 892..920
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 929..964
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 971..991
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 992..1015
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1016..1059
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1078..1094
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1120..1161
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1176..1190
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1225..1244
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1311..1325
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1359..1381
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1478..1492
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1708 AA; 185888 MW; F2C5579A5E6A0F1D CRC64;
MDQEGGGDGQ KAPSFQWRNY KLIVDPALDP ALRRPSQKVY RYDGVHFSVN DSKYIPVEDL
QDPRCHVRSK NRDFSLPVPK FKLDEFYIGQ IPLKEVTFAR LNDNVRETFL KDMCRKYGEV
EEVEILLHPR TRKHLGLARV LFTSTRGAKE TVKNLHLTSV MGNIIHAQLD IKGQQRMKYY
ELIVNGSYTP QTVPTGGKAL SEKFQGSGVA TETAESRRRS SSDTAAYPAG TTAVGTPGNG
TPCSQDTSFS SSRQDTPSSF GQFTPQSSQG TPYTSRGSTP YSQDSAYSSS TTSTSFKPRR
SENSYQDAFS RRHFSASSAS TTASTAIAAT TAATAASSAS SSSLSSSSSS SSSSSSSQFR
SSDSNYPAYY ESWNRYQRHT SYPPRRATRE DPPGAPFAEN TAECFPPSYT SYLPPEPSRP
ADQDYRPPAS EAPPPEPPEP GGGGSGGGPS PEREEVRTSP RPASPARSGS PAPETTNESV
PFAQHSSLDS RIEMLLKEQR SKFSFLASDT EEEEENSSVG LGARDAGSEV PSGSGHGPCT
PPPAPANFED VAPTGSGEPG ATRESPKANG QNQASPCSSG DDMEISDDDR GGSPPPAPTP
PQQPPPPPPP PPPPPPYLAS LPLGYPPHQP AYLLPPRPDG PPPPEYPPPP PPPPHIYDFV
NSLELMDRLG AQWGGMPMSF QMQTQMLTRL HQLRQGKGLI AASAGPPGGT FGEAFLPFPP
PQEAAYGLPY ALYAQGQEGR GAYSREAYHL PVPMAAEPLP SSSVSGEEAR LPPREEAELA
EGKTLPTAGT VGRVLAMLVQ EMKSIMQRDL NRKMVENVAF GAFDQWWESK EEKAKPFQNA
AKQQAKEEDK EKTKLKEPGL LSLVDWAKSG GTTGIEAFAF GSGLRGALRL PSFKVKRKEP
SEISEASEEK RPRPSTPAEE DEDDPEREKE AGEPGRPGTK PPKRDEERGK TQGKHRKSFA
LDSEGEEASQ ESSSEKDEED DEEDEEDEDR EEAVDTTKKE TGVSDGEDEE SDSSSKCSLY
ADSDGENDST SDSESSSSSS SSSSSSSSSS SSSSSSSSES SSEDEEEEER PAALSSASPP
PREVPVPTPA PVEVPAPERV AGSPVTPLPE QETSPARPAG PTEELPPSVP PPPPEPPAGP
PAAAPCPDER PSSPIPLLPP PKKRRKTVSF SAIEVVPAPE PPPATPPQAK SPGPASRKAP
RGVERTIRNL PLDHASLVKS WPEEVSRGSR SRAGGRGRAT EEEEAEPGTE VDLAVLADLA
LTPARRGLPS LPAVDDSEAT ETSDEAERPG PLLSHILLEH NYALAIKPTA PALAPRPPEP
VPAPAALFSS PADEVLEAPE VVVAEAEEPK PQQLQQQREE GEEEEEEEEE EEEEEESSDS
SSSSDGEGAL RRRSLRSHAR RRRPPPPPPP RPPRAYEPRS EFEQMTILYD IWNSGLDSED
MSYLRLTYER LLQQTSGADW LNDTHWVHHT ITNLTTPKRK RRPQDGPREH QTGSARSEGY
YPISKKEKDK YLDVCPVSAR QLEGVDTQGT NRVLSERRSE QRRLLSAIGT SAIMDSDLLK
LNQLKFRKKK LRFGRSRIHE WGLFAMEPIA ADEMVIEYVG QNIRQMVADM REKRYVQEGI
GSSYLFRVDH DTIIDATKCG NLARFINHCC TPNCYAKVIT IESQKKIVIY SKQPIGVDEE
ITYDYKFPLE DNKIPCLCGT ESCRGSLN
//