ID M4A0Q2_XIPMA Unreviewed; 2168 AA.
AC M4A0Q2;
DT 01-MAY-2013, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 2.
DT 27-MAR-2024, entry version 67.
DE SubName: Full=Histone-lysine N-methyltransferase SETD1A {ECO:0000313|Ensembl:ENSXMAP00000008046.2};
OS Xiphophorus maculatus (Southern platyfish) (Platypoecilus maculatus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC Xiphophorus.
OX NCBI_TaxID=8083 {ECO:0000313|Ensembl:ENSXMAP00000008046.2, ECO:0000313|Proteomes:UP000002852};
RN [1] {ECO:0000313|Proteomes:UP000002852}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JP 163 A {ECO:0000313|Proteomes:UP000002852};
RA Walter R., Schartl M., Warren W.;
RL Submitted (JAN-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000002852}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JP 163 A {ECO:0000313|Proteomes:UP000002852};
RX PubMed=23542700; DOI=10.1038/ng.2604;
RA Schartl M., Walter R.B., Shen Y., Garcia T., Catchen J., Amores A.,
RA Braasch I., Chalopin D., Volff J.N., Lesch K.P., Bisazza A., Minx P.,
RA Hillier L., Wilson R.K., Fuerstenberg S., Boore J., Searle S.,
RA Postlethwait J.H., Warren W.C.;
RT "The genome of the platyfish, Xiphophorus maculatus, provides insights into
RT evolutionary adaptation and several complex traits.";
RL Nat. Genet. 45:567-572(2013).
RN [3] {ECO:0000313|Ensembl:ENSXMAP00000008046.2}
RP IDENTIFICATION.
RC STRAIN=JP 163 A {ECO:0000313|Ensembl:ENSXMAP00000008046.2};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_014330779.1; XM_014475293.1.
DR STRING; 8083.ENSXMAP00000008046; -.
DR Ensembl; ENSXMAT00000008056.2; ENSXMAP00000008046.2; ENSXMAG00000008021.2.
DR GeneID; 102234339; -.
DR KEGG; xma:102234339; -.
DR CTD; 9739; -.
DR eggNOG; KOG1080; Eukaryota.
DR GeneTree; ENSGT00940000162290; -.
DR HOGENOM; CLU_001226_0_0_1; -.
DR InParanoid; M4A0Q2; -.
DR OMA; KVSRYPD; -.
DR OrthoDB; 950362at2759; -.
DR Proteomes; UP000002852; Unassembled WGS sequence.
DR GO; GO:0048188; C:Set1C/COMPASS complex; IEA:InterPro.
DR GO; GO:0042800; F:histone H3K4 methyltransferase activity; IEA:InterPro.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR CDD; cd12548; RRM_Set1A; 1.
DR CDD; cd19169; SET_SETD1; 1.
DR Gene3D; 3.30.70.330; -; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR024657; COMPASS_Set1_N-SET.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR000504; RRM_dom.
DR InterPro; IPR044570; Set1-like.
DR InterPro; IPR034467; Set1A_RRM.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR037841; SET_SETD1A/B.
DR PANTHER; PTHR45814; HISTONE-LYSINE N-METHYLTRANSFERASE SETD1; 1.
DR PANTHER; PTHR45814:SF3; HISTONE-LYSINE N-METHYLTRANSFERASE SETD1A; 1.
DR Pfam; PF11764; N-SET; 1.
DR Pfam; PF00076; RRM_1; 1.
DR Pfam; PF00856; SET; 1.
DR SMART; SM01291; N-SET; 1.
DR SMART; SM00508; PostSET; 1.
DR SMART; SM00360; RRM; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF54928; RNA-binding domain, RBD; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50102; RRM; 1.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW Chromatin regulator {ECO:0000256|ARBA:ARBA00022853};
KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000002852};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884, ECO:0000256|PROSITE-
KW ProRule:PRU00176}; S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT DOMAIN 81..169
FT /note="RRM"
FT /evidence="ECO:0000259|PROSITE:PS50102"
FT DOMAIN 2029..2146
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 2152..2168
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
FT REGION 191..216
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 239..679
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 701..949
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1018..1037
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1230..1537
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1563..1597
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1631..1651
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1720..1838
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 191..215
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 264..331
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 358..372
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 444..466
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 467..486
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 487..501
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 502..552
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 564..581
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 613..631
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 632..659
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 701..761
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 828..843
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 855..889
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 911..949
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1230..1268
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1289..1320
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1321..1341
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1342..1357
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1366..1397
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1765..1795
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1813..1828
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2168 AA; 238490 MW; B5B0790A9BB754D4 CRC64;
MDPDSGTETQ KAVSLQWESY KLVQDPAIRR VTQKVYRYDG VHFSVPDSGF PPVGELRDPR
PRRLWSRYTE ISLPVPKFKL DEFYVGPIPL KEVTFARLND NIKEPFLAEM CSKFGEVEEM
EILFHPKTRK HLGLARVLFT NTRGAKDTVK HLHNTSVMGN IIHAQLDIKG QQRQKYYDLI
VNGSYTPQTV PLGGKALTDS VQSQTPAQPQ PDTSEIRRRY SSELAVLAAG VHALTSGSMT
PCSVETGFGE QRVDTPPSSL PGPYTPSSSA SSQGGGGTPY SSRSGTPFSQ DSGYSGSRHP
NYNAGSLSSG YPPQDMLPSS SSSSAVSSSV GGFKVSRYPD EQDPSLYHRG RPSYPPTTSY
RPNEPPGYAP YPNLGGPGSH MAHHSSMPPP PICNQYDQPP LPDRERSESA GRYAAAGVGS
RRSSYHHQQD SNSSSKYHSH HSHHHSDRRD DRGYRRDSVG SRSGDHGHQR HRNHHHSHNH
HSSRRRSSHD RDRDRDADYA NSSDPRYNSN SYRSSSNSMS PPPSSYSAYA SSKDPAPTPP
QDGSSRSAGS TLAEKAGSDK DHHGALPPPP PPPPPPLPPA SVIAAAVAET LGTLDFNQDS
PAREDQWTKP KRRPTTPPAP PKTPPPSSPS QPTTAKSSSN SPSSASLPHH LSSSSSSPPP
PHRDSSPEPD STNESLPFVY HSSSLDSRIE MLLKEQKAKF SFLASDEEDE EDRKEEKQRS
RVREGAERRS DAAKEQEHRG DNGENDHRKK GERDREGHRG RKRGKGEGRK SPTVLPAGTQ
PSENYSPHIV PPEEPPPPVG IDATEALQQD DPQAGSADPG VRTGAHTPPP FNGQSQASPH
SSGEDMEISD DDAEETTTIT TVTTHQPTVA SGSTPSSSQA AAAGLSQPAD PSSSPPPISD
SSQHFGTSMH PPIPSYPPHL PPPPPPGYSL QPPPPPGIPP LPHMELHPEY PPPMPHHMYD
YATSMELMNQ YSGGAPMSFQ MQTHMLSRLH QMRLSSSNGT AGPSEAATGD YASYHLHSIP
PPHTHHPYMD QEGNGAGAHY DQDHRYMPPH MSYPYHEPHS TQIPPPPHHS IPPPHSGWPP
HVLPPHFQSY LPPPGYGTML TGEGDEYSAS GEALPMMAEN PHEATVQMVL ASLIQEMKNI
VQRDLNRKMV ENVAFATFDE WWDRKETKAK PFQTMVRGVS ALRDDEKKEE KVNRPREPLM
SLVDWAKSGG VEGFSLRGAL RLPSFKVKRK EPQEFKEGDL KRPRPSTPPD EDDEAGRMPE
ADRHAADRDN KRRKKKPRSR KPWELDSEGE ETSDGSSSEK EDEEVESEKE SDDDGLSSDS
DDESLSSSSE GSSSSASSSS SSSEDEEEEE GELVESAGPD SMDESTMDST TEKEDDRHIV
AGAVPKVDIK KGVSKEIKAD IPAAPLSPLG PRPSSPNAFV PPLKKRRKTV SFSTDENDSK
PLLPAPSPTQ PGSEAPPTAD KPLDSPIALT PPPSCRPAHS IQLLPFASKP GEGNALIVPP
PSRNQDPEES KKTPLAPSPK SPGKRVASKD SLRPAAAPVM VCRTVQNLPL DHASMCRMAF
EEAPAAAPGN KRSRGRTRTP SVCHAVREDE EDEEDGEQRL RIREQVGASS LLQLAGADLS
VLADVALKMD PEAGDSEETE TSDEAEEQKM EGDPFSLESL ALMMSPGVVV VLLEHNYCKP
PALAAATVAS GARKPTARQD ASVLVSADLN SISGVLEAPE EVIGEAAPPR GDKGEYFSPV
GDLGDSDDNR QMAPVSPSPK RKNYVGKGLE VDQDKSEKRK RKDKENLEPH RTKKQKEPLG
KKQRKRKLEE SDFEEDVDVE ELESGELSSS DTENEVAEGM RKSERLFLQE AGVTSSQRWP
KPRPAPEPLL LKFENRSEFE QMTILYDIWN SGLDGEDLSF LKKTYEKLLQ DDHSSDWLND
THWVNHTVTN LPNPRRKKKS ADGQLREHVT GCARSEGYYA ISRKEKDVYL DLDLPEQVIR
EVENVDSSGA NRVLSERRSE QRRLLTVIGT TAVMDSDLLK LNQLKFRKKK LRFGRSRIHE
WGLFAMEPIA ADEMVIEYVG QNIRQMVADN REKRYAQQGI GSSYLFRVDH DTIIDATKCG
NLARFINHCC TPNCYAKVIT IESQKKIVIY SKQPIAVNEE ITYDYKFPLE ENKIPCLCGT
ENCRGTLN
//