ID F7FC92_MACMU Unreviewed; 2659 AA.
AC F7FC92;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 11-DEC-2019, sequence version 3.
DT 27-MAR-2024, entry version 87.
DE SubName: Full=Nuclear receptor binding SET domain protein 1 {ECO:0000313|Ensembl:ENSMMUP00000011852.4};
GN Name=NSD1 {ECO:0000313|Ensembl:ENSMMUP00000011852.4,
GN ECO:0000313|VGNC:VGNC:75401};
OS Macaca mulatta (Rhesus macaque).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9544 {ECO:0000313|Ensembl:ENSMMUP00000011852.4, ECO:0000313|Proteomes:UP000006718};
RN [1] {ECO:0000313|Proteomes:UP000006718}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=17573 {ECO:0000313|Proteomes:UP000006718};
RX PubMed=17431167; DOI=10.1126/science.1139247;
RA Gibbs R.A., Rogers J., Katze M.G., Bumgarner R., Weinstock G.M.,
RA Mardis E.R., Remington K.A., Strausberg R.L., Venter J.C., Wilson R.K.,
RA Batzer M.A., Bustamante C.D., Eichler E.E., Hahn M.W., Hardison R.C.,
RA Makova K.D., Miller W., Milosavljevic A., Palermo R.E., Siepel A.,
RA Sikela J.M., Attaway T., Bell S., Bernard K.E., Buhay C.J.,
RA Chandrabose M.N., Dao M., Davis C., Delehaunty K.D., Ding Y., Dinh H.H.,
RA Dugan-Rocha S., Fulton L.A., Gabisi R.A., Garner T.T., Godfrey J.,
RA Hawes A.C., Hernandez J., Hines S., Holder M., Hume J., Jhangiani S.N.,
RA Joshi V., Khan Z.M., Kirkness E.F., Cree A., Fowler R.G., Lee S.,
RA Lewis L.R., Li Z., Liu Y.-S., Moore S.M., Muzny D., Nazareth L.V.,
RA Ngo D.N., Okwuonu G.O., Pai G., Parker D., Paul H.A., Pfannkoch C.,
RA Pohl C.S., Rogers Y.-H.C., Ruiz S.J., Sabo A., Santibanez J.,
RA Schneider B.W., Smith S.M., Sodergren E., Svatek A.F., Utterback T.R.,
RA Vattathil S., Warren W., White C.S., Chinwalla A.T., Feng Y., Halpern A.L.,
RA Hillier L.W., Huang X., Minx P., Nelson J.O., Pepin K.H., Qin X.,
RA Sutton G.G., Venter E., Walenz B.P., Wallis J.W., Worley K.C., Yang S.-P.,
RA Jones S.M., Marra M.A., Rocchi M., Schein J.E., Baertsch R., Clarke L.,
RA Csuros M., Glasscock J., Harris R.A., Havlak P., Jackson A.R., Jiang H.,
RA Liu Y., Messina D.N., Shen Y., Song H.X.-Z., Wylie T., Zhang L., Birney E.,
RA Han K., Konkel M.K., Lee J., Smit A.F.A., Ullmer B., Wang H., Xing J.,
RA Burhans R., Cheng Z., Karro J.E., Ma J., Raney B., She X., Cox M.J.,
RA Demuth J.P., Dumas L.J., Han S.-G., Hopkins J., Karimpour-Fard A.,
RA Kim Y.H., Pollack J.R., Vinar T., Addo-Quaye C., Degenhardt J., Denby A.,
RA Hubisz M.J., Indap A., Kosiol C., Lahn B.T., Lawson H.A., Marklein A.,
RA Nielsen R., Vallender E.J., Clark A.G., Ferguson B., Hernandez R.D.,
RA Hirani K., Kehrer-Sawatzki H., Kolb J., Patil S., Pu L.-L., Ren Y.,
RA Smith D.G., Wheeler D.A., Schenck I., Ball E.V., Chen R., Cooper D.N.,
RA Giardine B., Hsu F., Kent W.J., Lesk A., Nelson D.L., O'brien W.E.,
RA Pruefer K., Stenson P.D., Wallace J.C., Ke H., Liu X.-M., Wang P.,
RA Xiang A.P., Yang F., Barber G.P., Haussler D., Karolchik D., Kern A.D.,
RA Kuhn R.M., Smith K.E., Zwieg A.S.;
RT "Evolutionary and biomedical insights from the rhesus macaque genome.";
RL Science 316:222-234(2007).
RN [2] {ECO:0000313|Ensembl:ENSMMUP00000011852.4}
RP IDENTIFICATION.
RC STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000011852.4};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR SMR; F7FC92; -.
DR Ensembl; ENSMMUT00000012639.4; ENSMMUP00000011852.4; ENSMMUG00000009041.4.
DR VEuPathDB; HostDB:ENSMMUG00000009041; -.
DR VGNC; VGNC:75401; NSD1.
DR eggNOG; KOG1081; Eukaryota.
DR GeneTree; ENSGT00940000155027; -.
DR Proteomes; UP000006718; Chromosome 6.
DR Bgee; ENSMMUG00000009041; Expressed in spleen and 21 other cell types or tissues.
DR ExpressionAtlas; F7FC92; baseline.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0046975; F:histone H3K36 methyltransferase activity; IEA:UniProt.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR CDD; cd15648; PHD1_NSD1_2; 1.
DR CDD; cd15650; PHD2_NSD1; 1.
DR CDD; cd15653; PHD3_NSD1; 1.
DR CDD; cd15656; PHD4_NSD1; 1.
DR CDD; cd15659; PHD5_NSD1; 1.
DR CDD; cd20164; PWWP_NSD1_rpt2; 1.
DR CDD; cd19210; SET_NSD1; 1.
DR Gene3D; 2.30.30.140; -; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 4.
DR InterPro; IPR006560; AWS_dom.
DR InterPro; IPR041306; C5HCH.
DR InterPro; IPR047426; PHD1_NSD1_2.
DR InterPro; IPR047428; PHD2_NSD1.
DR InterPro; IPR047429; PHD3_NSD1.
DR InterPro; IPR047430; PHD4_NSD1.
DR InterPro; IPR047432; PHD5_NSD1.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR000313; PWWP_dom.
DR InterPro; IPR047423; PWWP_NSD1_rpt2.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR047433; SET_NSD1.
DR InterPro; IPR019786; Zinc_finger_PHD-type_CS.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR019787; Znf_PHD-finger.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR22884:SF312; HISTONE-LYSINE N-METHYLTRANSFERASE, H3 LYSINE-36 SPECIFIC; 1.
DR PANTHER; PTHR22884; SET DOMAIN PROTEINS; 1.
DR Pfam; PF17907; AWS; 1.
DR Pfam; PF17982; C5HCH; 1.
DR Pfam; PF00628; PHD; 1.
DR Pfam; PF00855; PWWP; 1.
DR Pfam; PF00856; SET; 1.
DR SMART; SM00570; AWS; 1.
DR SMART; SM00249; PHD; 5.
DR SMART; SM00508; PostSET; 1.
DR SMART; SM00293; PWWP; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 3.
DR SUPFAM; SSF82199; SET domain; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 1.
DR PROSITE; PS51215; AWS; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50812; PWWP; 1.
DR PROSITE; PS50280; SET; 1.
DR PROSITE; PS01359; ZF_PHD_1; 1.
DR PROSITE; PS50016; ZF_PHD_2; 2.
PE 4: Predicted;
KW Chromatin regulator {ECO:0000256|ARBA:ARBA00022853};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000006718};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00146}.
FT DOMAIN 1506..1552
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 1670..1714
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 1719..1781
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT DOMAIN 1853..1903
FT /note="AWS"
FT /evidence="ECO:0000259|PROSITE:PS51215"
FT DOMAIN 1905..2022
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 2029..2045
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
FT REGION 207..250
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 451..478
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 836..858
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 899..995
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1077..1100
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1128..1180
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1206..1235
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1265..1331
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1344..1391
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1424..1497
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2054..2074
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2176..2386
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2427..2487
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2516..2536
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2563..2588
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2626..2659
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 838..858
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 899..951
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1265..1281
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1363..1377
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1429..1444
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1445..1463
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1473..1491
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2180..2195
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2216..2231
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2243..2257
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2264..2278
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2294..2312
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2519..2535
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2638..2653
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2659 AA; 292538 MW; 1C2FE6588F1E0E47 CRC64;
MDQTCELPRR NCLLPFSNPV NLDAPEDKDS PFGNGQSNFS EPLNGCTMQL STVSGTSQNA
YGQDSPSCYI PLRRLQDLAS MINVEYLNGS ADGSESFQDP EKSDSRAQTP IVCTSLSPGG
PTALAMKQEL SCNNSPELQV KVTKTIKNGF LHFENFTCVD DADVDSEMDP EQPVTEDESI
EEIFEETQTN ATCNYETKSE NGVKVAMGSE QDSTPESRHG AVKSPFLPLA PQTETQKNKQ
RNEVDGSNEK AALLPAPFSL GDTNITIEEQ LNSINLSFQD DPDSSTSTLG NMLELPGTSS
SSTSQELPFV KTCNKFVFVS NRRPYRQYYV EAFGDPSERA WVAGKAIVMF EGRHQFEELP
VLRRRGKQKE KGYRHKVPQK ILSKWEASVG LAEQYDVPKG SKNRKCIPGS IKLDSEEDMP
FEDCTNDPES EHDLLLNGCL KSLAFDSEHS ADEKEKPCAK SRARKSSDNP KRTSVKKGHI
QFEAHKDERR GKIPENLGLN FISGDISDTQ ASNELSRIAN SLTGSNTAPG SFLFSSCGKN
TAKKEFETSN GDALLGLPEG ALISKCSREK NKPQRSLVCG SKVKLCYIGA GDEEKRSDSI
SICTTSDDGS SDLDPIEHSS ESDNSVLEIT DAFDRTENML SMQKNEKIKY SRFAATNTRV
KAKQKPLISN SHTDHLMGCT KSAEPGTETS QVNLSDLKAS TLVHKPQSDF TSDDLSPKFN
MSSSISSENS LIKGGAVNQA LLHSKSKQPK FRSIKCKHKE NPVMVEPPVT NDEYSLKCCS
SDTKGSPLAS ISKSGKVDGL KLLNNMHEKT RDSSDIETAV VKHVLSELKE LSYRSLGEDV
SDSGTSKPSK PLLFSSPSQN HIPIEPDYKF STLLMMLKDM HDSKTKEQRL MTAQNLVSYR
SPGRGDCSTN SPVGVSKVLV SGGSTHNSEK KGDGTQNSAN LSPSGGDSAL SGELSASLPG
LVSDKRDLPV SGKSRSNCVT RRNCGRSKPS SKLRDAFSAQ VVKNTVNRKA LKTERKRKLN
QLSSVTLDAV LQGDREHGGS LRGGAEDPSK EEPLQIMGHL TSEDGDHFSD VHFDNKVKQS
DPGKISEKGP SFENGKGPEL DSVMNSENDE LNGVNQVVPK KRWQRLNQRR TKPRKRMNRF
KEKENSECAF GALLPSDPVQ EGRDEFPEHR TSSASILEEP LTDQKHADCL DSVGPRLNVC
DKSSASIGDM EKEPGIPSLT PQAELPEPAV RSEKKRLRKP SKWLLEYTEE YDQIFAPKKK
QKKVQEQVHK VSSRCEEESL LARGRSSAQN KQVDENSLIS TKEEPPVLER EAPFLEGPLA
QSELGGGHAE LPQLTLSVPV APEVSPRPAL ESEELLVKTP GNYESKRQRK PTKKLLESND
LDPGFMPKKG DLGLSKKCYE AGHLENGITE SCATSYSKDF GGGTSKIFDR PRKRKRQRHA
AAKMQCKKVK NDDSSKEIPS LEGELMPHRT AASPKETVEE GVEHDSGMPA SKKMQGERGG
GAALKENVCQ NCEKLGELLL CEAQCCGAFH LECLGLTEMP RGKFICNECR TGIHTCFVCK
QSGEDVKRCL LPLCGKFYHE ECVQKYPPTV MQNKGFRCSL HICITCHAAN PANVSASKGR
LMRCVRCPVA YHANDFCLAA GSKILASNSI ICPNHFTPRR GCRNHEHVNV SWCFVCSEGG
SLLCCDSCPA AFHRECLNID IPEGNWYCND CKAGKKPHYR EIVWVKVGRY RWWPAEICHP
RAVPSNIDKM RHDVGEFPVL FFGSNDYLWT HQARVFPYME GDVSSKDKMG KGVDGTYKKA
LQEAAARFEE LKAQKELRQL QEDRKNDKKP PPYKHIKVNR PIGRVQIFTA DLSEIPRCNC
KATDENPCGI DSECINRMLL YECHPTVCPA GGRCQNQCFS KRQYPEVEIF RTLQRGWGLR
TKTDIKKGEF VNEYVGELID EEECRARIRY AQEHDITNFY MLTLDKDRII DAGPKGNYAR
FMNHCCQPNC ETQKWSVNGD TRVGLFALSD IKAGTELTFN YNLECLGNGK TVCKCGAPNC
SGFLGVRPKN QPIATEEKSK KFKKKQQGKR RTQGEITKER EDECFSCGDA GQLVSCKKPG
CPKVYHADCL NLTKRPAGKW ECPWHQCDIC GKEAASFCEM CPSSFCKQHR EGMLFISKLD
GRLSCTEHDP CGPNPLEPGE IREYVPPPVP LPPGPSTHLA EQSTGMAAQA PKMSDKPPAD
TNQTLSLSKK ALAGTCQRPL LPERPLERTD SRSQPLDKVR DLAGSGTKSQ SLVSSQRPLD
RQPAVAGPRP QLSDKPSPVT SPSSSPSVRS QPLERPLGTA DPRLDKSIGA ASPRPQSLEK
TPVPTGLRLP PPDRLLITSS PKPQTSDRPP DKPHASLSQR LPPPEKVLSA VVQTLVAKEK
ALRPVDQNTQ SKNRAALVMD LIDLTPRQKE RAASPHEVTP QADEKMPVLE SSSWPASKGL
GHMPRAVEKG SVSDPLQTSG KVAAHSEDPW QAVKSFTQAR LLSQPPAKAF LYEPTTQASG
RAPAGTEQTP GPLSQVPGLV KQAKQMIGGQ QLPALAARSG QSFRSLGKAP ASLPTEEKKL
VTTEQSPWAL GKASSRAGLW PIVAGQTLAQ SCWSPGSTQT LTQTCWSLGR GQDPKPEQNT
LPALNQAPSS HKCAESEQK
//