GenomeNet

Database: UniProt
Entry: F7FC92_MACMU
LinkDB: F7FC92_MACMU
Original site: F7FC92_MACMU 
ID   F7FC92_MACMU            Unreviewed;      2659 AA.
AC   F7FC92;
DT   27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT   11-DEC-2019, sequence version 3.
DT   27-MAR-2024, entry version 87.
DE   SubName: Full=Nuclear receptor binding SET domain protein 1 {ECO:0000313|Ensembl:ENSMMUP00000011852.4};
GN   Name=NSD1 {ECO:0000313|Ensembl:ENSMMUP00000011852.4,
GN   ECO:0000313|VGNC:VGNC:75401};
OS   Macaca mulatta (Rhesus macaque).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC   Cercopithecidae; Cercopithecinae; Macaca.
OX   NCBI_TaxID=9544 {ECO:0000313|Ensembl:ENSMMUP00000011852.4, ECO:0000313|Proteomes:UP000006718};
RN   [1] {ECO:0000313|Proteomes:UP000006718}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=17573 {ECO:0000313|Proteomes:UP000006718};
RX   PubMed=17431167; DOI=10.1126/science.1139247;
RA   Gibbs R.A., Rogers J., Katze M.G., Bumgarner R., Weinstock G.M.,
RA   Mardis E.R., Remington K.A., Strausberg R.L., Venter J.C., Wilson R.K.,
RA   Batzer M.A., Bustamante C.D., Eichler E.E., Hahn M.W., Hardison R.C.,
RA   Makova K.D., Miller W., Milosavljevic A., Palermo R.E., Siepel A.,
RA   Sikela J.M., Attaway T., Bell S., Bernard K.E., Buhay C.J.,
RA   Chandrabose M.N., Dao M., Davis C., Delehaunty K.D., Ding Y., Dinh H.H.,
RA   Dugan-Rocha S., Fulton L.A., Gabisi R.A., Garner T.T., Godfrey J.,
RA   Hawes A.C., Hernandez J., Hines S., Holder M., Hume J., Jhangiani S.N.,
RA   Joshi V., Khan Z.M., Kirkness E.F., Cree A., Fowler R.G., Lee S.,
RA   Lewis L.R., Li Z., Liu Y.-S., Moore S.M., Muzny D., Nazareth L.V.,
RA   Ngo D.N., Okwuonu G.O., Pai G., Parker D., Paul H.A., Pfannkoch C.,
RA   Pohl C.S., Rogers Y.-H.C., Ruiz S.J., Sabo A., Santibanez J.,
RA   Schneider B.W., Smith S.M., Sodergren E., Svatek A.F., Utterback T.R.,
RA   Vattathil S., Warren W., White C.S., Chinwalla A.T., Feng Y., Halpern A.L.,
RA   Hillier L.W., Huang X., Minx P., Nelson J.O., Pepin K.H., Qin X.,
RA   Sutton G.G., Venter E., Walenz B.P., Wallis J.W., Worley K.C., Yang S.-P.,
RA   Jones S.M., Marra M.A., Rocchi M., Schein J.E., Baertsch R., Clarke L.,
RA   Csuros M., Glasscock J., Harris R.A., Havlak P., Jackson A.R., Jiang H.,
RA   Liu Y., Messina D.N., Shen Y., Song H.X.-Z., Wylie T., Zhang L., Birney E.,
RA   Han K., Konkel M.K., Lee J., Smit A.F.A., Ullmer B., Wang H., Xing J.,
RA   Burhans R., Cheng Z., Karro J.E., Ma J., Raney B., She X., Cox M.J.,
RA   Demuth J.P., Dumas L.J., Han S.-G., Hopkins J., Karimpour-Fard A.,
RA   Kim Y.H., Pollack J.R., Vinar T., Addo-Quaye C., Degenhardt J., Denby A.,
RA   Hubisz M.J., Indap A., Kosiol C., Lahn B.T., Lawson H.A., Marklein A.,
RA   Nielsen R., Vallender E.J., Clark A.G., Ferguson B., Hernandez R.D.,
RA   Hirani K., Kehrer-Sawatzki H., Kolb J., Patil S., Pu L.-L., Ren Y.,
RA   Smith D.G., Wheeler D.A., Schenck I., Ball E.V., Chen R., Cooper D.N.,
RA   Giardine B., Hsu F., Kent W.J., Lesk A., Nelson D.L., O'brien W.E.,
RA   Pruefer K., Stenson P.D., Wallace J.C., Ke H., Liu X.-M., Wang P.,
RA   Xiang A.P., Yang F., Barber G.P., Haussler D., Karolchik D., Kern A.D.,
RA   Kuhn R.M., Smith K.E., Zwieg A.S.;
RT   "Evolutionary and biomedical insights from the rhesus macaque genome.";
RL   Science 316:222-234(2007).
RN   [2] {ECO:0000313|Ensembl:ENSMMUP00000011852.4}
RP   IDENTIFICATION.
RC   STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000011852.4};
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   SMR; F7FC92; -.
DR   Ensembl; ENSMMUT00000012639.4; ENSMMUP00000011852.4; ENSMMUG00000009041.4.
DR   VEuPathDB; HostDB:ENSMMUG00000009041; -.
DR   VGNC; VGNC:75401; NSD1.
DR   eggNOG; KOG1081; Eukaryota.
DR   GeneTree; ENSGT00940000155027; -.
DR   Proteomes; UP000006718; Chromosome 6.
DR   Bgee; ENSMMUG00000009041; Expressed in spleen and 21 other cell types or tissues.
DR   ExpressionAtlas; F7FC92; baseline.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0046975; F:histone H3K36 methyltransferase activity; IEA:UniProt.
DR   GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR   CDD; cd15648; PHD1_NSD1_2; 1.
DR   CDD; cd15650; PHD2_NSD1; 1.
DR   CDD; cd15653; PHD3_NSD1; 1.
DR   CDD; cd15656; PHD4_NSD1; 1.
DR   CDD; cd15659; PHD5_NSD1; 1.
DR   CDD; cd20164; PWWP_NSD1_rpt2; 1.
DR   CDD; cd19210; SET_NSD1; 1.
DR   Gene3D; 2.30.30.140; -; 1.
DR   Gene3D; 2.170.270.10; SET domain; 1.
DR   Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 4.
DR   InterPro; IPR006560; AWS_dom.
DR   InterPro; IPR041306; C5HCH.
DR   InterPro; IPR047426; PHD1_NSD1_2.
DR   InterPro; IPR047428; PHD2_NSD1.
DR   InterPro; IPR047429; PHD3_NSD1.
DR   InterPro; IPR047430; PHD4_NSD1.
DR   InterPro; IPR047432; PHD5_NSD1.
DR   InterPro; IPR003616; Post-SET_dom.
DR   InterPro; IPR000313; PWWP_dom.
DR   InterPro; IPR047423; PWWP_NSD1_rpt2.
DR   InterPro; IPR001214; SET_dom.
DR   InterPro; IPR046341; SET_dom_sf.
DR   InterPro; IPR047433; SET_NSD1.
DR   InterPro; IPR019786; Zinc_finger_PHD-type_CS.
DR   InterPro; IPR011011; Znf_FYVE_PHD.
DR   InterPro; IPR001965; Znf_PHD.
DR   InterPro; IPR019787; Znf_PHD-finger.
DR   InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR   PANTHER; PTHR22884:SF312; HISTONE-LYSINE N-METHYLTRANSFERASE, H3 LYSINE-36 SPECIFIC; 1.
DR   PANTHER; PTHR22884; SET DOMAIN PROTEINS; 1.
DR   Pfam; PF17907; AWS; 1.
DR   Pfam; PF17982; C5HCH; 1.
DR   Pfam; PF00628; PHD; 1.
DR   Pfam; PF00855; PWWP; 1.
DR   Pfam; PF00856; SET; 1.
DR   SMART; SM00570; AWS; 1.
DR   SMART; SM00249; PHD; 5.
DR   SMART; SM00508; PostSET; 1.
DR   SMART; SM00293; PWWP; 1.
DR   SMART; SM00317; SET; 1.
DR   SUPFAM; SSF57903; FYVE/PHD zinc finger; 3.
DR   SUPFAM; SSF82199; SET domain; 1.
DR   SUPFAM; SSF63748; Tudor/PWWP/MBT; 1.
DR   PROSITE; PS51215; AWS; 1.
DR   PROSITE; PS50868; POST_SET; 1.
DR   PROSITE; PS50812; PWWP; 1.
DR   PROSITE; PS50280; SET; 1.
DR   PROSITE; PS01359; ZF_PHD_1; 1.
DR   PROSITE; PS50016; ZF_PHD_2; 2.
PE   4: Predicted;
KW   Chromatin regulator {ECO:0000256|ARBA:ARBA00022853};
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Reference proteome {ECO:0000313|Proteomes:UP000006718};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW   Transferase {ECO:0000256|ARBA:ARBA00022679};
KW   Zinc {ECO:0000256|ARBA:ARBA00022833};
KW   Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW   ProRule:PRU00146}.
FT   DOMAIN          1506..1552
FT                   /note="PHD-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50016"
FT   DOMAIN          1670..1714
FT                   /note="PHD-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50016"
FT   DOMAIN          1719..1781
FT                   /note="PWWP"
FT                   /evidence="ECO:0000259|PROSITE:PS50812"
FT   DOMAIN          1853..1903
FT                   /note="AWS"
FT                   /evidence="ECO:0000259|PROSITE:PS51215"
FT   DOMAIN          1905..2022
FT                   /note="SET"
FT                   /evidence="ECO:0000259|PROSITE:PS50280"
FT   DOMAIN          2029..2045
FT                   /note="Post-SET"
FT                   /evidence="ECO:0000259|PROSITE:PS50868"
FT   REGION          207..250
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          451..478
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          836..858
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          899..995
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1077..1100
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1128..1180
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1206..1235
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1265..1331
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1344..1391
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1424..1497
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2054..2074
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2176..2386
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2427..2487
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2516..2536
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2563..2588
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2626..2659
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        838..858
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        899..951
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1265..1281
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1363..1377
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1429..1444
FT                   /note="Basic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1445..1463
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1473..1491
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2180..2195
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2216..2231
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2243..2257
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2264..2278
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2294..2312
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2519..2535
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2638..2653
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   2659 AA;  292538 MW;  1C2FE6588F1E0E47 CRC64;
     MDQTCELPRR NCLLPFSNPV NLDAPEDKDS PFGNGQSNFS EPLNGCTMQL STVSGTSQNA
     YGQDSPSCYI PLRRLQDLAS MINVEYLNGS ADGSESFQDP EKSDSRAQTP IVCTSLSPGG
     PTALAMKQEL SCNNSPELQV KVTKTIKNGF LHFENFTCVD DADVDSEMDP EQPVTEDESI
     EEIFEETQTN ATCNYETKSE NGVKVAMGSE QDSTPESRHG AVKSPFLPLA PQTETQKNKQ
     RNEVDGSNEK AALLPAPFSL GDTNITIEEQ LNSINLSFQD DPDSSTSTLG NMLELPGTSS
     SSTSQELPFV KTCNKFVFVS NRRPYRQYYV EAFGDPSERA WVAGKAIVMF EGRHQFEELP
     VLRRRGKQKE KGYRHKVPQK ILSKWEASVG LAEQYDVPKG SKNRKCIPGS IKLDSEEDMP
     FEDCTNDPES EHDLLLNGCL KSLAFDSEHS ADEKEKPCAK SRARKSSDNP KRTSVKKGHI
     QFEAHKDERR GKIPENLGLN FISGDISDTQ ASNELSRIAN SLTGSNTAPG SFLFSSCGKN
     TAKKEFETSN GDALLGLPEG ALISKCSREK NKPQRSLVCG SKVKLCYIGA GDEEKRSDSI
     SICTTSDDGS SDLDPIEHSS ESDNSVLEIT DAFDRTENML SMQKNEKIKY SRFAATNTRV
     KAKQKPLISN SHTDHLMGCT KSAEPGTETS QVNLSDLKAS TLVHKPQSDF TSDDLSPKFN
     MSSSISSENS LIKGGAVNQA LLHSKSKQPK FRSIKCKHKE NPVMVEPPVT NDEYSLKCCS
     SDTKGSPLAS ISKSGKVDGL KLLNNMHEKT RDSSDIETAV VKHVLSELKE LSYRSLGEDV
     SDSGTSKPSK PLLFSSPSQN HIPIEPDYKF STLLMMLKDM HDSKTKEQRL MTAQNLVSYR
     SPGRGDCSTN SPVGVSKVLV SGGSTHNSEK KGDGTQNSAN LSPSGGDSAL SGELSASLPG
     LVSDKRDLPV SGKSRSNCVT RRNCGRSKPS SKLRDAFSAQ VVKNTVNRKA LKTERKRKLN
     QLSSVTLDAV LQGDREHGGS LRGGAEDPSK EEPLQIMGHL TSEDGDHFSD VHFDNKVKQS
     DPGKISEKGP SFENGKGPEL DSVMNSENDE LNGVNQVVPK KRWQRLNQRR TKPRKRMNRF
     KEKENSECAF GALLPSDPVQ EGRDEFPEHR TSSASILEEP LTDQKHADCL DSVGPRLNVC
     DKSSASIGDM EKEPGIPSLT PQAELPEPAV RSEKKRLRKP SKWLLEYTEE YDQIFAPKKK
     QKKVQEQVHK VSSRCEEESL LARGRSSAQN KQVDENSLIS TKEEPPVLER EAPFLEGPLA
     QSELGGGHAE LPQLTLSVPV APEVSPRPAL ESEELLVKTP GNYESKRQRK PTKKLLESND
     LDPGFMPKKG DLGLSKKCYE AGHLENGITE SCATSYSKDF GGGTSKIFDR PRKRKRQRHA
     AAKMQCKKVK NDDSSKEIPS LEGELMPHRT AASPKETVEE GVEHDSGMPA SKKMQGERGG
     GAALKENVCQ NCEKLGELLL CEAQCCGAFH LECLGLTEMP RGKFICNECR TGIHTCFVCK
     QSGEDVKRCL LPLCGKFYHE ECVQKYPPTV MQNKGFRCSL HICITCHAAN PANVSASKGR
     LMRCVRCPVA YHANDFCLAA GSKILASNSI ICPNHFTPRR GCRNHEHVNV SWCFVCSEGG
     SLLCCDSCPA AFHRECLNID IPEGNWYCND CKAGKKPHYR EIVWVKVGRY RWWPAEICHP
     RAVPSNIDKM RHDVGEFPVL FFGSNDYLWT HQARVFPYME GDVSSKDKMG KGVDGTYKKA
     LQEAAARFEE LKAQKELRQL QEDRKNDKKP PPYKHIKVNR PIGRVQIFTA DLSEIPRCNC
     KATDENPCGI DSECINRMLL YECHPTVCPA GGRCQNQCFS KRQYPEVEIF RTLQRGWGLR
     TKTDIKKGEF VNEYVGELID EEECRARIRY AQEHDITNFY MLTLDKDRII DAGPKGNYAR
     FMNHCCQPNC ETQKWSVNGD TRVGLFALSD IKAGTELTFN YNLECLGNGK TVCKCGAPNC
     SGFLGVRPKN QPIATEEKSK KFKKKQQGKR RTQGEITKER EDECFSCGDA GQLVSCKKPG
     CPKVYHADCL NLTKRPAGKW ECPWHQCDIC GKEAASFCEM CPSSFCKQHR EGMLFISKLD
     GRLSCTEHDP CGPNPLEPGE IREYVPPPVP LPPGPSTHLA EQSTGMAAQA PKMSDKPPAD
     TNQTLSLSKK ALAGTCQRPL LPERPLERTD SRSQPLDKVR DLAGSGTKSQ SLVSSQRPLD
     RQPAVAGPRP QLSDKPSPVT SPSSSPSVRS QPLERPLGTA DPRLDKSIGA ASPRPQSLEK
     TPVPTGLRLP PPDRLLITSS PKPQTSDRPP DKPHASLSQR LPPPEKVLSA VVQTLVAKEK
     ALRPVDQNTQ SKNRAALVMD LIDLTPRQKE RAASPHEVTP QADEKMPVLE SSSWPASKGL
     GHMPRAVEKG SVSDPLQTSG KVAAHSEDPW QAVKSFTQAR LLSQPPAKAF LYEPTTQASG
     RAPAGTEQTP GPLSQVPGLV KQAKQMIGGQ QLPALAARSG QSFRSLGKAP ASLPTEEKKL
     VTTEQSPWAL GKASSRAGLW PIVAGQTLAQ SCWSPGSTQT LTQTCWSLGR GQDPKPEQNT
     LPALNQAPSS HKCAESEQK
//
DBGET integrated database retrieval system