ID A0A1U8CGH1_MESAU Unreviewed; 2688 AA.
AC A0A1U8CGH1;
DT 10-MAY-2017, integrated into UniProtKB/TrEMBL.
DT 10-MAY-2017, sequence version 1.
DT 27-MAR-2024, entry version 44.
DE SubName: Full=Histone-lysine N-methyltransferase, H3 lysine-36 and H4 lysine-20 specific isoform X1 {ECO:0000313|RefSeq:XP_005078407.1, ECO:0000313|RefSeq:XP_005078408.1};
GN Name=Nsd1 {ECO:0000313|RefSeq:XP_005078407.1,
GN ECO:0000313|RefSeq:XP_005078408.1, ECO:0000313|RefSeq:XP_012974748.1,
GN ECO:0000313|RefSeq:XP_012974749.1, ECO:0000313|RefSeq:XP_021087698.1};
OS Mesocricetus auratus (Golden hamster).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC Cricetidae; Cricetinae; Mesocricetus.
OX NCBI_TaxID=10036 {ECO:0000313|Proteomes:UP000189706, ECO:0000313|RefSeq:XP_012974749.1};
RN [1] {ECO:0000313|RefSeq:XP_005078407.1, ECO:0000313|RefSeq:XP_005078408.1}
RP IDENTIFICATION.
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_005078407.1; XM_005078350.3.
DR RefSeq; XP_005078408.1; XM_005078351.3.
DR RefSeq; XP_012974748.1; XM_013119294.2.
DR RefSeq; XP_012974749.1; XM_013119295.2.
DR RefSeq; XP_021087698.1; XM_021232039.1.
DR STRING; 10036.ENSMAUP00000016310; -.
DR GeneID; 101826029; -.
DR CTD; 64324; -.
DR eggNOG; KOG1081; Eukaryota.
DR OrthoDB; 950362at2759; -.
DR Proteomes; UP000189706; Unplaced.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0046975; F:histone H3K36 methyltransferase activity; IEA:Ensembl.
DR GO; GO:0050681; F:nuclear androgen receptor binding; IEA:Ensembl.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IEA:Ensembl.
DR GO; GO:0003712; F:transcription coregulator activity; IEA:Ensembl.
DR GO; GO:0008270; F:zinc ion binding; IEA:Ensembl.
DR GO; GO:0045893; P:positive regulation of DNA-templated transcription; IEA:Ensembl.
DR GO; GO:0033135; P:regulation of peptidyl-serine phosphorylation; IEA:Ensembl.
DR CDD; cd15648; PHD1_NSD1_2; 1.
DR CDD; cd15650; PHD2_NSD1; 1.
DR CDD; cd15653; PHD3_NSD1; 1.
DR CDD; cd15656; PHD4_NSD1; 1.
DR CDD; cd15659; PHD5_NSD1; 1.
DR CDD; cd20161; PWWP_NSD1_rpt1; 1.
DR CDD; cd20164; PWWP_NSD1_rpt2; 1.
DR CDD; cd19210; SET_NSD1; 1.
DR Gene3D; 2.30.30.140; -; 2.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 4.
DR InterPro; IPR006560; AWS_dom.
DR InterPro; IPR041306; C5HCH.
DR InterPro; IPR047426; PHD1_NSD1_2.
DR InterPro; IPR047428; PHD2_NSD1.
DR InterPro; IPR047429; PHD3_NSD1.
DR InterPro; IPR047430; PHD4_NSD1.
DR InterPro; IPR047432; PHD5_NSD1.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR000313; PWWP_dom.
DR InterPro; IPR047423; PWWP_NSD1_rpt2.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR047433; SET_NSD1.
DR InterPro; IPR019786; Zinc_finger_PHD-type_CS.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR019787; Znf_PHD-finger.
DR InterPro; IPR001841; Znf_RING.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR22884:SF312; HISTONE-LYSINE N-METHYLTRANSFERASE, H3 LYSINE-36 SPECIFIC; 1.
DR PANTHER; PTHR22884; SET DOMAIN PROTEINS; 1.
DR Pfam; PF17907; AWS; 1.
DR Pfam; PF17982; C5HCH; 1.
DR Pfam; PF00628; PHD; 1.
DR Pfam; PF00855; PWWP; 2.
DR Pfam; PF00856; SET; 1.
DR SMART; SM00570; AWS; 1.
DR SMART; SM00249; PHD; 5.
DR SMART; SM00508; PostSET; 1.
DR SMART; SM00293; PWWP; 2.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 3.
DR SUPFAM; SSF82199; SET domain; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 2.
DR PROSITE; PS51215; AWS; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50812; PWWP; 2.
DR PROSITE; PS50280; SET; 1.
DR PROSITE; PS01359; ZF_PHD_1; 1.
DR PROSITE; PS50016; ZF_PHD_2; 2.
DR PROSITE; PS50089; ZF_RING_2; 1.
PE 4: Predicted;
KW Chromatin regulator {ECO:0000256|ARBA:ARBA00022853};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000189706};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00175}.
FT DOMAIN 323..388
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT DOMAIN 1541..1587
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 1591..1636
FT /note="RING-type"
FT /evidence="ECO:0000259|PROSITE:PS50089"
FT DOMAIN 1705..1749
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 1754..1816
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT DOMAIN 1888..1938
FT /note="AWS"
FT /evidence="ECO:0000259|PROSITE:PS51215"
FT DOMAIN 1940..2057
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 2064..2080
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
FT REGION 18..41
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 210..251
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 281..306
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 486..506
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 733..753
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 793..833
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 932..989
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1013..1032
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1041..1092
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1105..1270
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1320..1342
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1380..1406
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1456..1533
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2211..2419
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2437..2531
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2548..2619
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2657..2688
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 234..248
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 808..825
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1105..1141
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1464..1480
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1481..1499
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2215..2230
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2231..2265
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2277..2291
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2298..2312
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2327..2344
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2386..2406
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2450..2472
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2581..2597
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2657..2680
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2688 AA; 295568 MW; CD3862CF6D2C414B CRC64;
MDQTCELSRR NCLLSFSNPV NLEAPEDKDS PFGNGQSNFS EPLKGCTMQL PTASGAPQNA
YGQDSPSCYI PLRRLQDLAS MINVEYLSGS ADGSESFQDP AKSDSRAQSA VACTSLSPGG
PTALAMKQDP SCNNSPELQL RVTKTTKNGF LHFENFTCVD DAVVDSEMDP EQPVTKDESI
VEIFEETQTN ATCNNEPKSE IGVEMAMGSE RDSTPESRHG AVKRPFLPLA PETEKQKNKQ
RNEVDRSNEK TALLPAPISL GDTNINVEEQ FNSINLSFQD DPDSSTSTLG NMLELPGTSS
SSTSQELPFC QLKKKSTPLK YEVGDLIWAK FKRRPWWPCR ICADPLINTH SKMKVANRRP
YREYYVEAFG DPSEKAWVAG KAIVMFEGRH QFEELPVLRK RGKQKEKGYR HKVPQKILSK
WEASVGLAEQ YDVPKGSKKQ KCVTSSIKLD SEEEMLFEDC TNDPESEQDL LLNGCLKSLA
FDSEHSADEK EKPCDKSGAR KSSDNIKRTS VKKGLLPFEA QKEEQRGKIP ENLDLDFIPG
SASDKQASNE LSRIANSLTG SKTAPGGFLF SSCTQSTAKA DFEISNCDSL PGLSESALSS
KHSVEKTKLQ PGLICGSKVQ LCYIGAGDEE KRSDSVSVCA TSDDGSSDLD PIERNSECDS
SVLEITDAFD RTESTLSMHK NETKYSRYPA TNRIKEKQKS LITNSHTDHL MDSTKTMEPG
TAEMSQVNLS DLKVCNPVPK PQPEFRNDSL TPKFSAPPSI CSENSLTKGG AANQTLLPLK
SRQPKFRSIK CKHRENPADA EPSATNEDPS LKCCSSDSKG SPLASTPKSG RAEGLKLLNN
MHEKTRESSD IETAVVKHVL SELKELSYRS LSEDASDSGT SKASKPLLFS AAASQHHLPI
EPDYKFSTLL MMLKDMHDSK TKEQRLMTAQ NLASYRTPDR GDCSTSNPGG TSKVLVLGGS
TQNSEKTGAG TQDTVRLSSS GGDSALSGEL SSLSSLASDK RDSTCGKSLN CVPRRNCGRV
RPSSKLRETV SAQMAKPLVN PKALKTERKR KVNQLPAVTI GANRLGDKES EGSLNGPSGG
AGDPGKEESP QLMGHLRNAD THFSDAQFDS KIKQPDPDKV LEKEPSFENR KGPELSSEVN
SENDETHGVS QVVPKKRWQR LNQRRPKPGK RTNRFREKEN SEGAFAALLP GEPVQKGRDD
YLEQRAPPTS ILEDSVADPN HGGHSDSVGP RLSVCDKSSV SMGDVEKETG IPSLTPQTKL
PEPAIRSEKK RLRKPSKWLL EYTEEYDQIF APKKKHKKVQ EQVHKVSSRC EDESLLARCR
SSAQNKQVDE NSLISTKEEP PVLEREAPFL EGPLAQSDLG VGHAELPQLT LSVPVAPEVS
PQPALESEEL LVKTPGNYES KRQRKPTKKL LESNDLDPGF MPKKGDLGLS RKCFEVGRLE
NSIADSRATS HLKAFGGGTT KIFDKPRKRK RQRHVTARVH YKKVKKEDSS KDTPRTEGEP
ILHRTAASPK ELPEDGVEQD SGMSASKKLQ GERGGGAALK ENVCQNCEKL GELLLCEAQC
CGAFHLECLG LTEMPRGKFI CNECRTGIHT CFVCKQSGED VKRCLLPLCG KFYHEECVQK
YPPTVTQNKG FRCPLHICIT CHAANPANVS ASKGRLMRCV RCPVAYHAND FCLAAGSKIL
ASNSIICPNH FTPRRGCRNH EHVNVSWCFV CSEGGSLLCC DSCPAAFHRE CLNIDIPEGN
WYCNDCKAGK KPHYREIVWV KVGRYRWWPA EICHPRAVPS NIDKMRHDVG EFPVLFFGSN
DYLWTHQARV FPYMEGDVSS KDKMGKGVDG TYKKALQEAA ARFEELKAQK ELRQLQEDRK
NDKKPPPYKH IKVNRPIGRV QIFTADLSEI PRCNCKATDE NPCGIDSECI NRMLLYECHP
TVCPAGGRCQ NQCFSKRQYP DVEIFRTLQR GWGLRTKTDI KKGEFVNEYV GELIDEEECR
ARIRYAQEHD ITNFYMLTLD KDRIIDAGPK GNYARFMNHC CQPNCETQKW SVNGDTRVGL
FALSDIKAGT ELTFNYNLEC LGNGKTVCKC GAPNCSGFLG VRPKNQPIVT EEKSRKFKKK
QHGKRRSQGE VTKEREDECF SCGDAGQLVS CKKPGCPKVY HADCLNLTKR PAGKWECPWH
QCDVCGKEAA SFCEMCPSSF CKQHREGMLF ISKLDGRLSC TEHDPCGPNP LEPGEIREYV
PPPVPLPPSP STQLTEQSSG RAPQGPKMSD QPPTDATQML PVSKKALTGT CQRPLPPEKP
ERTESSSHLL DRGRDVAGSG TKSQSLVSSQ RPQDRPPAKE GPRPQPSDKS SPVTKPSSSS
SVRPLPLERP LGMIDPRLDK SIGAASPKSQ SVEKTPALTG LRLSPPDRLL TTSSPKSQVS
DRPPDKSHAS LTQRLPPPEK VLSAVVQSLV AKEKALRPVD QNTQSKHRAA LVMDRMDLTP
RQKERAASPH DVTPQADEKM PVLESSSWPS SKGLGHMPRA VEKGSVSDSL QPAGKAASPS
EHPWQAVKSL TQARLLSPPS AKAFLYESAT QASGRAPVGA EQTLGPPNPA PGLVKQVKQL
SRGLTAKSGQ SFRSLGKIPA SLPNEEKKLA TTEQSPWGLG KASPGAGLWP IVAGQTLAQA
CWSAGGTQTL AQTCWSLGRG QDPKPEQNTI QAFNQAPSSR KCGESEKK
//