ID K7DBF3_PANTR Unreviewed; 2428 AA.
AC K7DBF3; A0A2J8J5S1;
DT 09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT 09-JAN-2013, sequence version 1.
DT 27-MAR-2024, entry version 70.
DE SubName: Full=Nuclear receptor binding SET domain protein 1 {ECO:0000313|EMBL:JAA30527.1, ECO:0000313|Ensembl:ENSPTRP00000067069.1};
GN Name=NSD1 {ECO:0000313|EMBL:JAA30527.1,
GN ECO:0000313|Ensembl:ENSPTRP00000067069.1, ECO:0000313|VGNC:VGNC:6953};
OS Pan troglodytes (Chimpanzee).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Pan.
OX NCBI_TaxID=9598 {ECO:0000313|EMBL:JAA30527.1};
RN [1] {ECO:0000313|Ensembl:ENSPTRP00000067069.1, ECO:0000313|Proteomes:UP000002277}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=16136131; DOI=10.1038/nature04072;
RG Chimpanzee sequencing and analysis consortium;
RT "Initial sequence of the chimpanzee genome and comparison with the human
RT genome.";
RL Nature 437:69-87(2005).
RN [2] {ECO:0000313|EMBL:JAA30527.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Skeletal muscle {ECO:0000313|EMBL:JAA39912.1}, and Skin
RC {ECO:0000313|EMBL:JAA30527.1};
RA Maudhoo M.D., Meehan D.T., Norgren R.B.Jr.;
RT "De novo assembly of the reference chimpanzee transcriptome from NextGen
RT mRNA sequences.";
RL Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|Ensembl:ENSPTRP00000067069.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AACZ04060243; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AACZ04060244; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; GABD01002573; JAA30527.1; -; mRNA.
DR EMBL; GABE01004827; JAA39912.1; -; mRNA.
DR RefSeq; XP_016809818.1; XM_016954329.1.
DR RefSeq; XP_016809819.1; XM_016954330.1.
DR RefSeq; XP_016809820.1; XM_016954331.1.
DR Ensembl; ENSPTRT00000078053.1; ENSPTRP00000067069.1; ENSPTRG00000017575.6.
DR GeneID; 471754; -.
DR CTD; 64324; -.
DR VGNC; VGNC:6953; NSD1.
DR GeneTree; ENSGT00940000155027; -.
DR OrthoDB; 950362at2759; -.
DR Proteomes; UP000002277; Chromosome 5.
DR Bgee; ENSPTRG00000017575; Expressed in lymph node and 21 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0046975; F:histone H3K36 methyltransferase activity; IEA:UniProt.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR CDD; cd15648; PHD1_NSD1_2; 1.
DR CDD; cd15650; PHD2_NSD1; 1.
DR CDD; cd15653; PHD3_NSD1; 1.
DR CDD; cd15656; PHD4_NSD1; 1.
DR CDD; cd15659; PHD5_NSD1; 1.
DR CDD; cd20161; PWWP_NSD1_rpt1; 1.
DR CDD; cd20164; PWWP_NSD1_rpt2; 1.
DR CDD; cd19210; SET_NSD1; 1.
DR Gene3D; 2.30.30.140; -; 2.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 4.
DR InterPro; IPR006560; AWS_dom.
DR InterPro; IPR041306; C5HCH.
DR InterPro; IPR047426; PHD1_NSD1_2.
DR InterPro; IPR047428; PHD2_NSD1.
DR InterPro; IPR047429; PHD3_NSD1.
DR InterPro; IPR047430; PHD4_NSD1.
DR InterPro; IPR047432; PHD5_NSD1.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR000313; PWWP_dom.
DR InterPro; IPR047423; PWWP_NSD1_rpt2.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR047433; SET_NSD1.
DR InterPro; IPR019786; Zinc_finger_PHD-type_CS.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR019787; Znf_PHD-finger.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR22884:SF312; HISTONE-LYSINE N-METHYLTRANSFERASE, H3 LYSINE-36 SPECIFIC; 1.
DR PANTHER; PTHR22884; SET DOMAIN PROTEINS; 1.
DR Pfam; PF17907; AWS; 1.
DR Pfam; PF17982; C5HCH; 1.
DR Pfam; PF00628; PHD; 1.
DR Pfam; PF00855; PWWP; 2.
DR Pfam; PF00856; SET; 1.
DR SMART; SM00570; AWS; 1.
DR SMART; SM00249; PHD; 5.
DR SMART; SM00508; PostSET; 1.
DR SMART; SM00293; PWWP; 2.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 3.
DR SUPFAM; SSF82199; SET domain; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 2.
DR PROSITE; PS51215; AWS; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50812; PWWP; 2.
DR PROSITE; PS50280; SET; 1.
DR PROSITE; PS01359; ZF_PHD_1; 1.
DR PROSITE; PS50016; ZF_PHD_2; 2.
PE 2: Evidence at transcript level;
KW Chromatin regulator {ECO:0000256|ARBA:ARBA00022853};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Receptor {ECO:0000313|EMBL:JAA30527.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000002277};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00146}.
FT DOMAIN 54..119
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT DOMAIN 1275..1321
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 1439..1483
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 1488..1550
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT DOMAIN 1622..1672
FT /note="AWS"
FT /evidence="ECO:0000259|PROSITE:PS51215"
FT DOMAIN 1674..1791
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 1798..1814
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
FT REGION 1..42
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 218..244
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 603..622
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 667..763
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 800..820
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 843..865
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 975..1004
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1027..1076
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1114..1160
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1212..1267
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1823..1843
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1945..2154
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2196..2231
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2328..2348
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2397..2428
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 9..42
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 605..622
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 667..719
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1027..1050
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1132..1146
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1214..1232
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1242..1260
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1949..1964
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1985..2000
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2012..2026
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2063..2081
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2127..2141
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2407..2422
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2428 AA; 267475 MW; 69C8606901CEF1BC CRC64;
MPLKTRTALS DDPDSSTSTL GNMLELPGTS SSSTSQELPF CQPKKKSTPL KYEVGDLIWA
KFKRRPWWPC RICSDPLINT HSKMKVSNRR PYRQYYVEAF GDPSERAWVA GKAIVMFEGR
HQFEELPVLR RRGKQKEKGY RHKVPQKILS KWEASVGLAE QYDVPKGSKN RKCIPGSIKL
DSEEDMPFED CTNDPESEHD LLLNGCLKSL AFDSEHSADE KEKPCAKSRA RKSSDNPKRT
SVKKGHIQFE AHKDERRGKI PENLGLNFIS GDISDTQASN ELSRIANSLT GSNTAPGSFL
FSSCGKNTAK KEFETSNGDS LLGLPEGALI SKCSREKNKP QRSLVCGSKV KLCYIGAGDE
EKRSDSISIC TTSDDGSSDL DPIEHSSESD NSVLEIPDAF DRTENMLSMQ KNEKIKYSRF
AATNTRVKAK QKPLISNSHT DHLMGCTKSA EPGTETSQVN LSDLKASTLV HKPQSDFTND
ALSPKFNMSS SISSENSLIK GGAANQALLH SKSKQPKFRS IKCKHKENPV MVEPPVINEE
CSLKCCSSDT KGSPLASISK SGKVDGLKLL NNMHEKTRDS SDIETAVVKH VLSELKELSY
RSLGEDVSDS GTSKPSKPLL FSSASSQNHI PIEPDYKFST LLMMLKDMHD SKTKEQRLMT
AQNLVSYRSP GRGDCSTNSP VGVSKVLVSG GSTHNSEKKG DGTQNSANPS PSGGDSALSG
ELSASLPGLV SDKRDLPASG KSRSDCVTRR NCGRSKPSSK LRDAFSAQMV KNTVNRKALK
TERKRKLNQL PSVTLDAVLQ GDREHGGSLR GGAEDPSKED PLQIMGHLTS EDGDHFSDVH
FDSKVKQSDP GKISEKGLSF ENGKGPELDS VMNSENDELN GVNQVVPKKR WQRLNQRRTK
PRKRMNRFKE KENSECAFRV LLPSDPVQEG RDEFPEHRTP PSASILEEPL TEQNHADCLD
SVGPRLNVCD KSSASIGDME KEPGIPSLTP QAELPEPAVR SEKKRLRKPS KWLLEYTEEY
DQIFAPKKKQ KKVQEQVHKV SSRCEEESLL ARGRSSAQNK QVDENSLIST KEEPPVLERE
APFLEGPLAQ SELGGGHAEL PQLTLSVPVA PEVSPRPALE SEELLVKTPG NYESKRQRKP
TKKLLESNDL DPGFMPKKGD LGLSKKCYEA GHLENGITES CATSYSKDFG GGTTKIFDKP
RKRKRQRHAA AKMQCKKVKN DDSSKEIPGS EGELMPHRTA TSPKETVEEG VEHDPGMPAS
KKMQGERGGG AALKENVCQN CEKLGELLLC EAQCCGAFHL ECLGLTEMPR GKFICNECRT
GIHTCFVCKQ SGEDVKRCLL PLCGKFYHEE CVQKYPPTVM QNKGFRCSLH ICITCHAANP
ANVSASKGRL MRCVRCPVAY HANDFCLAAG SKILASNSII CPNHFTPRRG CRNHEHVNVS
WCFVCSEGGS LLCCDSCPAA FHRECLNIDI PEGNWYCNDC KAGKKPHYRE IVWVKVGRYR
WWPAEICHPR AVPSNIDKMR HDVGEFPVLF FGSNDYLWTH QARVFPYMEG DVSSKDKMGK
GVDGTYKKAL QEAAARFEEL KAQKELRQLQ EDRKNDKKPP PYKHIKVNRP IGRVQIFTAD
LSEIPRCNCK ATDENPCGID SECINRMLLY ECHPTVCPAG GRCQNQCFSK RQYPEVEIFR
TLQRGWGLRT KTDIKKGEFV NEYVGELIDE EECRARIRYA QEHDITNFYM LTLDKDRIID
AGPKGNYARF MNHCCQPNCE TQKWSVNGDT RVGLFALSDI KAGTELTFNY NLECLGNGKT
VCKCGAPNCS GFLGVRPKNQ PIATEEKSKK FKKKQQGKRR TQGEITKERE DECFSCGDAG
QLVSCKKPGC PKVYHADCLN LTKRPAGKWE CPWHQCDICG KEAASFCEMC PSSFCKQHRE
GMLFISKLDG RLSCTEHDPC GPNPLEPGEI REYVPPPVPL PPGPSTHLAE QSTGMAAQAP
KMSDKPPADT NQTLSLSKKA LAGTCQRPLL PERPLERTDS RPQPLDKVRD LAGSGTKSQS
LVSSQRPLDR PPAVAGPRPQ LSDKPSPVTS PSSSPSVRSQ PLERPLGTAD PRLDKSIGAA
SPRPQSLEKT PVPTGLRLPP PDRLLITSSP KPQTSDRPTD KPHASLSQRL PPPEKVLSAV
VQTLVAKEKA LRPVDQNTQS KNRAALVMDL IDLTPRQKER AASPHEVTPQ ADEKMPVLES
SSWPASKGLG HMPRAVEKGC VSDPLQTSGK AAAPSEDPWQ AVKSLTQARL LSQPPAKAFL
YEPTTQASGR ASAGAEQTPG PLSQSLGLVK QAKQMVGGQQ LPALAAKSGQ SFRSLGKAPA
SLPTEEKKLV TTEQSPWALG KASSRAGLWP IVAGQTLAQS CWSAGSTQTL AQTCWSLGRG
QDPKPEQNTL PALNQAPSSH KCAESEQK
//