GenomeNet

Database: UniProt
Entry: K7DBF3_PANTR
LinkDB: K7DBF3_PANTR
Original site: K7DBF3_PANTR 
ID   K7DBF3_PANTR            Unreviewed;      2428 AA.
AC   K7DBF3; A0A2J8J5S1;
DT   09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT   09-JAN-2013, sequence version 1.
DT   27-MAR-2024, entry version 70.
DE   SubName: Full=Nuclear receptor binding SET domain protein 1 {ECO:0000313|EMBL:JAA30527.1, ECO:0000313|Ensembl:ENSPTRP00000067069.1};
GN   Name=NSD1 {ECO:0000313|EMBL:JAA30527.1,
GN   ECO:0000313|Ensembl:ENSPTRP00000067069.1, ECO:0000313|VGNC:VGNC:6953};
OS   Pan troglodytes (Chimpanzee).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC   Pan.
OX   NCBI_TaxID=9598 {ECO:0000313|EMBL:JAA30527.1};
RN   [1] {ECO:0000313|Ensembl:ENSPTRP00000067069.1, ECO:0000313|Proteomes:UP000002277}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=16136131; DOI=10.1038/nature04072;
RG   Chimpanzee sequencing and analysis consortium;
RT   "Initial sequence of the chimpanzee genome and comparison with the human
RT   genome.";
RL   Nature 437:69-87(2005).
RN   [2] {ECO:0000313|EMBL:JAA30527.1}
RP   NUCLEOTIDE SEQUENCE.
RC   TISSUE=Skeletal muscle {ECO:0000313|EMBL:JAA39912.1}, and Skin
RC   {ECO:0000313|EMBL:JAA30527.1};
RA   Maudhoo M.D., Meehan D.T., Norgren R.B.Jr.;
RT   "De novo assembly of the reference chimpanzee transcriptome from NextGen
RT   mRNA sequences.";
RL   Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|Ensembl:ENSPTRP00000067069.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AACZ04060243; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AACZ04060244; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; GABD01002573; JAA30527.1; -; mRNA.
DR   EMBL; GABE01004827; JAA39912.1; -; mRNA.
DR   RefSeq; XP_016809818.1; XM_016954329.1.
DR   RefSeq; XP_016809819.1; XM_016954330.1.
DR   RefSeq; XP_016809820.1; XM_016954331.1.
DR   Ensembl; ENSPTRT00000078053.1; ENSPTRP00000067069.1; ENSPTRG00000017575.6.
DR   GeneID; 471754; -.
DR   CTD; 64324; -.
DR   VGNC; VGNC:6953; NSD1.
DR   GeneTree; ENSGT00940000155027; -.
DR   OrthoDB; 950362at2759; -.
DR   Proteomes; UP000002277; Chromosome 5.
DR   Bgee; ENSPTRG00000017575; Expressed in lymph node and 21 other cell types or tissues.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0046975; F:histone H3K36 methyltransferase activity; IEA:UniProt.
DR   GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR   CDD; cd15648; PHD1_NSD1_2; 1.
DR   CDD; cd15650; PHD2_NSD1; 1.
DR   CDD; cd15653; PHD3_NSD1; 1.
DR   CDD; cd15656; PHD4_NSD1; 1.
DR   CDD; cd15659; PHD5_NSD1; 1.
DR   CDD; cd20161; PWWP_NSD1_rpt1; 1.
DR   CDD; cd20164; PWWP_NSD1_rpt2; 1.
DR   CDD; cd19210; SET_NSD1; 1.
DR   Gene3D; 2.30.30.140; -; 2.
DR   Gene3D; 2.170.270.10; SET domain; 1.
DR   Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 4.
DR   InterPro; IPR006560; AWS_dom.
DR   InterPro; IPR041306; C5HCH.
DR   InterPro; IPR047426; PHD1_NSD1_2.
DR   InterPro; IPR047428; PHD2_NSD1.
DR   InterPro; IPR047429; PHD3_NSD1.
DR   InterPro; IPR047430; PHD4_NSD1.
DR   InterPro; IPR047432; PHD5_NSD1.
DR   InterPro; IPR003616; Post-SET_dom.
DR   InterPro; IPR000313; PWWP_dom.
DR   InterPro; IPR047423; PWWP_NSD1_rpt2.
DR   InterPro; IPR001214; SET_dom.
DR   InterPro; IPR046341; SET_dom_sf.
DR   InterPro; IPR047433; SET_NSD1.
DR   InterPro; IPR019786; Zinc_finger_PHD-type_CS.
DR   InterPro; IPR011011; Znf_FYVE_PHD.
DR   InterPro; IPR001965; Znf_PHD.
DR   InterPro; IPR019787; Znf_PHD-finger.
DR   InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR   PANTHER; PTHR22884:SF312; HISTONE-LYSINE N-METHYLTRANSFERASE, H3 LYSINE-36 SPECIFIC; 1.
DR   PANTHER; PTHR22884; SET DOMAIN PROTEINS; 1.
DR   Pfam; PF17907; AWS; 1.
DR   Pfam; PF17982; C5HCH; 1.
DR   Pfam; PF00628; PHD; 1.
DR   Pfam; PF00855; PWWP; 2.
DR   Pfam; PF00856; SET; 1.
DR   SMART; SM00570; AWS; 1.
DR   SMART; SM00249; PHD; 5.
DR   SMART; SM00508; PostSET; 1.
DR   SMART; SM00293; PWWP; 2.
DR   SMART; SM00317; SET; 1.
DR   SUPFAM; SSF57903; FYVE/PHD zinc finger; 3.
DR   SUPFAM; SSF82199; SET domain; 1.
DR   SUPFAM; SSF63748; Tudor/PWWP/MBT; 2.
DR   PROSITE; PS51215; AWS; 1.
DR   PROSITE; PS50868; POST_SET; 1.
DR   PROSITE; PS50812; PWWP; 2.
DR   PROSITE; PS50280; SET; 1.
DR   PROSITE; PS01359; ZF_PHD_1; 1.
DR   PROSITE; PS50016; ZF_PHD_2; 2.
PE   2: Evidence at transcript level;
KW   Chromatin regulator {ECO:0000256|ARBA:ARBA00022853};
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Receptor {ECO:0000313|EMBL:JAA30527.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000002277};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW   Transferase {ECO:0000256|ARBA:ARBA00022679};
KW   Zinc {ECO:0000256|ARBA:ARBA00022833};
KW   Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW   ProRule:PRU00146}.
FT   DOMAIN          54..119
FT                   /note="PWWP"
FT                   /evidence="ECO:0000259|PROSITE:PS50812"
FT   DOMAIN          1275..1321
FT                   /note="PHD-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50016"
FT   DOMAIN          1439..1483
FT                   /note="PHD-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50016"
FT   DOMAIN          1488..1550
FT                   /note="PWWP"
FT                   /evidence="ECO:0000259|PROSITE:PS50812"
FT   DOMAIN          1622..1672
FT                   /note="AWS"
FT                   /evidence="ECO:0000259|PROSITE:PS51215"
FT   DOMAIN          1674..1791
FT                   /note="SET"
FT                   /evidence="ECO:0000259|PROSITE:PS50280"
FT   DOMAIN          1798..1814
FT                   /note="Post-SET"
FT                   /evidence="ECO:0000259|PROSITE:PS50868"
FT   REGION          1..42
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          218..244
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          603..622
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          667..763
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          800..820
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          843..865
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          975..1004
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1027..1076
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1114..1160
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1212..1267
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1823..1843
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1945..2154
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2196..2231
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2328..2348
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2397..2428
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        9..42
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        605..622
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        667..719
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1027..1050
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1132..1146
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1214..1232
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1242..1260
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1949..1964
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1985..2000
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2012..2026
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2063..2081
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2127..2141
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2407..2422
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   2428 AA;  267475 MW;  69C8606901CEF1BC CRC64;
     MPLKTRTALS DDPDSSTSTL GNMLELPGTS SSSTSQELPF CQPKKKSTPL KYEVGDLIWA
     KFKRRPWWPC RICSDPLINT HSKMKVSNRR PYRQYYVEAF GDPSERAWVA GKAIVMFEGR
     HQFEELPVLR RRGKQKEKGY RHKVPQKILS KWEASVGLAE QYDVPKGSKN RKCIPGSIKL
     DSEEDMPFED CTNDPESEHD LLLNGCLKSL AFDSEHSADE KEKPCAKSRA RKSSDNPKRT
     SVKKGHIQFE AHKDERRGKI PENLGLNFIS GDISDTQASN ELSRIANSLT GSNTAPGSFL
     FSSCGKNTAK KEFETSNGDS LLGLPEGALI SKCSREKNKP QRSLVCGSKV KLCYIGAGDE
     EKRSDSISIC TTSDDGSSDL DPIEHSSESD NSVLEIPDAF DRTENMLSMQ KNEKIKYSRF
     AATNTRVKAK QKPLISNSHT DHLMGCTKSA EPGTETSQVN LSDLKASTLV HKPQSDFTND
     ALSPKFNMSS SISSENSLIK GGAANQALLH SKSKQPKFRS IKCKHKENPV MVEPPVINEE
     CSLKCCSSDT KGSPLASISK SGKVDGLKLL NNMHEKTRDS SDIETAVVKH VLSELKELSY
     RSLGEDVSDS GTSKPSKPLL FSSASSQNHI PIEPDYKFST LLMMLKDMHD SKTKEQRLMT
     AQNLVSYRSP GRGDCSTNSP VGVSKVLVSG GSTHNSEKKG DGTQNSANPS PSGGDSALSG
     ELSASLPGLV SDKRDLPASG KSRSDCVTRR NCGRSKPSSK LRDAFSAQMV KNTVNRKALK
     TERKRKLNQL PSVTLDAVLQ GDREHGGSLR GGAEDPSKED PLQIMGHLTS EDGDHFSDVH
     FDSKVKQSDP GKISEKGLSF ENGKGPELDS VMNSENDELN GVNQVVPKKR WQRLNQRRTK
     PRKRMNRFKE KENSECAFRV LLPSDPVQEG RDEFPEHRTP PSASILEEPL TEQNHADCLD
     SVGPRLNVCD KSSASIGDME KEPGIPSLTP QAELPEPAVR SEKKRLRKPS KWLLEYTEEY
     DQIFAPKKKQ KKVQEQVHKV SSRCEEESLL ARGRSSAQNK QVDENSLIST KEEPPVLERE
     APFLEGPLAQ SELGGGHAEL PQLTLSVPVA PEVSPRPALE SEELLVKTPG NYESKRQRKP
     TKKLLESNDL DPGFMPKKGD LGLSKKCYEA GHLENGITES CATSYSKDFG GGTTKIFDKP
     RKRKRQRHAA AKMQCKKVKN DDSSKEIPGS EGELMPHRTA TSPKETVEEG VEHDPGMPAS
     KKMQGERGGG AALKENVCQN CEKLGELLLC EAQCCGAFHL ECLGLTEMPR GKFICNECRT
     GIHTCFVCKQ SGEDVKRCLL PLCGKFYHEE CVQKYPPTVM QNKGFRCSLH ICITCHAANP
     ANVSASKGRL MRCVRCPVAY HANDFCLAAG SKILASNSII CPNHFTPRRG CRNHEHVNVS
     WCFVCSEGGS LLCCDSCPAA FHRECLNIDI PEGNWYCNDC KAGKKPHYRE IVWVKVGRYR
     WWPAEICHPR AVPSNIDKMR HDVGEFPVLF FGSNDYLWTH QARVFPYMEG DVSSKDKMGK
     GVDGTYKKAL QEAAARFEEL KAQKELRQLQ EDRKNDKKPP PYKHIKVNRP IGRVQIFTAD
     LSEIPRCNCK ATDENPCGID SECINRMLLY ECHPTVCPAG GRCQNQCFSK RQYPEVEIFR
     TLQRGWGLRT KTDIKKGEFV NEYVGELIDE EECRARIRYA QEHDITNFYM LTLDKDRIID
     AGPKGNYARF MNHCCQPNCE TQKWSVNGDT RVGLFALSDI KAGTELTFNY NLECLGNGKT
     VCKCGAPNCS GFLGVRPKNQ PIATEEKSKK FKKKQQGKRR TQGEITKERE DECFSCGDAG
     QLVSCKKPGC PKVYHADCLN LTKRPAGKWE CPWHQCDICG KEAASFCEMC PSSFCKQHRE
     GMLFISKLDG RLSCTEHDPC GPNPLEPGEI REYVPPPVPL PPGPSTHLAE QSTGMAAQAP
     KMSDKPPADT NQTLSLSKKA LAGTCQRPLL PERPLERTDS RPQPLDKVRD LAGSGTKSQS
     LVSSQRPLDR PPAVAGPRPQ LSDKPSPVTS PSSSPSVRSQ PLERPLGTAD PRLDKSIGAA
     SPRPQSLEKT PVPTGLRLPP PDRLLITSSP KPQTSDRPTD KPHASLSQRL PPPEKVLSAV
     VQTLVAKEKA LRPVDQNTQS KNRAALVMDL IDLTPRQKER AASPHEVTPQ ADEKMPVLES
     SSWPASKGLG HMPRAVEKGC VSDPLQTSGK AAAPSEDPWQ AVKSLTQARL LSQPPAKAFL
     YEPTTQASGR ASAGAEQTPG PLSQSLGLVK QAKQMVGGQQ LPALAAKSGQ SFRSLGKAPA
     SLPTEEKKLV TTEQSPWALG KASSRAGLWP IVAGQTLAQS CWSAGSTQTL AQTCWSLGRG
     QDPKPEQNTL PALNQAPSSH KCAESEQK
//
DBGET integrated database retrieval system