ID F6V1Z3_XENTR Unreviewed; 1255 AA.
AC F6V1Z3;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 02-JUN-2021, sequence version 4.
DT 27-MAR-2024, entry version 93.
DE SubName: Full=Nuclear receptor binding SET domain protein 2 {ECO:0000313|Ensembl:ENSXETP00000005100};
GN Name=nsd2 {ECO:0000313|Ensembl:ENSXETP00000005100};
OS Xenopus tropicalis (Western clawed frog) (Silurana tropicalis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Amphibia;
OC Batrachia; Anura; Pipoidea; Pipidae; Xenopodinae; Xenopus; Silurana.
OX NCBI_TaxID=8364 {ECO:0000313|Ensembl:ENSXETP00000005100};
RN [1] {ECO:0000313|Ensembl:ENSXETP00000005100}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Nigerian {ECO:0000313|Ensembl:ENSXETP00000005100};
RX PubMed=20431018; DOI=10.1126/science.1183670;
RA Hellsten U., Harland R.M., Gilchrist M.J., Hendrix D., Jurka J.,
RA Kapitonov V., Ovcharenko I., Putnam N.H., Shu S., Taher L., Blitz I.L.,
RA Blumberg B., Dichmann D.S., Dubchak I., Amaya E., Detter J.C., Fletcher R.,
RA Gerhard D.S., Goodstein D., Graves T., Grigoriev I.V., Grimwood J.,
RA Kawashima T., Lindquist E., Lucas S.M., Mead P.E., Mitros T., Ogino H.,
RA Ohta Y., Poliakov A.V., Pollet N., Robert J., Salamov A., Sater A.K.,
RA Schmutz J., Terry A., Vize P.D., Warren W.C., Wells D., Wills A.,
RA Wilson R.K., Zimmerman L.B., Zorn A.M., Grainger R., Grammer T.,
RA Khokha M.K., Richardson P.M., Rokhsar D.S.;
RT "The genome of the Western clawed frog Xenopus tropicalis.";
RL Science 328:633-636(2010).
RN [2] {ECO:0000313|Ensembl:ENSXETP00000005100}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (JUN-2011) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; F6V1Z3; -.
DR Ensembl; ENSXETT00000005100; ENSXETP00000005100; ENSXETG00000026376.
DR AGR; Xenbase:XB-GENE-988824; -.
DR Xenbase; XB-GENE-988824; nsd2.
DR eggNOG; KOG1081; Eukaryota.
DR HOGENOM; CLU_004494_2_1_1; -.
DR TreeFam; TF329088; -.
DR Bgee; ENSXETG00000026376; Expressed in blastula and 11 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0042054; F:histone methyltransferase activity; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR CDD; cd21991; HMG-box_NSD2; 1.
DR CDD; cd15651; PHD2_NSD2; 1.
DR CDD; cd15654; PHD3_NSD2; 1.
DR CDD; cd15660; PHD5_NSD2; 1.
DR CDD; cd20162; PWWP_NSD2_rpt1; 1.
DR CDD; cd20165; PWWP_NSD2_rpt2; 1.
DR CDD; cd19211; SET_NSD2; 1.
DR Gene3D; 2.30.30.140; -; 2.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 4.
DR InterPro; IPR006560; AWS_dom.
DR InterPro; IPR041306; C5HCH.
DR InterPro; IPR047443; HMG-box_NSD2.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR047439; PHD2_NSD2.
DR InterPro; IPR047441; PHD3_NSD2.
DR InterPro; IPR047442; PHD5_NSD2.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR000313; PWWP_dom.
DR InterPro; IPR047434; PWWP_NSD2_rpt1.
DR InterPro; IPR047435; PWWP_NSD2_rpt2.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR047437; SET_NSD2.
DR InterPro; IPR019786; Zinc_finger_PHD-type_CS.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR019787; Znf_PHD-finger.
DR InterPro; IPR001841; Znf_RING.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR22884:SF293; HISTONE-LYSINE N-METHYLTRANSFERASE NSD2; 1.
DR PANTHER; PTHR22884; SET DOMAIN PROTEINS; 1.
DR Pfam; PF17907; AWS; 1.
DR Pfam; PF17982; C5HCH; 1.
DR Pfam; PF00505; HMG_box; 1.
DR Pfam; PF00628; PHD; 1.
DR Pfam; PF00855; PWWP; 2.
DR Pfam; PF00856; SET; 1.
DR SMART; SM00570; AWS; 1.
DR SMART; SM00398; HMG; 1.
DR SMART; SM00249; PHD; 5.
DR SMART; SM00508; PostSET; 1.
DR SMART; SM00293; PWWP; 2.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 3.
DR SUPFAM; SSF47095; HMG-box; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 2.
DR PROSITE; PS51215; AWS; 1.
DR PROSITE; PS50118; HMG_BOX_2; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50812; PWWP; 2.
DR PROSITE; PS50280; SET; 1.
DR PROSITE; PS01359; ZF_PHD_1; 1.
DR PROSITE; PS50016; ZF_PHD_2; 1.
DR PROSITE; PS50089; ZF_RING_2; 1.
PE 4: Predicted;
KW Chromatin regulator {ECO:0000256|ARBA:ARBA00022853};
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603};
KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00175}.
FT DOMAIN 192..256
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT DOMAIN 409..458
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DOMAIN 618..664
FT /note="RING-type"
FT /evidence="ECO:0000259|PROSITE:PS50089"
FT DOMAIN 732..776
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 781..843
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT DOMAIN 912..962
FT /note="AWS"
FT /evidence="ECO:0000259|PROSITE:PS51215"
FT DOMAIN 964..1081
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 1088..1104
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
FT DNA_BIND 409..458
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 119..176
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 331..408
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 469..528
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1109..1133
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 123..138
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 153..168
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 351..376
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 378..400
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 473..503
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1255 AA; 141565 MW; 8C09E44F2BBEE1DE CRC64;
MKQVPHILGS PAVKPSNPEV SRECSVLLGR AQISASLQEG VIQKLNGHDS LPFIPTEKLK
DLTSRVFNGE SGAQEAKVRF EAQEVQGIET PPSTTPTKNG SPEIKLKITK TYMNGKPLFE
SSICGDRGEE EPQPEQKKSK RGRKRKIITS EANEKSSSEM PSHSESSKMK PVSESEFLEH
KNKEDFSLKY SVGDVVWSKV SGYPWWPCMV TSDPVLYSHT KVKGQKKNVR QYHVQFFGNA
PERAWISEKS MVPFKGENEY DHLCQESAKQ APTKAEKTKL LKPITGKIRA QWDIGIEQAK
EGILMTPEER KEKFTFIYIK DRPQLNPRVA SEVGVPVESS EKSTESPQKE GSSSYKRRRV
SRTPAASKEV DPADKTTTAK GPENSSDPKG MTSPASKKRS GAYTPRSRKG DAVNQFLVFC
QKHKDEVINE HPDASDEEIE ELLESQWNML SNKQKARYNT KFAILTSPKS EEDSGSLHGH
KRNHLKTTHN QTEDSEIQES PRKRLRMHNR KQRETCKPPK ARASAPVKNS PAMFSLSDAC
KPLKKRNRSS TLGSVALPVS KSSSPASLTE NEVCEKVGDL MLCEGVCCSA FHLSCIGLST
RPAGKYLCKE CTSGARSCFL CKESNRDVKR CIVPHCGKFY HESCLRKYPL AVFESRGFRC
PLHRCATCYF SNPSNPRASK GKMVRCVRCP LAYHGAESCI VAGCTALTST SIICSSHFAA
NKAKSHHAHI NVSWCFVCSN GGSLLCCESC PAAFHPDCLN IEMPDGSWFC NDCRLGKKPR
FNDIIWVKLG NYRWWPAEVC HPKNVPPNIQ KMKHAIGEFP VFFFGSKDYY WTHQARVFPY
MEGDRGSKHH GGKSIGKVFK NALQEAETRF CEIMRQREAK VTQENEKKPP PYKHIKVNKP
YGKVQVYTAD ISEIPKCNCK PSSEKPCGFD SECLNRMLMY ECHPQVCPAG DRCQNQCFNK
RQYPETKIIK TEGKGWGLIA TRDIKKGEFV NEYIGELIDE EECMYRIRHA QENDITHFYM
LTIDKDRIID AGPKGNFSRF MNHSCQPNCE TQKWSVNGDT RVGLFAVRDI PAGEELTFNY
NLDCLGNEKT ICRCGAPNCS GFLGDRPKNN TASSHEEKVK KPKKKQKKRR TKTDGKKQSE
DYCFRCNDGG ELVLCDRKFC TKAYHLSCLS LTKRPFGKWE CPWHHCDVCG KASVSCCSLC
PNSFCKGHYD DSQFTRTAEG QLCCPEHDPE ESVECEVVKN LPKKKAKQRK NKKTR
//