ID A0A286XRF6_CAVPO Unreviewed; 1389 AA.
AC A0A286XRF6;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 27-MAR-2024, entry version 36.
DE SubName: Full=Nuclear receptor binding SET domain protein 3 {ECO:0000313|Ensembl:ENSCPOP00000028068.1};
GN Name=NSD3 {ECO:0000313|Ensembl:ENSCPOP00000028068.1};
OS Cavia porcellus (Guinea pig).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Hystricomorpha; Caviidae;
OC Cavia.
OX NCBI_TaxID=10141 {ECO:0000313|Ensembl:ENSCPOP00000028068.1, ECO:0000313|Proteomes:UP000005447};
RN [1] {ECO:0000313|Proteomes:UP000005447}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=2N {ECO:0000313|Proteomes:UP000005447};
RX PubMed=21993624; DOI=10.1038/nature10530;
RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., Washietl S.,
RA Kheradpour P., Ernst J., Jordan G., Mauceli E., Ward L.D., Lowe C.B.,
RA Holloway A.K., Clamp M., Gnerre S., Alfoldi J., Beal K., Chang J.,
RA Clawson H., Cuff J., Di Palma F., Fitzgerald S., Flicek P., Guttman M.,
RA Hubisz M.J., Jaffe D.B., Jungreis I., Kent W.J., Kostka D., Lara M.,
RA Martins A.L., Massingham T., Moltke I., Raney B.J., Rasmussen M.D.,
RA Robinson J., Stark A., Vilella A.J., Wen J., Xie X., Zody M.C., Baldwin J.,
RA Bloom T., Chin C.W., Heiman D., Nicol R., Nusbaum C., Young S.,
RA Wilkinson J., Worley K.C., Kovar C.L., Muzny D.M., Gibbs R.A., Cree A.,
RA Dihn H.H., Fowler G., Jhangiani S., Joshi V., Lee S., Lewis L.R.,
RA Nazareth L.V., Okwuonu G., Santibanez J., Warren W.C., Mardis E.R.,
RA Weinstock G.M., Wilson R.K., Delehaunty K., Dooling D., Fronik C.,
RA Fulton L., Fulton B., Graves T., Minx P., Sodergren E., Birney E.,
RA Margulies E.H., Herrero J., Green E.D., Haussler D., Siepel A., Goldman N.,
RA Pollard K.S., Pedersen J.S., Lander E.S., Kellis M.;
RT "A high-resolution map of human evolutionary constraint using 29 mammals.";
RL Nature 478:476-482(2011).
RN [2] {ECO:0000313|Ensembl:ENSCPOP00000028068.1}
RP IDENTIFICATION.
RC STRAIN=2N {ECO:0000313|Ensembl:ENSCPOP00000028068.1};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAKN02052463; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02052464; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02052465; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02052466; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02052467; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_013000148.1; XM_013144694.1.
DR Ensembl; ENSCPOT00000040284.1; ENSCPOP00000028068.1; ENSCPOG00000014573.4.
DR GeneID; 100731623; -.
DR CTD; 54904; -.
DR VEuPathDB; HostDB:ENSCPOG00000014573; -.
DR GeneTree; ENSGT00940000155355; -.
DR OrthoDB; 950362at2759; -.
DR Proteomes; UP000005447; Unassembled WGS sequence.
DR Bgee; ENSCPOG00000014573; Expressed in pituitary gland and 12 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0042054; F:histone methyltransferase activity; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR CDD; cd15649; PHD1_NSD3; 1.
DR CDD; cd15652; PHD2_NSD3; 1.
DR CDD; cd15658; PHD4_NSD3; 1.
DR CDD; cd15661; PHD5_NSD3; 1.
DR CDD; cd20163; PWWP_NSD3_rpt1; 1.
DR CDD; cd20166; PWWP_NSD3_rpt2; 1.
DR CDD; cd19212; SET_NSD3; 1.
DR Gene3D; 2.30.30.140; -; 2.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 4.
DR InterPro; IPR006560; AWS_dom.
DR InterPro; IPR041306; C5HCH.
DR InterPro; IPR047456; PHD2_NSD3.
DR InterPro; IPR047458; PHD4_NSD3.
DR InterPro; IPR047527; PHD5_NSD3.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR000313; PWWP_dom.
DR InterPro; IPR047451; PWWP_NSD3_rpt1.
DR InterPro; IPR047453; PWWP_NSD3_rpt2.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR047461; SET_NSD3.
DR InterPro; IPR019786; Zinc_finger_PHD-type_CS.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR019787; Znf_PHD-finger.
DR InterPro; IPR001841; Znf_RING.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR22884:SF473; HISTONE-LYSINE N-METHYLTRANSFERASE NSD3; 1.
DR PANTHER; PTHR22884; SET DOMAIN PROTEINS; 1.
DR Pfam; PF17907; AWS; 1.
DR Pfam; PF17982; C5HCH; 1.
DR Pfam; PF00855; PWWP; 2.
DR Pfam; PF00856; SET; 1.
DR SMART; SM00570; AWS; 1.
DR SMART; SM00249; PHD; 5.
DR SMART; SM00508; PostSET; 1.
DR SMART; SM00293; PWWP; 2.
DR SMART; SM00184; RING; 2.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 3.
DR SUPFAM; SSF82199; SET domain; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 2.
DR PROSITE; PS51215; AWS; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50812; PWWP; 2.
DR PROSITE; PS50280; SET; 1.
DR PROSITE; PS01359; ZF_PHD_1; 1.
DR PROSITE; PS50016; ZF_PHD_2; 3.
PE 4: Predicted;
KW Chromatin regulator {ECO:0000256|ARBA:ARBA00022853};
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000005447};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00146}.
FT DOMAIN 270..333
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT DOMAIN 701..748
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 863..907
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 912..974
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT DOMAIN 1045..1095
FT /note="AWS"
FT /evidence="ECO:0000259|PROSITE:PS51215"
FT DOMAIN 1097..1214
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 1221..1237
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
FT DOMAIN 1273..1320
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT REGION 121..151
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 183..242
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 344..367
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 402..465
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 543..657
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 670..692
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 986..1013
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 124..142
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 202..242
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 348..367
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 426..446
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 447..465
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 543..574
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 575..604
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 607..621
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 639..657
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1389 AA; 156118 MW; 25F44B9490E7F70C CRC64;
MDFSFSFMQG IMGNTIQQPP QLIDSANIRQ EDAFDNNSDI VEDGGQTPYE ATLQQGFQYP
PTTEDLPPLT NGYPPSIGMY ETQTKYQSYN QYPNGSANGF GAVRNFSPTD YYHSEIPNTR
PHEILEKPSP PQPPPPPSVP QTVIPKKTGS PEIKLKITKT IQNGRELFES SLCGDLLNEV
QATEHTKSKH ESRKEKRKKS NKHDSSRSEE RKSHKIPKLE PEEQNRPNER VDPTPEKPRE
EAVLKDTALV QPVLTSAPTT EVTTGIKFQV GDLVWSKVGT YPWWPCMVSS DPQLEVHTKI
NTRGAREYHV QFFSNQPERA WVHEKRVREY KGHKQYEELL AEATKQASNH SEKQKIRKPR
PQRERAQWDI GIAHAEKALK MTREERIEQY TFIYIDKQPE EALSQTKKNA ASKTEVKKPR
RPRSVLNAQP EQTNAGEVAS SQSSSDLRRH SQRRHTSVEE EDSPPVKIAW KTAAARKSLP
ASITMHKGSL DLQKCNMSPV VKIEQVFALQ NATGDGKFID QFVYSTKGIG NKTEISVRGQ
DRLIISSPNQ RSEKPTQNVS SPEATSGPTG SVEKKQQRRS IRTRSESEKS SEAVPKKKIK
KEQVETVPQA TVKTGLQKGA SEISDSCKPL KKRSRASTDV EMTSSAYRDA SDSDSRGLSD
LQVGFGKQVD SPSAAADADV SDVQSMDSSL SRRGIGMSRK DTVCQICESS GDSLVPCEGE
CYRYFHLECL GWTSVPDGKF TCMECKTGQH PCFSCKVPGE DVKRCSVGAC GKFYHEACVR
KFSTAIFESK GFRCPQHCCS ACSMEKDIHK ASKGRMMRCL RCPVAYHSAD ACIAAGSMFV
SSYILICSDH SKRSNNSAAA VNVGFCFVCA RGGRLLCCES CPASFHPECL NIDTPEGCWN
CNDCKAGKKL HYKQIVWVKL GNYRWWPAEI CNPRSVPLNI QSLKHDLGDF PVFFFGSHDY
YWVHQGRVFP YVEGDKSFAE GQTSINKTFK KALEEAAKRF QELKAQRESK EALEVEKNSR
KPPPYKHIKA NKVIGKVQIH VADLSEIPRC NCKPADENPC GLESECLNRM LQYECHPQVC
PAGDRCQNQC FTKRLYPDAE IIKTERRGWG LRTKRSIKKG EFVNEYVGEL IDEEECRLRI
KRAHENSVTN FYMLTVTKDR IIDAGPKGNY SRFMNHSCNP NCETQKWTVN GDVRVGLFAL
CDIPAGMELT FNYNLDCLGN GRTECHCGAD NCSGFLGVRP KAACAATAEE KAKNAKLKQK
RRKVKTEPKQ MHEDYCFQCG DGGELVMCDK KDCPKAYHLL CLNLTQPPYG KWECPWHQCN
MCSSAAVSFC EFCPHSFCKD HGKGALVASA LEGRLCCSEH DPVCPVSPEY WSKIKCKWES
QDRGEEAQE
//