ID A0A096LSX8_POEFO Unreviewed; 1448 AA.
AC A0A096LSX8;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 27-MAR-2024, entry version 57.
DE SubName: Full=Nuclear receptor binding SET domain protein 3 {ECO:0000313|Ensembl:ENSPFOP00000022269.1};
OS Poecilia formosa (Amazon molly) (Limia formosa).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC Poecilia.
OX NCBI_TaxID=48698 {ECO:0000313|Ensembl:ENSPFOP00000022269.1, ECO:0000313|Proteomes:UP000028760};
RN [1] {ECO:0000313|Proteomes:UP000028760}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=female {ECO:0000313|Proteomes:UP000028760};
RA Schartl M., Warren W.;
RL Submitted (OCT-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSPFOP00000022269.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AYCK01009758; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR Ensembl; ENSPFOT00000031748.1; ENSPFOP00000022269.1; ENSPFOG00000011218.2.
DR GeneTree; ENSGT00940000155355; -.
DR Proteomes; UP000028760; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0042054; F:histone methyltransferase activity; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR CDD; cd15565; PHD2_NSD; 1.
DR CDD; cd15661; PHD5_NSD3; 1.
DR CDD; cd20163; PWWP_NSD3_rpt1; 1.
DR CDD; cd20166; PWWP_NSD3_rpt2; 1.
DR Gene3D; 2.30.30.140; -; 2.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 4.
DR InterPro; IPR006560; AWS_dom.
DR InterPro; IPR041306; C5HCH.
DR InterPro; IPR047527; PHD5_NSD3.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR000313; PWWP_dom.
DR InterPro; IPR047451; PWWP_NSD3_rpt1.
DR InterPro; IPR047453; PWWP_NSD3_rpt2.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR019786; Zinc_finger_PHD-type_CS.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR019787; Znf_PHD-finger.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR22884:SF473; HISTONE-LYSINE N-METHYLTRANSFERASE NSD3; 1.
DR PANTHER; PTHR22884; SET DOMAIN PROTEINS; 1.
DR Pfam; PF17907; AWS; 1.
DR Pfam; PF17982; C5HCH; 1.
DR Pfam; PF00628; PHD; 1.
DR Pfam; PF00855; PWWP; 2.
DR Pfam; PF00856; SET; 1.
DR SMART; SM00570; AWS; 1.
DR SMART; SM00249; PHD; 4.
DR SMART; SM00293; PWWP; 2.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 3.
DR SUPFAM; SSF82199; SET domain; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 2.
DR PROSITE; PS51215; AWS; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50812; PWWP; 2.
DR PROSITE; PS50280; SET; 1.
DR PROSITE; PS01359; ZF_PHD_1; 1.
DR PROSITE; PS50016; ZF_PHD_2; 2.
PE 4: Predicted;
KW Chromatin regulator {ECO:0000256|ARBA:ARBA00022853};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000028760};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00146}.
FT DOMAIN 285..348
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT DOMAIN 703..750
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 970..1032
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT DOMAIN 1104..1154
FT /note="AWS"
FT /evidence="ECO:0000259|PROSITE:PS51215"
FT DOMAIN 1156..1273
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 1280..1296
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
FT DOMAIN 1332..1379
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT REGION 24..69
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 123..160
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 196..280
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 410..478
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 511..700
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1065..1087
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1422..1448
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 133..148
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 216..242
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 243..259
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 415..458
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 553..577
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 578..610
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 642..661
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 680..700
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1065..1081
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1434..1448
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1448 AA; 161121 MW; 79E26C82AC975BEA CRC64;
MDFSFSFMQG IMGNTIQQPP QLIDSANIRQ DGSCDTGSDP GEDSGPSYDA ALDAEFSYPP
SASEDMPQVP NGYPPGLGLY EPQAKFSMYS QFPNGSANGY GAIRSYGDHG LLPGEGTVLR
GPGLHERPLS PVSPPLSVHP PPSLPALAPS NTPGGGVLKK TSSPEIKLKI IKTYQNGKEL
FESALCGDLL QELQVTGQTQ RRHERKKEKR KKSARLQLQA QQSVERSQEQ TASTAQTEES
VQPQPREPPP LPPSDPPPAQ AEKTQRTVIK TEPKTPKAKS EEFVIGDLVW SKVGTYPWWP
CMVSSDPQMK VHTRINTRGH REYHVQFFGS VAERAWIHEK RIVTYQGKQQ FDELQAETLR
KATNPVERQK LLKPIPQRER TQWEVGVGHA EDAFLMTRQE RIDNYTFIYV DPDPEEAPPT
KKPSIRAEKR SRRSSGSVGK KEDVGVKSPD REQPPRRQLP RRQCSISNTE DSPPPVVRPW
KTAAARKLLP LSITMKRLNV EITKCDWPLL QKNITPSQDS RAKPEPSPEE EEEEGDEGEE
ESDERRGSPA SRRSESANRQ NSSPGSPSSS PQGSQERKPQ RRSVRSRSES ERGADPIPKK
KTKKEQAEMA PETTLKTGSQ KGASEISDAC KPLKKRSRAS TDVEMASSQY RDTSDSDSRG
LNDPQGLFGK SLDSPAAADA DASDTQSVDS GLSRQDSGTG QRDTVCQICE AYGEGLVVCE
GDCSRQFHLE CLGLSSPPEG RFTCTECRNG NHPCFSCKSV DPEVTRCSVS GCGCFYHEDC
VRKLPGTTSG SGGGFCCPQH SCSTCCLERD LQRASKGRLI RCIRCPLAYH PSDGCLAAGS
VILTHHIMIC SNHGSAKKNG LLSSPVNVGW CFLCARGLLV QDLTDTILSS YAYKSHYLLT
ESNRAELKLP MIPSPSSATK KNVGKDPNVG GKLLCCDSCP ASFHPECLEM EMPEGAWSCS
DCRAGKKPHY KQIVWVKLGN YRWWPAEICN PRLVPSNIQS LRHDIGDFPV FFFGSHDYYW
INQGRVFPYV ENDKNFVTGQ ININKTFKKA LEEAARRFQE LKAQRESREA LEQERNSRKP
PPYKIIKQSN KPVGKVQMHV ADLSEIPRCN CKPTEEHPCS LDSQCLNRML QYECHPQVCP
AGDRCENQCF SKRLYAETEV VKTEGCGWGL RTNQTLRKGD FVTEYVGEVI DSEECQQRIK
RAHENHVTNF YMLTLTKDRV IDAGPKGNSS RFMNHSCSPN CETQKWTVNG DVRIGLFTLC
NIEAGTELTF NYNLHCVGNR RMSCHCGSDN CSGFLGVQPT SAVVMEKEEK AKNAKLKPKK
RKLRPEGKHT HEYVCFCCGE GGELVMCDRK DCPKAYHLLC LNLTKPPYGR WECPWHDCSV
CGASASSLCD FCPRSFCRDH EAGALTASAL DDRLCCSNHD PLSPLGSDST QPRRFDRSPV
RVKEESKE
//