ID A0A3Q2DY07_CYPVA Unreviewed; 1250 AA.
AC A0A3Q2DY07;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 10-APR-2019, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE SubName: Full=SET domain bifurcated histone lysine methyltransferase 1 {ECO:0000313|Ensembl:ENSCVAP00000024657.1};
OS Cyprinodon variegatus (Sheepshead minnow).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Cyprinodontiformes; Cyprinodontidae;
OC Cyprinodon.
OX NCBI_TaxID=28743 {ECO:0000313|Ensembl:ENSCVAP00000024657.1, ECO:0000313|Proteomes:UP000265020};
RN [1] {ECO:0000313|Ensembl:ENSCVAP00000024657.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Chromosome {ECO:0000256|ARBA:ARBA00004286}.
CC Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_015236739.1; XM_015381253.1.
DR AlphaFoldDB; A0A3Q2DY07; -.
DR STRING; 28743.ENSCVAP00000024657; -.
DR Ensembl; ENSCVAT00000003641.1; ENSCVAP00000024657.1; ENSCVAG00000008877.1.
DR GeneID; 107088829; -.
DR KEGG; cvg:107088829; -.
DR CTD; 768131; -.
DR GeneTree; ENSGT00940000157471; -.
DR OrthoDB; 2877903at2759; -.
DR Proteomes; UP000265020; Unplaced.
DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0140938; F:histone H3 methyltransferase activity; IEA:UniProt.
DR GO; GO:0016279; F:protein-lysine N-methyltransferase activity; IEA:UniProt.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd01395; HMT_MBD; 1.
DR CDD; cd10517; SET_SETDB1; 1.
DR CDD; cd20382; Tudor_SETDB1_rpt1; 1.
DR CDD; cd21181; Tudor_SETDB1_rpt2; 1.
DR Gene3D; 2.30.30.140; -; 3.
DR Gene3D; 2.170.270.10; SET domain; 2.
DR InterPro; IPR016177; DNA-bd_dom_sf.
DR InterPro; IPR040880; DUF5604.
DR InterPro; IPR025796; Hist-Lys_N-MeTrfase_SETDB1.
DR InterPro; IPR001739; Methyl_CpG_DNA-bd.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR007728; Pre-SET_dom.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR047232; SETDB1/2-like_MBD.
DR InterPro; IPR002999; Tudor.
DR InterPro; IPR041292; Tudor_4.
DR InterPro; IPR041291; TUDOR_5.
DR PANTHER; PTHR46024; HISTONE-LYSINE N-METHYLTRANSFERASE EGGLESS; 1.
DR PANTHER; PTHR46024:SF2; HISTONE-LYSINE N-METHYLTRANSFERASE SETDB1; 1.
DR Pfam; PF18300; DUF5604; 1.
DR Pfam; PF01429; MBD; 1.
DR Pfam; PF05033; Pre-SET; 1.
DR Pfam; PF00856; SET; 1.
DR Pfam; PF18358; Tudor_4; 1.
DR Pfam; PF18359; Tudor_5; 1.
DR SMART; SM00391; MBD; 1.
DR SMART; SM00468; PreSET; 1.
DR SMART; SM00317; SET; 1.
DR SMART; SM00333; TUDOR; 2.
DR SUPFAM; SSF54171; DNA-binding domain; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50867; PRE_SET; 1.
DR PROSITE; PS51573; SAM_MT43_SUVAR39_1; 1.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW Chromatin regulator {ECO:0000256|ARBA:ARBA00022853};
KW Chromosome {ECO:0000256|ARBA:ARBA00022454};
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054, ECO:0000256|SAM:Coils};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000265020};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Repressor {ECO:0000256|ARBA:ARBA00022491};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT DOMAIN 767..840
FT /note="Pre-SET"
FT /evidence="ECO:0000259|PROSITE:PS50867"
FT DOMAIN 843..1225
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 1234..1250
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
FT REGION 429..541
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 565..584
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 935..978
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 991..1102
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1122..1155
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 31..75
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 458..520
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 935..952
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 991..1011
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1012..1027
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1048..1067
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1076..1091
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1135..1152
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1250 AA; 138433 MW; B26F108068488848 CRC64;
MESGEGMEVD TWDPSLEDEL GVSLDELKKW IEEAVEQSEV VQKKKAQLKE LEEWVEQKEK
EEAETEKLLN DAYQSVTECE KLVKAVYESN GLVYRESSSE DEGGGGRRLP SEVIEIDDDE
DDDVIAVGCL VPPSKPLMPA KDPVLKDAST ALQKATQQVQ KVVQIVTKPS TGSALIRTAV
QSAAQPGIQT STPAIFVSHA PSQTMTKPNP NIKEDELKVG MSILGKKRTK TWHKGTLVAI
NPVGNGIFKY KVRFDKGKSL LSGNHVAFEY NPTLESLYVG ARVVAKYKDG NLVWLYAGIV
AEMPNNKNRM RFLIFFDDGY ASYVILPELY PVCRPLKRTW EDIEDASCRD FIEEYISAYP
SRPMVLLKVG QIIKTEWEGT WWKSKVEEVD SSLVKILFLD DKRSEWIYRG STRLEPMFNL
KLASANTHEK KLAGHQRTRP NMGALRSKGP VVQYTGVESV GGSPTKPQQS APSQPQATTL
IQQLQQRPQP QPIQQVQQSP QTQQTPQTPQ PSQSGQLPRV ENKHQMAKKS TSPFVPGVGG
THASKMLQAA SSNTSSLSTI KVVSTSNAPT SSFTSSYQRP TSTLVSPPPV VTHAMATIPQ
QPSYRAPTDR IFYLAHTCQP ACLNRVRPAK SDLHRGKNPL LTPLLYDFRR MTGRRKVNRK
MSFHVIYKAP CGLCLRSMGE IQNYLFQTQC DFIFLEMFCL DAYVLVDRPF QPQRPFYYIP
DITSGKEDIP LSCVNEIDTT PPPNVKYSKE RIPEDGVFIN TSSDFLVGCE CTDGCRDKSK
CSCHQLTLQA SGCTPGGQIN HSAGYSYKRL EECLPTGIYE CNKRCKCCAQ MCTNRLVQHG
LQVRLQLFKT QNKGWGIRCL DDIAKGSFVC IYAGKILTDD FADKEGLEMG DEYFANLDHI
ESVENFKEGY ESEAHCSDSE GSGVDVSRMK IQPSALVSSS ARKKAQSSSS SDDSNNEEEK
DSKSEDESDS SDDTFVKENF YTPGSVWRSY TTRGQAKGNK EGSQDSKDGL SVSAKGQEDD
KPPSMPEETG KSKVASWLTG QGLKKETGDN KSQIKTDLAK KQDVMTLSDS DDVQTISSGS
EDREKEREKV PPASVSGVTK KQVAVKSTRG IALKNTHSMM VKTAPSAGGV GPSGQGSKPG
QQGQSGGGTE NAPKNTRLFF DGEESCYIID AKLEGNLGRY LNHSCSPNLF VQNVFVDTHD
LRFPWVAFFA SKRIRAGTEL TWDYNYEVGS VEGKELLCCC GSTECRGRLL
//