ID W5QI91_SHEEP Unreviewed; 1292 AA.
AC W5QI91;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 58.
DE RecName: Full=Histone-lysine N-methyltransferase SETDB1 {ECO:0008006|Google:ProtNLM};
GN Name=SETDB1 {ECO:0000313|Ensembl:ENSOARP00000022443.1};
OS Ovis aries (Sheep).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Caprinae; Ovis.
OX NCBI_TaxID=9940 {ECO:0000313|Ensembl:ENSOARP00000022443.1, ECO:0000313|Proteomes:UP000002356};
RN [1] {ECO:0000313|Ensembl:ENSOARP00000022443.1, ECO:0000313|Proteomes:UP000002356}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Texel {ECO:0000313|Ensembl:ENSOARP00000022443.1,
RC ECO:0000313|Proteomes:UP000002356};
RX PubMed=20809919; DOI=10.1111/j.1365-2052.2010.02100.x;
RA Archibald A.L., Cockett N.E., Dalrymple B.P., Faraut T., Kijas J.W.,
RA Maddox J.F., McEwan J.C., Hutton Oddy V., Raadsma H.W., Wade C., Wang J.,
RA Wang W., Xun X.;
RT "The sheep genome reference sequence: a work in progress.";
RL Anim. Genet. 41:449-453(2010).
RN [2] {ECO:0000313|Ensembl:ENSOARP00000022443.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Chromosome {ECO:0000256|ARBA:ARBA00004286}.
CC Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMGL01004611; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01004612; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR SMR; W5QI91; -.
DR STRING; 9940.ENSOARP00000022443; -.
DR PaxDb; 9940-ENSOARP00000022443; -.
DR Ensembl; ENSOART00000022750.1; ENSOARP00000022443.1; ENSOARG00000020880.1.
DR eggNOG; KOG1141; Eukaryota.
DR HOGENOM; CLU_003279_1_0_1; -.
DR OMA; IRAVTNC; -.
DR Proteomes; UP000002356; Chromosome 1.
DR Bgee; ENSOARG00000020880; Expressed in submandibular lymph node and 52 other cell types or tissues.
DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0140938; F:histone H3 methyltransferase activity; IEA:UniProt.
DR GO; GO:0016279; F:protein-lysine N-methyltransferase activity; IEA:UniProt.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd01395; HMT_MBD; 1.
DR CDD; cd10517; SET_SETDB1; 1.
DR CDD; cd20382; Tudor_SETDB1_rpt1; 1.
DR CDD; cd21181; Tudor_SETDB1_rpt2; 1.
DR Gene3D; 2.30.30.140; -; 3.
DR Gene3D; 2.170.270.10; SET domain; 2.
DR InterPro; IPR016177; DNA-bd_dom_sf.
DR InterPro; IPR040880; DUF5604.
DR InterPro; IPR025796; Hist-Lys_N-MeTrfase_SETDB1.
DR InterPro; IPR001739; Methyl_CpG_DNA-bd.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR007728; Pre-SET_dom.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR047232; SETDB1/2-like_MBD.
DR InterPro; IPR002999; Tudor.
DR InterPro; IPR041292; Tudor_4.
DR InterPro; IPR041291; TUDOR_5.
DR PANTHER; PTHR46024; HISTONE-LYSINE N-METHYLTRANSFERASE EGGLESS; 1.
DR PANTHER; PTHR46024:SF2; HISTONE-LYSINE N-METHYLTRANSFERASE SETDB1; 1.
DR Pfam; PF18300; DUF5604; 1.
DR Pfam; PF01429; MBD; 1.
DR Pfam; PF05033; Pre-SET; 1.
DR Pfam; PF00856; SET; 1.
DR Pfam; PF18358; Tudor_4; 1.
DR Pfam; PF18359; Tudor_5; 1.
DR SMART; SM00391; MBD; 1.
DR SMART; SM00468; PreSET; 1.
DR SMART; SM00317; SET; 1.
DR SMART; SM00333; TUDOR; 2.
DR SUPFAM; SSF54171; DNA-binding domain; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50982; MBD; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50867; PRE_SET; 1.
DR PROSITE; PS51573; SAM_MT43_SUVAR39_1; 1.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW Chromatin regulator {ECO:0000256|ARBA:ARBA00022853};
KW Chromosome {ECO:0000256|ARBA:ARBA00022454};
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054, ECO:0000256|SAM:Coils};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000002356};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Repressor {ECO:0000256|ARBA:ARBA00022491};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT DOMAIN 595..666
FT /note="MBD"
FT /evidence="ECO:0000259|PROSITE:PS50982"
FT DOMAIN 728..801
FT /note="Pre-SET"
FT /evidence="ECO:0000259|PROSITE:PS50867"
FT DOMAIN 804..1267
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 1276..1292
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
FT REGION 115..153
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 411..555
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 869..1161
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 37..64
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 438..454
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 455..470
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 476..538
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 539..553
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 894..911
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 916..945
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 952..968
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 970..1010
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1035..1054
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1079..1141
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1292 AA; 143224 MW; C63C9245D272E9EC CRC64;
MSSLPGCIGL AAATAAVESE EIAELQQSVV EELGISMEEL RQFIDEELEK MDCVQQRKKQ
LAELETWVIQ KESEVAHVDQ LFDDASRCSN IRAVTNCESL VKDFYSKLGL QYRDSSSEDE
ASRPTEIIEI PDEDDDVLSI DSGDAGNRTP KDQKLREAMA ALRKSAQDVQ KFMDAVNKKS
SSQDLHKGTL SQMPGELSKD GDLVVSMRIL GKKRTKTWHK GTLIAIQTVG SGKKYKVKFD
NKGKSLLSGN HIAYDYHPPA DKLYVGSRVV AKYKDGNQVW LYAGIVAETP NVKNKLRFLI
FFDDGYASYV TQSELYPICR PLKKTWEDIE DISCRDFIEE YITAYPNRPM VLLKSGQLIK
TEWEGTWWKS RVEEVDGSLV RILFLDDKRC EWIYRGSTRL EPMFSMKTSS ASALEKKQGG
QLRTRPNMGA VRSKGPVVQY TQDLTSTGTQ FKPTEPPQPT APPAPPGPAL SPQAGDSESL
EGQLAQSRKQ VAKKSTSFRP GSVGSGHSSP TSPALSENAP GGKPGINQTY RSPLGSTTSA
PAPPAPPAPP AFHGMLERAP AEPSYRAPME KLFYLPHVCS YTCLSRVRPM RSEQYRGKNP
LLVPLLYDFR RMTARRRVNR KMGFHVIYKT PCGLCLRTMQ EIERYLFETG CDFLFLEMFC
LDPYVLVDRK FQPYKPFYYI LDITYGKEDV PLSCVNEIDT TPPPQVAYSK ERIPGKGVFI
NTGPEFLVGC DCKDGCRDKS KCACHQLTIQ ATACTPGGQI NPNSGYQYKR LEECLPTGVY
ECNKRCKCDP NMCTNRLVQH GLQVRLQLFK TQNKGWGIRC LDDIAKGSFV CIYAGKILTD
DFADKEGLEM GDEYFANLDH IESVENFKEG YESDAPCSSD SSGVDLKDQE DGNSGTEDPE
ESNDDSSDDN FCKDEDFSTS SVWRSYATRR QTRGQKENGL SEMSSKDSRP PDLGPPHIPV
PPSIPVGGCN PPSSEETSKN KVASWLSCNS ISEGGFADSD SRSSFKTSEG GEGRTGGGRG
EAEKASTSGL GFKDEGDIKQ AKKEDPDDRN RMSLVTESSR NYGYNPSPVK PEGLRRPPSK
TSMHQSRRLM ASAQPNPDDI LTLSSSTESE GESGTSRKPT AGQTSATAMD SDDIQTISSG
SEGDDFEDKK NMSGPMKRQV AVKSTRGFAL KSTHGIAIKS TNMASVEKGE SAPVRKNTRQ
FYDGEESCYI IDAKLEGNLG RYLNHSCSPN LFVQNVFVDT HDLRFPWVAF FASKRIRAGT
ELTWDYNYEV GSVEGKELLC CCGAIECRGR LL
//