ID A0A151M5C4_ALLMI Unreviewed; 1491 AA.
AC A0A151M5C4;
DT 08-JUN-2016, integrated into UniProtKB/TrEMBL.
DT 08-JUN-2016, sequence version 1.
DT 27-MAR-2024, entry version 36.
DE SubName: Full=Histone-lysine N-methyltransferase SETDB1 isoform B {ECO:0000313|EMBL:KYO19719.1};
GN Name=SETDB1 {ECO:0000313|EMBL:KYO19719.1};
GN ORFNames=Y1Q_0012364 {ECO:0000313|EMBL:KYO19719.1};
OS Alligator mississippiensis (American alligator).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Crocodylia; Alligatoridae; Alligatorinae;
OC Alligator.
OX NCBI_TaxID=8496 {ECO:0000313|EMBL:KYO19719.1};
RN [1] {ECO:0000313|EMBL:KYO19719.1, ECO:0000313|Proteomes:UP000050525}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=KSC_2009_1 {ECO:0000313|EMBL:KYO19719.1};
RX PubMed=22293439; DOI=10.1186/gb-2012-13-1-415;
RA St John J.A., Braun E.L., Isberg S.R., Miles L.G., Chong A.Y., Gongora J.,
RA Dalzell P., Moran C., Bed'hom B., Abzhanov A., Burgess S.C., Cooksey A.M.,
RA Castoe T.A., Crawford N.G., Densmore L.D., Drew J.C., Edwards S.V.,
RA Faircloth B.C., Fujita M.K., Greenwold M.J., Hoffmann F.G., Howard J.M.,
RA Iguchi T., Janes D.E., Khan S.Y., Kohno S., de Koning A.J., Lance S.L.,
RA McCarthy F.M., McCormack J.E., Merchant M.E., Peterson D.G., Pollock D.D.,
RA Pourmand N., Raney B.J., Roessler K.A., Sanford J.R., Sawyer R.H.,
RA Schmidt C.J., Triplett E.W., Tuberville T.D., Venegas-Anaya M.,
RA Howard J.T., Jarvis E.D., Guillette L.J.Jr., Glenn T.C., Green R.E.,
RA Ray D.A.;
RT "Sequencing three crocodilian genomes to illuminate the evolution of
RT archosaurs and amniotes.";
RL Genome Biol. 13:415-415(2012).
CC -!- SUBCELLULAR LOCATION: Chromosome {ECO:0000256|ARBA:ARBA00004286}.
CC Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KYO19719.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AKHW03006582; KYO19719.1; -; Genomic_DNA.
DR STRING; 8496.A0A151M5C4; -.
DR Proteomes; UP000050525; Unassembled WGS sequence.
DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0140938; F:histone H3 methyltransferase activity; IEA:UniProt.
DR GO; GO:0016279; F:protein-lysine N-methyltransferase activity; IEA:UniProt.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR CDD; cd01395; HMT_MBD; 1.
DR CDD; cd10517; SET_SETDB1; 1.
DR CDD; cd20382; Tudor_SETDB1_rpt1; 1.
DR CDD; cd21181; Tudor_SETDB1_rpt2; 1.
DR Gene3D; 2.30.30.140; -; 3.
DR Gene3D; 2.170.270.10; SET domain; 2.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 1.
DR InterPro; IPR016177; DNA-bd_dom_sf.
DR InterPro; IPR040880; DUF5604.
DR InterPro; IPR025796; Hist-Lys_N-MeTrfase_SETDB1.
DR InterPro; IPR001739; Methyl_CpG_DNA-bd.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR007728; Pre-SET_dom.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR047232; SETDB1/2-like_MBD.
DR InterPro; IPR002999; Tudor.
DR InterPro; IPR041292; Tudor_4.
DR InterPro; IPR041291; TUDOR_5.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR46024; HISTONE-LYSINE N-METHYLTRANSFERASE EGGLESS; 1.
DR PANTHER; PTHR46024:SF2; HISTONE-LYSINE N-METHYLTRANSFERASE SETDB1; 1.
DR Pfam; PF18300; DUF5604; 1.
DR Pfam; PF01429; MBD; 1.
DR Pfam; PF05033; Pre-SET; 1.
DR Pfam; PF00856; SET; 1.
DR Pfam; PF18358; Tudor_4; 1.
DR Pfam; PF18359; Tudor_5; 1.
DR SMART; SM00391; MBD; 1.
DR SMART; SM00468; PreSET; 1.
DR SMART; SM00317; SET; 1.
DR SMART; SM00333; TUDOR; 2.
DR SUPFAM; SSF54171; DNA-binding domain; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50982; MBD; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50867; PRE_SET; 1.
DR PROSITE; PS51573; SAM_MT43_SUVAR39_1; 1.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW Chromatin regulator {ECO:0000256|ARBA:ARBA00022853};
KW Chromosome {ECO:0000256|ARBA:ARBA00022454};
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054, ECO:0000256|SAM:Coils};
KW Methyltransferase {ECO:0000313|EMBL:KYO19719.1};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000050525};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Repressor {ECO:0000256|ARBA:ARBA00022491};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679, ECO:0000313|EMBL:KYO19719.1}.
FT DOMAIN 799..870
FT /note="MBD"
FT /evidence="ECO:0000259|PROSITE:PS50982"
FT DOMAIN 932..1005
FT /note="Pre-SET"
FT /evidence="ECO:0000259|PROSITE:PS50867"
FT DOMAIN 1008..1466
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 1475..1491
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
FT REGION 155..198
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 226..260
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 696..732
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1077..1357
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 85..129
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 156..173
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 232..255
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1089..1115
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1120..1154
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1173..1205
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1237..1251
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1280..1334
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1491 AA; 165650 MW; 1B5AA364D700D4DA CRC64;
MRAGAAEVFP PESACVVLSP AFAAASRGEV WSARATVAGS GVGAPQCSAL ERSPALASRV
AGSEMDSQEI GELQQEVMEE LGISMEDLQQ FIDEELEKLE CVKQRKKQLE ELEKNVKQKE
AEVAHVDQLF DDASRAINKC EVLVKDLYSK LGLQYRESSS EDEDSASKPT EVIEIPDEDD
DDVMSVDSGD TSKNPKDQTL LREAMAAMRK SAQDVQKFMD AITKKKTTQE TQRDGVSSQQ
SSVNASQQAN SGGDLSKDGD LTVGMRILGK KRTKTWHKGT LIAIQSVGES RRPGGAGKKY
KVKFDNKGKS LLSGNHIAYD YHPLPEKLYV GSRVVAKYKD GNQVWLYAGI VAETPNVKNK
LRFLIFFDDG YASYVTQPEL YLVCRPLKKT WEDIEDVSCR DFIEEYITAY PNRPMVLLKS
GQLIKTEWEG TWWKSRVEEV DGSLVKILFL DDKRCEWIYR GSTRLEPMFS MKTSTASTQE
KKQSGQARTR PNVGAVRSKG PVVQYTQDLT GTGTQYKPIE QMQATVLGSQ PVSPQPAEIE
HWPPAALFSS PWTDRLMDPS DFGATESSGS LFGLDGYGPN PLELTSDIPK HAPYHPRSMD
WIVSEACTGP EGRLHEEGKE HYNGSHVHEN ASSPEPVFWD FKINFLGERD AIPVHFCHQC
GFPIRIYGRM IPCQHVYCYD CASLGSDSQL AQSRKQVAKK STSFRPGSVG SGQSSPASPV
LSETPPIGRT GVNQQVYRFP STSLPSQPIS GVAPSFHGIM DRVPNEPSYH APMEKLFYLP
HVCSFTCLSR VRPIRSDQYR SKNPLLIPLL YDFRRMTARR RINRKMGFHV LYKTPCGLCL
RTMQEIERYL FETDCDFLFL EMFCLDPYVL VDRKFQPYKP FYYIADITKG KEDVPLSCVN
EIDSTPPPQV AYSKERIPGK GVYINTSWEF LVGCDCKDGC RDKSKCACHQ LTVQATGCTP
GGQINHNSGY QYKRLDECLP TGVYECNKRC KCNVNMCTNR LVQHGLQVRL QLFKTQNKGW
GIRCLDDIAK GSFVCIYAGK ILTDDFADKE GLEMGDEYFA NLDHIESVEN FKEGYESDAK
CSSDSSGVDL KDDDDDNTGS EEQEESNEDS SDDNFGKDED FSTNSVWRSY ATRRQTRGQK
ENGLSETASK DSGLARQVSH EESAVAVSCK LPVSEETSKN KVASWLNSNS MTDSFQDNDS
ASSFKMNEAG EAKAGKMDAP GERERASTSG LGGLDAEGKG QKKEESEETN KFPLLSETTG
RLYGYNPSPP KLEGVRRPLS KTTLHQSRRQ SAPPQPSTDD VLTLSSSTDS EGENGQTASG
QAQGNANDSD DIQTISSGSE EEDDKKNPSS GLGPVKRQVA VKSTRGFALK STHGIAIKST
NLASADKGES APVRRTTRQF YDGEESCYII DAKLEGNLGR YLNHSCSPNL FVQNVFVDTH
DLRFPWVAFF ASKRIRAGTE LTWDYNYEVG SVEGKELLCC CGAIECRGRL L
//