ID A0A267FFV9_9PLAT Unreviewed; 624 AA.
AC A0A267FFV9;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 27-MAR-2024, entry version 27.
DE RecName: Full=Pre-SET domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=BOX15_Mlig025313g1 {ECO:0000313|EMBL:PAA72603.1};
OS Macrostomum lignano.
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes;
OC Rhabditophora; Macrostomorpha; Macrostomida; Macrostomidae; Macrostomum.
OX NCBI_TaxID=282301 {ECO:0000313|EMBL:PAA72603.1, ECO:0000313|Proteomes:UP000215902};
RN [1] {ECO:0000313|EMBL:PAA72603.1, ECO:0000313|Proteomes:UP000215902}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DV1 {ECO:0000313|EMBL:PAA72603.1};
RC TISSUE=Whole organism {ECO:0000313|EMBL:PAA72603.1};
RA Berezikov E.;
RT "A platform for efficient transgenesis in Macrostomum lignano, a flatworm
RT model organism for stem cell research.";
RL Submitted (JUN-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Chromosome {ECO:0000256|ARBA:ARBA00004286}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PAA72603.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NIVC01001075; PAA72603.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A267FFV9; -.
DR STRING; 282301.A0A267FFV9; -.
DR Proteomes; UP000215902; Unassembled WGS sequence.
DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0042054; F:histone methyltransferase activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd21181; Tudor_SETDB1_rpt2; 1.
DR Gene3D; 2.30.30.140; -; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR016177; DNA-bd_dom_sf.
DR InterPro; IPR001739; Methyl_CpG_DNA-bd.
DR InterPro; IPR007728; Pre-SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR041292; Tudor_4.
DR InterPro; IPR041291; TUDOR_5.
DR PANTHER; PTHR46024; HISTONE-LYSINE N-METHYLTRANSFERASE EGGLESS; 1.
DR PANTHER; PTHR46024:SF1; HISTONE-LYSINE N-METHYLTRANSFERASE EGGLESS; 1.
DR Pfam; PF01429; MBD; 1.
DR Pfam; PF05033; Pre-SET; 1.
DR Pfam; PF18358; Tudor_4; 1.
DR Pfam; PF18359; Tudor_5; 1.
DR SMART; SM00391; MBD; 1.
DR SMART; SM00468; PreSET; 1.
DR SUPFAM; SSF54171; DNA-binding domain; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50982; MBD; 1.
DR PROSITE; PS50867; PRE_SET; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000215902};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Repressor {ECO:0000256|ARBA:ARBA00022491}.
FT DOMAIN 352..437
FT /note="MBD"
FT /evidence="ECO:0000259|PROSITE:PS50982"
FT DOMAIN 500..572
FT /note="Pre-SET"
FT /evidence="ECO:0000259|PROSITE:PS50867"
FT REGION 245..265
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 280..300
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 248..265
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 624 AA; 68820 MW; 3A93187B5561C744 CRC64;
MVPRRRYHEL PRDAYVVRFL GAPTLAANQQ PTQPQPQLSR TVQPVQMALG DSCLTRQERC
PVGTRVVSTV ADPDYADEGP DAEADGDETP RYAGIVAESP GEANKFRYLI FFDDGYCQYS
EQAAVHRVLA QSVNNWRLAG PESRFFIRDY LDAYPTRDLL RLQPGQAVVA ELRGGWHNAR
VEAVDGALIR LTFMHDGSCE WVYRGSTRLK TLYDRLYGPP KQQQQQPKQK PVQVQVSAAV
VASQARVGHA RKSTSASRLA SSKSTANAAE DDDDFVYIGS TGRRRRRREN GDSDDDSVAQ
DAAYRGRQVA TFDPESPCPQ PMLYRPHRCS PSCCPDPAGG LSSAVAFDDA AARGRNPLEA
PVALGWWRLL VLAADDDGGV GGGGGGGGGP AGRGREVVEY RAPCGRSLRS HADLDAYLRE
TCCRLHVSAF CFDAAVRINN EFQPLKIIYQ LKDISYGKEP VPVSAVNSLD NTSLPYIEYA
AERVPTQGVN TNESEPGFLV CCDCTDDCSD RRRCACQQLT IRSSQAITGK ADIRVTYRHG
RHMARALGGI YECNAKCRCR AATCRNRVVQ HGLRNRLQVF RTARKGWGIR TLHDIPKGAF
ICIYAGEVYN EETAVRYGTE LGDE
//