ID H2KS02_CLOSI Unreviewed; 1685 AA.
AC H2KS02;
DT 21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT 21-MAR-2012, sequence version 1.
DT 24-JAN-2024, entry version 55.
DE SubName: Full=Histone-lysine N-methyltransferase SETD1B {ECO:0000313|EMBL:GAA31138.2};
GN ORFNames=CLF_107711 {ECO:0000313|EMBL:GAA31138.2};
OS Clonorchis sinensis (Chinese liver fluke).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC Digenea; Opisthorchiida; Opisthorchiata; Opisthorchiidae; Clonorchis.
OX NCBI_TaxID=79923 {ECO:0000313|EMBL:GAA31138.2, ECO:0000313|Proteomes:UP000008909};
RN [1] {ECO:0000313|EMBL:GAA31138.2}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Henan {ECO:0000313|EMBL:GAA31138.2};
RX PubMed=22023798; DOI=10.1186/gb-2011-12-10-r107;
RA Wang X., Chen W., Huang Y., Sun J., Men J., Liu H., Luo F., Guo L., Lv X.,
RA Deng C., Zhou C., Fan Y., Li X., Huang L., Hu Y., Liang C., Hu X., Xu J.,
RA Yu X.;
RT "The draft genome of the carcinogenic human liver fluke Clonorchis
RT sinensis.";
RL Genome Biol. 12:R107-R107(2011).
RN [2]
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Henan;
RA Wang X., Huang Y., Chen W., Liu H., Guo L., Chen Y., Luo F., Zhou W.,
RA Sun J., Mao Q., Liang P., Zhou C., Tian Y., Men J., Lv X., Huang L.,
RA Zhou J., Hu Y., Li R., Zhang F., Lei H., Li X., Hu X., Liang C., Xu J.,
RA Wu Z., Yu X.;
RT "The genome and transcriptome sequence of Clonorchis sinensis provide
RT insights into the carcinogenic liver fluke.";
RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DF143265; GAA31138.2; -; Genomic_DNA.
DR Proteomes; UP000008909; Unassembled WGS sequence.
DR GO; GO:0048188; C:Set1C/COMPASS complex; IEA:InterPro.
DR GO; GO:0042800; F:histone H3K4 methyltransferase activity; IEA:InterPro.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR CDD; cd19169; SET_SETD1; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR044570; Set1-like.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR037841; SET_SETD1A/B.
DR PANTHER; PTHR45814; HISTONE-LYSINE N-METHYLTRANSFERASE SETD1; 1.
DR PANTHER; PTHR45814:SF2; HISTONE-LYSINE N-METHYLTRANSFERASE SETD1; 1.
DR Pfam; PF00856; SET; 1.
DR SMART; SM00508; PostSET; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603,
KW ECO:0000313|EMBL:GAA31138.2}; Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000008909};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679, ECO:0000313|EMBL:GAA31138.2}.
FT DOMAIN 1546..1663
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 1669..1685
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
FT REGION 30..77
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 93..184
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 197..258
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 304..325
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 477..857
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 924..984
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1030..1075
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1152..1182
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1331..1359
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 33..57
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 115..129
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 130..145
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 163..184
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 213..258
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 304..324
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 481..525
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 536..572
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 591..613
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 614..639
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 646..683
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 698..722
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 741..768
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 792..806
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 823..844
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 924..949
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 964..978
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1035..1075
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1156..1174
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1685 AA; 186957 MW; 866ACC9A827DDA4B CRC64;
MADMLPRNGS VDQLPDAECA VARSRLLSGH HSSFGHHPNT VQSKHVSHGS MFTTESTAPG
PGRLRAQRTP SRDLPPSHCM TTFNSHLFSR ASANRSDHSG SLDHLPPTGE IRVVTDLKPN
DEESVRHQLR NKPLGSSSTF PTERGLHISD SIPEALDPGS AHPTNTDRVN KIRSNQPPCE
ESLESRIQKL LQLNALSGTP AASSPPPPLD VHSSTTPITV TSSVSSSSYK TATTTAPACS
SDPLSRHGDS HVSLTQSDRT RVSLESDVYF PHLSDRPQAL INRRTLLPTP ESLTDASILD
TSRAFSLPRQ SSSPSTSRTP GSKVAQKPLD TVELNMIALD VFTLFIDELK EIMRRDVTRR
IVEGNAFKIF SNWWDSKEND ILKAPLDTAG LHPSRSTANL NDDTQPPASE PVIPAEQITQ
TSQKTGVLPS SVVVTASLST CITSTTPCTS MPTTSAQPGF NLFGFGLFTG LRASLPKIRR
KPRPPSPEPS RSAKHSEQRG RSESANESEN DTPDPKPRRR STGRKSTASD TSSTDSDTDA
DVRAVSKHDG EWESPAHELR KTDNDQPPRN SPAKPRRARL HRRQRRLTSD SFSSVDHSPV
RPRTPSSRSS SAENSRSCDR DRNTEDRPRK PETSPQRKEP SRVADVFASS SGSDSDVSTL
SSQKRSVVEK PSVSSSDEST EPLNYLSEAE HLAKSRSPSE ASLKSQHSNL GSPSTEVGDN
DVAVVSTHSP IHPEWTKPSP PHHSSDISEL DTSLSSADEQ TPATPNKIGP SSGRRRGRPP
RTAVVSSARR GRGKSRKTES EHSTLLSPVR KYEGAYNRFF PARQPSESDS SPDRGTSTDE
GSQLAEPSRY PVSASRPDNW SALRASLHSP CERDAQSGFP VNYSRRKQSL SDLRRSRESL
PVNDTHAFSI SHVDRVHADK DYTPLVDRPR GMVTKPDHER HRQPRDSLKH LKNSLLSPMD
HWKDTMQSSR HDGDETDSAS NDSIEVVDTV RKPYSWPEVL LLEHNYFHLP PVGSTIKYRS
RIPLKIVTDS VQSVTKSDDS ERSRSSLVDR GRKRPLQSND HSNRMTADTD EMDSDSTWFR
NKKQALMSPV NHQNMDLESA QFLPEDERIP KQQYFKPDRE VTRYMEPKVR IRGNRELASL
FTPVVKADLE RENLSAPSKR PTNLASSDEV KPETPTFSPR SPEEEEYILR SILLRGADPE
DIMFFQMVCE HSLNCSPSSL CTWCKTPKYS AYLQGSQKLT LSLGNFRWVD HPPTLIPDPF
EIPTYLSHNG LLIRFPEEPS CDSARTSGRV CGLKRLKESP TKSSRCMRLA SFSSSHPDNV
VRNVTRDLFG SDSSSSTDLS DRSSTSKTEP PAAHSANSAS LATAVRVNNI QDKKLVRVDP
LPPQSWLKFD RLLIAVAAFS SSLRFSGIPS DVQLDTIDVH LGPAASLPPI HSSGCARTQG
FYWMPTEERF RRAWSVGRSL IAEDGRRRPI PLMPATVEAQ LGVNAVIQAI GGSIEDRLTE
QASEAKKKQL TQFREARSAQ RRLLAQFQDI ETGDLLKFNQ LKFRKKQLIF AKSPIHAWGL
IALEPIAAEE MVIEYVGQVV RKSVAELRER QYEAKGIGGS YLFRIDDDFV IDATMCGNNG
RFINHSCQPN CYAKIITVEG KKKIVIYSKR DINVMEEITY DYKFPYEEEK IPCQCGASTC
RGTLN
//