GenomeNet

Database: UniProt
Entry: H2KS02_CLOSI
LinkDB: H2KS02_CLOSI
Original site: H2KS02_CLOSI 
ID   H2KS02_CLOSI            Unreviewed;      1685 AA.
AC   H2KS02;
DT   21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT   21-MAR-2012, sequence version 1.
DT   24-JAN-2024, entry version 55.
DE   SubName: Full=Histone-lysine N-methyltransferase SETD1B {ECO:0000313|EMBL:GAA31138.2};
GN   ORFNames=CLF_107711 {ECO:0000313|EMBL:GAA31138.2};
OS   Clonorchis sinensis (Chinese liver fluke).
OC   Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC   Digenea; Opisthorchiida; Opisthorchiata; Opisthorchiidae; Clonorchis.
OX   NCBI_TaxID=79923 {ECO:0000313|EMBL:GAA31138.2, ECO:0000313|Proteomes:UP000008909};
RN   [1] {ECO:0000313|EMBL:GAA31138.2}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Henan {ECO:0000313|EMBL:GAA31138.2};
RX   PubMed=22023798; DOI=10.1186/gb-2011-12-10-r107;
RA   Wang X., Chen W., Huang Y., Sun J., Men J., Liu H., Luo F., Guo L., Lv X.,
RA   Deng C., Zhou C., Fan Y., Li X., Huang L., Hu Y., Liang C., Hu X., Xu J.,
RA   Yu X.;
RT   "The draft genome of the carcinogenic human liver fluke Clonorchis
RT   sinensis.";
RL   Genome Biol. 12:R107-R107(2011).
RN   [2]
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=Henan;
RA   Wang X., Huang Y., Chen W., Liu H., Guo L., Chen Y., Luo F., Zhou W.,
RA   Sun J., Mao Q., Liang P., Zhou C., Tian Y., Men J., Lv X., Huang L.,
RA   Zhou J., Hu Y., Li R., Zhang F., Lei H., Li X., Hu X., Liang C., Xu J.,
RA   Wu Z., Yu X.;
RT   "The genome and transcriptome sequence of Clonorchis sinensis provide
RT   insights into the carcinogenic liver fluke.";
RL   Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; DF143265; GAA31138.2; -; Genomic_DNA.
DR   Proteomes; UP000008909; Unassembled WGS sequence.
DR   GO; GO:0048188; C:Set1C/COMPASS complex; IEA:InterPro.
DR   GO; GO:0042800; F:histone H3K4 methyltransferase activity; IEA:InterPro.
DR   GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR   GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR   CDD; cd19169; SET_SETD1; 1.
DR   Gene3D; 2.170.270.10; SET domain; 1.
DR   InterPro; IPR003616; Post-SET_dom.
DR   InterPro; IPR044570; Set1-like.
DR   InterPro; IPR001214; SET_dom.
DR   InterPro; IPR046341; SET_dom_sf.
DR   InterPro; IPR037841; SET_SETD1A/B.
DR   PANTHER; PTHR45814; HISTONE-LYSINE N-METHYLTRANSFERASE SETD1; 1.
DR   PANTHER; PTHR45814:SF2; HISTONE-LYSINE N-METHYLTRANSFERASE SETD1; 1.
DR   Pfam; PF00856; SET; 1.
DR   SMART; SM00508; PostSET; 1.
DR   SMART; SM00317; SET; 1.
DR   SUPFAM; SSF82199; SET domain; 1.
DR   PROSITE; PS50868; POST_SET; 1.
DR   PROSITE; PS50280; SET; 1.
PE   4: Predicted;
KW   Methyltransferase {ECO:0000256|ARBA:ARBA00022603,
KW   ECO:0000313|EMBL:GAA31138.2}; Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Reference proteome {ECO:0000313|Proteomes:UP000008909};
KW   RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW   S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW   Transferase {ECO:0000256|ARBA:ARBA00022679, ECO:0000313|EMBL:GAA31138.2}.
FT   DOMAIN          1546..1663
FT                   /note="SET"
FT                   /evidence="ECO:0000259|PROSITE:PS50280"
FT   DOMAIN          1669..1685
FT                   /note="Post-SET"
FT                   /evidence="ECO:0000259|PROSITE:PS50868"
FT   REGION          30..77
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          93..184
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          197..258
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          304..325
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          477..857
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          924..984
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1030..1075
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1152..1182
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1331..1359
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        33..57
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        115..129
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        130..145
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        163..184
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        213..258
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        304..324
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        481..525
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        536..572
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        591..613
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        614..639
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        646..683
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        698..722
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        741..768
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        792..806
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        823..844
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        924..949
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        964..978
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1035..1075
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1156..1174
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1685 AA;  186957 MW;  866ACC9A827DDA4B CRC64;
     MADMLPRNGS VDQLPDAECA VARSRLLSGH HSSFGHHPNT VQSKHVSHGS MFTTESTAPG
     PGRLRAQRTP SRDLPPSHCM TTFNSHLFSR ASANRSDHSG SLDHLPPTGE IRVVTDLKPN
     DEESVRHQLR NKPLGSSSTF PTERGLHISD SIPEALDPGS AHPTNTDRVN KIRSNQPPCE
     ESLESRIQKL LQLNALSGTP AASSPPPPLD VHSSTTPITV TSSVSSSSYK TATTTAPACS
     SDPLSRHGDS HVSLTQSDRT RVSLESDVYF PHLSDRPQAL INRRTLLPTP ESLTDASILD
     TSRAFSLPRQ SSSPSTSRTP GSKVAQKPLD TVELNMIALD VFTLFIDELK EIMRRDVTRR
     IVEGNAFKIF SNWWDSKEND ILKAPLDTAG LHPSRSTANL NDDTQPPASE PVIPAEQITQ
     TSQKTGVLPS SVVVTASLST CITSTTPCTS MPTTSAQPGF NLFGFGLFTG LRASLPKIRR
     KPRPPSPEPS RSAKHSEQRG RSESANESEN DTPDPKPRRR STGRKSTASD TSSTDSDTDA
     DVRAVSKHDG EWESPAHELR KTDNDQPPRN SPAKPRRARL HRRQRRLTSD SFSSVDHSPV
     RPRTPSSRSS SAENSRSCDR DRNTEDRPRK PETSPQRKEP SRVADVFASS SGSDSDVSTL
     SSQKRSVVEK PSVSSSDEST EPLNYLSEAE HLAKSRSPSE ASLKSQHSNL GSPSTEVGDN
     DVAVVSTHSP IHPEWTKPSP PHHSSDISEL DTSLSSADEQ TPATPNKIGP SSGRRRGRPP
     RTAVVSSARR GRGKSRKTES EHSTLLSPVR KYEGAYNRFF PARQPSESDS SPDRGTSTDE
     GSQLAEPSRY PVSASRPDNW SALRASLHSP CERDAQSGFP VNYSRRKQSL SDLRRSRESL
     PVNDTHAFSI SHVDRVHADK DYTPLVDRPR GMVTKPDHER HRQPRDSLKH LKNSLLSPMD
     HWKDTMQSSR HDGDETDSAS NDSIEVVDTV RKPYSWPEVL LLEHNYFHLP PVGSTIKYRS
     RIPLKIVTDS VQSVTKSDDS ERSRSSLVDR GRKRPLQSND HSNRMTADTD EMDSDSTWFR
     NKKQALMSPV NHQNMDLESA QFLPEDERIP KQQYFKPDRE VTRYMEPKVR IRGNRELASL
     FTPVVKADLE RENLSAPSKR PTNLASSDEV KPETPTFSPR SPEEEEYILR SILLRGADPE
     DIMFFQMVCE HSLNCSPSSL CTWCKTPKYS AYLQGSQKLT LSLGNFRWVD HPPTLIPDPF
     EIPTYLSHNG LLIRFPEEPS CDSARTSGRV CGLKRLKESP TKSSRCMRLA SFSSSHPDNV
     VRNVTRDLFG SDSSSSTDLS DRSSTSKTEP PAAHSANSAS LATAVRVNNI QDKKLVRVDP
     LPPQSWLKFD RLLIAVAAFS SSLRFSGIPS DVQLDTIDVH LGPAASLPPI HSSGCARTQG
     FYWMPTEERF RRAWSVGRSL IAEDGRRRPI PLMPATVEAQ LGVNAVIQAI GGSIEDRLTE
     QASEAKKKQL TQFREARSAQ RRLLAQFQDI ETGDLLKFNQ LKFRKKQLIF AKSPIHAWGL
     IALEPIAAEE MVIEYVGQVV RKSVAELRER QYEAKGIGGS YLFRIDDDFV IDATMCGNNG
     RFINHSCQPN CYAKIITVEG KKKIVIYSKR DINVMEEITY DYKFPYEEEK IPCQCGASTC
     RGTLN
//
DBGET integrated database retrieval system