GenomeNet

Database: UniProt
Entry: H2KRL5_CLOSI
LinkDB: H2KRL5_CLOSI
Original site: H2KRL5_CLOSI 
ID   H2KRL5_CLOSI            Unreviewed;      1763 AA.
AC   H2KRL5;
DT   21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT   21-MAR-2012, sequence version 1.
DT   27-MAR-2024, entry version 54.
DE   SubName: Full=Histone-lysine N-methyltransferase MLL3 {ECO:0000313|EMBL:GAA32467.2};
DE   Flags: Fragment;
GN   ORFNames=CLF_106624 {ECO:0000313|EMBL:GAA32467.2};
OS   Clonorchis sinensis (Chinese liver fluke).
OC   Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC   Digenea; Opisthorchiida; Opisthorchiata; Opisthorchiidae; Clonorchis.
OX   NCBI_TaxID=79923 {ECO:0000313|EMBL:GAA32467.2, ECO:0000313|Proteomes:UP000008909};
RN   [1] {ECO:0000313|EMBL:GAA32467.2}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Henan {ECO:0000313|EMBL:GAA32467.2};
RX   PubMed=22023798; DOI=10.1186/gb-2011-12-10-r107;
RA   Wang X., Chen W., Huang Y., Sun J., Men J., Liu H., Luo F., Guo L., Lv X.,
RA   Deng C., Zhou C., Fan Y., Li X., Huang L., Hu Y., Liang C., Hu X., Xu J.,
RA   Yu X.;
RT   "The draft genome of the carcinogenic human liver fluke Clonorchis
RT   sinensis.";
RL   Genome Biol. 12:R107-R107(2011).
RN   [2]
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=Henan;
RA   Wang X., Huang Y., Chen W., Liu H., Guo L., Chen Y., Luo F., Zhou W.,
RA   Sun J., Mao Q., Liang P., Zhou C., Tian Y., Men J., Lv X., Huang L.,
RA   Zhou J., Hu Y., Li R., Zhang F., Lei H., Li X., Hu X., Liang C., Xu J.,
RA   Wu Z., Yu X.;
RT   "The genome and transcriptome sequence of Clonorchis sinensis provide
RT   insights into the carcinogenic liver fluke.";
RL   Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; DF143187; GAA32467.2; -; Genomic_DNA.
DR   Proteomes; UP000008909; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR   GO; GO:0008168; F:methyltransferase activity; IEA:UniProtKB-KW.
DR   GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR   GO; GO:0043933; P:protein-containing complex organization; IEA:UniProt.
DR   Gene3D; 3.30.160.360; -; 1.
DR   Gene3D; 2.170.270.10; SET domain; 1.
DR   Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 1.
DR   InterPro; IPR034732; EPHD.
DR   InterPro; IPR003889; FYrich_C.
DR   InterPro; IPR003888; FYrich_N.
DR   InterPro; IPR003616; Post-SET_dom.
DR   InterPro; IPR001214; SET_dom.
DR   InterPro; IPR046341; SET_dom_sf.
DR   InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR   PANTHER; PTHR45888; HL01030P-RELATED; 1.
DR   PANTHER; PTHR45888:SF6; HL01030P-RELATED; 1.
DR   Pfam; PF05965; FYRC; 1.
DR   Pfam; PF05964; FYRN; 1.
DR   Pfam; PF00856; SET; 1.
DR   Pfam; PF13832; zf-HC5HC2H_2; 1.
DR   SMART; SM00542; FYRC; 1.
DR   SMART; SM00541; FYRN; 1.
DR   SMART; SM00508; PostSET; 1.
DR   SMART; SM00317; SET; 1.
DR   SUPFAM; SSF82199; SET domain; 1.
DR   PROSITE; PS51805; EPHD; 1.
DR   PROSITE; PS51543; FYRC; 1.
DR   PROSITE; PS51542; FYRN; 1.
DR   PROSITE; PS50868; POST_SET; 1.
DR   PROSITE; PS50280; SET; 1.
PE   4: Predicted;
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Methyltransferase {ECO:0000313|EMBL:GAA32467.2};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Reference proteome {ECO:0000313|Proteomes:UP000008909};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW   Transferase {ECO:0000256|ARBA:ARBA00022679, ECO:0000313|EMBL:GAA32467.2};
KW   Zinc {ECO:0000256|ARBA:ARBA00022833};
KW   Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT   DOMAIN          1243..1351
FT                   /note="PHD-type"
FT                   /evidence="ECO:0000259|PROSITE:PS51805"
FT   DOMAIN          1623..1739
FT                   /note="SET"
FT                   /evidence="ECO:0000259|PROSITE:PS50280"
FT   DOMAIN          1747..1763
FT                   /note="Post-SET"
FT                   /evidence="ECO:0000259|PROSITE:PS50868"
FT   REGION          114..148
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          254..291
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          565..603
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          796..815
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          828..880
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        254..274
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        832..858
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:GAA32467.2"
SQ   SEQUENCE   1763 AA;  192783 MW;  FE38B13AABBCF72E CRC64;
     SAFRLPQQRA RCGMELSGMS KVSESTLISS SASCQTHLYD RIPRSPTAVV QPSSPIPSAH
     NSAGTDGLKF TLVDESPCLA AFKSQTDNNH HSISPEISFA TFSNNLERTD DLSVTNQATS
     GPDLSQFPND DSSDINRSSN VGKSHSTPLI CTSPAAAPGS TFFSPDHQLL NGGSTPISFV
     CPADCTTPTA ADTSLDVNGL SGSHTTASPN DVDRTRMDLT KLDETIDYVL SAARSDDPCE
     GLFVLPCRTR EHTTLSSPTL RANTPTSGHL FSPSPSPTPS SPQSCHATAS PLNSTNKQLH
     LYVSSSSVSS IATATSMPRY VVDQSSGLVQ LSTGVVNSQC LPAMCNGQQS LSFVQVFSDP
     PSSSRIPNHE RSTVPQLGSL VTSPQVVFHS GLSHAVQNHS IAQVNHTLLP TAAAVRGCGP
     LVSPNKSNLL SPATGSRSTK LSTIPCAPFS SIDPLLSPNR AVVISPPMCV VQNPRLASPT
     RNVPVFSTSS NDMPNQCPQL PGERRHYVHS QVPVSFRQPI VPAQTFTPSC ISAANTVCLQ
     PTVPDQNIPL VHPGPTTAES LRHITQNHSA ASRKRSNTSG SGLSKRRKTT AAPPTSLANG
     STNLIKGSLI EPNSPYRANI VAHEYFEKLK LGDLKSLVSA PVLRPYPSVL PPVGGSDIVT
     PFLKVDDEQS VLCRKGLKYI HGSMSVHIPK CSLSEASDRK VDSYILRHLP NATFTGPLVC
     GSKNPTHAMS PTSPVCPSPE PLTFENHMLA LFSTFNRSPK EECKSTLVVD RSVSQNTSNM
     PSFASRLLPM LTEVTPNSGK PDPERSTPPL PLFHMPNPRL VYDHARSGST DRDGVEFNTT
     AQSETSRSNS TFIYNSPSRE PNDVKPPVIT APGTGPLIDS ETGEIVAGHR FRPPAESMHS
     DKLHITFTIN PSMTGGIPKI VQRIAELLGV EQDSVSYQVT RSGAQITLDQ KQSQSEGLMR
     NLESQLREHF TPLSLHVETA SVRPTDSNIH SDLQTVVERE ELSNHPGTAR PELKQNVDTS
     NLLKLDGPCI TNSEWARYSE EPVSVVSLLA NKNGVSKPCQ NCDTLVSPYS GHRKTLDDIA
     ALTGLTTRTD TNGDFIFCSK DCMLAFAHSI SVRRLSLESP IACPTSATTT GSVSDSRPQT
     LVCDLFSTVP VVLQSSLPKL GKSSRRHLTG VHIVASGHKR RQVSAVHGTA KHRRWRDSRW
     RIFSPEFVPT TTPAPSDKLI TSGPITFSRD DGPCLRPPPS ADPRRCTLCG TKGDASENGL
     GRLLPMNIDK WLHVNCALWC YEVYETVGGS LNDVDIWIKK AMETNCTHCG HLGAGLPCYN
     PRCTFSYHVS CAIAIGCMFF TDRGMYCPQH QPRETHPMQL PSLLVNRRVY ITRDENAEVG
     GVIHEEDRTK IVRIGSVCLH NVGQLLPHQI ESGYFHTRRY IYPVGFHTTR IYWSMRRPHR
     RARYVCEIDE ADGRPSFRIT ALDDGLEDVT VVSDSCTGAW QVILSRIEGM RRENHMVQLF
     PKHIKGEELY GLSEPHIVRA VESLPGMDGL RDYVFNFGRM ELIAEMPLAI NPSGCARTEP
     KMQTYVKRSF LSSHKLTPTT SSPVGFRRSV HPSLSISTPS DYSKQYQSSR SQQYRRLKGE
     ASSNVILGRS RIQGLGLFAA RNLEPQTMVI EYIGELIRLE LANKREKDYE AHNRGIYMFR
     LNDDTVIDAT VCGGLARYIN HSCQPNCFAE FLNFGDHSHI VIITNRLIEK GEELCYDYNF
     DLEDGGSKIP CLCRSTNCRK WMN
//
DBGET integrated database retrieval system