ID H2KPM4_CLOSI Unreviewed; 888 AA.
AC H2KPM4;
DT 21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT 21-MAR-2012, sequence version 1.
DT 28-JUN-2023, entry version 28.
DE SubName: Full=Kyphoscoliosis peptidase {ECO:0000313|EMBL:GAA33255.2};
GN ORFNames=CLF_102240 {ECO:0000313|EMBL:GAA33255.2};
OS Clonorchis sinensis (Chinese liver fluke).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC Digenea; Opisthorchiida; Opisthorchiata; Opisthorchiidae; Clonorchis.
OX NCBI_TaxID=79923 {ECO:0000313|EMBL:GAA33255.2, ECO:0000313|Proteomes:UP000008909};
RN [1] {ECO:0000313|EMBL:GAA33255.2}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Henan {ECO:0000313|EMBL:GAA33255.2};
RX PubMed=22023798; DOI=10.1186/gb-2011-12-10-r107;
RA Wang X., Chen W., Huang Y., Sun J., Men J., Liu H., Luo F., Guo L., Lv X.,
RA Deng C., Zhou C., Fan Y., Li X., Huang L., Hu Y., Liang C., Hu X., Xu J.,
RA Yu X.;
RT "The draft genome of the carcinogenic human liver fluke Clonorchis
RT sinensis.";
RL Genome Biol. 12:R107-R107(2011).
RN [2]
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Henan;
RA Wang X., Huang Y., Chen W., Liu H., Guo L., Chen Y., Luo F., Zhou W.,
RA Sun J., Mao Q., Liang P., Zhou C., Tian Y., Men J., Lv X., Huang L.,
RA Zhou J., Hu Y., Li R., Zhang F., Lei H., Li X., Hu X., Liang C., Xu J.,
RA Wu Z., Yu X.;
RT "The genome and transcriptome sequence of Clonorchis sinensis provide
RT insights into the carcinogenic liver fluke.";
RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DF142919; GAA33255.2; -; Genomic_DNA.
DR AlphaFoldDB; H2KPM4; -.
DR Proteomes; UP000008909; Unassembled WGS sequence.
DR Gene3D; 3.10.620.30; -; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR002931; Transglutaminase-like.
DR PANTHER; PTHR47020; HILLARIN; 1.
DR PANTHER; PTHR47020:SF1; HILLARIN; 1.
DR Pfam; PF01841; Transglut_core; 1.
DR SMART; SM00460; TGc; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000008909}.
FT DOMAIN 258..326
FT /note="Transglutaminase-like"
FT /evidence="ECO:0000259|SMART:SM00460"
FT REGION 56..99
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 62..99
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 888 AA; 99936 MW; 99BE285F7F50E91E CRC64;
MSNPTMRRNV DPDVIIEAFS NLMDDKGNLH SDIQQNTSWT VQTQSAFDRR GFGPQALQYG
RVDSPGQHSG SSHPELDSGC NGNGSVQYSQ VSPPGSAISS VSTQRLPFYR QQYDQFSATA
GGSPGYAPSG QFYRPDQSTA NWSAGPIQQE PPKEFSLLPI STETYVEKPR PQDLPSIGDK
STAPFGSFDV FRRVDEHAVS ISQQQQDSFK QLIWQLIYAR NITDELEKVR VIFLWLCTKD
LHKMNFDNVK PDTPEEILMG IRTGKSTYAQ IFYTLCRYAG LHCKLLIGYA KGAEYAPGMQ
FSGRQGQHSW NAVLVDKVWR LVDCHWAARR LIGKRPSPEN VRYGLDMFYF LANPGQLIYT
HFPHDSDWQL LRHPITLKEF ENLAPVKSAF FKYNLDLITH RNAVIVCTDP EVCVVIAFPP
QAEKYLSFTF GLSIDNKEGS EEYRGLPLTR YGRQEISVVE HRSLFYVRPP RPGAYKLLIY
AKHHQNGPAH QDNGAFDYSS DANSSSMYGA VCEYRLVANF HPNCSLPPYP PCQSSSYGPN
ELAKNYQIQA KQTQCTLRAT RGCLEIRFAL GSALGLPPPR MMGRLRCTLV PDEALTRSLL
QRSVNGNREA VFAVFLPEAG EYGLEIYANN PAKDGNSFYV VWQYIILSDS ESPVRGLPTI
PPAYLGPMPR FDSMGLSTVS HPDPFIQANT GELCIQLNQN PDIPVRMMAQ LIHASHDVSE
DCSQQILQQM RDNQVYFVVR FPHPGFFKFQ IYALPYSEPG ESLPGVFNYL LEATQVHRNR
SGQVMCFPQQ FSQWKEGCYL HSPLDGILSP SAQPSDVIPF KLTVPKAIAV AVVVGDEWTH
LNKVGDRWEG QVSLKQHWGR EHQLAVCANY NTGNGNYGTI LEYQVARR
//