ID H2KQN5_CLOSI Unreviewed; 431 AA.
AC H2KQN5;
DT 21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT 21-MAR-2012, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE SubName: Full=Legumain {ECO:0000313|EMBL:GAA42795.2};
DE Flags: Fragment;
GN ORFNames=CLF_104342 {ECO:0000313|EMBL:GAA42795.2};
OS Clonorchis sinensis (Chinese liver fluke).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC Digenea; Opisthorchiida; Opisthorchiata; Opisthorchiidae; Clonorchis.
OX NCBI_TaxID=79923 {ECO:0000313|EMBL:GAA42795.2, ECO:0000313|Proteomes:UP000008909};
RN [1] {ECO:0000313|EMBL:GAA42795.2}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Henan {ECO:0000313|EMBL:GAA42795.2};
RX PubMed=22023798; DOI=10.1186/gb-2011-12-10-r107;
RA Wang X., Chen W., Huang Y., Sun J., Men J., Liu H., Luo F., Guo L., Lv X.,
RA Deng C., Zhou C., Fan Y., Li X., Huang L., Hu Y., Liang C., Hu X., Xu J.,
RA Yu X.;
RT "The draft genome of the carcinogenic human liver fluke Clonorchis
RT sinensis.";
RL Genome Biol. 12:R107-R107(2011).
RN [2]
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Henan;
RA Wang X., Huang Y., Chen W., Liu H., Guo L., Chen Y., Luo F., Zhou W.,
RA Sun J., Mao Q., Liang P., Zhou C., Tian Y., Men J., Lv X., Huang L.,
RA Zhou J., Hu Y., Li R., Zhang F., Lei H., Li X., Hu X., Liang C., Xu J.,
RA Wu Z., Yu X.;
RT "The genome and transcriptome sequence of Clonorchis sinensis provide
RT insights into the carcinogenic liver fluke.";
RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the peptidase C13 family.
CC {ECO:0000256|ARBA:ARBA00009941}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DF143031; GAA42795.2; -; Genomic_DNA.
DR AlphaFoldDB; H2KQN5; -.
DR MEROPS; C13.007; -.
DR Proteomes; UP000008909; Unassembled WGS sequence.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd21115; legumain_C; 1.
DR Gene3D; 1.10.132.130; -; 1.
DR Gene3D; 3.40.50.1460; -; 1.
DR InterPro; IPR048501; Legum_prodom.
DR InterPro; IPR046427; Legumain_prodom_sf.
DR InterPro; IPR001096; Peptidase_C13.
DR PANTHER; PTHR12000; HEMOGLOBINASE FAMILY MEMBER; 1.
DR PANTHER; PTHR12000:SF42; LEGUMAIN; 1.
DR Pfam; PF01650; Peptidase_C13; 1.
DR PIRSF; PIRSF019663; Legumain; 1.
DR PRINTS; PR00776; HEMOGLOBNASE.
PE 3: Inferred from homology;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Reference proteome {ECO:0000313|Proteomes:UP000008909};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..16
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 17..431
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003564036"
FT ACT_SITE 153
FT /evidence="ECO:0000256|PIRSR:PIRSR019663-1"
FT ACT_SITE 194
FT /note="Nucleophile"
FT /evidence="ECO:0000256|PIRSR:PIRSR019663-1"
FT NON_TER 431
FT /evidence="ECO:0000313|EMBL:GAA42795.2"
SQ SEQUENCE 431 AA; 49486 MW; 0613650726F72B8A CRC64;
MRRSCLLIAF FYVNYAAWLG SVCVGSRLLH SDPTKNWVVL VAGSNGWGNY RHQADVFHAY
QILRHNNISA EQIITFAYDD IANNSENPFM GKVFNDYYHI DVYEGVIIDY RGEDVTPQNF
LRVLRGDKEL EAAGKKVLKS GPEDHVFIYF SDHGGDGIIS FPEDELSATD LNKTLGYMYK
NGKYKKLVLY VEACESGSMF EGILPSNIGI YVTTAANNQE ASWATFCHDE VIDTCLADEY
SYNWLTDSEE HDLTHRTLDQ QFKSVKRRTK RSHVSRFGEM DVGRLPVGDF QGHSEQSMLL
DSATMTQVLH SRPSRWAHLT TISRRLVHAE SVEEHELAAR KLYRTLQLGH IVKQTFDDIV
MDVTTFHQPT IHELSKSEEL QCYEAVFKQF RKRCFTIRQV PEVAQYAGYL RKLCKKGYET
KILIQSVHKV C
//