GenomeNet

Database: UniProt
Entry: H2KQN5_CLOSI
LinkDB: H2KQN5_CLOSI
Original site: H2KQN5_CLOSI 
ID   H2KQN5_CLOSI            Unreviewed;       431 AA.
AC   H2KQN5;
DT   21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT   21-MAR-2012, sequence version 1.
DT   27-MAR-2024, entry version 32.
DE   SubName: Full=Legumain {ECO:0000313|EMBL:GAA42795.2};
DE   Flags: Fragment;
GN   ORFNames=CLF_104342 {ECO:0000313|EMBL:GAA42795.2};
OS   Clonorchis sinensis (Chinese liver fluke).
OC   Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC   Digenea; Opisthorchiida; Opisthorchiata; Opisthorchiidae; Clonorchis.
OX   NCBI_TaxID=79923 {ECO:0000313|EMBL:GAA42795.2, ECO:0000313|Proteomes:UP000008909};
RN   [1] {ECO:0000313|EMBL:GAA42795.2}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Henan {ECO:0000313|EMBL:GAA42795.2};
RX   PubMed=22023798; DOI=10.1186/gb-2011-12-10-r107;
RA   Wang X., Chen W., Huang Y., Sun J., Men J., Liu H., Luo F., Guo L., Lv X.,
RA   Deng C., Zhou C., Fan Y., Li X., Huang L., Hu Y., Liang C., Hu X., Xu J.,
RA   Yu X.;
RT   "The draft genome of the carcinogenic human liver fluke Clonorchis
RT   sinensis.";
RL   Genome Biol. 12:R107-R107(2011).
RN   [2]
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=Henan;
RA   Wang X., Huang Y., Chen W., Liu H., Guo L., Chen Y., Luo F., Zhou W.,
RA   Sun J., Mao Q., Liang P., Zhou C., Tian Y., Men J., Lv X., Huang L.,
RA   Zhou J., Hu Y., Li R., Zhang F., Lei H., Li X., Hu X., Liang C., Xu J.,
RA   Wu Z., Yu X.;
RT   "The genome and transcriptome sequence of Clonorchis sinensis provide
RT   insights into the carcinogenic liver fluke.";
RL   Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
CC   -!- SIMILARITY: Belongs to the peptidase C13 family.
CC       {ECO:0000256|ARBA:ARBA00009941}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; DF143031; GAA42795.2; -; Genomic_DNA.
DR   AlphaFoldDB; H2KQN5; -.
DR   MEROPS; C13.007; -.
DR   Proteomes; UP000008909; Unassembled WGS sequence.
DR   GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd21115; legumain_C; 1.
DR   Gene3D; 1.10.132.130; -; 1.
DR   Gene3D; 3.40.50.1460; -; 1.
DR   InterPro; IPR048501; Legum_prodom.
DR   InterPro; IPR046427; Legumain_prodom_sf.
DR   InterPro; IPR001096; Peptidase_C13.
DR   PANTHER; PTHR12000; HEMOGLOBINASE FAMILY MEMBER; 1.
DR   PANTHER; PTHR12000:SF42; LEGUMAIN; 1.
DR   Pfam; PF01650; Peptidase_C13; 1.
DR   PIRSF; PIRSF019663; Legumain; 1.
DR   PRINTS; PR00776; HEMOGLOBNASE.
PE   3: Inferred from homology;
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Protease {ECO:0000256|ARBA:ARBA00022670};
KW   Reference proteome {ECO:0000313|Proteomes:UP000008909};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..16
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           17..431
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5003564036"
FT   ACT_SITE        153
FT                   /evidence="ECO:0000256|PIRSR:PIRSR019663-1"
FT   ACT_SITE        194
FT                   /note="Nucleophile"
FT                   /evidence="ECO:0000256|PIRSR:PIRSR019663-1"
FT   NON_TER         431
FT                   /evidence="ECO:0000313|EMBL:GAA42795.2"
SQ   SEQUENCE   431 AA;  49486 MW;  0613650726F72B8A CRC64;
     MRRSCLLIAF FYVNYAAWLG SVCVGSRLLH SDPTKNWVVL VAGSNGWGNY RHQADVFHAY
     QILRHNNISA EQIITFAYDD IANNSENPFM GKVFNDYYHI DVYEGVIIDY RGEDVTPQNF
     LRVLRGDKEL EAAGKKVLKS GPEDHVFIYF SDHGGDGIIS FPEDELSATD LNKTLGYMYK
     NGKYKKLVLY VEACESGSMF EGILPSNIGI YVTTAANNQE ASWATFCHDE VIDTCLADEY
     SYNWLTDSEE HDLTHRTLDQ QFKSVKRRTK RSHVSRFGEM DVGRLPVGDF QGHSEQSMLL
     DSATMTQVLH SRPSRWAHLT TISRRLVHAE SVEEHELAAR KLYRTLQLGH IVKQTFDDIV
     MDVTTFHQPT IHELSKSEEL QCYEAVFKQF RKRCFTIRQV PEVAQYAGYL RKLCKKGYET
     KILIQSVHKV C
//
DBGET integrated database retrieval system