ID A0A0V0SP18_9BILA Unreviewed; 728 AA.
AC A0A0V0SP18;
DT 16-MAR-2016, integrated into UniProtKB/TrEMBL.
DT 16-MAR-2016, sequence version 1.
DT 27-MAR-2024, entry version 18.
DE SubName: Full=Cathepsin L {ECO:0000313|EMBL:KRX28363.1};
DE Flags: Fragment;
GN Name=CL1 {ECO:0000313|EMBL:KRX28363.1};
GN ORFNames=T07_3506 {ECO:0000313|EMBL:KRX28363.1};
OS Trichinella nelsoni.
OC Eukaryota; Metazoa; Ecdysozoa; Nematoda; Enoplea; Dorylaimia;
OC Trichinellida; Trichinellidae; Trichinella.
OX NCBI_TaxID=6336 {ECO:0000313|EMBL:KRX28363.1, ECO:0000313|Proteomes:UP000054630};
RN [1] {ECO:0000313|EMBL:KRX28363.1, ECO:0000313|Proteomes:UP000054630}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ISS37 {ECO:0000313|EMBL:KRX28363.1};
RA Korhonen P.K., Edoardo P., Giuseppe L.R., Gasser R.B.;
RT "Evolution of Trichinella species and genotypes.";
RL Submitted (JAN-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KRX28363.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JYDL01000001; KRX28363.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0V0SP18; -.
DR Proteomes; UP000054630; Unassembled WGS sequence.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd02248; Peptidase_C1A; 2.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 2.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025661; Pept_asp_AS.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR InterPro; IPR013201; Prot_inhib_I29.
DR PANTHER; PTHR12411:SF959; CATHEPSIN F; 1.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR Pfam; PF08246; Inhibitor_I29; 2.
DR Pfam; PF00112; Peptidase_C1; 2.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00848; Inhibitor_I29; 2.
DR SMART; SM00645; Pept_C1; 2.
DR SUPFAM; SSF54001; Cysteine proteinases; 2.
DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 2.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Reference proteome {ECO:0000313|Proteomes:UP000054630};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807};
KW Zymogen {ECO:0000256|ARBA:ARBA00023145}.
FT DOMAIN 71..128
FT /note="Cathepsin propeptide inhibitor"
FT /evidence="ECO:0000259|SMART:SM00848"
FT DOMAIN 154..374
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
FT DOMAIN 390..447
FT /note="Cathepsin propeptide inhibitor"
FT /evidence="ECO:0000259|SMART:SM00848"
FT DOMAIN 473..689
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KRX28363.1"
SQ SEQUENCE 728 AA; 84233 MW; 7B9B3670CC39AF4E CRC64;
LNGIIMVSVK CTVFLFCLFY CTWALPMKQK RPLFTNVNHL ERYMDSKFDK NLLLKLLPEM
NAKESRSWEN FKQFMVEFNK WYETEKLTAE KYNIFKSNMV IAKRLQEEEQ GTAIYGPTIF
ADITPEEFRK THLNFNPNSV KKPKRMANIP KSNISERMDW RKFNAVTSVK DQGNCGSCWA
FCTVANIEGA WAVKTAQLIS LSEQQLVDCD RLDDGCEGGL PVNAYLEIIR LGGLEKEEDY
KYTARSGKCK FNHTKSVVYI NDTVVLPEDE DAIARYVSEN GPVAVGLNAD AMMFYRSGIA
HPSRLMCSPD GINHGVTIVG YDVKESLFWS TPYWIIKNSW GPNWGEKGYY YLYRGKDEFQ
NQTEKSFNIS NYFNRFLLMN ANEQQSLKAF LMFMKEFNKR YESEDEFIKK YSIFNDNMKI
AMHLQQQEKG TAIYGPTIFA DMTQDEFRKT YLNMQETSAL LPKQRIALLK VNRPNKFDWR
NYNVVTKVKR QGKCGSSWAF STIANIESAW AIKFGDLISL SEQQIIDCDK INRGCRGGQP
LKAYHEIIRM SGVQAESDYP YTGLHGSCKL SKEKIKVYIN DTVLLHKNET TIANYLYEHG
PVAVRMNADI LMLYRKGIIK PTKSSCNPNF LNHGATIIGY GKESWLHWWS NPYWIIKNSW
GVDWGENGYF RLYRGNEACG VNRMVTSMSE MQACNLFKLK DTLFSRLSTC GIGDRLMRLH
IREFIEKD
//