ID K7F689_PELSI Unreviewed; 330 AA.
AC K7F689;
DT 09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT 09-JAN-2013, sequence version 1.
DT 27-MAR-2024, entry version 60.
DE RecName: Full=Cathepsin B {ECO:0000256|ARBA:ARBA00015559};
DE EC=3.4.22.1 {ECO:0000256|ARBA:ARBA00012537};
GN Name=CTSB {ECO:0000313|Ensembl:ENSPSIP00000003549.1};
OS Pelodiscus sinensis (Chinese softshell turtle) (Trionyx sinensis).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Testudinata; Testudines; Cryptodira; Trionychia;
OC Trionychidae; Pelodiscus.
OX NCBI_TaxID=13735 {ECO:0000313|Ensembl:ENSPSIP00000003549.1, ECO:0000313|Proteomes:UP000007267};
RN [1] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RG Soft-shell Turtle Genome Consortium;
RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000007267}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RX PubMed=23624526; DOI=10.1038/ng.2615;
RA Wang Z., Pascual-Anaya J., Zadissa A., Li W., Niimura Y., Huang Z., Li C.,
RA White S., Xiong Z., Fang D., Wang B., Ming Y., Chen Y., Zheng Y.,
RA Kuraku S., Pignatelli M., Herrero J., Beal K., Nozawa M., Li Q., Wang J.,
RA Zhang H., Yu L., Shigenobu S., Wang J., Liu J., Flicek P., Searle S.,
RA Wang J., Kuratani S., Yin Y., Aken B., Zhang G., Irie N.;
RT "The draft genomes of soft-shell turtle and green sea turtle yield insights
RT into the development and evolution of the turtle-specific body plan.";
RL Nat. Genet. 45:701-706(2013).
RN [3] {ECO:0000313|Ensembl:ENSPSIP00000003549.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Hydrolysis of proteins with broad specificity for peptide
CC bonds. Preferentially cleaves -Arg-Arg-|-Xaa bonds in small molecule
CC substrates (thus differing from cathepsin L). In addition to being an
CC endopeptidase, shows peptidyl-dipeptidase activity, liberating C-
CC terminal dipeptides.; EC=3.4.22.1;
CC Evidence={ECO:0000256|ARBA:ARBA00001754};
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGCU01038101; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01038102; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01038103; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01038104; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01038105; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01038106; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01038107; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01038108; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01038109; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGCU01038110; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; K7F689; -.
DR STRING; 13735.ENSPSIP00000003549; -.
DR Ensembl; ENSPSIT00000003567.1; ENSPSIP00000003549.1; ENSPSIG00000003377.1.
DR eggNOG; KOG1543; Eukaryota.
DR GeneTree; ENSGT00940000158680; -.
DR HOGENOM; CLU_012184_3_3_1; -.
DR OMA; DEKIPYW; -.
DR TreeFam; TF314576; -.
DR Proteomes; UP000007267; Unassembled WGS sequence.
DR GO; GO:0009897; C:external side of plasma membrane; IEA:Ensembl.
DR GO; GO:0005615; C:extracellular space; IEA:Ensembl.
DR GO; GO:0005764; C:lysosome; IEA:Ensembl.
DR GO; GO:1904090; C:peptidase inhibitor complex; IEA:Ensembl.
DR GO; GO:0048471; C:perinuclear region of cytoplasm; IEA:Ensembl.
DR GO; GO:0005518; F:collagen binding; IEA:Ensembl.
DR GO; GO:0004197; F:cysteine-type endopeptidase activity; IEA:UniProtKB-EC.
DR GO; GO:0043394; F:proteoglycan binding; IEA:Ensembl.
DR GO; GO:0097067; P:cellular response to thyroid hormone stimulus; IEA:Ensembl.
DR GO; GO:0030574; P:collagen catabolic process; IEA:Ensembl.
DR GO; GO:0030855; P:epithelial cell differentiation; IEA:Ensembl.
DR GO; GO:0051603; P:proteolysis involved in protein catabolic process; IEA:Ensembl.
DR GO; GO:0006590; P:thyroid hormone generation; IEA:Ensembl.
DR GO; GO:0046718; P:viral entry into host cell; IEA:Ensembl.
DR CDD; cd02620; Peptidase_C1A_CathepsinB; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR012599; Propeptide_C1A.
DR PANTHER; PTHR12411:SF895; CATHEPSIN B; 1.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR Pfam; PF08127; Propeptide_C1; 1.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
PE 3: Inferred from homology;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Reference proteome {ECO:0000313|Proteomes:UP000007267};
KW Signal {ECO:0000256|SAM:SignalP};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807}.
FT SIGNAL 1..16
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 17..330
FT /note="Cathepsin B"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5018739329"
FT DOMAIN 79..324
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 330 AA; 36029 MW; 91D44B5741D2CA77 CRC64;
PWSLAALFVL VALASARSVP HFAPLSPDLV NYINKLNTTW QAGHNFRNAD LSYVKQLCGT
FLHGPKLPVR AEFAGDLNLP DSFDSRKQWP NCPTINEIRD QGSCGSCWAF GAVEAISDRV
CVHTNGKMNV EISAEDLLSC CGFECGMGGG CSRPTAKDTG ASLRAEGLES TALGLSHPAG
CRPYSIPPCE HHVNGSRPPC TGEQGDTPKC DQHCEAGYSP SYETDKHFGA TSYNVPRSEK
EIMAEIYKNG PVEGAFSVYE DFLMYKSGKR GRDVPGGARS WGGSSPAYWQ PAQGWERDWR
ERCFFKILRG QDHCGIESEI GSGQPRTEPE
//