ID A0A669D608_ORENI Unreviewed; 345 AA.
AC A0A669D608;
DT 17-JUN-2020, integrated into UniProtKB/TrEMBL.
DT 17-JUN-2020, sequence version 1.
DT 27-MAR-2024, entry version 14.
DE RecName: Full=Cathepsin B {ECO:0000256|ARBA:ARBA00015559};
DE EC=3.4.22.1 {ECO:0000256|ARBA:ARBA00012537};
OS Oreochromis niloticus (Nile tilapia) (Tilapia nilotica).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC Pseudocrenilabrinae; Oreochromini; Oreochromis.
OX NCBI_TaxID=8128 {ECO:0000313|Ensembl:ENSONIP00000055941.1, ECO:0000313|Proteomes:UP000005207};
RN [1] {ECO:0000313|Proteomes:UP000005207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Broad Institute Genome Assembly Team;
RG Broad Institute Sequencing Platform;
RA Di Palma F., Johnson J., Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Oreochromis niloticus (Nile Tilapia).";
RL Submitted (JAN-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSONIP00000055941.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Hydrolysis of proteins with broad specificity for peptide
CC bonds. Preferentially cleaves -Arg-Arg-|-Xaa bonds in small molecule
CC substrates (thus differing from cathepsin L). In addition to being an
CC endopeptidase, shows peptidyl-dipeptidase activity, liberating C-
CC terminal dipeptides.; EC=3.4.22.1;
CC Evidence={ECO:0000256|ARBA:ARBA00001754};
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A669D608; -.
DR Ensembl; ENSONIT00000036627.1; ENSONIP00000055941.1; ENSONIG00000038588.1.
DR GeneTree; ENSGT00940000166128; -.
DR Proteomes; UP000005207; Linkage group LG15.
DR GO; GO:0004197; F:cysteine-type endopeptidase activity; IEA:UniProtKB-EC.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd02620; Peptidase_C1A_CathepsinB; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025661; Pept_asp_AS.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR012599; Propeptide_C1A.
DR PANTHER; PTHR12411:SF990; CATHEPSIN B; 1.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR Pfam; PF08127; Propeptide_C1; 1.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Reference proteome {ECO:0000313|Proteomes:UP000005207};
KW Signal {ECO:0000256|SAM:SignalP};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807}.
FT SIGNAL 1..18
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 19..345
FT /note="Cathepsin B"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5025577443"
FT DOMAIN 79..326
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 345 AA; 38147 MW; 794E1DD31F21DAC0 CRC64;
MSPLFLLSVV VMVCTTWAHP HHSLLSSEMV DFINKANTTW TATKNFQNID TTYVKQLCGT
ILNGPKLPEV LHNIEGIKLP DSFDARKQWP NCATIQQIRD QGSCGSCWAF GAAEAISDRL
CIQSGGKISV EISAEDLLAC CDECGMGCYG GYPSAAWEFW AKKGLVTGGL YDSKVGCLPY
TIAPCEHHVN GSRPPCGSSE TPKCVQQCAD GYSLSYEKDK HFGRRTYGVP SDPEQIMTEL
YKNGPVEASF TVYDDFLLYK SGVYQHVTGD VLGGHAIKIL GWGDDNGTPY WLAANSWNTD
WGDEGFFKIK RGNDECDIES EVVTGIPVTG RFVESKRRRT RKKIY
//