ID D2HMR2_AILME Unreviewed; 339 AA.
AC D2HMR2; G1LI42;
DT 09-FEB-2010, integrated into UniProtKB/TrEMBL.
DT 09-FEB-2010, sequence version 1.
DT 27-MAR-2024, entry version 66.
DE RecName: Full=Cathepsin B {ECO:0000256|ARBA:ARBA00015559};
DE EC=3.4.22.1 {ECO:0000256|ARBA:ARBA00012537};
DE Flags: Fragment;
GN Name=CTSB {ECO:0000313|Ensembl:ENSAMEP00000006590.2};
GN ORFNames=PANDA_012896 {ECO:0000313|EMBL:EFB23278.1};
OS Ailuropoda melanoleuca (Giant panda).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Carnivora; Caniformia; Ursidae; Ailuropoda.
OX NCBI_TaxID=9646 {ECO:0000313|EMBL:EFB23278.1};
RN [1] {ECO:0000313|EMBL:EFB23278.1, ECO:0000313|Proteomes:UP000008912}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=20010809; DOI=10.1038/nature08696;
RA Li R., Fan W., Tian G., Zhu H., He L., Cai J., Huang Q., Cai Q., Li B.,
RA Bai Y., Zhang Z., Zhang Y., Wang W., Li J., Wei F., Li H., Jian M., Li J.,
RA Zhang Z., Nielsen R., Li D., Gu W., Yang Z., Xuan Z., Ryder O.A.,
RA Leung F.C., Zhou Y., Cao J., Sun X., Fu Y., Fang X., Guo X., Wang B.,
RA Hou R., Shen F., Mu B., Ni P., Lin R., Qian W., Wang G., Yu C., Nie W.,
RA Wang J., Wu Z., Liang H., Min J., Wu Q., Cheng S., Ruan J., Wang M.,
RA Shi Z., Wen M., Liu B., Ren X., Zheng H., Dong D., Cook K., Shan G.,
RA Zhang H., Kosiol C., Xie X., Lu Z., Zheng H., Li Y., Steiner C.C.,
RA Lam T.T., Lin S., Zhang Q., Li G., Tian J., Gong T., Liu H., Zhang D.,
RA Fang L., Ye C., Zhang J., Hu W., Xu A., Ren Y., Zhang G., Bruford M.W.,
RA Li Q., Ma L., Guo Y., An N., Hu Y., Zheng Y., Shi Y., Li Z., Liu Q.,
RA Chen Y., Zhao J., Qu N., Zhao S., Tian F., Wang X., Wang H., Xu L., Liu X.,
RA Vinar T., Wang Y., Lam T.W., Yiu S.M., Liu S., Zhang H., Li D., Huang Y.,
RA Wang X., Yang G., Jiang Z., Wang J., Qin N., Li L., Li J., Bolund L.,
RA Kristiansen K., Wong G.K., Olson M., Zhang X., Li S., Yang H., Wang J.,
RA Wang J.;
RT "The sequence and de novo assembly of the giant panda genome.";
RL Nature 463:311-317(2010).
RN [2] {ECO:0000313|Ensembl:ENSAMEP00000006590.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Hydrolysis of proteins with broad specificity for peptide
CC bonds. Preferentially cleaves -Arg-Arg-|-Xaa bonds in small molecule
CC substrates (thus differing from cathepsin L). In addition to being an
CC endopeptidase, shows peptidyl-dipeptidase activity, liberating C-
CC terminal dipeptides.; EC=3.4.22.1;
CC Evidence={ECO:0000256|ARBA:ARBA00001754};
CC -!- SUBCELLULAR LOCATION: Apical cell membrane
CC {ECO:0000256|ARBA:ARBA00004465}; Peripheral membrane protein
CC {ECO:0000256|ARBA:ARBA00004465}; Extracellular side
CC {ECO:0000256|ARBA:ARBA00004465}. Lysosome
CC {ECO:0000256|ARBA:ARBA00004371}.
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL193056; EFB23278.1; -; Genomic_DNA.
DR RefSeq; XP_002923704.1; XM_002923658.3.
DR STRING; 9646.ENSAMEP00000006590; -.
DR MEROPS; C01.060; -.
DR Ensembl; ENSAMET00000006868.2; ENSAMEP00000006590.2; ENSAMEG00000006251.2.
DR GeneID; 100476830; -.
DR KEGG; aml:100476830; -.
DR CTD; 1508; -.
DR eggNOG; KOG1543; Eukaryota.
DR GeneTree; ENSGT00940000158680; -.
DR HOGENOM; CLU_012184_3_3_1; -.
DR OrthoDB; 808912at2759; -.
DR TreeFam; TF314576; -.
DR Proteomes; UP000008912; Unassembled WGS sequence.
DR GO; GO:0016324; C:apical plasma membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0009897; C:external side of plasma membrane; IEA:Ensembl.
DR GO; GO:0005615; C:extracellular space; IEA:Ensembl.
DR GO; GO:0005764; C:lysosome; IEA:UniProtKB-SubCell.
DR GO; GO:1904090; C:peptidase inhibitor complex; IEA:Ensembl.
DR GO; GO:0048471; C:perinuclear region of cytoplasm; IEA:Ensembl.
DR GO; GO:0005518; F:collagen binding; IEA:Ensembl.
DR GO; GO:0004197; F:cysteine-type endopeptidase activity; IEA:UniProtKB-EC.
DR GO; GO:0043394; F:proteoglycan binding; IEA:Ensembl.
DR GO; GO:0097067; P:cellular response to thyroid hormone stimulus; IEA:Ensembl.
DR GO; GO:0030574; P:collagen catabolic process; IEA:Ensembl.
DR GO; GO:0046697; P:decidualization; IEA:Ensembl.
DR GO; GO:0030855; P:epithelial cell differentiation; IEA:Ensembl.
DR GO; GO:0051603; P:proteolysis involved in protein catabolic process; IEA:Ensembl.
DR GO; GO:0006590; P:thyroid hormone generation; IEA:Ensembl.
DR GO; GO:0046718; P:viral entry into host cell; IEA:Ensembl.
DR CDD; cd02620; Peptidase_C1A_CathepsinB; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025661; Pept_asp_AS.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR012599; Propeptide_C1A.
DR PANTHER; PTHR12411:SF895; CATHEPSIN B; 1.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR Pfam; PF08127; Propeptide_C1; 1.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Lysosome {ECO:0000256|ARBA:ARBA00023228};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Reference proteome {ECO:0000313|Proteomes:UP000008912};
KW Signal {ECO:0000256|SAM:SignalP};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807}.
FT SIGNAL 1..19
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 20..339
FT /note="Cathepsin B"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5041198121"
FT DOMAIN 80..329
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
FT NON_TER 339
FT /evidence="ECO:0000313|EMBL:EFB23278.1"
SQ SEQUENCE 339 AA; 37688 MW; 5542446C078EE1C0 CRC64;
MWQLLACLSC LVVLAGAQSR PPFQLLSDEL VNYVNKRNTT WKAGHNFHNV DPSYLRRLCG
TFLGGPKLPQ RVWFAENMVL PENFDAREQW PNCPTIKEIR DQGSCGSCWA FGAVEAISDR
ICIRTNGHVN VEVSAEDMLT CCGDQCGDGC NGGFPAEAWN FWTKQGLVSG GLYESHVGCR
PYSIPPCEHH VNGSRPPCTG EGDTPKCSKF CEPGYTPSYK EDKHYGCSSY SVSSSEKEIM
AEIYKNGPVE AAFTVYSDFL LYKSGVYQHV TGEMMGGHAV RILGWGVENG TPYWLVGNSW
NTDWGDNGFF KILRGRDHCG IESEIVAGIP CTDQYWKKI
//