GenomeNet

Database: UniProt
Entry: D2HMR2_AILME
LinkDB: D2HMR2_AILME
Original site: D2HMR2_AILME 
ID   D2HMR2_AILME            Unreviewed;       339 AA.
AC   D2HMR2; G1LI42;
DT   09-FEB-2010, integrated into UniProtKB/TrEMBL.
DT   09-FEB-2010, sequence version 1.
DT   27-MAR-2024, entry version 66.
DE   RecName: Full=Cathepsin B {ECO:0000256|ARBA:ARBA00015559};
DE            EC=3.4.22.1 {ECO:0000256|ARBA:ARBA00012537};
DE   Flags: Fragment;
GN   Name=CTSB {ECO:0000313|Ensembl:ENSAMEP00000006590.2};
GN   ORFNames=PANDA_012896 {ECO:0000313|EMBL:EFB23278.1};
OS   Ailuropoda melanoleuca (Giant panda).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Carnivora; Caniformia; Ursidae; Ailuropoda.
OX   NCBI_TaxID=9646 {ECO:0000313|EMBL:EFB23278.1};
RN   [1] {ECO:0000313|EMBL:EFB23278.1, ECO:0000313|Proteomes:UP000008912}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=20010809; DOI=10.1038/nature08696;
RA   Li R., Fan W., Tian G., Zhu H., He L., Cai J., Huang Q., Cai Q., Li B.,
RA   Bai Y., Zhang Z., Zhang Y., Wang W., Li J., Wei F., Li H., Jian M., Li J.,
RA   Zhang Z., Nielsen R., Li D., Gu W., Yang Z., Xuan Z., Ryder O.A.,
RA   Leung F.C., Zhou Y., Cao J., Sun X., Fu Y., Fang X., Guo X., Wang B.,
RA   Hou R., Shen F., Mu B., Ni P., Lin R., Qian W., Wang G., Yu C., Nie W.,
RA   Wang J., Wu Z., Liang H., Min J., Wu Q., Cheng S., Ruan J., Wang M.,
RA   Shi Z., Wen M., Liu B., Ren X., Zheng H., Dong D., Cook K., Shan G.,
RA   Zhang H., Kosiol C., Xie X., Lu Z., Zheng H., Li Y., Steiner C.C.,
RA   Lam T.T., Lin S., Zhang Q., Li G., Tian J., Gong T., Liu H., Zhang D.,
RA   Fang L., Ye C., Zhang J., Hu W., Xu A., Ren Y., Zhang G., Bruford M.W.,
RA   Li Q., Ma L., Guo Y., An N., Hu Y., Zheng Y., Shi Y., Li Z., Liu Q.,
RA   Chen Y., Zhao J., Qu N., Zhao S., Tian F., Wang X., Wang H., Xu L., Liu X.,
RA   Vinar T., Wang Y., Lam T.W., Yiu S.M., Liu S., Zhang H., Li D., Huang Y.,
RA   Wang X., Yang G., Jiang Z., Wang J., Qin N., Li L., Li J., Bolund L.,
RA   Kristiansen K., Wong G.K., Olson M., Zhang X., Li S., Yang H., Wang J.,
RA   Wang J.;
RT   "The sequence and de novo assembly of the giant panda genome.";
RL   Nature 463:311-317(2010).
RN   [2] {ECO:0000313|Ensembl:ENSAMEP00000006590.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- CATALYTIC ACTIVITY:
CC       Reaction=Hydrolysis of proteins with broad specificity for peptide
CC         bonds. Preferentially cleaves -Arg-Arg-|-Xaa bonds in small molecule
CC         substrates (thus differing from cathepsin L). In addition to being an
CC         endopeptidase, shows peptidyl-dipeptidase activity, liberating C-
CC         terminal dipeptides.; EC=3.4.22.1;
CC         Evidence={ECO:0000256|ARBA:ARBA00001754};
CC   -!- SUBCELLULAR LOCATION: Apical cell membrane
CC       {ECO:0000256|ARBA:ARBA00004465}; Peripheral membrane protein
CC       {ECO:0000256|ARBA:ARBA00004465}; Extracellular side
CC       {ECO:0000256|ARBA:ARBA00004465}. Lysosome
CC       {ECO:0000256|ARBA:ARBA00004371}.
CC   -!- SIMILARITY: Belongs to the peptidase C1 family.
CC       {ECO:0000256|ARBA:ARBA00008455}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; GL193056; EFB23278.1; -; Genomic_DNA.
DR   RefSeq; XP_002923704.1; XM_002923658.3.
DR   STRING; 9646.ENSAMEP00000006590; -.
DR   MEROPS; C01.060; -.
DR   Ensembl; ENSAMET00000006868.2; ENSAMEP00000006590.2; ENSAMEG00000006251.2.
DR   GeneID; 100476830; -.
DR   KEGG; aml:100476830; -.
DR   CTD; 1508; -.
DR   eggNOG; KOG1543; Eukaryota.
DR   GeneTree; ENSGT00940000158680; -.
DR   HOGENOM; CLU_012184_3_3_1; -.
DR   OrthoDB; 808912at2759; -.
DR   TreeFam; TF314576; -.
DR   Proteomes; UP000008912; Unassembled WGS sequence.
DR   GO; GO:0016324; C:apical plasma membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0009897; C:external side of plasma membrane; IEA:Ensembl.
DR   GO; GO:0005615; C:extracellular space; IEA:Ensembl.
DR   GO; GO:0005764; C:lysosome; IEA:UniProtKB-SubCell.
DR   GO; GO:1904090; C:peptidase inhibitor complex; IEA:Ensembl.
DR   GO; GO:0048471; C:perinuclear region of cytoplasm; IEA:Ensembl.
DR   GO; GO:0005518; F:collagen binding; IEA:Ensembl.
DR   GO; GO:0004197; F:cysteine-type endopeptidase activity; IEA:UniProtKB-EC.
DR   GO; GO:0043394; F:proteoglycan binding; IEA:Ensembl.
DR   GO; GO:0097067; P:cellular response to thyroid hormone stimulus; IEA:Ensembl.
DR   GO; GO:0030574; P:collagen catabolic process; IEA:Ensembl.
DR   GO; GO:0046697; P:decidualization; IEA:Ensembl.
DR   GO; GO:0030855; P:epithelial cell differentiation; IEA:Ensembl.
DR   GO; GO:0051603; P:proteolysis involved in protein catabolic process; IEA:Ensembl.
DR   GO; GO:0006590; P:thyroid hormone generation; IEA:Ensembl.
DR   GO; GO:0046718; P:viral entry into host cell; IEA:Ensembl.
DR   CDD; cd02620; Peptidase_C1A_CathepsinB; 1.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR025661; Pept_asp_AS.
DR   InterPro; IPR000169; Pept_cys_AS.
DR   InterPro; IPR025660; Pept_his_AS.
DR   InterPro; IPR013128; Peptidase_C1A.
DR   InterPro; IPR000668; Peptidase_C1A_C.
DR   InterPro; IPR012599; Propeptide_C1A.
DR   PANTHER; PTHR12411:SF895; CATHEPSIN B; 1.
DR   PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR   Pfam; PF00112; Peptidase_C1; 1.
DR   Pfam; PF08127; Propeptide_C1; 1.
DR   PRINTS; PR00705; PAPAIN.
DR   SMART; SM00645; Pept_C1; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR   PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR   PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Lysosome {ECO:0000256|ARBA:ARBA00023228};
KW   Protease {ECO:0000256|ARBA:ARBA00022670};
KW   Reference proteome {ECO:0000313|Proteomes:UP000008912};
KW   Signal {ECO:0000256|SAM:SignalP};
KW   Thiol protease {ECO:0000256|ARBA:ARBA00022807}.
FT   SIGNAL          1..19
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           20..339
FT                   /note="Cathepsin B"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5041198121"
FT   DOMAIN          80..329
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
FT   NON_TER         339
FT                   /evidence="ECO:0000313|EMBL:EFB23278.1"
SQ   SEQUENCE   339 AA;  37688 MW;  5542446C078EE1C0 CRC64;
     MWQLLACLSC LVVLAGAQSR PPFQLLSDEL VNYVNKRNTT WKAGHNFHNV DPSYLRRLCG
     TFLGGPKLPQ RVWFAENMVL PENFDAREQW PNCPTIKEIR DQGSCGSCWA FGAVEAISDR
     ICIRTNGHVN VEVSAEDMLT CCGDQCGDGC NGGFPAEAWN FWTKQGLVSG GLYESHVGCR
     PYSIPPCEHH VNGSRPPCTG EGDTPKCSKF CEPGYTPSYK EDKHYGCSSY SVSSSEKEIM
     AEIYKNGPVE AAFTVYSDFL LYKSGVYQHV TGEMMGGHAV RILGWGVENG TPYWLVGNSW
     NTDWGDNGFF KILRGRDHCG IESEIVAGIP CTDQYWKKI
//
DBGET integrated database retrieval system