GenomeNet

Database: UniProt
Entry: K7F689_PELSI
LinkDB: K7F689_PELSI
Original site: K7F689_PELSI 
ID   K7F689_PELSI            Unreviewed;       330 AA.
AC   K7F689;
DT   09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT   09-JAN-2013, sequence version 1.
DT   27-MAR-2024, entry version 60.
DE   RecName: Full=Cathepsin B {ECO:0000256|ARBA:ARBA00015559};
DE            EC=3.4.22.1 {ECO:0000256|ARBA:ARBA00012537};
GN   Name=CTSB {ECO:0000313|Ensembl:ENSPSIP00000003549.1};
OS   Pelodiscus sinensis (Chinese softshell turtle) (Trionyx sinensis).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Testudinata; Testudines; Cryptodira; Trionychia;
OC   Trionychidae; Pelodiscus.
OX   NCBI_TaxID=13735 {ECO:0000313|Ensembl:ENSPSIP00000003549.1, ECO:0000313|Proteomes:UP000007267};
RN   [1] {ECO:0000313|Proteomes:UP000007267}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RG   Soft-shell Turtle Genome Consortium;
RL   Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Proteomes:UP000007267}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Daiwa-1 {ECO:0000313|Proteomes:UP000007267};
RX   PubMed=23624526; DOI=10.1038/ng.2615;
RA   Wang Z., Pascual-Anaya J., Zadissa A., Li W., Niimura Y., Huang Z., Li C.,
RA   White S., Xiong Z., Fang D., Wang B., Ming Y., Chen Y., Zheng Y.,
RA   Kuraku S., Pignatelli M., Herrero J., Beal K., Nozawa M., Li Q., Wang J.,
RA   Zhang H., Yu L., Shigenobu S., Wang J., Liu J., Flicek P., Searle S.,
RA   Wang J., Kuratani S., Yin Y., Aken B., Zhang G., Irie N.;
RT   "The draft genomes of soft-shell turtle and green sea turtle yield insights
RT   into the development and evolution of the turtle-specific body plan.";
RL   Nat. Genet. 45:701-706(2013).
RN   [3] {ECO:0000313|Ensembl:ENSPSIP00000003549.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- CATALYTIC ACTIVITY:
CC       Reaction=Hydrolysis of proteins with broad specificity for peptide
CC         bonds. Preferentially cleaves -Arg-Arg-|-Xaa bonds in small molecule
CC         substrates (thus differing from cathepsin L). In addition to being an
CC         endopeptidase, shows peptidyl-dipeptidase activity, liberating C-
CC         terminal dipeptides.; EC=3.4.22.1;
CC         Evidence={ECO:0000256|ARBA:ARBA00001754};
CC   -!- SIMILARITY: Belongs to the peptidase C1 family.
CC       {ECO:0000256|ARBA:ARBA00008455}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AGCU01038101; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01038102; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01038103; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01038104; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01038105; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01038106; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01038107; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01038108; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01038109; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AGCU01038110; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   AlphaFoldDB; K7F689; -.
DR   STRING; 13735.ENSPSIP00000003549; -.
DR   Ensembl; ENSPSIT00000003567.1; ENSPSIP00000003549.1; ENSPSIG00000003377.1.
DR   eggNOG; KOG1543; Eukaryota.
DR   GeneTree; ENSGT00940000158680; -.
DR   HOGENOM; CLU_012184_3_3_1; -.
DR   OMA; DEKIPYW; -.
DR   TreeFam; TF314576; -.
DR   Proteomes; UP000007267; Unassembled WGS sequence.
DR   GO; GO:0009897; C:external side of plasma membrane; IEA:Ensembl.
DR   GO; GO:0005615; C:extracellular space; IEA:Ensembl.
DR   GO; GO:0005764; C:lysosome; IEA:Ensembl.
DR   GO; GO:1904090; C:peptidase inhibitor complex; IEA:Ensembl.
DR   GO; GO:0048471; C:perinuclear region of cytoplasm; IEA:Ensembl.
DR   GO; GO:0005518; F:collagen binding; IEA:Ensembl.
DR   GO; GO:0004197; F:cysteine-type endopeptidase activity; IEA:UniProtKB-EC.
DR   GO; GO:0043394; F:proteoglycan binding; IEA:Ensembl.
DR   GO; GO:0097067; P:cellular response to thyroid hormone stimulus; IEA:Ensembl.
DR   GO; GO:0030574; P:collagen catabolic process; IEA:Ensembl.
DR   GO; GO:0030855; P:epithelial cell differentiation; IEA:Ensembl.
DR   GO; GO:0051603; P:proteolysis involved in protein catabolic process; IEA:Ensembl.
DR   GO; GO:0006590; P:thyroid hormone generation; IEA:Ensembl.
DR   GO; GO:0046718; P:viral entry into host cell; IEA:Ensembl.
DR   CDD; cd02620; Peptidase_C1A_CathepsinB; 1.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR000169; Pept_cys_AS.
DR   InterPro; IPR013128; Peptidase_C1A.
DR   InterPro; IPR000668; Peptidase_C1A_C.
DR   InterPro; IPR012599; Propeptide_C1A.
DR   PANTHER; PTHR12411:SF895; CATHEPSIN B; 1.
DR   PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR   Pfam; PF00112; Peptidase_C1; 1.
DR   Pfam; PF08127; Propeptide_C1; 1.
DR   SMART; SM00645; Pept_C1; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
PE   3: Inferred from homology;
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Protease {ECO:0000256|ARBA:ARBA00022670};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007267};
KW   Signal {ECO:0000256|SAM:SignalP};
KW   Thiol protease {ECO:0000256|ARBA:ARBA00022807}.
FT   SIGNAL          1..16
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           17..330
FT                   /note="Cathepsin B"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5018739329"
FT   DOMAIN          79..324
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
SQ   SEQUENCE   330 AA;  36029 MW;  91D44B5741D2CA77 CRC64;
     PWSLAALFVL VALASARSVP HFAPLSPDLV NYINKLNTTW QAGHNFRNAD LSYVKQLCGT
     FLHGPKLPVR AEFAGDLNLP DSFDSRKQWP NCPTINEIRD QGSCGSCWAF GAVEAISDRV
     CVHTNGKMNV EISAEDLLSC CGFECGMGGG CSRPTAKDTG ASLRAEGLES TALGLSHPAG
     CRPYSIPPCE HHVNGSRPPC TGEQGDTPKC DQHCEAGYSP SYETDKHFGA TSYNVPRSEK
     EIMAEIYKNG PVEGAFSVYE DFLMYKSGKR GRDVPGGARS WGGSSPAYWQ PAQGWERDWR
     ERCFFKILRG QDHCGIESEI GSGQPRTEPE
//
DBGET integrated database retrieval system