GenomeNet

Database: UniProt
Entry: G3PUU4_GASAC
LinkDB: G3PUU4_GASAC
Original site: G3PUU4_GASAC 
ID   G3PUU4_GASAC            Unreviewed;       331 AA.
AC   G3PUU4;
DT   16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT   16-NOV-2011, sequence version 1.
DT   27-MAR-2024, entry version 57.
DE   RecName: Full=Cathepsin B {ECO:0000256|ARBA:ARBA00015559};
DE            EC=3.4.22.1 {ECO:0000256|ARBA:ARBA00012537};
OS   Gasterosteus aculeatus (Three-spined stickleback).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Eupercaria; Perciformes; Cottioidei; Gasterosteales; Gasterosteidae;
OC   Gasterosteus.
OX   NCBI_TaxID=69293 {ECO:0000313|Ensembl:ENSGACP00000021381.1, ECO:0000313|Proteomes:UP000007635};
RN   [1] {ECO:0000313|Ensembl:ENSGACP00000021381.1, ECO:0000313|Proteomes:UP000007635}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Lindblad-Toh K., Mauceli E., Grabherr M., Chang J.L., Lander E.S.;
RL   Submitted (JAN-2006) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSGACP00000021381.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- CATALYTIC ACTIVITY:
CC       Reaction=Hydrolysis of proteins with broad specificity for peptide
CC         bonds. Preferentially cleaves -Arg-Arg-|-Xaa bonds in small molecule
CC         substrates (thus differing from cathepsin L). In addition to being an
CC         endopeptidase, shows peptidyl-dipeptidase activity, liberating C-
CC         terminal dipeptides.; EC=3.4.22.1;
CC         Evidence={ECO:0000256|ARBA:ARBA00001754};
CC   -!- SIMILARITY: Belongs to the peptidase C1 family.
CC       {ECO:0000256|ARBA:ARBA00008455}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; G3PUU4; -.
DR   Ensembl; ENSGACT00000021422.1; ENSGACP00000021381.1; ENSGACG00000016202.1.
DR   GeneTree; ENSGT00940000158680; -.
DR   Proteomes; UP000007635; Unassembled WGS sequence.
DR   Bgee; ENSGACG00000016202; Expressed in spleen and 13 other cell types or tissues.
DR   GO; GO:0004197; F:cysteine-type endopeptidase activity; IEA:UniProtKB-EC.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd02620; Peptidase_C1A_CathepsinB; 1.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR025661; Pept_asp_AS.
DR   InterPro; IPR000169; Pept_cys_AS.
DR   InterPro; IPR025660; Pept_his_AS.
DR   InterPro; IPR013128; Peptidase_C1A.
DR   InterPro; IPR000668; Peptidase_C1A_C.
DR   InterPro; IPR012599; Propeptide_C1A.
DR   PANTHER; PTHR12411:SF978; CATHEPSIN B; 1.
DR   PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR   Pfam; PF00112; Peptidase_C1; 1.
DR   Pfam; PF08127; Propeptide_C1; 1.
DR   PRINTS; PR00705; PAPAIN.
DR   SMART; SM00645; Pept_C1; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR   PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR   PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Protease {ECO:0000256|ARBA:ARBA00022670};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007635};
KW   Signal {ECO:0000256|SAM:SignalP};
KW   Thiol protease {ECO:0000256|ARBA:ARBA00022807}.
FT   SIGNAL          1..18
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           19..331
FT                   /note="Cathepsin B"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5018535403"
FT   DOMAIN          79..328
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
SQ   SEQUENCE   331 AA;  36200 MW;  CB09D2EBF91C8477 CRC64;
     MWHSAFLLLA ASLSVSLARP RFTPLSSEMV NYINKVNTTW KAGHNFHNVD YSYVKRLCGT
     LLKGPKLPVM VQYAGDLNLP KNFDSRQQWP DCPTLQEVRD QGSCGSCWAF GAAEAISDRV
     CIHSNAKVNV EISSEDLLSC CESCGMGCNG GYPSAAWDFW THEGLVSGGL YDSHIGCRPY
     TIPPCEHHVN GSRPSCSGEG GDTPQCVHQC EAGYTPSYKG DKHFGKTSYT VLSDEKQIQS
     EISKNGPVEA AFTVYEDFVL YKSGVYQHVS GSAVGGHAIK MLGWGEEDGV PYWLCANSWN
     TDWGDNGFFK ILRGSDHCGI ESEIVAGIPN L
//
DBGET integrated database retrieval system