GenomeNet

Database: UniProt
Entry: G3PUV2_GASAC
LinkDB: G3PUV2_GASAC
Original site: G3PUV2_GASAC 
ID   G3PUV2_GASAC            Unreviewed;       330 AA.
AC   G3PUV2;
DT   16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT   16-NOV-2011, sequence version 1.
DT   27-MAR-2024, entry version 57.
DE   RecName: Full=Cathepsin B {ECO:0000256|ARBA:ARBA00015559};
DE            EC=3.4.22.1 {ECO:0000256|ARBA:ARBA00012537};
OS   Gasterosteus aculeatus (Three-spined stickleback).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Eupercaria; Perciformes; Cottioidei; Gasterosteales; Gasterosteidae;
OC   Gasterosteus.
OX   NCBI_TaxID=69293 {ECO:0000313|Ensembl:ENSGACP00000021389.1, ECO:0000313|Proteomes:UP000007635};
RN   [1] {ECO:0000313|Ensembl:ENSGACP00000021389.1, ECO:0000313|Proteomes:UP000007635}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Lindblad-Toh K., Mauceli E., Grabherr M., Chang J.L., Lander E.S.;
RL   Submitted (JAN-2006) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSGACP00000021389.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- CATALYTIC ACTIVITY:
CC       Reaction=Hydrolysis of proteins with broad specificity for peptide
CC         bonds. Preferentially cleaves -Arg-Arg-|-Xaa bonds in small molecule
CC         substrates (thus differing from cathepsin L). In addition to being an
CC         endopeptidase, shows peptidyl-dipeptidase activity, liberating C-
CC         terminal dipeptides.; EC=3.4.22.1;
CC         Evidence={ECO:0000256|ARBA:ARBA00001754};
CC   -!- SIMILARITY: Belongs to the peptidase C1 family.
CC       {ECO:0000256|ARBA:ARBA00008455}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; G3PUV2; -.
DR   Ensembl; ENSGACT00000021430.1; ENSGACP00000021389.1; ENSGACG00000016202.1.
DR   GeneTree; ENSGT00940000158680; -.
DR   Proteomes; UP000007635; Unassembled WGS sequence.
DR   Bgee; ENSGACG00000016202; Expressed in spleen and 13 other cell types or tissues.
DR   GO; GO:0004197; F:cysteine-type endopeptidase activity; IEA:UniProtKB-EC.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd02620; Peptidase_C1A_CathepsinB; 1.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR025661; Pept_asp_AS.
DR   InterPro; IPR000169; Pept_cys_AS.
DR   InterPro; IPR025660; Pept_his_AS.
DR   InterPro; IPR013128; Peptidase_C1A.
DR   InterPro; IPR000668; Peptidase_C1A_C.
DR   InterPro; IPR012599; Propeptide_C1A.
DR   PANTHER; PTHR12411:SF978; CATHEPSIN B; 1.
DR   PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR   Pfam; PF00112; Peptidase_C1; 1.
DR   Pfam; PF08127; Propeptide_C1; 1.
DR   PRINTS; PR00705; PAPAIN.
DR   SMART; SM00645; Pept_C1; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR   PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR   PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Protease {ECO:0000256|ARBA:ARBA00022670};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007635};
KW   Signal {ECO:0000256|SAM:SignalP};
KW   Thiol protease {ECO:0000256|ARBA:ARBA00022807}.
FT   SIGNAL          1..18
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           19..330
FT                   /note="Cathepsin B"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5018792930"
FT   DOMAIN          79..328
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
SQ   SEQUENCE   330 AA;  36100 MW;  2EA2EBF91C8477D2 CRC64;
     MWHSAFLLLA ASLSVSLARP RFTPLSSEMV NYINKVNTTW KAGHNFHNVD YSYVKRLCGT
     LLKGPKLPVM VQYAGDLNLP KNFDSRQQWP DCPTLQEVRD QGSCGSCWAF GAAEAISDRV
     CIHSNAKVNV EISSEDLLSC CESCGMGCNG GYPSAAWDFW THEGLVSGGL YDSHIGCRPY
     TIPPCEHHVN GSRPSCSGEG GDTPQCVHQC EAGYTPSYKG DKHFGKTSYT VLSDEKQIQS
     EISKNGPVEA AFTVYEDFVL YKSGVYQHVS GSAVGGHAIK MLGWGEEDGV PYWLCANSWN
     TDWGDNGFFK ILRGSDHCGI ESEIVAGIPK
//
DBGET integrated database retrieval system