GenomeNet

Database: UniProt
Entry: A0A093S2V1_9PASS
LinkDB: A0A093S2V1_9PASS
Original site: A0A093S2V1_9PASS 
ID   A0A093S2V1_9PASS        Unreviewed;       340 AA.
AC   A0A093S2V1;
DT   26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT   26-NOV-2014, sequence version 1.
DT   27-MAR-2024, entry version 39.
DE   RecName: Full=Cathepsin B {ECO:0000256|ARBA:ARBA00015559};
DE            EC=3.4.22.1 {ECO:0000256|ARBA:ARBA00012537};
GN   ORFNames=N305_10461 {ECO:0000313|EMBL:KFW77294.1};
OS   Manacus vitellinus (golden-collared manakin).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Passeriformes; Pipridae; Manacus.
OX   NCBI_TaxID=328815 {ECO:0000313|EMBL:KFW77294.1, ECO:0000313|Proteomes:UP000053258};
RN   [1] {ECO:0000313|EMBL:KFW77294.1, ECO:0000313|Proteomes:UP000053258}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=BGI_N305 {ECO:0000313|EMBL:KFW77294.1};
RA   Zhang G., Li C.;
RT   "Genome evolution of avian class.";
RL   Submitted (JUN-2014) to the EMBL/GenBank/DDBJ databases.
CC   -!- CATALYTIC ACTIVITY:
CC       Reaction=Hydrolysis of proteins with broad specificity for peptide
CC         bonds. Preferentially cleaves -Arg-Arg-|-Xaa bonds in small molecule
CC         substrates (thus differing from cathepsin L). In addition to being an
CC         endopeptidase, shows peptidyl-dipeptidase activity, liberating C-
CC         terminal dipeptides.; EC=3.4.22.1;
CC         Evidence={ECO:0000256|ARBA:ARBA00001754};
CC   -!- SUBCELLULAR LOCATION: Lysosome {ECO:0000256|ARBA:ARBA00004371}.
CC   -!- SIMILARITY: Belongs to the peptidase C1 family.
CC       {ECO:0000256|ARBA:ARBA00008455}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KL669857; KFW77294.1; -; Genomic_DNA.
DR   RefSeq; XP_008917711.1; XM_008919463.2.
DR   AlphaFoldDB; A0A093S2V1; -.
DR   STRING; 328815.ENSMVIP00005019795; -.
DR   MEROPS; C01.060; -.
DR   GeneID; 103753723; -.
DR   CTD; 1508; -.
DR   OrthoDB; 808912at2759; -.
DR   Proteomes; UP000053258; Unassembled WGS sequence.
DR   GO; GO:0004197; F:cysteine-type endopeptidase activity; IEA:UniProtKB-EC.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd02620; Peptidase_C1A_CathepsinB; 1.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR025661; Pept_asp_AS.
DR   InterPro; IPR000169; Pept_cys_AS.
DR   InterPro; IPR025660; Pept_his_AS.
DR   InterPro; IPR013128; Peptidase_C1A.
DR   InterPro; IPR000668; Peptidase_C1A_C.
DR   InterPro; IPR012599; Propeptide_C1A.
DR   PANTHER; PTHR12411:SF895; CATHEPSIN B; 1.
DR   PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR   Pfam; PF00112; Peptidase_C1; 1.
DR   Pfam; PF08127; Propeptide_C1; 1.
DR   PRINTS; PR00705; PAPAIN.
DR   SMART; SM00645; Pept_C1; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR   PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR   PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Protease {ECO:0000256|ARBA:ARBA00022670};
KW   Reference proteome {ECO:0000313|Proteomes:UP000053258};
KW   Signal {ECO:0000256|SAM:SignalP};
KW   Thiol protease {ECO:0000256|ARBA:ARBA00022807}.
FT   SIGNAL          1..17
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           18..340
FT                   /note="Cathepsin B"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5018666920"
FT   DOMAIN          80..330
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
SQ   SEQUENCE   340 AA;  37597 MW;  037E640CFE2E6A95 CRC64;
     MWPSVSILCV LVAFANARSI PYFPPLSSDL VNHINKLNTT WKAGHNFPNA DMSYVKKLCG
     TFLGGAKLPE RVDFAADMEL PDNFDSRTQW PNCPTISEIR DQGSCGSCWA FGAVEAISDR
     ICVHTNAKVS VEVSAEDLLS CCGFECGMGC NGGYPSGAWR YWTERGLVSG GLYDSHVGCR
     PYSIPPCEHH VNGSRPPCTG EGGGTPRCSR HCEPGYSPSY KEDKHYGITS YGVPRSEKEI
     MAEIYKNGPV EGAFIVYEDF LMYKSGVYQH VSGEQVGGHA IRILGWGVEN DTPYWLVANS
     WNTDWGDNGF FKILRGEDHC GIESEVVAGI PRTEQYWKRM
//
DBGET integrated database retrieval system