GenomeNet

Database: UniProt
Entry: A0A3Q0KH54_SCHMA
LinkDB: A0A3Q0KH54_SCHMA
Original site: A0A3Q0KH54_SCHMA 
ID   A0A3Q0KH54_SCHMA        Unreviewed;       340 AA.
AC   A0A3Q0KH54;
DT   13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT   13-FEB-2019, sequence version 1.
DT   27-MAR-2024, entry version 20.
DE   SubName: Full=Cathepsin B-like peptidase (C01 family) {ECO:0000313|WBParaSite:Smp_067060.1};
OS   Schistosoma mansoni (Blood fluke).
OC   Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC   Digenea; Strigeidida; Schistosomatoidea; Schistosomatidae; Schistosoma.
OX   NCBI_TaxID=6183 {ECO:0000313|Proteomes:UP000008854, ECO:0000313|WBParaSite:Smp_067060.1};
RN   [1] {ECO:0000313|Proteomes:UP000008854}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Puerto Rican {ECO:0000313|Proteomes:UP000008854};
RX   PubMed=22253936; DOI=10.1371/journal.pntd.0001455;
RA   Protasio A.V., Tsai I.J., Babbage A., Nichol S., Hunt M., Aslett M.A.,
RA   De Silva N., Velarde G.S., Anderson T.J., Clark R.C., Davidson C.,
RA   Dillon G.P., Holroyd N.E., LoVerde P.T., Lloyd C., McQuillan J.,
RA   Oliveira G., Otto T.D., Parker-Manuel S.J., Quail M.A., Wilson R.A.,
RA   Zerlotini A., Dunne D.W., Berriman M.;
RT   "A systematically improved high quality genome and transcriptome of the
RT   human blood fluke Schistosoma mansoni.";
RL   PLoS Negl. Trop. Dis. 6:E1455-E1455(2012).
RN   [2] {ECO:0000313|WBParaSite:Smp_067060.1}
RP   IDENTIFICATION.
RC   STRAIN=Puerto Rican {ECO:0000313|WBParaSite:Smp_067060.1};
RG   WormBaseParasite;
RL   Submitted (DEC-2018) to UniProtKB.
CC   -!- SIMILARITY: Belongs to the peptidase C1 family.
CC       {ECO:0000256|ARBA:ARBA00008455}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; A0A3Q0KH54; -.
DR   STRING; 6183.A0A3Q0KH54; -.
DR   EnsemblMetazoa; Smp_067060.1; Smp_067060.1; Smp_067060.
DR   WBParaSite; Smp_067060.1; Smp_067060.1; Smp_067060.
DR   InParanoid; A0A3Q0KH54; -.
DR   Proteomes; UP000008854; Unassembled WGS sequence.
DR   ExpressionAtlas; A0A3Q0KH54; baseline.
DR   GO; GO:0004197; F:cysteine-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd02620; Peptidase_C1A_CathepsinB; 1.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR025661; Pept_asp_AS.
DR   InterPro; IPR000169; Pept_cys_AS.
DR   InterPro; IPR025660; Pept_his_AS.
DR   InterPro; IPR013128; Peptidase_C1A.
DR   InterPro; IPR000668; Peptidase_C1A_C.
DR   InterPro; IPR012599; Propeptide_C1A.
DR   PANTHER; PTHR12411:SF1000; CATHEPSIN B1, ISOFORM A; 1.
DR   PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR   Pfam; PF00112; Peptidase_C1; 1.
DR   Pfam; PF08127; Propeptide_C1; 1.
DR   PRINTS; PR00705; PAPAIN.
DR   SMART; SM00645; Pept_C1; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR   PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR   PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Protease {ECO:0000256|ARBA:ARBA00022670};
KW   Reference proteome {ECO:0000313|Proteomes:UP000008854};
KW   Signal {ECO:0000256|SAM:SignalP};
KW   Thiol protease {ECO:0000256|ARBA:ARBA00022807}.
FT   SIGNAL          1..17
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           18..340
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5018601151"
FT   DOMAIN          89..338
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
SQ   SEQUENCE   340 AA;  38540 MW;  5454DE7B59FEB150 CRC64;
     MLTSVLCIAS LITHLDAHIS IKNEKFKPLS DDIISYINEH PNAGWRAEKS NRFHSLDDAR
     IQMGARREEP DLRRKRRPTV DHNEWNVEIP SNFDSRKKWP GCKSIATIRD QSRCGSCWAF
     GAVEAMSDRS CIQSGGKQNV ELSAVDLLSC CESCGLGCEG GILGPAWDFW VKEGIVTGSS
     KENHTGCEPY PFPKCEHHTK GKYPPCGSKI YKTPRCKQTC QKKYKTPYTQ DKHRGKSSYN
     VKNDEKAIQK EIMKYGPVEA SFTVYEDFLN YKSGIYKHIT GEALGGHAIR IIGWGVENKT
     PYWLIANSWN EDWGENGYFR IVRGRDECFI ESEVIAGQIN
//
DBGET integrated database retrieval system