GenomeNet

Database: UniProt
Entry: A0A6G0ICQ4_LARCR
LinkDB: A0A6G0ICQ4_LARCR
Original site: A0A6G0ICQ4_LARCR 
ID   A0A6G0ICQ4_LARCR        Unreviewed;       337 AA.
AC   A0A6G0ICQ4;
DT   12-AUG-2020, integrated into UniProtKB/TrEMBL.
DT   12-AUG-2020, sequence version 1.
DT   27-MAR-2024, entry version 8.
DE   RecName: Full=Cathepsin S {ECO:0000256|ARBA:ARBA00039679};
DE            EC=3.4.22.27 {ECO:0000256|ARBA:ARBA00038916};
GN   ORFNames=D5F01_LYC13158 {ECO:0000313|EMBL:KAE8289275.1};
OS   Larimichthys crocea (Large yellow croaker) (Pseudosciaena crocea).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Eupercaria; Sciaenidae; Larimichthys.
OX   NCBI_TaxID=215358 {ECO:0000313|EMBL:KAE8289275.1, ECO:0000313|Proteomes:UP000424527};
RN   [1] {ECO:0000313|EMBL:KAE8289275.1, ECO:0000313|Proteomes:UP000424527}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=JMULYC20181020 {ECO:0000313|EMBL:KAE8289275.1};
RC   TISSUE=Muscle {ECO:0000313|EMBL:KAE8289275.1};
RA   Xiao S.;
RT   "Chromosome genome assembly for large yellow croaker.";
RL   Submitted (JUL-2019) to the EMBL/GenBank/DDBJ databases.
CC   -!- CATALYTIC ACTIVITY:
CC       Reaction=Similar to cathepsin L, but with much less activity on Z-Phe-
CC         Arg-|-NHMec, and more activity on the Z-Val-Val-Arg-|-Xaa compound.;
CC         EC=3.4.22.27; Evidence={ECO:0000256|ARBA:ARBA00035956};
CC   -!- SUBCELLULAR LOCATION: Cytoplasmic vesicle, phagosome
CC       {ECO:0000256|ARBA:ARBA00004262}. Secreted
CC       {ECO:0000256|ARBA:ARBA00004613}.
CC   -!- SIMILARITY: Belongs to the peptidase C1 family.
CC       {ECO:0000256|ARBA:ARBA00008455}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KAE8289275.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; REGW02000012; KAE8289275.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A6G0ICQ4; -.
DR   Proteomes; UP000424527; Unassembled WGS sequence.
DR   GO; GO:0045335; C:phagocytic vesicle; IEA:UniProtKB-SubCell.
DR   GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd02248; Peptidase_C1A; 1.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR025661; Pept_asp_AS.
DR   InterPro; IPR000169; Pept_cys_AS.
DR   InterPro; IPR025660; Pept_his_AS.
DR   InterPro; IPR013128; Peptidase_C1A.
DR   InterPro; IPR000668; Peptidase_C1A_C.
DR   InterPro; IPR039417; Peptidase_C1A_papain-like.
DR   InterPro; IPR013201; Prot_inhib_I29.
DR   PANTHER; PTHR12411:SF525; CATHEPSIN S; 1.
DR   PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR   Pfam; PF08246; Inhibitor_I29; 1.
DR   Pfam; PF00112; Peptidase_C1; 1.
DR   PRINTS; PR00705; PAPAIN.
DR   SMART; SM00848; Inhibitor_I29; 1.
DR   SMART; SM00645; Pept_C1; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR   PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR   PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE   3: Inferred from homology;
KW   Cytoplasmic vesicle {ECO:0000256|ARBA:ARBA00023329};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Protease {ECO:0000256|ARBA:ARBA00022670};
KW   Reference proteome {ECO:0000313|Proteomes:UP000424527};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525}; Signal {ECO:0000256|SAM:SignalP};
KW   Thiol protease {ECO:0000256|ARBA:ARBA00022807};
KW   Zymogen {ECO:0000256|ARBA:ARBA00023145}.
FT   SIGNAL          1..24
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           25..337
FT                   /note="Cathepsin S"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5026149708"
FT   DOMAIN          34..94
FT                   /note="Cathepsin propeptide inhibitor"
FT                   /evidence="ECO:0000259|SMART:SM00848"
FT   DOMAIN          122..336
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
SQ   SEQUENCE   337 AA;  37015 MW;  A122B86A9A23F4AA CRC64;
     MSPTDQSLML GSLLLFSLCA GAAAMFDSKL DVHWLMWKKT HGKMYKNDVE DESRRELWEK
     NLVLITMHNL EASMGFHTYK LSMNHMGDLT PEEIMQSFAT LTPPADIQRA PSPFAGASGA
     AAPDTMDWRE KGCVTSVKMQ GACGSCWAFS AAGALEGQLA KMTGKLVDLS PQNLVDCSTK
     YGNHGCNGGF MHKAFQYVID NHGIDSDAAY PYTGRSQECR YSPRFRAANC SQYSFLPEGD
     EGALKEALAT IGPISVAIDA RRPRFAFYSS GVYDDPSCSQ DVNHGVLAVG YGTLNGQDYW
     LVKNSWGVKF GENGYIRMAR NKNDQCGIAL YGCYPIM
//
DBGET integrated database retrieval system