ID A0A6G0ICQ4_LARCR Unreviewed; 337 AA.
AC A0A6G0ICQ4;
DT 12-AUG-2020, integrated into UniProtKB/TrEMBL.
DT 12-AUG-2020, sequence version 1.
DT 27-MAR-2024, entry version 8.
DE RecName: Full=Cathepsin S {ECO:0000256|ARBA:ARBA00039679};
DE EC=3.4.22.27 {ECO:0000256|ARBA:ARBA00038916};
GN ORFNames=D5F01_LYC13158 {ECO:0000313|EMBL:KAE8289275.1};
OS Larimichthys crocea (Large yellow croaker) (Pseudosciaena crocea).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Sciaenidae; Larimichthys.
OX NCBI_TaxID=215358 {ECO:0000313|EMBL:KAE8289275.1, ECO:0000313|Proteomes:UP000424527};
RN [1] {ECO:0000313|EMBL:KAE8289275.1, ECO:0000313|Proteomes:UP000424527}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JMULYC20181020 {ECO:0000313|EMBL:KAE8289275.1};
RC TISSUE=Muscle {ECO:0000313|EMBL:KAE8289275.1};
RA Xiao S.;
RT "Chromosome genome assembly for large yellow croaker.";
RL Submitted (JUL-2019) to the EMBL/GenBank/DDBJ databases.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Similar to cathepsin L, but with much less activity on Z-Phe-
CC Arg-|-NHMec, and more activity on the Z-Val-Val-Arg-|-Xaa compound.;
CC EC=3.4.22.27; Evidence={ECO:0000256|ARBA:ARBA00035956};
CC -!- SUBCELLULAR LOCATION: Cytoplasmic vesicle, phagosome
CC {ECO:0000256|ARBA:ARBA00004262}. Secreted
CC {ECO:0000256|ARBA:ARBA00004613}.
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KAE8289275.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; REGW02000012; KAE8289275.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A6G0ICQ4; -.
DR Proteomes; UP000424527; Unassembled WGS sequence.
DR GO; GO:0045335; C:phagocytic vesicle; IEA:UniProtKB-SubCell.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd02248; Peptidase_C1A; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025661; Pept_asp_AS.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR InterPro; IPR013201; Prot_inhib_I29.
DR PANTHER; PTHR12411:SF525; CATHEPSIN S; 1.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR Pfam; PF08246; Inhibitor_I29; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00848; Inhibitor_I29; 1.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Cytoplasmic vesicle {ECO:0000256|ARBA:ARBA00023329};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Reference proteome {ECO:0000313|Proteomes:UP000424527};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}; Signal {ECO:0000256|SAM:SignalP};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807};
KW Zymogen {ECO:0000256|ARBA:ARBA00023145}.
FT SIGNAL 1..24
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 25..337
FT /note="Cathepsin S"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5026149708"
FT DOMAIN 34..94
FT /note="Cathepsin propeptide inhibitor"
FT /evidence="ECO:0000259|SMART:SM00848"
FT DOMAIN 122..336
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 337 AA; 37015 MW; A122B86A9A23F4AA CRC64;
MSPTDQSLML GSLLLFSLCA GAAAMFDSKL DVHWLMWKKT HGKMYKNDVE DESRRELWEK
NLVLITMHNL EASMGFHTYK LSMNHMGDLT PEEIMQSFAT LTPPADIQRA PSPFAGASGA
AAPDTMDWRE KGCVTSVKMQ GACGSCWAFS AAGALEGQLA KMTGKLVDLS PQNLVDCSTK
YGNHGCNGGF MHKAFQYVID NHGIDSDAAY PYTGRSQECR YSPRFRAANC SQYSFLPEGD
EGALKEALAT IGPISVAIDA RRPRFAFYSS GVYDDPSCSQ DVNHGVLAVG YGTLNGQDYW
LVKNSWGVKF GENGYIRMAR NKNDQCGIAL YGCYPIM
//