ID A0CM86_PARTE Unreviewed; 306 AA.
AC A0CM86;
DT 28-NOV-2006, integrated into UniProtKB/TrEMBL.
DT 28-NOV-2006, sequence version 1.
DT 27-MAR-2024, entry version 63.
DE SubName: Full=Chromosome undetermined scaffold_21, whole genome shotgun sequence {ECO:0000313|EMBL:CAK71903.1};
GN ORFNames=GSPATT00008382001 {ECO:0000313|EMBL:CAK71903.1};
OS Paramecium tetraurelia.
OC Eukaryota; Sar; Alveolata; Ciliophora; Intramacronucleata;
OC Oligohymenophorea; Peniculida; Parameciidae; Paramecium.
OX NCBI_TaxID=5888 {ECO:0000313|EMBL:CAK71903.1, ECO:0000313|Proteomes:UP000000600};
RN [1] {ECO:0000313|EMBL:CAK71903.1, ECO:0000313|Proteomes:UP000000600}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Stock d4-2 {ECO:0000313|EMBL:CAK71903.1,
RC ECO:0000313|Proteomes:UP000000600};
RX PubMed=17086204; DOI=10.1038/nature05230;
RG Genoscope;
RA Aury J.-M., Jaillon O., Duret L., Noel B., Jubin C., Porcel B.M.,
RA Segurens B., Daubin V., Anthouard V., Aiach N., Arnaiz O., Billaut A.,
RA Beisson J., Blanc I., Bouhouche K., Camara F., Duharcourt S., Guigo R.,
RA Gogendeau D., Katinka M., Keller A.-M., Kissmehl R., Klotz C., Koll F.,
RA Le Moue A., Lepere C., Malinsky S., Nowacki M., Nowak J.K., Plattner H.,
RA Poulain J., Ruiz F., Serrano V., Zagulski M., Dessen P., Betermier M.,
RA Weissenbach J., Scarpelli C., Schachter V., Sperling L., Meyer E.,
RA Cohen J., Wincker P.;
RT "Global trends of whole-genome duplications revealed by the ciliate
RT Paramecium tetraurelia.";
RL Nature 444:171-178(2006).
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CT868108; CAK71903.1; -; Genomic_DNA.
DR RefSeq; XP_001439300.1; XM_001439263.1.
DR AlphaFoldDB; A0CM86; -.
DR STRING; 5888.A0CM86; -.
DR MEROPS; C01.099; -.
DR EnsemblProtists; CAK71903; CAK71903; GSPATT00008382001.
DR GeneID; 5025085; -.
DR KEGG; ptm:GSPATT00008382001; -.
DR eggNOG; KOG1543; Eukaryota.
DR HOGENOM; CLU_012184_1_3_1; -.
DR InParanoid; A0CM86; -.
DR OMA; SGFMFYS; -.
DR Proteomes; UP000000600; Partially assembled WGS sequence.
DR GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR GO; GO:0004197; F:cysteine-type endopeptidase activity; IBA:GO_Central.
DR GO; GO:0051603; P:proteolysis involved in protein catabolic process; IBA:GO_Central.
DR CDD; cd02248; Peptidase_C1A; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR InterPro; IPR013201; Prot_inhib_I29.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR PANTHER; PTHR12411:SF642; PRO-CATHEPSIN H; 1.
DR Pfam; PF08246; Inhibitor_I29; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR SMART; SM00848; Inhibitor_I29; 1.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
PE 3: Inferred from homology;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Reference proteome {ECO:0000313|Proteomes:UP000000600};
KW Signal {ECO:0000256|SAM:SignalP}; Zymogen {ECO:0000256|ARBA:ARBA00023145}.
FT SIGNAL 1..18
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 19..306
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5018771801"
FT DOMAIN 29..85
FT /note="Cathepsin propeptide inhibitor"
FT /evidence="ECO:0000259|SMART:SM00848"
FT DOMAIN 108..306
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 306 AA; 35083 MW; 15805C9C9F557039 CRC64;
MQKILSLLTT ALLLSGLAFY SQNEESHSFK TWQKKYNKFY SSSEEAYRQI IFNQNVELIN
KHNSNPNKSY SMAINQFVDL TREEFQAIYL GKSTIVKTEN IELSARKNFE AVDWSSKLFP
IKDQFNCGSY WIFSAVGAVE AFLRVKKVLK WSLSEQQLVD CADSWGCYGG DADFALDYIV
NTGIVYELDY PYKGREGFCK VRREGEVKIS GRERIGSNED DIKQKVQEYP VSASVDCQGW
AYYSKGIFDE GCTDHRSNHD VVIVGFDKDG NWKIRNSWGV GWGEQGYMWL KSGNTCGIMN
RVDRAI
//