ID Q23GZ4_TETTS Unreviewed; 336 AA.
AC Q23GZ4;
DT 18-APR-2006, integrated into UniProtKB/TrEMBL.
DT 18-APR-2006, sequence version 1.
DT 27-MAR-2024, entry version 80.
DE SubName: Full=Papain family cysteine protease {ECO:0000313|EMBL:EAR95853.1};
GN ORFNames=TTHERM_00881440 {ECO:0000313|EMBL:EAR95853.1};
OS Tetrahymena thermophila (strain SB210).
OC Eukaryota; Sar; Alveolata; Ciliophora; Intramacronucleata;
OC Oligohymenophorea; Hymenostomatida; Tetrahymenina; Tetrahymenidae;
OC Tetrahymena.
OX NCBI_TaxID=312017 {ECO:0000313|EMBL:EAR95853.1, ECO:0000313|Proteomes:UP000009168};
RN [1] {ECO:0000313|Proteomes:UP000009168}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=SB210 {ECO:0000313|Proteomes:UP000009168};
RX PubMed=16933976; DOI=10.1371/journal.pbio.0040286;
RA Eisen J.A., Coyne R.S., Wu M., Wu D., Thiagarajan M., Wortman J.R.,
RA Badger J.H., Ren Q., Amedeo P., Jones K.M., Tallon L.J., Delcher A.L.,
RA Salzberg S.L., Silva J.C., Haas B.J., Majoros W.H., Farzad M.,
RA Carlton J.M., Smith R.K. Jr., Garg J., Pearlman R.E., Karrer K.M., Sun L.,
RA Manning G., Elde N.C., Turkewitz A.P., Asai D.J., Wilkes D.E., Wang Y.,
RA Cai H., Collins K., Stewart B.A., Lee S.R., Wilamowska K., Weinberg Z.,
RA Ruzzo W.L., Wloga D., Gaertig J., Frankel J., Tsao C.-C., Gorovsky M.A.,
RA Keeling P.J., Waller R.F., Patron N.J., Cherry J.M., Stover N.A.,
RA Krieger C.J., del Toro C., Ryder H.F., Williamson S.C., Barbeau R.A.,
RA Hamilton E.P., Orias E.;
RT "Macronuclear genome sequence of the ciliate Tetrahymena thermophila, a
RT model eukaryote.";
RL PLoS Biol. 4:1620-1642(2006).
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GG662702; EAR95853.1; -; Genomic_DNA.
DR RefSeq; XP_001016098.1; XM_001016098.3.
DR AlphaFoldDB; Q23GZ4; -.
DR SMR; Q23GZ4; -.
DR STRING; 312017.Q23GZ4; -.
DR MEROPS; C01.A54; -.
DR MEROPS; I29.003; -.
DR EnsemblProtists; EAR95853; EAR95853; TTHERM_00881440.
DR GeneID; 7839566; -.
DR KEGG; tet:TTHERM_00881440; -.
DR eggNOG; KOG1543; Eukaryota.
DR HOGENOM; CLU_012184_1_0_1; -.
DR InParanoid; Q23GZ4; -.
DR OMA; CAKRLDH; -.
DR OrthoDB; 808912at2759; -.
DR Proteomes; UP000009168; Unassembled WGS sequence.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd02248; Peptidase_C1A; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR InterPro; IPR013201; Prot_inhib_I29.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR PANTHER; PTHR12411:SF642; PRO-CATHEPSIN H; 1.
DR Pfam; PF08246; Inhibitor_I29; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00848; Inhibitor_I29; 1.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Protease {ECO:0000313|EMBL:EAR95853.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000009168};
KW Signal {ECO:0000256|SAM:SignalP}; Zymogen {ECO:0000256|ARBA:ARBA00023145}.
FT SIGNAL 1..19
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 20..336
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5018682660"
FT DOMAIN 30..87
FT /note="Cathepsin propeptide inhibitor"
FT /evidence="ECO:0000259|SMART:SM00848"
FT DOMAIN 127..335
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 336 AA; 37759 MW; 2DBCA6C2C5B2734C CRC64;
MNKKFIILSI IMLMPLCLAQ DISVEKLLAY NKWSSQNQRA YLNEDEKLYR QIVFFENLQK
IKEHNSNPNN TYSIHLNQFS DMTREEFAEK ILMKQDLIND YMKGIGQQAT HNNANNETQM
NSQNHTLAAS IDWRTKGAVT SVKDQGQCGS CWSFSAAALM ESFNFIQNKA LVNFSEQQLV
DCVTPENGYP SYGCKGGWPA TCLDYASKVG ITTLDKYPYV AVQKNCTVTG TNNGFKLKKW
IVIPNTSNDL KSALNFSPVS VLVDATNWDY YSSGIFNGCN QTNINLNHAV LAVGYDEKDN
WIVKNSWSAG WGEHGYIRLA PNNTCGILSS NIQVTA
//