GenomeNet

Database: UniProt
Entry: A0A196S922_BLAHN
LinkDB: A0A196S922_BLAHN
Original site: A0A196S922_BLAHN 
ID   A0A196S922_BLAHN        Unreviewed;       317 AA.
AC   A0A196S922;
DT   05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT   05-OCT-2016, sequence version 1.
DT   27-MAR-2024, entry version 19.
DE   SubName: Full=Cysteine protease {ECO:0000313|EMBL:OAO12856.1};
GN   ORFNames=AV274_5456 {ECO:0000313|EMBL:OAO12856.1};
OS   Blastocystis sp. subtype 1 (strain ATCC 50177 / NandII).
OC   Eukaryota; Sar; Stramenopiles; Bigyra; Opalozoa; Opalinata; Blastocystidae;
OC   Blastocystis.
OX   NCBI_TaxID=478820 {ECO:0000313|EMBL:OAO12856.1, ECO:0000313|Proteomes:UP000078348};
RN   [1] {ECO:0000313|EMBL:OAO12856.1, ECO:0000313|Proteomes:UP000078348}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=ATCC 50177 / NandII {ECO:0000313|Proteomes:UP000078348};
RA   Gentekaki E., Curtis B., Stairs C., Eme L., Herman E., Klimes V.,
RA   Arias M.C., Elias M., Hilliou F., Klute M., Malik S.-B., Pightling A.,
RA   Rachubinski R., Salas D., Schlacht A., Suga H., Archibald J., Ball S.G.,
RA   Clark G., Dacks J., Van Der Giezen M., Tsaousis A., Roger A.;
RT   "Nuclear genome of Blastocystis sp. subtype 1 NandII.";
RL   Submitted (MAY-2016) to the EMBL/GenBank/DDBJ databases.
CC   -!- SIMILARITY: Belongs to the peptidase C1 family.
CC       {ECO:0000256|ARBA:ARBA00008455}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:OAO12856.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; LXWW01000506; OAO12856.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A196S922; -.
DR   STRING; 478820.A0A196S922; -.
DR   OrthoDB; 5472443at2759; -.
DR   Proteomes; UP000078348; Unassembled WGS sequence.
DR   GO; GO:0008234; F:cysteine-type peptidase activity; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd02248; Peptidase_C1A; 1.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR025661; Pept_asp_AS.
DR   InterPro; IPR000169; Pept_cys_AS.
DR   InterPro; IPR025660; Pept_his_AS.
DR   InterPro; IPR013128; Peptidase_C1A.
DR   InterPro; IPR000668; Peptidase_C1A_C.
DR   InterPro; IPR039417; Peptidase_C1A_papain-like.
DR   InterPro; IPR013201; Prot_inhib_I29.
DR   PANTHER; PTHR12411:SF741; CATHEPSIN K; 1.
DR   PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR   Pfam; PF08246; Inhibitor_I29; 1.
DR   Pfam; PF00112; Peptidase_C1; 1.
DR   PRINTS; PR00705; PAPAIN.
DR   SMART; SM00848; Inhibitor_I29; 1.
DR   SMART; SM00645; Pept_C1; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR   PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR   PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Hydrolase {ECO:0000313|EMBL:OAO12856.1};
KW   Protease {ECO:0000313|EMBL:OAO12856.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000078348};
KW   Signal {ECO:0000256|SAM:SignalP}; Zymogen {ECO:0000256|ARBA:ARBA00023145}.
FT   SIGNAL          1..16
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           17..317
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5018783596"
FT   DOMAIN          22..78
FT                   /note="Cathepsin propeptide inhibitor"
FT                   /evidence="ECO:0000259|SMART:SM00848"
FT   DOMAIN          103..316
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
SQ   SEQUENCE   317 AA;  35278 MW;  019D9A832F27CCC9 CRC64;
     MKAIFFVLFA VALSTSLRDS QFFAFKSQYG KHYSSPEEQK YRLAVFNVNL NKIEAHNAKH
     LPWTLGVNKF ADITEEEFAY KFCGCAKDPK QRSDVMTPML GDAPKRVDWR EKGAVTPIKD
     QASCGSCWAF STTGTTEGAY FIHSGELVSL SEQQLVDCAK RPKYEAAGCG GGWPWSVLDY
     VSSHGLCKEE DYPYKGMDQE CHDDACKVAV KSVSKVQLPQ EDEVSLANAV ALTPVSIVLD
     ASAFQFYKGG IITQCTERIN HAVLAVGYDE DESGMKYWIV KNSWGENWGE KGYVRIEKDV
     GGMGRCAITY SSVYPTF
//
DBGET integrated database retrieval system