ID A0A196SN22_BLAHN Unreviewed; 670 AA.
AC A0A196SN22;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE SubName: Full=Xylem cysteine proteinase 2 {ECO:0000313|EMBL:OAO17264.1};
GN ORFNames=AV274_0975 {ECO:0000313|EMBL:OAO17264.1};
OS Blastocystis sp. subtype 1 (strain ATCC 50177 / NandII).
OC Eukaryota; Sar; Stramenopiles; Bigyra; Opalozoa; Opalinata; Blastocystidae;
OC Blastocystis.
OX NCBI_TaxID=478820 {ECO:0000313|EMBL:OAO17264.1, ECO:0000313|Proteomes:UP000078348};
RN [1] {ECO:0000313|EMBL:OAO17264.1, ECO:0000313|Proteomes:UP000078348}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 50177 / NandII {ECO:0000313|Proteomes:UP000078348};
RA Gentekaki E., Curtis B., Stairs C., Eme L., Herman E., Klimes V.,
RA Arias M.C., Elias M., Hilliou F., Klute M., Malik S.-B., Pightling A.,
RA Rachubinski R., Salas D., Schlacht A., Suga H., Archibald J., Ball S.G.,
RA Clark G., Dacks J., Van Der Giezen M., Tsaousis A., Roger A.;
RT "Nuclear genome of Blastocystis sp. subtype 1 NandII.";
RL Submitted (MAY-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OAO17264.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LXWW01000037; OAO17264.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A196SN22; -.
DR STRING; 478820.A0A196SN22; -.
DR OrthoDB; 106254at2759; -.
DR Proteomes; UP000078348; Unassembled WGS sequence.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR PANTHER; PTHR36489:SF1; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR36489; PROTEIN-COUPLED RECEPTOR GPR1, PUTATIVE-RELATED; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000078348};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..16
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 17..670
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5008274730"
FT DOMAIN 261..453
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
FT REGION 114..137
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 495..670
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 495..647
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 670 AA; 74560 MW; 02CA4E395C4E0F48 CRC64;
MKSSFIFALV IAAALACEDG FMEVHFSRVS GDAQCQVRLS NEAMKTNGLV LVDKFTAYPM
AYCLNPGYYY IGCNGAATVN VKYLTYNKDH DLTAAPHYKV LKLDQVKEQW TSVPSGHKTR
DFDKSKPITG KPAYPADPTN KGSDFTCDQA GYLSYLHKWT PEMKCTSRDV TTEEFQKRLE
YFIDSCEKIH DWNMRDKYRM EFTFYADWHP DEFEEITTTK QRYSGIKTPV PSTIPVYNNS
NLRYLMPSLD PCDDVFEDRR LAENTLRSYI CKPQNCSVTW AFAVTTSIEY AIKKLYLEEY
DQIVEVALSA QELIDCVGKE HGVTGKVCDG LPLAWGFDYV FENGGLRLCY YHHTNTEDEC
QVIDDEKKYF INGYEKPTIY NKLGLFDLVM RGPTAVTLGL DPEFFQYYRN DREEGPYFDT
AFWRPSVYGV VVEYLQYAVE GQPEYAEWPF FAIESRLRAC YSFVFRLPIR ETTADANIAG
ISGFAIRPIV SELLPTPEPP TVPPTPTPTP TPTPTPTPTP TPTPTPTPTP TPTPTPTPTP
TPTPTPTPTP TPTPTPTPTP TPTPTPTPTP TPTPTPTPTP TPTPTPTATP TPTPTPTPTA
TPTPTVTPTP TATPTPTPTP TPTPTPTPTP TPTNTPTPTP TMTPTPTMTP TPTLQLREAD
HLLLGRDQGR
//