ID F9W6D2_TRYCI Unreviewed; 336 AA.
AC F9W6D2;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 19-OCT-2011, sequence version 1.
DT 27-MAR-2024, entry version 41.
DE SubName: Full=WGS project CAEQ00000000 data, annotated contig 144 {ECO:0000313|EMBL:CCD12737.1};
GN ORFNames=TCIL3000_0_03600 {ECO:0000313|EMBL:CCD12737.1};
OS Trypanosoma congolense (strain IL3000).
OC Eukaryota; Discoba; Euglenozoa; Kinetoplastea; Metakinetoplastina;
OC Trypanosomatida; Trypanosomatidae; Trypanosoma; Nannomonas.
OX NCBI_TaxID=1068625 {ECO:0000313|EMBL:CCD12737.1, ECO:0000313|Proteomes:UP000000702};
RN [1] {ECO:0000313|Proteomes:UP000000702}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=IL3000 {ECO:0000313|Proteomes:UP000000702};
RA Jackson A.P., Berry A., Allison H.C., Burton P., Anderson J., Aslett M.,
RA Brown R., Corton N., Harris D., Hauser H., Gamble J., Gilderthorp R.,
RA McQuillan J., Quail M.A., Sanders M., Van Tonder A., Ginger M.L.,
RA Donelson J.E., Field M.C., Barry J.D., Berriman M., Hertz-Fowler C.;
RT "Divergent evolution of antigenic variation in African trypanosomes.";
RL Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:CCD12737.1, ECO:0000313|Proteomes:UP000000702}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=IL3000 {ECO:0000313|EMBL:CCD12737.1,
RC ECO:0000313|Proteomes:UP000000702};
RX PubMed=22331916; DOI=10.1073/pnas.1117313109;
RA Jackson A.P., Berry A., Aslett M., Allison H.C., Burton P.,
RA Vavrova-Anderson J., Brown R., Browne H., Corton N., Hauser H., Gamble J.,
RA Gilderthorp R., Marcello L., McQuillan J., Otto T.D., Quail M.A.,
RA Sanders M.J., van Tonder A., Ginger M.L., Field M.C., Barry J.D.,
RA Hertz-Fowler C., Berriman M.;
RT "Antigenic diversity is generated by distinct evolutionary mechanisms in
RT African trypanosome species.";
RL Proc. Natl. Acad. Sci. U.S.A. 109:3416-3421(2012).
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCD12737.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAEQ01000850; CCD12737.1; -; Genomic_DNA.
DR AlphaFoldDB; F9W6D2; -.
DR SMR; F9W6D2; -.
DR MEROPS; C01.098; -.
DR VEuPathDB; TriTrypDB:TcIL3000_0_03600; -.
DR OMA; DEKIPYW; -.
DR OrthoDB; 808912at2759; -.
DR Proteomes; UP000000702; Unassembled WGS sequence.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd02620; Peptidase_C1A_CathepsinB; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR PANTHER; PTHR12411:SF895; CATHEPSIN B; 1.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000000702};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..18
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 19..336
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5018539101"
FT DOMAIN 90..329
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 336 AA; 36991 MW; 0DD42B4868BB3316 CRC64;
MRVYVALCLL STALVALGAS ALLAKDAPVL TKTFVDRINQ LNGGMWKAVY NGKMQNITFA
EARRLTGAFR RKTSSLPPVR FTEEQLRTEL PESFDSAEKW PNCPTIREIA DQSACGSCWA
VSTASAISDR HCTVGGVQQL RISAAHLLSC CKDCGDGCDG GYPDSAWEYY VSHGLASSYC
QPYPFPHCGH HGGKGKKPPC SKYDFHTPKC NTTCTDKAIP LIKYRGNDSY VLLHGEDDFK
RELYFNGPFV VAFQVYSDFL AYKTGVYRHV SGDFLGGHAV RIVGWGKLNG TPYWKIANSW
DTDWGMNGHF LILRGNNECG IESTGYAGLP AIPRNA
//