GenomeNet

Database: UniProt
Entry: F9W6D2_TRYCI
LinkDB: F9W6D2_TRYCI
Original site: F9W6D2_TRYCI 
ID   F9W6D2_TRYCI            Unreviewed;       336 AA.
AC   F9W6D2;
DT   19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT   19-OCT-2011, sequence version 1.
DT   27-MAR-2024, entry version 41.
DE   SubName: Full=WGS project CAEQ00000000 data, annotated contig 144 {ECO:0000313|EMBL:CCD12737.1};
GN   ORFNames=TCIL3000_0_03600 {ECO:0000313|EMBL:CCD12737.1};
OS   Trypanosoma congolense (strain IL3000).
OC   Eukaryota; Discoba; Euglenozoa; Kinetoplastea; Metakinetoplastina;
OC   Trypanosomatida; Trypanosomatidae; Trypanosoma; Nannomonas.
OX   NCBI_TaxID=1068625 {ECO:0000313|EMBL:CCD12737.1, ECO:0000313|Proteomes:UP000000702};
RN   [1] {ECO:0000313|Proteomes:UP000000702}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=IL3000 {ECO:0000313|Proteomes:UP000000702};
RA   Jackson A.P., Berry A., Allison H.C., Burton P., Anderson J., Aslett M.,
RA   Brown R., Corton N., Harris D., Hauser H., Gamble J., Gilderthorp R.,
RA   McQuillan J., Quail M.A., Sanders M., Van Tonder A., Ginger M.L.,
RA   Donelson J.E., Field M.C., Barry J.D., Berriman M., Hertz-Fowler C.;
RT   "Divergent evolution of antigenic variation in African trypanosomes.";
RL   Submitted (JUL-2011) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EMBL:CCD12737.1, ECO:0000313|Proteomes:UP000000702}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=IL3000 {ECO:0000313|EMBL:CCD12737.1,
RC   ECO:0000313|Proteomes:UP000000702};
RX   PubMed=22331916; DOI=10.1073/pnas.1117313109;
RA   Jackson A.P., Berry A., Aslett M., Allison H.C., Burton P.,
RA   Vavrova-Anderson J., Brown R., Browne H., Corton N., Hauser H., Gamble J.,
RA   Gilderthorp R., Marcello L., McQuillan J., Otto T.D., Quail M.A.,
RA   Sanders M.J., van Tonder A., Ginger M.L., Field M.C., Barry J.D.,
RA   Hertz-Fowler C., Berriman M.;
RT   "Antigenic diversity is generated by distinct evolutionary mechanisms in
RT   African trypanosome species.";
RL   Proc. Natl. Acad. Sci. U.S.A. 109:3416-3421(2012).
CC   -!- SIMILARITY: Belongs to the peptidase C1 family.
CC       {ECO:0000256|ARBA:ARBA00008455}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:CCD12737.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CAEQ01000850; CCD12737.1; -; Genomic_DNA.
DR   AlphaFoldDB; F9W6D2; -.
DR   SMR; F9W6D2; -.
DR   MEROPS; C01.098; -.
DR   VEuPathDB; TriTrypDB:TcIL3000_0_03600; -.
DR   OMA; DEKIPYW; -.
DR   OrthoDB; 808912at2759; -.
DR   Proteomes; UP000000702; Unassembled WGS sequence.
DR   GO; GO:0008234; F:cysteine-type peptidase activity; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR   CDD; cd02620; Peptidase_C1A_CathepsinB; 1.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR000169; Pept_cys_AS.
DR   InterPro; IPR025660; Pept_his_AS.
DR   InterPro; IPR013128; Peptidase_C1A.
DR   InterPro; IPR000668; Peptidase_C1A_C.
DR   PANTHER; PTHR12411:SF895; CATHEPSIN B; 1.
DR   PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR   Pfam; PF00112; Peptidase_C1; 1.
DR   PRINTS; PR00705; PAPAIN.
DR   SMART; SM00645; Pept_C1; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR   PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE   3: Inferred from homology;
KW   Reference proteome {ECO:0000313|Proteomes:UP000000702};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..18
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           19..336
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5018539101"
FT   DOMAIN          90..329
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
SQ   SEQUENCE   336 AA;  36991 MW;  0DD42B4868BB3316 CRC64;
     MRVYVALCLL STALVALGAS ALLAKDAPVL TKTFVDRINQ LNGGMWKAVY NGKMQNITFA
     EARRLTGAFR RKTSSLPPVR FTEEQLRTEL PESFDSAEKW PNCPTIREIA DQSACGSCWA
     VSTASAISDR HCTVGGVQQL RISAAHLLSC CKDCGDGCDG GYPDSAWEYY VSHGLASSYC
     QPYPFPHCGH HGGKGKKPPC SKYDFHTPKC NTTCTDKAIP LIKYRGNDSY VLLHGEDDFK
     RELYFNGPFV VAFQVYSDFL AYKTGVYRHV SGDFLGGHAV RIVGWGKLNG TPYWKIANSW
     DTDWGMNGHF LILRGNNECG IESTGYAGLP AIPRNA
//
DBGET integrated database retrieval system