GenomeNet

Database: UniProt
Entry: W7KBD7_PLAFO
LinkDB: W7KBD7_PLAFO
Original site: W7KBD7_PLAFO 
ID   W7KBD7_PLAFO            Unreviewed;       936 AA.
AC   W7KBD7;
DT   16-APR-2014, integrated into UniProtKB/TrEMBL.
DT   16-APR-2014, sequence version 1.
DT   27-MAR-2024, entry version 38.
DE   RecName: Full=Peptidase C1A papain C-terminal domain-containing protein {ECO:0000259|SMART:SM00645};
GN   ORFNames=PFNF54_00229 {ECO:0000313|EMBL:EWC90984.1};
OS   Plasmodium falciparum (isolate NF54).
OC   Eukaryota; Sar; Alveolata; Apicomplexa; Aconoidasida; Haemosporida;
OC   Plasmodiidae; Plasmodium; Plasmodium (Laverania).
OX   NCBI_TaxID=5843 {ECO:0000313|EMBL:EWC90984.1, ECO:0000313|Proteomes:UP000030673};
RN   [1] {ECO:0000313|EMBL:EWC90984.1, ECO:0000313|Proteomes:UP000030673}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=NF54 {ECO:0000313|EMBL:EWC90984.1,
RC   ECO:0000313|Proteomes:UP000030673};
RG   The Broad Institute Genome Sequencing Platform;
RG   The Broad Institute Genome Sequencing Center for Infectious Disease;
RA   Neafsey D., Cheeseman I., Volkman S., Adams J., Walker B., Young S.K.,
RA   Zeng Q., Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Alvarado L.,
RA   Arachchi H.M., Berlin A.M., Chapman S.B., Dewar J., Goldberg J., Griggs A.,
RA   Gujja S., Hansen M., Howarth C., Imamovic A., Larimer J., McCowan C.,
RA   Murphy C., Neiman D., Pearson M., Priest M., Roberts A., Saif S., Shea T.,
RA   Sisk P., Sykes S., Wortman J., Nusbaum C., Birren B.;
RT   "The Genome Sequence of Plasmodium falciparum NF54.";
RL   Submitted (FEB-2013) to the EMBL/GenBank/DDBJ databases.
CC   -!- SIMILARITY: Belongs to the peptidase C1 family.
CC       {ECO:0000256|ARBA:ARBA00008455}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KE123721; EWC90984.1; -; Genomic_DNA.
DR   AlphaFoldDB; W7KBD7; -.
DR   EnsemblProtists; EWC90984; EWC90984; PFNF54_00229.
DR   OMA; EMKFKSP; -.
DR   Proteomes; UP000030673; Unassembled WGS sequence.
DR   GO; GO:0008234; F:cysteine-type peptidase activity; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR   CDD; cd02619; Peptidase_C1; 1.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR000169; Pept_cys_AS.
DR   InterPro; IPR013128; Peptidase_C1A.
DR   InterPro; IPR000668; Peptidase_C1A_C.
DR   PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR   PANTHER; PTHR12411:SF642; PRO-CATHEPSIN H; 1.
DR   Pfam; PF00112; Peptidase_C1; 1.
DR   SMART; SM00645; Pept_C1; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
PE   3: Inferred from homology;
KW   Coiled coil {ECO:0000256|SAM:Coils};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Membrane {ECO:0000256|ARBA:ARBA00023136};
KW   Reference proteome {ECO:0000313|Proteomes:UP000030673};
KW   Zymogen {ECO:0000256|ARBA:ARBA00023145}.
FT   DOMAIN          515..766
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
FT   REGION          1..92
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          815..847
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COILED          298..325
FT                   /evidence="ECO:0000256|SAM:Coils"
FT   COMPBIAS        31..70
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   936 AA;  108359 MW;  6BEEFF4A804317AC CRC64;
     MNVICCTNVI VGQEKPPPDS TVGANPGDER ESSGRVNNPA SGEQGTTNSP TEQPDQTRDR
     SSSVPQGSPR EPVSPENPNP VTQIPGNGGA LVTPIPLPKL TLEDSESSKS VIDIEVKSAL
     LKNYDGVKIT GPCRSYFRVM LVPHITVYVY ATYDRIQLEP KFGPSDLIDI NDLTNKCNKD
     SNKYFKLVLY IKNNILILKW KVQDKDSKPT NIDVDVKKYK IPKLDRPFTS IQVYTVNTEH
     GLIESKNYDI NSEIPEQCEA ISTNCFLNGS LDVENCYHCT LLAKKVDSNN ECFNYVSKEA
     KELINKNLEE KNKTFKGEDE DLDSNEQKLE ESIDNILSNI YKIYESKQDK ERKKSHYNNK
     KELVTIEELN SVLKIELLNY CKLLKEVDRS GMLDHHEIGN EIDIFNNLIR LLKAHPGEST
     YVLNEKLRNP ALCFKNIEEW LVNKKGLLLS NEKIQNLSTT NYNVTDLEES EYDYERFISD
     DMFEKDMNGV IDLSLFDNEK KLKSPYFRRN KYCNNEYCDR WKDKTGCISK IEVEEQGNCG
     LCWIFASKLH FETIRCMRGY GHFRSSALYV ANCSDRDSDE ICFVGSNPVE FLEIVEETGF
     LPLESDVPYY YTDAGNDCPE PEKNWINLWG STELLNHKRP RQRMTTKGYI SYESSYFSDN
     MDLFIKIIKR EIQNKGSVIA YIKTENVIDF DFNGKGVHNM CGDKEPDHAA NIIGYGNYID
     EEGEKKSYWL IRNSWGYYWG DEGNFRVDMY GPSYCKYNFI HTVVVFKVDL GIIEVPKKEK
     ESEYFSYFLK YTPNFLYNLF FNNYTTNDEY KLNNRLKTNQ HNNKKNKKDR YISAQDEPPT
     DNVESQAENN KKTEIYHILK HIKDKKIKRG LVKYESLLET KKDHSCSRTH SIDPEKHEEC
     NQFCIDNWKA CKDHYSPGYC LTKLYTKDDN CFFCNV
//
DBGET integrated database retrieval system