ID W7KBD7_PLAFO Unreviewed; 936 AA.
AC W7KBD7;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 38.
DE RecName: Full=Peptidase C1A papain C-terminal domain-containing protein {ECO:0000259|SMART:SM00645};
GN ORFNames=PFNF54_00229 {ECO:0000313|EMBL:EWC90984.1};
OS Plasmodium falciparum (isolate NF54).
OC Eukaryota; Sar; Alveolata; Apicomplexa; Aconoidasida; Haemosporida;
OC Plasmodiidae; Plasmodium; Plasmodium (Laverania).
OX NCBI_TaxID=5843 {ECO:0000313|EMBL:EWC90984.1, ECO:0000313|Proteomes:UP000030673};
RN [1] {ECO:0000313|EMBL:EWC90984.1, ECO:0000313|Proteomes:UP000030673}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=NF54 {ECO:0000313|EMBL:EWC90984.1,
RC ECO:0000313|Proteomes:UP000030673};
RG The Broad Institute Genome Sequencing Platform;
RG The Broad Institute Genome Sequencing Center for Infectious Disease;
RA Neafsey D., Cheeseman I., Volkman S., Adams J., Walker B., Young S.K.,
RA Zeng Q., Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Dewar J., Goldberg J., Griggs A.,
RA Gujja S., Hansen M., Howarth C., Imamovic A., Larimer J., McCowan C.,
RA Murphy C., Neiman D., Pearson M., Priest M., Roberts A., Saif S., Shea T.,
RA Sisk P., Sykes S., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Plasmodium falciparum NF54.";
RL Submitted (FEB-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KE123721; EWC90984.1; -; Genomic_DNA.
DR AlphaFoldDB; W7KBD7; -.
DR EnsemblProtists; EWC90984; EWC90984; PFNF54_00229.
DR OMA; EMKFKSP; -.
DR Proteomes; UP000030673; Unassembled WGS sequence.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd02619; Peptidase_C1; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR PANTHER; PTHR12411:SF642; PRO-CATHEPSIN H; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Membrane {ECO:0000256|ARBA:ARBA00023136};
KW Reference proteome {ECO:0000313|Proteomes:UP000030673};
KW Zymogen {ECO:0000256|ARBA:ARBA00023145}.
FT DOMAIN 515..766
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
FT REGION 1..92
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 815..847
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 298..325
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 31..70
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 936 AA; 108359 MW; 6BEEFF4A804317AC CRC64;
MNVICCTNVI VGQEKPPPDS TVGANPGDER ESSGRVNNPA SGEQGTTNSP TEQPDQTRDR
SSSVPQGSPR EPVSPENPNP VTQIPGNGGA LVTPIPLPKL TLEDSESSKS VIDIEVKSAL
LKNYDGVKIT GPCRSYFRVM LVPHITVYVY ATYDRIQLEP KFGPSDLIDI NDLTNKCNKD
SNKYFKLVLY IKNNILILKW KVQDKDSKPT NIDVDVKKYK IPKLDRPFTS IQVYTVNTEH
GLIESKNYDI NSEIPEQCEA ISTNCFLNGS LDVENCYHCT LLAKKVDSNN ECFNYVSKEA
KELINKNLEE KNKTFKGEDE DLDSNEQKLE ESIDNILSNI YKIYESKQDK ERKKSHYNNK
KELVTIEELN SVLKIELLNY CKLLKEVDRS GMLDHHEIGN EIDIFNNLIR LLKAHPGEST
YVLNEKLRNP ALCFKNIEEW LVNKKGLLLS NEKIQNLSTT NYNVTDLEES EYDYERFISD
DMFEKDMNGV IDLSLFDNEK KLKSPYFRRN KYCNNEYCDR WKDKTGCISK IEVEEQGNCG
LCWIFASKLH FETIRCMRGY GHFRSSALYV ANCSDRDSDE ICFVGSNPVE FLEIVEETGF
LPLESDVPYY YTDAGNDCPE PEKNWINLWG STELLNHKRP RQRMTTKGYI SYESSYFSDN
MDLFIKIIKR EIQNKGSVIA YIKTENVIDF DFNGKGVHNM CGDKEPDHAA NIIGYGNYID
EEGEKKSYWL IRNSWGYYWG DEGNFRVDMY GPSYCKYNFI HTVVVFKVDL GIIEVPKKEK
ESEYFSYFLK YTPNFLYNLF FNNYTTNDEY KLNNRLKTNQ HNNKKNKKDR YISAQDEPPT
DNVESQAENN KKTEIYHILK HIKDKKIKRG LVKYESLLET KKDHSCSRTH SIDPEKHEEC
NQFCIDNWKA CKDHYSPGYC LTKLYTKDDN CFFCNV
//