ID W7A0W2_9APIC Unreviewed; 1370 AA.
AC W7A0W2;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 22-FEB-2023, entry version 31.
DE RecName: Full=Peptidase C1A papain C-terminal domain-containing protein {ECO:0000259|SMART:SM00645};
GN ORFNames=C922_02817 {ECO:0000313|EMBL:EUD66832.1};
OS Plasmodium inui San Antonio 1.
OC Eukaryota; Sar; Alveolata; Apicomplexa; Aconoidasida; Haemosporida;
OC Plasmodiidae; Plasmodium; Plasmodium (Plasmodium).
OX NCBI_TaxID=1237626 {ECO:0000313|EMBL:EUD66832.1, ECO:0000313|Proteomes:UP000030640};
RN [1] {ECO:0000313|EMBL:EUD66832.1, ECO:0000313|Proteomes:UP000030640}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=San Antonio 1 {ECO:0000313|EMBL:EUD66832.1,
RC ECO:0000313|Proteomes:UP000030640};
RG The Broad Institute Genome Sequencing Platform;
RG The Broad Institute Genome Sequencing Center for Infectious Disease;
RA Neafsey D., Cheeseman I., Volkman S., Adams J., Walker B., Young S.K.,
RA Zeng Q., Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Dewar J., Goldberg J., Griggs A.,
RA Gujja S., Hansen M., Howarth C., Imamovic A., Larimer J., McCowan C.,
RA Murphy C., Neiman D., Pearson M., Priest M., Roberts A., Saif S., Shea T.,
RA Sisk P., Sykes S., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Plasmodium inui San Antonio 1.";
RL Submitted (FEB-2013) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KI965469; EUD66832.1; -; Genomic_DNA.
DR RefSeq; XP_008816638.1; XM_008818416.1.
DR EnsemblProtists; EUD66832; EUD66832; C922_02817.
DR GeneID; 20038091; -.
DR VEuPathDB; PlasmoDB:C922_02817; -.
DR OrthoDB; 240131at2759; -.
DR Proteomes; UP000030640; Unassembled WGS sequence.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd02619; Peptidase_C1; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR006970; PT.
DR PANTHER; PTHR36489:SF1; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR36489; PROTEIN-COUPLED RECEPTOR GPR1, PUTATIVE-RELATED; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR Pfam; PF04886; PT; 4.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
PE 4: Predicted;
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..1370
FT /note="Peptidase C1A papain C-terminal domain-containing
FT protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004887827"
FT DOMAIN 670..922
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
FT REGION 25..264
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 610..640
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 969..1279
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 36..50
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 80..147
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 163..251
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 622..636
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1033..1048
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1055..1128
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1129..1237
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1370 AA; 150686 MW; C43F564692F65D08 CRC64;
MQLSLPFLFI ASAALLDNVI KCDEEVTIPD PPQPPDENPE SKDDPPGNWD PSPHKGDGAV
EPSVDENESG GEHGAVDQQT DHPTDQPTGH PTDQPTGHPT DQPTGHPADQ PTDQPTGHPA
DQPTDQPTDQ PTGDPADQST DQPTGHPADQ PTGHPADQPT GHPADQPTDQ PTDQPTNQPT
GHPTDQSTDQ PTGHPADQPT DQSTDQPTDQ STDQPTDQST DQPTDQPTNQ PADQPTNQPA
DQPTDQPLTQ PPVEVSEKAA AAAVRNPNEI EAKCSQLKDQ DGVKITGPCG AKFQMFLVPH
VTINVETETN TIYIGKKLDD VIITKKQHKV VSGKSSPLLQ FEENSNLLLN QCVNGKTFKF
VVIVKGEEII LKWKVYEKRP SETDNDKVDV RTFVLKNTDR PITAIQVHTA KGNEDSFLLE
SKSYFLKDDM PAKCDLIATN CFLSGNLDIE ACYKCTVLSE NTELDSPCFS YLPDDVKHNY
EKIKRKAQQN GDPKEVQFAV SIGNILQGMY KLGETGLNEL LSFDEADTSL KAELLNYCAS
MKEVDASGVL DSYELGTEED VFANLTRILR NHAGETKSML QNKLKNPAIC LKNADDWVER
KKGLLLPSLS HTHVEATPPA NAQEEETKKE DTPEGSEKIQ TNGYNSVINF VSSEETNMQS
TSFIDNMFCN DEYCDRTKDT NSCMAKIEAE DQGVCATSWV FASKMHLETI RCMKGHDHVA
SSALYVANCS NNEDKDKCQA PSNPLEFLDI LEETKFLPAE SDLPYSYTSV NNVCPELKSH
WKNLWANVKL LDPHNEPTSV STKGYTAYQS DHFKGNMDAF IKLVKSEVMN KGSVIAYVKA
TGALSYDLNG KKVLSLCGDE TPDIAVNIIG YGNYINGEGV KKSYWLLRNS WGKHWGDHGH
FKVDMYGPPG CEHNFIHTAA IFNVDVPLLE NLEKKRPMLY NYYMKSSPDF YNHILYKGVQ
TEEDREMGIS PEDKMISPVV SAQKSEADTL NGAEKSSVVV EGKENPAVGV GESTQEVESP
LEAVTDKSKE AEQQDEEEAD EESEEERKDK AEEEMQGEEQ EEGDDEEWEE EGEDEAEEEA
EEEEEAEEEE EEEEAEEAEE EEEAEEEEEA EEEEEEDDDE EPEEAGVEPE ERGADPEQAG
AEPEEEGAKL EEGGGKSEEG GGKSEERGAK SEEGGAKSEE GGAKSEEGGA KPEEGGAKSE
EGGAKPEEGG AKREEGGAKP EEGDSETAKK GMEAEEPSKV AVSDASPEGV KGPQTVASPS
ASVNPTPPPS STPAPTRKTS LIKVKQIMEV IHIIKHIKNG KLRLGIATYE DDLSIANKHD
CSRSYSRDPK KLPECIRFCH DEWNNCNGEF SPGYCLNQRR RKNDCFFCYV
//