ID A0A0L0C7H3_LUCCU Unreviewed; 345 AA.
AC A0A0L0C7H3;
DT 11-NOV-2015, integrated into UniProtKB/TrEMBL.
DT 11-NOV-2015, sequence version 1.
DT 27-MAR-2024, entry version 28.
DE RecName: Full=Peptidase C1A papain C-terminal domain-containing protein {ECO:0000259|SMART:SM00645};
GN ORFNames=FF38_06488 {ECO:0000313|EMBL:KNC28196.1};
OS Lucilia cuprina (Green bottle fly) (Australian sheep blowfly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Oestroidea;
OC Calliphoridae; Luciliinae; Lucilia.
OX NCBI_TaxID=7375 {ECO:0000313|EMBL:KNC28196.1, ECO:0000313|Proteomes:UP000037069};
RN [1] {ECO:0000313|EMBL:KNC28196.1, ECO:0000313|Proteomes:UP000037069}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=LS {ECO:0000313|EMBL:KNC28196.1,
RC ECO:0000313|Proteomes:UP000037069};
RC TISSUE=Full body {ECO:0000313|EMBL:KNC28196.1};
RX PubMed=26108605; DOI=10.1038/ncomms8344;
RA Anstead C.A., Korhonen P.K., Young N.D., Hall R.S., Jex A.R., Murali S.C.,
RA Hughes D.S., Lee S.F., Perry T., Stroehlein A.J., Ansell B.R.,
RA Breugelmans B., Hofmann A., Qu J., Dugan S., Lee S.L., Chao H., Dinh H.,
RA Han Y., Doddapaneni H.V., Worley K.C., Muzny D.M., Ioannidis P.,
RA Waterhouse R.M., Zdobnov E.M., James P.J., Bagnall N.H., Kotze A.C.,
RA Gibbs R.A., Richards S., Batterham P., Gasser R.B.;
RT "Lucilia cuprina genome unlocks parasitic fly biology to underpin future
RT interventions.";
RL Nat. Commun. 6:7344-7344(2015).
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KNC28196.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JRES01000812; KNC28196.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0L0C7H3; -.
DR STRING; 7375.A0A0L0C7H3; -.
DR EnsemblMetazoa; KNC28196; KNC28196; FF38_06488.
DR OMA; DEKIPYW; -.
DR Proteomes; UP000037069; Unassembled WGS sequence.
DR GO; GO:0004197; F:cysteine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd02620; Peptidase_C1A_CathepsinB; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025661; Pept_asp_AS.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR012599; Propeptide_C1A.
DR PANTHER; PTHR12411:SF1000; CATHEPSIN B1, ISOFORM A; 1.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR Pfam; PF08127; Propeptide_C1; 1.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Reference proteome {ECO:0000313|Proteomes:UP000037069};
KW Signal {ECO:0000256|SAM:SignalP};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..345
FT /note="Peptidase C1A papain C-terminal domain-containing
FT protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5018752564"
FT DOMAIN 93..342
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 345 AA; 38427 MW; 132FE3F8A280A911 CRC64;
MRKHFLLILC VAFLTIGQVI ANLDAEHDLL SDEFIEIVRS KAKTWTAGRN FAKSVPRSHI
HRLMGVHPDA HKFALPEKRL VLGDYVGLAD GDIPDEFDAR NAWPNCPTIK EIRDQGSCGS
CWAFGAVEAM SDRVCIHSNA STHFHFSADD LVSCCHTCGF GCNGGFPGAA WAYWTRKGIV
SGGPYGSNQG CRPYEISPCE HHVNGTRPPC DGEHGKTPRC QHQCQKSYNV DYSKDKHFGS
KSYSVRRNVR DIQEEIMTNG PVEGAFTVYE DLILYKDGVY QHVHGRELGG HAIRMLGWGV
ENNTPYWLIA NSWNTDWGNN GYFKILRGED HCGIESSISA GLPKL
//