ID A0A182P7H9_9DIPT Unreviewed; 343 AA.
AC A0A182P7H9;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 28.
DE RecName: Full=Cathepsin L {ECO:0008006|Google:ProtNLM};
OS Anopheles epiroticus.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=199890 {ECO:0000313|EnsemblMetazoa:AEPI002881-PA, ECO:0000313|Proteomes:UP000075885};
RN [1] {ECO:0000313|Proteomes:UP000075885}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Epiroticus2 {ECO:0000313|Proteomes:UP000075885};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Howell P., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles epiroticus epiroticus2.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:AEPI002881-PA}
RP IDENTIFICATION.
RC STRAIN=Epiroticus2 {ECO:0000313|EnsemblMetazoa:AEPI002881-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A182P7H9; -.
DR STRING; 199890.A0A182P7H9; -.
DR EnsemblMetazoa; AEPI002881-RA; AEPI002881-PA; AEPI002881.
DR VEuPathDB; VectorBase:AEPI002881; -.
DR OrthoDB; 5472948at2759; -.
DR Proteomes; UP000075885; Unassembled WGS sequence.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd02248; Peptidase_C1A; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025661; Pept_asp_AS.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR InterPro; IPR013201; Prot_inhib_I29.
DR PANTHER; PTHR12411:SF971; CATHEPSIN L-RELATED; 1.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR Pfam; PF08246; Inhibitor_I29; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00848; Inhibitor_I29; 1.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Protease {ECO:0000256|ARBA:ARBA00022670}; Signal {ECO:0000256|SAM:SignalP};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807};
KW Zymogen {ECO:0000256|ARBA:ARBA00023145}.
FT SIGNAL 1..16
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 17..343
FT /note="Cathepsin L"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5018780530"
FT DOMAIN 27..87
FT /note="Cathepsin propeptide inhibitor"
FT /evidence="ECO:0000259|SMART:SM00848"
FT DOMAIN 126..342
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 343 AA; 38554 MW; 28CC3AC4FD82BADE CRC64;
MKFLILILGF VAAANAISIF DLVKEEWTAF KLQHRKKYDS ESEERIRMKI YVQNKHKIAK
HNQRYDLGQE KFRLRVNKYA DMLHEEFVHT LNGYNRSISA KGQLLRGELK PIEEAVAWIE
PANVDVPKTI DWRTKGAVTA VKDQGHCGSC WSFSATGALE GQHFRKTGKL VSLSEQNLVD
CSQKYGNNGC NGGMMDFAFQ YVKDNKGIDT EKSYPYEAID DECHFNPKAV GATDKGFVDI
PQGDEKALMK AIATVGPVSV AIDASHESFQ FYSEGVYYEP QCDSEQLDHG VLAVGYGTSE
DGEDYWLVKN SWGTTWGDQG YVKMARNRDN HCGIATTASY PLV
//