GenomeNet

Database: UniProt
Entry: A0A182HHE1_ANOAR
LinkDB: A0A182HHE1_ANOAR
Original site: A0A182HHE1_ANOAR 
ID   A0A182HHE1_ANOAR        Unreviewed;       343 AA.
AC   A0A182HHE1;
DT   07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT   07-SEP-2016, sequence version 1.
DT   27-MAR-2024, entry version 29.
DE   RecName: Full=Cathepsin L {ECO:0008006|Google:ProtNLM};
OS   Anopheles arabiensis (Mosquito).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Anophelinae; Anopheles.
OX   NCBI_TaxID=7173 {ECO:0000313|EnsemblMetazoa:AARA000644-PA.1, ECO:0000313|Proteomes:UP000075840};
RN   [1] {ECO:0000313|Proteomes:UP000075840}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Dongola {ECO:0000313|Proteomes:UP000075840};
RG   The Broad Institute Genomics Platform;
RA   Neafsey D.E., Howell P., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA   Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA   Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA   Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA   Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA   Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA   Birren B.;
RT   "The Genome Sequence of Anopheles arabiensis DONG5_A.";
RL   Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EnsemblMetazoa:AARA000644-PA.1}
RP   IDENTIFICATION.
RC   STRAIN=Dongola {ECO:0000313|EnsemblMetazoa:AARA000644-PA.1};
RG   EnsemblMetazoa;
RL   Submitted (AUG-2022) to UniProtKB.
CC   -!- SIMILARITY: Belongs to the peptidase C1 family.
CC       {ECO:0000256|ARBA:ARBA00008455}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; APCN01002407; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   AlphaFoldDB; A0A182HHE1; -.
DR   EnsemblMetazoa; AARA000644-RA; AARA000644-PA; AARA000644.
DR   VEuPathDB; VectorBase:AARA000644; -.
DR   VEuPathDB; VectorBase:AARA21_004676; -.
DR   OrthoDB; 5472948at2759; -.
DR   Proteomes; UP000075840; Unassembled WGS sequence.
DR   GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd02248; Peptidase_C1A; 1.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR025661; Pept_asp_AS.
DR   InterPro; IPR000169; Pept_cys_AS.
DR   InterPro; IPR025660; Pept_his_AS.
DR   InterPro; IPR013128; Peptidase_C1A.
DR   InterPro; IPR000668; Peptidase_C1A_C.
DR   InterPro; IPR039417; Peptidase_C1A_papain-like.
DR   InterPro; IPR013201; Prot_inhib_I29.
DR   PANTHER; PTHR12411:SF971; CATHEPSIN L-RELATED; 1.
DR   PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR   Pfam; PF08246; Inhibitor_I29; 1.
DR   Pfam; PF00112; Peptidase_C1; 1.
DR   PRINTS; PR00705; PAPAIN.
DR   SMART; SM00848; Inhibitor_I29; 1.
DR   SMART; SM00645; Pept_C1; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR   PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR   PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Protease {ECO:0000256|ARBA:ARBA00022670};
KW   Thiol protease {ECO:0000256|ARBA:ARBA00022807};
KW   Zymogen {ECO:0000256|ARBA:ARBA00023145}.
FT   DOMAIN          27..87
FT                   /note="Cathepsin propeptide inhibitor"
FT                   /evidence="ECO:0000259|SMART:SM00848"
FT   DOMAIN          126..342
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
SQ   SEQUENCE   343 AA;  38580 MW;  96403B7429C0E2B2 CRC64;
     MKFLILILGF VAAANAISIF ELVKEEWTAF KLQHRKKYDS ETEERIRMKI YVQNKHKIAK
     HNQRYDLGQE KFRLRVNKYA DLLHEEFVHT LNGFNRSVSG KGQLLRGELK PIEEAVTWIE
     PANVDVPTAM DWRTKGAVTP VKDQGHCGSC WSFSATGALE GQHFRKTGKL VSLSEQNLVD
     CSQKYGNNGC NGGMMDFAFQ YIKDNKGIDT EKSYPYEAID DECHYNPKAV GATDKGFVDI
     PQGNEKALMK ALATVGPVSV AIDASHESFQ FYSEGVYYEP QCDSEQLDHG VLAVGYGTTE
     DGEDYWLVKN SWGTTWGDQG YVKMARNRDN HCGIATTASY PLV
//
DBGET integrated database retrieval system