ID A0A182N3N5_9DIPT Unreviewed; 1520 AA.
AC A0A182N3N5;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 24-JAN-2024, entry version 36.
DE RecName: Full=Calpain {ECO:0008006|Google:ProtNLM};
OS Anopheles dirus.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=7168 {ECO:0000313|EnsemblMetazoa:ADIR002247-PA, ECO:0000313|Proteomes:UP000075884};
RN [1] {ECO:0000313|Proteomes:UP000075884}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=WRAIR2 {ECO:0000313|Proteomes:UP000075884};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Walton C., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles dirus WRAIR2.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:ADIR002247-PA}
RP IDENTIFICATION.
RC STRAIN=WRAIR2 {ECO:0000313|EnsemblMetazoa:ADIR002247-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SIMILARITY: Belongs to the peptidase C2 family.
CC {ECO:0000256|ARBA:ARBA00007623}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 7168.A0A182N3N5; -.
DR EnsemblMetazoa; ADIR002247-RA; ADIR002247-PA; ADIR002247.
DR VEuPathDB; VectorBase:ADIR002247; -.
DR OrthoDB; 142935at2759; -.
DR Proteomes; UP000075884; Unassembled WGS sequence.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0004198; F:calcium-dependent cysteine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd00214; Calpain_III; 2.
DR CDD; cd00044; CysPc; 2.
DR Gene3D; 2.60.120.380; -; 2.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 2.
DR Gene3D; 1.10.238.10; EF-hand; 2.
DR InterPro; IPR033883; C2_III.
DR InterPro; IPR022684; Calpain_cysteine_protease.
DR InterPro; IPR022682; Calpain_domain_III.
DR InterPro; IPR022683; Calpain_III.
DR InterPro; IPR036213; Calpain_III_sf.
DR InterPro; IPR011992; EF-hand-dom_pair.
DR InterPro; IPR018247; EF_Hand_1_Ca_BS.
DR InterPro; IPR002048; EF_hand_dom.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR001300; Peptidase_C2_calpain_cat.
DR PANTHER; PTHR10183; CALPAIN; 1.
DR PANTHER; PTHR10183:SF379; CALPAIN-A-RELATED; 1.
DR Pfam; PF01067; Calpain_III; 2.
DR Pfam; PF13405; EF-hand_6; 2.
DR Pfam; PF00648; Peptidase_C2; 2.
DR PRINTS; PR00704; CALPAIN.
DR SMART; SM00720; calpain_III; 2.
DR SMART; SM00230; CysPc; 2.
DR SMART; SM00054; EFh; 2.
DR SUPFAM; SSF49758; Calpain large subunit, middle domain (domain III); 2.
DR SUPFAM; SSF54001; Cysteine proteinases; 2.
DR SUPFAM; SSF47473; EF-hand; 2.
DR PROSITE; PS50203; CALPAIN_CAT; 2.
DR PROSITE; PS00018; EF_HAND_1; 2.
DR PROSITE; PS50222; EF_HAND_2; 2.
PE 3: Inferred from homology;
KW Calcium {ECO:0000256|ARBA:ARBA00022837};
KW Hydrolase {ECO:0000256|PROSITE-ProRule:PRU00239};
KW Protease {ECO:0000256|ARBA:ARBA00022670, ECO:0000256|PROSITE-
KW ProRule:PRU00239}; Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807, ECO:0000256|PROSITE-
KW ProRule:PRU00239}.
FT DOMAIN 116..417
FT /note="Calpain catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50203"
FT DOMAIN 738..769
FT /note="EF-hand"
FT /evidence="ECO:0000259|PROSITE:PS50222"
FT DOMAIN 838..1139
FT /note="Calpain catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50203"
FT DOMAIN 1442..1473
FT /note="EF-hand"
FT /evidence="ECO:0000259|PROSITE:PS50222"
FT ACT_SITE 169
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00239"
FT ACT_SITE 330
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00239"
FT ACT_SITE 357
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00239"
FT ACT_SITE 891
FT /evidence="ECO:0000256|PIRSR:PIRSR622684-1,
FT ECO:0000256|PROSITE-ProRule:PRU00239"
FT ACT_SITE 1052
FT /evidence="ECO:0000256|PIRSR:PIRSR622684-1,
FT ECO:0000256|PROSITE-ProRule:PRU00239"
FT ACT_SITE 1079
FT /evidence="ECO:0000256|PIRSR:PIRSR622684-1,
FT ECO:0000256|PROSITE-ProRule:PRU00239"
SQ SEQUENCE 1520 AA; 173950 MW; FAB87A80A8F4922A CRC64;
MVERSNRSSL SKVIRSGLIV PERRKCGYRT PCETTTFVSA WCTGENQYCG VSYILVTLRD
KTLYDTVVRM SQHAVLRFIL WVVSARIDSA EPTYSPSSNH QSFYALKQQC LARRRLFEDP
DFPPTDASIA MPRLSGVRWR RATEISSAAQ FFVDGASRMD ICQGALNDCW LLTAATNLTS
HPWLFKRVLP VDNSFTAGQY AGIFHFRFWE FGQWVEVVID DRLPTDANGT LLFGRSANGD
EFWSALLEKA YAKFYGSYGA LDGGTAREAM QDLTGGLTEF YQPKKMAGRE EQLWDILRGG
SEMGSLFACN LKSDPTAENV ATKDGLLRGH SYSITKVHTI PGLLSDSDNT RLLRIRNPWG
NGVEWNGRWS DKSREWKSLD TAERKRVGLT LENDGEFWIE LADFMKHFDR LEVCHLSPEL
HSVEETTGEK ESPRGYRWEL SALDGQWIRD TTAGGNVSFL DTFSQNPQYT IHVQEGQTGS
GAVIALMQKY RRADALPSLT IGFIVYKVTR EDLRQKPVPR EFFQRNDHAI VGGSIFINAR
EISCRLTLDA GLYLVIPSTF EPGEEGEYLL RIFTARGNTL CENDAILCFG TLDDRITERG
QFFDTPRWAL LANAFYNRAD SSQQLDQTGL QDVLWQQFFR RSDVSSDSGR FAGQAHKRSG
TSPLRKFYSC VLQLCACFWM RSSRNKTRRV EEHTVESQNH RQKEELERIV SLLMSRIADG
QETIGYEQFR TVARDVYQWE TVFRLYDTDG SGTLDRRELR QALRSSGFNI NNRILCRLLR
MVVELNRPQI ELIDYVLCAA ECRHAIVYHI EHKPTYSPSS NHQTFYTLQQ QCLGRSQLFE
DPDFPPTDAS IAMPRLSGVR WRRATEISPA AQFFVDGASR MDICQGALND CWLLTAATNL
TSRPWLFKRV LPVDNSFTAG QYAGIFHFRF WQFGQWVEVV IDDRLPTDAN GELLFGRSSE
KNEFWSALLE KAYAKFYGSY GALNNGTARE AMQDLTGGLT EFYEPKMMAG KEEQLWDILH
GGWEMGSLFA CNLKSDPTGR NVTTKDGLLR GHSYSITKIH TISGLLSDSD NTRLLRIRNP
WGNGVEWNGC WSDRSHKWKS LDAAERKRIG LTIENDGEFW IELADFMKHF DRLEVCHLSP
ELHSIGETPG TAQPPKPYRW ELSALDGQWI RDTTAGGNVL QNRETFHQNP QYTIHVQDGQ
PGSGVVIALM QKHRRADGLQ SLAIGFIVYK VTREDLRQKP VPREFFQRQD HAFVHSSIFI
NAREISCRLT LDAGLYLVIP STFEPGEEGE YLLRIFTARG NTLCENDAIL SFGTLDNRIT
ERRHFLETVQ FLKLKSAFYN HVDSSHQLDP TGLQEILWQN IFRRRTINSS MSKFHSIVLQ
LFARFNMRYS STRRNGDQTG DSPNHQCQEL ERIVSLLMSR IADGQDTIGY EQFRTVARDV
CQWETVFRLY DTDGSGTLDR RELRQALRSS GFHINNRILC RLLRMVEELN RPQIELIDYV
LCVAECRHAI DEEVKNQGKY
//