ID A0A4W4FZI5_ELEEL Unreviewed; 539 AA.
AC A0A4W4FZI5;
DT 18-SEP-2019, integrated into UniProtKB/TrEMBL.
DT 18-SEP-2019, sequence version 1.
DT 27-MAR-2024, entry version 16.
DE SubName: Full=Si:dkey-228a15.1 {ECO:0000313|Ensembl:ENSEEEP00000029261.1};
GN Name=LOC113587460 {ECO:0000313|Ensembl:ENSEEEP00000029261.1};
OS Electrophorus electricus (Electric eel) (Gymnotus electricus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Gymnotiformes;
OC Gymnotoidei; Gymnotidae; Electrophorus.
OX NCBI_TaxID=8005 {ECO:0000313|Ensembl:ENSEEEP00000029261.1, ECO:0000313|Proteomes:UP000314983};
RN [1] {ECO:0000313|Proteomes:UP000314983}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24970089;
RA Gallant J.R., Traeger L.L., Volkening J.D., Moffett H., Chen P.H.,
RA Novina C.D., Phillips G.N.Jr., Anand R., Wells G.B., Pinch M., Guth R.,
RA Unguez G.A., Albert J.S., Zakon H.H., Samanta M.P., Sussman M.R.;
RT "Nonhuman genetics. Genomic basis for the convergent evolution of electric
RT organs.";
RL Science 344:1522-1525(2014).
RN [2] {ECO:0000313|Proteomes:UP000314983}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=28695212;
RA Traeger L.L., Sabat G., Barrett-Wilt G.A., Wells G.B., Sussman M.R.;
RT "A tail of two voltages: Proteomic comparison of the three electric organs
RT of the electric eel.";
RL Sci. Adv. 3:e1700523-e1700523(2017).
RN [3] {ECO:0000313|Ensembl:ENSEEEP00000029261.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A4W4FZI5; -.
DR Ensembl; ENSEEET00000029600.1; ENSEEEP00000029261.1; ENSEEEG00000014024.1.
DR GeneTree; ENSGT00940000163923; -.
DR Proteomes; UP000314983; Unassembled WGS sequence.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd02248; Peptidase_C1A; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025661; Pept_asp_AS.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR InterPro; IPR013201; Prot_inhib_I29.
DR PANTHER; PTHR12411:SF975; 26-29KD-PROTEINASE; 1.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR Pfam; PF08246; Inhibitor_I29; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00848; Inhibitor_I29; 1.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Reference proteome {ECO:0000313|Proteomes:UP000314983};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807};
KW Zymogen {ECO:0000256|ARBA:ARBA00023145}.
FT DOMAIN 237..293
FT /note="Cathepsin propeptide inhibitor"
FT /evidence="ECO:0000259|SMART:SM00848"
FT DOMAIN 320..537
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 539 AA; 60497 MW; 3E8995B809592A69 CRC64;
MWNTRRLLEA TPLVGRTVPD FGNTYHVKGL ISLPYAEIKE PFEAWYDLEG KRSRIDYYHG
MVSTLQLSAL MTQGVGYKIT PVTTETEFNA MKCFQVNGTE EAPVLPQAAL PNIQGFQFER
IEYYAGVLCE VWKNVTIIGH KKNTYRLWVA HPDGGAAVAT PHHYEMMGYN TLLGSHYDKY
LIDYSDFKPH TDPKDFSLPE GISCGPFPGP GVEHQLLANP IQDLIHTSPV GHVHRLFGHF
KEKYERSYEN EMEHETREHH FVHNLRYVHS MNRAGLTFSL SVNHLADRSQ DELAIMRGRR
GRKTPNKAQP FPMELCSVTP PDSLDWRLYG AVTPVKDQAV CGSCWSFAST GALEGALFIK
TGERVTLSQQ MLVDCTWGFG NNGCDGGEEW RAFEWVMKHG GISTAESYGA YTGMNSLCHY
NESTLTARVK SYTNVTSGDM EALKVALFKN GPAAVSIDAS HRSFTFYSDG VYYEPSCKNG
MDDLDHAVLA VGYGMLDNQT YWLVKNSWST YWGNDGYVLM SMKDNNCGVA TAATYVTLV
//