ID A0A4W3KHJ8_CALMI Unreviewed; 330 AA.
AC A0A4W3KHJ8;
DT 18-SEP-2019, integrated into UniProtKB/TrEMBL.
DT 18-SEP-2019, sequence version 1.
DT 27-MAR-2024, entry version 17.
DE SubName: Full=Cathepsin L1-like {ECO:0000313|Ensembl:ENSCMIP00000046860.1};
GN Name=LOC103183579 {ECO:0000313|Ensembl:ENSCMIP00000046860.1};
OS Callorhinchus milii (Ghost shark).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Chondrichthyes;
OC Holocephali; Chimaeriformes; Callorhinchidae; Callorhinchus.
OX NCBI_TaxID=7868 {ECO:0000313|Ensembl:ENSCMIP00000046860.1, ECO:0000313|Proteomes:UP000314986};
RN [1] {ECO:0000313|Proteomes:UP000314986}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=17185593; DOI=10.1126/science.1130708;
RA Venkatesh B., Kirkness E.F., Loh Y.H., Halpern A.L., Lee A.P., Johnson J.,
RA Dandona N., Viswanathan L.D., Tay A., Venter J.C., Strausberg R.L.,
RA Brenner S.;
RT "Ancient noncoding elements conserved in the human genome.";
RL Science 314:1892-1892(2006).
RN [2] {ECO:0000313|Proteomes:UP000314986}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=17407382;
RA Venkatesh B., Kirkness E.F., Loh Y.H., Halpern A.L., Lee A.P., Johnson J.,
RA Dandona N., Viswanathan L.D., Tay A., Venter J.C., Strausberg R.L.,
RA Brenner S.;
RT "Survey sequencing and comparative analysis of the elephant shark
RT (Callorhinchus milii) genome.";
RL PLoS Biol. 5:E101-E101(2007).
RN [3] {ECO:0000313|Proteomes:UP000314986}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24402279; DOI=10.1038/nature12826;
RG International Elephant Shark Genome Sequencing Consortium;
RA Venkatesh B., Lee A.P., Ravi V., Maurya A.K., Lian M.M., Swann J.B.,
RA Ohta Y., Flajnik M.F., Sutoh Y., Kasahara M., Hoon S., Gangu V., Roy S.W.,
RA Irimia M., Korzh V., Kondrychyn I., Lim Z.W., Tay B.H., Tohari S.,
RA Kong K.W., Ho S., Lorente-Galdos B., Quilez J., Marques-Bonet T.,
RA Raney B.J., Ingham P.W., Tay A., Hillier L.W., Minx P., Boehm T.,
RA Wilson R.K., Brenner S., Warren W.C.;
RT "Elephant shark genome provides unique insights into gnathostome
RT evolution.";
RL Nature 505:174-179(2014).
RN [4] {ECO:0000313|Ensembl:ENSCMIP00000046860.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A4W3KHJ8; -.
DR Ensembl; ENSCMIT00000047524.1; ENSCMIP00000046860.1; ENSCMIG00000019213.1.
DR GeneTree; ENSGT00940000153321; -.
DR Proteomes; UP000314986; Unassembled WGS sequence.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd02248; Peptidase_C1A; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR InterPro; IPR013201; Prot_inhib_I29.
DR PANTHER; PTHR12411:SF971; CATHEPSIN L-RELATED; 1.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR Pfam; PF08246; Inhibitor_I29; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00848; Inhibitor_I29; 1.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Reference proteome {ECO:0000313|Proteomes:UP000314986};
KW Signal {ECO:0000256|SAM:SignalP};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807};
KW Zymogen {ECO:0000256|ARBA:ARBA00023145}.
FT SIGNAL 1..19
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 20..330
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5021376989"
FT DOMAIN 30..91
FT /note="Cathepsin propeptide inhibitor"
FT /evidence="ECO:0000259|SMART:SM00848"
FT DOMAIN 119..329
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 330 AA; 37123 MW; 36677A9E2AC057DD CRC64;
MNFLLRGCFV VFVLVKASTV SINPELDNEW LSWKSFHKKL YQEVRKDAAF RRNVWEQNLQ
YIEKHNTDYF MGKHTFAVGM NQFGDMTVEE FSNLLNRRNT RQYNNKTCNI FTADDKTDLP
QSVDWRPKGY VTGVKDQKAC GSCWAFCTTG VLEGQWFKKT GKLISLSEQY LMDCSQSVGN
HGCNGGSSIH SLLFIKEKGI NSEESYPYTA KGCTYNESVA TCQGVKMIRK NSERDLAAAV
ATVGPIAVGF DAGRASFMFY KSGIYYDEAC SKTVMDHGLL VVGYGTEAGE PYWIVKNSWG
VKWGNSGYFH TAKDKGNNCG IASYAHYPVV
//