GenomeNet

Database: UniProt
Entry: A0A4W3KHJ8_CALMI
LinkDB: A0A4W3KHJ8_CALMI
Original site: A0A4W3KHJ8_CALMI 
ID   A0A4W3KHJ8_CALMI        Unreviewed;       330 AA.
AC   A0A4W3KHJ8;
DT   18-SEP-2019, integrated into UniProtKB/TrEMBL.
DT   18-SEP-2019, sequence version 1.
DT   27-MAR-2024, entry version 17.
DE   SubName: Full=Cathepsin L1-like {ECO:0000313|Ensembl:ENSCMIP00000046860.1};
GN   Name=LOC103183579 {ECO:0000313|Ensembl:ENSCMIP00000046860.1};
OS   Callorhinchus milii (Ghost shark).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Chondrichthyes;
OC   Holocephali; Chimaeriformes; Callorhinchidae; Callorhinchus.
OX   NCBI_TaxID=7868 {ECO:0000313|Ensembl:ENSCMIP00000046860.1, ECO:0000313|Proteomes:UP000314986};
RN   [1] {ECO:0000313|Proteomes:UP000314986}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=17185593; DOI=10.1126/science.1130708;
RA   Venkatesh B., Kirkness E.F., Loh Y.H., Halpern A.L., Lee A.P., Johnson J.,
RA   Dandona N., Viswanathan L.D., Tay A., Venter J.C., Strausberg R.L.,
RA   Brenner S.;
RT   "Ancient noncoding elements conserved in the human genome.";
RL   Science 314:1892-1892(2006).
RN   [2] {ECO:0000313|Proteomes:UP000314986}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=17407382;
RA   Venkatesh B., Kirkness E.F., Loh Y.H., Halpern A.L., Lee A.P., Johnson J.,
RA   Dandona N., Viswanathan L.D., Tay A., Venter J.C., Strausberg R.L.,
RA   Brenner S.;
RT   "Survey sequencing and comparative analysis of the elephant shark
RT   (Callorhinchus milii) genome.";
RL   PLoS Biol. 5:E101-E101(2007).
RN   [3] {ECO:0000313|Proteomes:UP000314986}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=24402279; DOI=10.1038/nature12826;
RG   International Elephant Shark Genome Sequencing Consortium;
RA   Venkatesh B., Lee A.P., Ravi V., Maurya A.K., Lian M.M., Swann J.B.,
RA   Ohta Y., Flajnik M.F., Sutoh Y., Kasahara M., Hoon S., Gangu V., Roy S.W.,
RA   Irimia M., Korzh V., Kondrychyn I., Lim Z.W., Tay B.H., Tohari S.,
RA   Kong K.W., Ho S., Lorente-Galdos B., Quilez J., Marques-Bonet T.,
RA   Raney B.J., Ingham P.W., Tay A., Hillier L.W., Minx P., Boehm T.,
RA   Wilson R.K., Brenner S., Warren W.C.;
RT   "Elephant shark genome provides unique insights into gnathostome
RT   evolution.";
RL   Nature 505:174-179(2014).
RN   [4] {ECO:0000313|Ensembl:ENSCMIP00000046860.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2023) to UniProtKB.
CC   -!- SIMILARITY: Belongs to the peptidase C1 family.
CC       {ECO:0000256|ARBA:ARBA00008455}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; A0A4W3KHJ8; -.
DR   Ensembl; ENSCMIT00000047524.1; ENSCMIP00000046860.1; ENSCMIG00000019213.1.
DR   GeneTree; ENSGT00940000153321; -.
DR   Proteomes; UP000314986; Unassembled WGS sequence.
DR   GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd02248; Peptidase_C1A; 1.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR000169; Pept_cys_AS.
DR   InterPro; IPR025660; Pept_his_AS.
DR   InterPro; IPR013128; Peptidase_C1A.
DR   InterPro; IPR000668; Peptidase_C1A_C.
DR   InterPro; IPR039417; Peptidase_C1A_papain-like.
DR   InterPro; IPR013201; Prot_inhib_I29.
DR   PANTHER; PTHR12411:SF971; CATHEPSIN L-RELATED; 1.
DR   PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR   Pfam; PF08246; Inhibitor_I29; 1.
DR   Pfam; PF00112; Peptidase_C1; 1.
DR   PRINTS; PR00705; PAPAIN.
DR   SMART; SM00848; Inhibitor_I29; 1.
DR   SMART; SM00645; Pept_C1; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR   PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Protease {ECO:0000256|ARBA:ARBA00022670};
KW   Reference proteome {ECO:0000313|Proteomes:UP000314986};
KW   Signal {ECO:0000256|SAM:SignalP};
KW   Thiol protease {ECO:0000256|ARBA:ARBA00022807};
KW   Zymogen {ECO:0000256|ARBA:ARBA00023145}.
FT   SIGNAL          1..19
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           20..330
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5021376989"
FT   DOMAIN          30..91
FT                   /note="Cathepsin propeptide inhibitor"
FT                   /evidence="ECO:0000259|SMART:SM00848"
FT   DOMAIN          119..329
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
SQ   SEQUENCE   330 AA;  37123 MW;  36677A9E2AC057DD CRC64;
     MNFLLRGCFV VFVLVKASTV SINPELDNEW LSWKSFHKKL YQEVRKDAAF RRNVWEQNLQ
     YIEKHNTDYF MGKHTFAVGM NQFGDMTVEE FSNLLNRRNT RQYNNKTCNI FTADDKTDLP
     QSVDWRPKGY VTGVKDQKAC GSCWAFCTTG VLEGQWFKKT GKLISLSEQY LMDCSQSVGN
     HGCNGGSSIH SLLFIKEKGI NSEESYPYTA KGCTYNESVA TCQGVKMIRK NSERDLAAAV
     ATVGPIAVGF DAGRASFMFY KSGIYYDEAC SKTVMDHGLL VVGYGTEAGE PYWIVKNSWG
     VKWGNSGYFH TAKDKGNNCG IASYAHYPVV
//
DBGET integrated database retrieval system