GenomeNet

Database: UniProt
Entry: A0A669ETU5_ORENI
LinkDB: A0A669ETU5_ORENI
Original site: A0A669ETU5_ORENI 
ID   A0A669ETU5_ORENI        Unreviewed;       333 AA.
AC   A0A669ETU5;
DT   17-JUN-2020, integrated into UniProtKB/TrEMBL.
DT   17-JUN-2020, sequence version 1.
DT   27-MAR-2024, entry version 16.
DE   SubName: Full=Cathepsin L1 {ECO:0000313|Ensembl:ENSONIP00000074883.1};
GN   Name=LOC100704815 {ECO:0000313|Ensembl:ENSONIP00000074883.1};
OS   Oreochromis niloticus (Nile tilapia) (Tilapia nilotica).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC   Pseudocrenilabrinae; Oreochromini; Oreochromis.
OX   NCBI_TaxID=8128 {ECO:0000313|Ensembl:ENSONIP00000074883.1, ECO:0000313|Proteomes:UP000005207};
RN   [1] {ECO:0000313|Proteomes:UP000005207}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG   Broad Institute Genome Assembly Team;
RG   Broad Institute Sequencing Platform;
RA   Di Palma F., Johnson J., Lander E.S., Lindblad-Toh K.;
RT   "The Genome Sequence of Oreochromis niloticus (Nile Tilapia).";
RL   Submitted (JAN-2012) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSONIP00000074883.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- SIMILARITY: Belongs to the peptidase C1 family.
CC       {ECO:0000256|ARBA:ARBA00008455}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   RefSeq; XP_003453254.1; XM_003453206.4.
DR   AlphaFoldDB; A0A669ETU5; -.
DR   Ensembl; ENSONIT00000069615.1; ENSONIP00000074883.1; ENSONIG00000003460.2.
DR   GeneID; 100704815; -.
DR   KEGG; onl:100704815; -.
DR   GeneTree; ENSGT00940000163885; -.
DR   OMA; MMEAFEY; -.
DR   OrthoDB; 5472948at2759; -.
DR   Proteomes; UP000005207; Linkage group LG19.
DR   GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd02248; Peptidase_C1A; 1.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR025661; Pept_asp_AS.
DR   InterPro; IPR000169; Pept_cys_AS.
DR   InterPro; IPR025660; Pept_his_AS.
DR   InterPro; IPR013128; Peptidase_C1A.
DR   InterPro; IPR000668; Peptidase_C1A_C.
DR   InterPro; IPR039417; Peptidase_C1A_papain-like.
DR   InterPro; IPR013201; Prot_inhib_I29.
DR   PANTHER; PTHR12411:SF1011; CATHEPSIN L.1; 1.
DR   PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR   Pfam; PF08246; Inhibitor_I29; 1.
DR   Pfam; PF00112; Peptidase_C1; 1.
DR   PRINTS; PR00705; PAPAIN.
DR   SMART; SM00848; Inhibitor_I29; 1.
DR   SMART; SM00645; Pept_C1; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
DR   PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR   PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR   PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Protease {ECO:0000256|ARBA:ARBA00022670};
KW   Reference proteome {ECO:0000313|Proteomes:UP000005207};
KW   Signal {ECO:0000256|SAM:SignalP};
KW   Thiol protease {ECO:0000256|ARBA:ARBA00022807};
KW   Zymogen {ECO:0000256|ARBA:ARBA00023145}.
FT   SIGNAL          1..18
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           19..333
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5025627113"
FT   DOMAIN          26..86
FT                   /note="Cathepsin propeptide inhibitor"
FT                   /evidence="ECO:0000259|SMART:SM00848"
FT   DOMAIN          118..332
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
SQ   SEQUENCE   333 AA;  37178 MW;  C0DF774A55D048E1 CRC64;
     MKLLLVVAAV LVVSSCASIS LEDMEFHAWK LKFKKSYDSP SEETHRKQVW LNNRKLVLIH
     NALADQGLKS FHLGMTYFAD MENQEYKKLI SQGCLGSFNA SLHRRGSTFN RLPKGTKLPK
     TVDWRKQGYV TKVKHQKECG SCWAFSATGA LEGQHFRKTR KLVSLSEQQL VDCSRSFGNH
     GCNGGWMNPA FQYIRYNGGL DTEDSYPYKA KDGICHYNPN SVGAICSGHV DVSPDEAALK
     QAVATIGPIS IAVDASHESF QLYQSGVYDE HRCNKKHVTH AMLVVGYGTE GGHDYWLIKN
     SWGLQWGDKG YIKMTRNKGN QCGIATAASY PLV
//
DBGET integrated database retrieval system