ID G5C2H0_HETGA Unreviewed; 462 AA.
AC G5C2H0;
DT 14-DEC-2011, integrated into UniProtKB/TrEMBL.
DT 14-DEC-2011, sequence version 1.
DT 24-JAN-2024, entry version 57.
DE RecName: Full=Dipeptidyl peptidase 1 {ECO:0000256|ARBA:ARBA00014709};
DE EC=3.4.14.1 {ECO:0000256|ARBA:ARBA00012059};
DE AltName: Full=Cathepsin C {ECO:0000256|ARBA:ARBA00029779};
DE AltName: Full=Cathepsin J {ECO:0000256|ARBA:ARBA00029762};
DE AltName: Full=Dipeptidyl peptidase I {ECO:0000256|ARBA:ARBA00032961};
DE AltName: Full=Dipeptidyl transferase {ECO:0000256|ARBA:ARBA00030778};
GN Name=CTSC {ECO:0000313|EMBL:JAN98325.1};
GN ORFNames=GW7_12090 {ECO:0000313|EMBL:EHB15731.1};
OS Heterocephalus glaber (Naked mole rat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Hystricomorpha; Bathyergidae;
OC Heterocephalus.
OX NCBI_TaxID=10181 {ECO:0000313|EMBL:EHB15731.1, ECO:0000313|Proteomes:UP000006813};
RN [1] {ECO:0000313|EMBL:EHB15731.1, ECO:0000313|Proteomes:UP000006813}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21993625; DOI=10.1038/nature10533;
RA Kim E.B., Fang X., Fushan A.A., Huang Z., Lobanov A.V., Han L.,
RA Marino S.M., Sun X., Turanov A.A., Yang P., Yim S.H., Zhao X.,
RA Kasaikina M.V., Stoletzki N., Peng C., Polak P., Xiong Z., Kiezun A.,
RA Zhu Y., Chen Y., Kryukov G.V., Zhang Q., Peshkin L., Yang L., Bronson R.T.,
RA Buffenstein R., Wang B., Han C., Li Q., Chen L., Zhao W., Sunyaev S.R.,
RA Park T.J., Zhang G., Wang J., Gladyshev V.N.;
RT "Genome sequencing reveals insights into physiology and longevity of the
RT naked mole rat.";
RL Nature 479:223-227(2011).
RN [2] {ECO:0000313|EMBL:JAN98325.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Liver {ECO:0000313|EMBL:JAN98325.1};
RA Bens M., Sahm A., Jahn N., Morhart M., Holtze S., Hildebrandt T.B.,
RA Platzer M., Szafranski K.;
RT "FRAMA: From RNA-seq data to annotated mRNA assemblies.";
RL Submitted (OCT-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Release of an N-terminal dipeptide, Xaa-Yaa-|-Zaa-, except
CC when Xaa is Arg or Lys, or Yaa or Zaa is Pro.; EC=3.4.14.1;
CC Evidence={ECO:0000256|ARBA:ARBA00000738};
CC -!- COFACTOR:
CC Name=chloride; Xref=ChEBI:CHEBI:17996;
CC Evidence={ECO:0000256|ARBA:ARBA00001923};
CC -!- SUBUNIT: Tetramer of heterotrimers consisting of exclusion domain,
CC heavy- and light chains. {ECO:0000256|ARBA:ARBA00011610}.
CC -!- SUBCELLULAR LOCATION: Lysosome {ECO:0000256|ARBA:ARBA00004371}.
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JH173078; EHB15731.1; -; Genomic_DNA.
DR EMBL; GEBF01005307; JAN98325.1; -; Transcribed_RNA.
DR RefSeq; XP_004868266.1; XM_004868209.2.
DR AlphaFoldDB; G5C2H0; -.
DR STRING; 10181.G5C2H0; -.
DR MEROPS; C01.070; -.
DR Ensembl; ENSHGLT00100017547; ENSHGLP00100017362; ENSHGLG00100012805.
DR GeneID; 101697606; -.
DR KEGG; hgl:101697606; -.
DR CTD; 1075; -.
DR eggNOG; KOG1543; Eukaryota.
DR InParanoid; G5C2H0; -.
DR OMA; HWDWRNV; -.
DR OrthoDB; 5475703at2759; -.
DR Proteomes; UP000006813; Unassembled WGS sequence.
DR Bgee; ENSHGLG00000001420; Expressed in liver and 10 other cell types or tissues.
DR GO; GO:0005764; C:lysosome; IEA:UniProtKB-SubCell.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0008239; F:dipeptidyl-peptidase activity; IEA:UniProtKB-EC.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd02621; Peptidase_C1A_CathepsinC; 1.
DR Gene3D; 2.40.128.80; Cathepsin C, exclusion domain; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR039412; CatC.
DR InterPro; IPR014882; CathepsinC_exc.
DR InterPro; IPR036496; CathepsinC_exc_dom_sf.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025661; Pept_asp_AS.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR PANTHER; PTHR12411:SF942; DIPEPTIDYL PEPTIDASE 1; 1.
DR Pfam; PF08773; CathepsinC_exc; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR SUPFAM; SSF75001; Dipeptidyl peptidase I (cathepsin C), exclusion domain; 1.
DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Chloride {ECO:0000256|ARBA:ARBA00023214};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Lysosome {ECO:0000256|ARBA:ARBA00023228};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Reference proteome {ECO:0000313|Proteomes:UP000006813};
KW Signal {ECO:0000256|SAM:SignalP};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807}.
FT SIGNAL 1..24
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 25..462
FT /note="Dipeptidyl peptidase 1"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5007361741"
FT DOMAIN 230..457
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 462 AA; 52296 MW; E226E3171CF796E4 CRC64;
MGLRPHSLLA ALLLLLLGLR GVRCDTPANC TYPDLLGTWV FQVGPGGSGR DVNCSAMEPP
EKKVAVHLKK MDTAYDDFGN TGYFTIIYNQ GFEIVLNDYK WFAFFKYETK GSKVISYCQE
TMTGWVHDVL GRNWACFVGK KVESPENVNV NTAYLESHLE KYSNRLYKYD HKFVKAINAV
QKSWTATTYK EYETLTLREM ARRRGGHNQI IPRPKPAPLS AEIQQKILQL PKSWDWRDVH
GMNFVSPVRN QGYCGSCYSF ASMGMLEARI RILTNNTQTP ILSPQEVVSC SQYAQGCEGG
FPYLIAGKYA QDFGFVEESC FPYTGTDAPC KMKEDCMRYY TSEYHYVGGF YGGCNEALMK
LELVQHGPMA VAFEVCDDFM HYHKGIYHHT GLRDPFNPFE LTNHAVLLVG YGTDSANGMD
YWIVKNSWGT SWGEKGYFRI LRGTDECAIE SIAMAATPIP KL
//