GenomeNet

Database: UniProt
Entry: A0A1R3KYY0_9ROSI
LinkDB: A0A1R3KYY0_9ROSI
Original site: A0A1R3KYY0_9ROSI 
ID   A0A1R3KYY0_9ROSI        Unreviewed;       313 AA.
AC   A0A1R3KYY0;
DT   12-APR-2017, integrated into UniProtKB/TrEMBL.
DT   12-APR-2017, sequence version 1.
DT   27-MAR-2024, entry version 28.
DE   RecName: Full=Peptidase C1A, papain {ECO:0008006|Google:ProtNLM};
GN   ORFNames=COLO4_03345 {ECO:0000313|EMBL:OMP12270.1};
OS   Corchorus olitorius.
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Malvales; Malvaceae; Grewioideae; Apeibeae; Corchorus.
OX   NCBI_TaxID=93759 {ECO:0000313|EMBL:OMP12270.1, ECO:0000313|Proteomes:UP000187203};
RN   [1] {ECO:0000313|Proteomes:UP000187203}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. O-4 {ECO:0000313|Proteomes:UP000187203};
RA   Alam M., Haque M.S., Islam M.S., Emdad E.M., Islam M.M., Ahmed B.,
RA   Halim A., Hossen Q.M.M., Hossain M.Z., Ahmed R., Khan M.M., Islam R.,
RA   Rashid M.M., Khan S.A., Rahman M.S., Alam M., Yahiya A.S., Khan M.S.,
RA   Azam M.S., Haque T., Lashkar M.Z.H., Akhand A.I., Morshed G., Roy S.,
RA   Uddin K.S., Rabeya T., Hossain A.S., Chowdhury A., Snigdha A.R.,
RA   Mortoza M.S., Matin S.A., Hoque S.M.E., Islam M.K., Roy D.K., Haider R.,
RA   Moosa M.M., Elias S.M., Hasan A.M., Jahan S., Shafiuddin M., Mahmood N.,
RA   Shommy N.S.;
RT   "Corchorus olitorius genome sequencing.";
RL   Submitted (SEP-2013) to the EMBL/GenBank/DDBJ databases.
CC   -!- SIMILARITY: Belongs to the peptidase C1 family.
CC       {ECO:0000256|ARBA:ARBA00008455}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:OMP12270.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AWUE01009607; OMP12270.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A1R3KYY0; -.
DR   STRING; 93759.A0A1R3KYY0; -.
DR   OrthoDB; 5472443at2759; -.
DR   Proteomes; UP000187203; Unassembled WGS sequence.
DR   GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   CDD; cd02248; Peptidase_C1A; 1.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR000169; Pept_cys_AS.
DR   InterPro; IPR013128; Peptidase_C1A.
DR   InterPro; IPR000668; Peptidase_C1A_C.
DR   InterPro; IPR039417; Peptidase_C1A_papain-like.
DR   InterPro; IPR013201; Prot_inhib_I29.
DR   PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR   PANTHER; PTHR12411:SF357; OS01G0971400 PROTEIN; 1.
DR   Pfam; PF08246; Inhibitor_I29; 1.
DR   Pfam; PF00112; Peptidase_C1; 1.
DR   SMART; SM00848; Inhibitor_I29; 1.
DR   SMART; SM00645; Pept_C1; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Protease {ECO:0000256|ARBA:ARBA00022670};
KW   Reference proteome {ECO:0000313|Proteomes:UP000187203};
KW   Signal {ECO:0000256|SAM:SignalP};
KW   Thiol protease {ECO:0000256|ARBA:ARBA00022807}.
FT   SIGNAL          1..25
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           26..313
FT                   /note="Peptidase C1A, papain"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5018532245"
FT   DOMAIN          48..104
FT                   /note="Cathepsin propeptide inhibitor"
FT                   /evidence="ECO:0000259|SMART:SM00848"
FT   DOMAIN          133..310
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
SQ   SEQUENCE   313 AA;  35359 MW;  8DD640E97B885444 CRC64;
     MAISFLSKLS ILTFLASLSL FSALAHDFSI VGYSPEHLTS TDKLIELFEL WISKHGKIYE
     SIEEKLLRFE VFKDNLKHID RRNREISSYW LGLNEFADLS HEEFKSKYLG LKPEVFRKRQ
     SPGEFTYKDV SELPKSVDWR KKGAVTPVKN QGSCGSCWAF STVAAVEGIN KIVTGNLTSL
     SEQELIDCDT SFNNGCNGGL MDYAFEFIMA NGGLHKEEDY PYLMEEGTCE EKKEESEVVT
     INGYRDVPQN NEQSLLKALA HQPLSVAIEA SDRDFQFYSG VSSFLLLKPF LHTICNRSPD
     DISGENAQGN KQE
//
DBGET integrated database retrieval system