ID A0A118JYY1_CYNCS Unreviewed; 690 AA.
AC A0A118JYY1;
DT 13-APR-2016, integrated into UniProtKB/TrEMBL.
DT 13-APR-2016, sequence version 1.
DT 24-JAN-2024, entry version 18.
DE SubName: Full=Cysteine peptidase, asparagine active site-containing protein {ECO:0000313|EMBL:KVH98169.1};
GN ORFNames=Ccrd_023621 {ECO:0000313|EMBL:KVH98169.1};
OS Cynara cardunculus var. scolymus (Globe artichoke) (Cynara scolymus).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Carduoideae; Cardueae;
OC Carduinae; Cynara.
OX NCBI_TaxID=59895 {ECO:0000313|EMBL:KVH98169.1, ECO:0000313|Proteomes:UP000243975};
RN [1] {ECO:0000313|EMBL:KVH98169.1, ECO:0000313|Proteomes:UP000243975}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=2C {ECO:0000313|EMBL:KVH98169.1};
RX PubMed=26786968; DOI=10.1038/srep19427;
RA Scaglione D., Reyes-Chin-Wo S., Acquadro A., Froenicke L., Portis E.,
RA Beitel C., Tirone M., Mauro R., Lo Monaco A., Mauromicale G., Faccioli P.,
RA Cattivelli L., Rieseberg L., Michelmore R., Lanteri S.;
RT "The genome sequence of the outbreeding globe artichoke constructed de novo
RT incorporating a phase-aware low-pass sequencing strategy of F1 progeny.";
RL Sci. Rep. 6:19427-19427(2016).
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KVH98169.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LEKV01003807; KVH98169.1; -; Genomic_DNA.
DR STRING; 59895.A0A118JYY1; -.
DR EnsemblPlants; KVH98169; KVH98169; Ccrd_023621.
DR Gramene; KVH98169; KVH98169; Ccrd_023621.
DR Proteomes; UP000243975; Unassembled WGS sequence.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd02248; Peptidase_C1A; 2.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 2.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025661; Pept_asp_AS.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR InterPro; IPR013201; Prot_inhib_I29.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR PANTHER; PTHR12411:SF897; CYSTEINE PROTEINASES SUPERFAMILY PROTEIN-RELATED; 1.
DR Pfam; PF08246; Inhibitor_I29; 2.
DR Pfam; PF00112; Peptidase_C1; 2.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00848; Inhibitor_I29; 2.
DR SMART; SM00645; Pept_C1; 2.
DR SUPFAM; SSF54001; Cysteine proteinases; 2.
DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 2.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 2.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 2.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Reference proteome {ECO:0000313|Proteomes:UP000243975};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..690
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5007159813"
FT DOMAIN 42..98
FT /note="Cathepsin propeptide inhibitor"
FT /evidence="ECO:0000259|SMART:SM00848"
FT DOMAIN 131..351
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
FT DOMAIN 369..425
FT /note="Cathepsin propeptide inhibitor"
FT /evidence="ECO:0000259|SMART:SM00848"
FT DOMAIN 458..674
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
FT UNSURE 47
FT /note="D or N"
FT /evidence="ECO:0000313|EMBL:KVH98169.1"
SQ SEQUENCE 690 AA; 77223 MW; C46C4ADC394F66D3 CRC64;
METTRLVLFS LVLVLALGFV SAGSFEYTED DLASDEAKWA LYERWRDHHK VPEQSDDEKQ
KRFIIFMDTV KRVDNHNKAK KPYLMELNKL XDLTXXEIVR TYTGAKLDEH HRMLSSRGNS
SRLKYADRHD LPAEVDWRQF NNVVVAPKNQ AQCGSCWAFA AMGALESAHG LKTGNLISLS
EQQIIDCDTE KNGGCNGGVP AYALNYVAKH GGMTTSECYP YNDPAGQTVC CGAKLQNRVV
HCQGFEDIPI DDEPAMMERV AEQPVSACLF VYEGFYGYKE GIYTGDDCVG EQNPHAINIV
GYGTTPEGCK YWIIKNSWGE DWGEKGYMRL AREVGNPRGA CSITMQTCFP SFDYHEEELE
TEESLWELYE RWRSHHRMAA ASRQEKHRRF NVFKSNLQHV HNTNRMNKPY KLKLNRFADM
TNHEFTSAYA GSKVSHHRML HGDRISNKGF MYANHDNVPA SIDWRKENAV TPVKDQGHCG
SCWAFSTVVA VEGLNQIKTK KLVSLSEQEL VDCDTGKNEG CDGGLMDLAF DFIKKNGGLT
TEDNYPYTAA AGSCKTVKEG VPTVSIDGHE DVPVNDEDAL MKAVANQPVS VAIDAGGSDF
QFYSQGVFTG KCGTQLNHGV AVVGYDMSDD GTKYWIVKNS WGADWGENGY IRMQRGVPEK
TGLCGIAMEA SYPIKNSDTN PKASFIRDEL
//