GenomeNet

Database: UniProt
Entry: A0A118JYY1_CYNCS
LinkDB: A0A118JYY1_CYNCS
Original site: A0A118JYY1_CYNCS 
ID   A0A118JYY1_CYNCS        Unreviewed;       690 AA.
AC   A0A118JYY1;
DT   13-APR-2016, integrated into UniProtKB/TrEMBL.
DT   13-APR-2016, sequence version 1.
DT   24-JAN-2024, entry version 18.
DE   SubName: Full=Cysteine peptidase, asparagine active site-containing protein {ECO:0000313|EMBL:KVH98169.1};
GN   ORFNames=Ccrd_023621 {ECO:0000313|EMBL:KVH98169.1};
OS   Cynara cardunculus var. scolymus (Globe artichoke) (Cynara scolymus).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   asterids; campanulids; Asterales; Asteraceae; Carduoideae; Cardueae;
OC   Carduinae; Cynara.
OX   NCBI_TaxID=59895 {ECO:0000313|EMBL:KVH98169.1, ECO:0000313|Proteomes:UP000243975};
RN   [1] {ECO:0000313|EMBL:KVH98169.1, ECO:0000313|Proteomes:UP000243975}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=2C {ECO:0000313|EMBL:KVH98169.1};
RX   PubMed=26786968; DOI=10.1038/srep19427;
RA   Scaglione D., Reyes-Chin-Wo S., Acquadro A., Froenicke L., Portis E.,
RA   Beitel C., Tirone M., Mauro R., Lo Monaco A., Mauromicale G., Faccioli P.,
RA   Cattivelli L., Rieseberg L., Michelmore R., Lanteri S.;
RT   "The genome sequence of the outbreeding globe artichoke constructed de novo
RT   incorporating a phase-aware low-pass sequencing strategy of F1 progeny.";
RL   Sci. Rep. 6:19427-19427(2016).
CC   -!- SIMILARITY: Belongs to the peptidase C1 family.
CC       {ECO:0000256|ARBA:ARBA00008455}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KVH98169.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; LEKV01003807; KVH98169.1; -; Genomic_DNA.
DR   STRING; 59895.A0A118JYY1; -.
DR   EnsemblPlants; KVH98169; KVH98169; Ccrd_023621.
DR   Gramene; KVH98169; KVH98169; Ccrd_023621.
DR   Proteomes; UP000243975; Unassembled WGS sequence.
DR   GO; GO:0008234; F:cysteine-type peptidase activity; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR   CDD; cd02248; Peptidase_C1A; 2.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 2.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR025661; Pept_asp_AS.
DR   InterPro; IPR000169; Pept_cys_AS.
DR   InterPro; IPR025660; Pept_his_AS.
DR   InterPro; IPR013128; Peptidase_C1A.
DR   InterPro; IPR000668; Peptidase_C1A_C.
DR   InterPro; IPR039417; Peptidase_C1A_papain-like.
DR   InterPro; IPR013201; Prot_inhib_I29.
DR   PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR   PANTHER; PTHR12411:SF897; CYSTEINE PROTEINASES SUPERFAMILY PROTEIN-RELATED; 1.
DR   Pfam; PF08246; Inhibitor_I29; 2.
DR   Pfam; PF00112; Peptidase_C1; 2.
DR   PRINTS; PR00705; PAPAIN.
DR   SMART; SM00848; Inhibitor_I29; 2.
DR   SMART; SM00645; Pept_C1; 2.
DR   SUPFAM; SSF54001; Cysteine proteinases; 2.
DR   PROSITE; PS00640; THIOL_PROTEASE_ASN; 2.
DR   PROSITE; PS00139; THIOL_PROTEASE_CYS; 2.
DR   PROSITE; PS00639; THIOL_PROTEASE_HIS; 2.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Reference proteome {ECO:0000313|Proteomes:UP000243975};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..22
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           23..690
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5007159813"
FT   DOMAIN          42..98
FT                   /note="Cathepsin propeptide inhibitor"
FT                   /evidence="ECO:0000259|SMART:SM00848"
FT   DOMAIN          131..351
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
FT   DOMAIN          369..425
FT                   /note="Cathepsin propeptide inhibitor"
FT                   /evidence="ECO:0000259|SMART:SM00848"
FT   DOMAIN          458..674
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
FT   UNSURE          47
FT                   /note="D or N"
FT                   /evidence="ECO:0000313|EMBL:KVH98169.1"
SQ   SEQUENCE   690 AA;  77223 MW;  C46C4ADC394F66D3 CRC64;
     METTRLVLFS LVLVLALGFV SAGSFEYTED DLASDEAKWA LYERWRDHHK VPEQSDDEKQ
     KRFIIFMDTV KRVDNHNKAK KPYLMELNKL XDLTXXEIVR TYTGAKLDEH HRMLSSRGNS
     SRLKYADRHD LPAEVDWRQF NNVVVAPKNQ AQCGSCWAFA AMGALESAHG LKTGNLISLS
     EQQIIDCDTE KNGGCNGGVP AYALNYVAKH GGMTTSECYP YNDPAGQTVC CGAKLQNRVV
     HCQGFEDIPI DDEPAMMERV AEQPVSACLF VYEGFYGYKE GIYTGDDCVG EQNPHAINIV
     GYGTTPEGCK YWIIKNSWGE DWGEKGYMRL AREVGNPRGA CSITMQTCFP SFDYHEEELE
     TEESLWELYE RWRSHHRMAA ASRQEKHRRF NVFKSNLQHV HNTNRMNKPY KLKLNRFADM
     TNHEFTSAYA GSKVSHHRML HGDRISNKGF MYANHDNVPA SIDWRKENAV TPVKDQGHCG
     SCWAFSTVVA VEGLNQIKTK KLVSLSEQEL VDCDTGKNEG CDGGLMDLAF DFIKKNGGLT
     TEDNYPYTAA AGSCKTVKEG VPTVSIDGHE DVPVNDEDAL MKAVANQPVS VAIDAGGSDF
     QFYSQGVFTG KCGTQLNHGV AVVGYDMSDD GTKYWIVKNS WGADWGENGY IRMQRGVPEK
     TGLCGIAMEA SYPIKNSDTN PKASFIRDEL
//
DBGET integrated database retrieval system