ID A0A103XMI6_CYNCS Unreviewed; 401 AA.
AC A0A103XMI6;
DT 13-APR-2016, integrated into UniProtKB/TrEMBL.
DT 13-APR-2016, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE SubName: Full=Cysteine peptidase, asparagine active site-containing protein {ECO:0000313|EMBL:KVH93224.1};
GN ORFNames=Ccrd_004746 {ECO:0000313|EMBL:KVH93224.1};
OS Cynara cardunculus var. scolymus (Globe artichoke) (Cynara scolymus).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Carduoideae; Cardueae;
OC Carduinae; Cynara.
OX NCBI_TaxID=59895 {ECO:0000313|EMBL:KVH93224.1, ECO:0000313|Proteomes:UP000243975};
RN [1] {ECO:0000313|EMBL:KVH93224.1, ECO:0000313|Proteomes:UP000243975}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=2C {ECO:0000313|EMBL:KVH93224.1};
RX PubMed=26786968; DOI=10.1038/srep19427;
RA Scaglione D., Reyes-Chin-Wo S., Acquadro A., Froenicke L., Portis E.,
RA Beitel C., Tirone M., Mauro R., Lo Monaco A., Mauromicale G., Faccioli P.,
RA Cattivelli L., Rieseberg L., Michelmore R., Lanteri S.;
RT "The genome sequence of the outbreeding globe artichoke constructed de novo
RT incorporating a phase-aware low-pass sequencing strategy of F1 progeny.";
RL Sci. Rep. 6:19427-19427(2016).
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KVH93224.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LEKV01004791; KVH93224.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A103XMI6; -.
DR STRING; 59895.A0A103XMI6; -.
DR EnsemblPlants; KVH93224; KVH93224; Ccrd_004746.
DR Gramene; KVH93224; KVH93224; Ccrd_004746.
DR OMA; ALKHDQC; -.
DR Proteomes; UP000243975; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd02248; Peptidase_C1A; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025661; Pept_asp_AS.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR InterPro; IPR013201; Prot_inhib_I29.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR PANTHER; PTHR12411:SF972; PAPAIN FAMILY CYSTEINE PROTEASE-RELATED; 1.
DR Pfam; PF08246; Inhibitor_I29; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00848; Inhibitor_I29; 1.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000243975};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 21..40
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 46..64
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 88..144
FT /note="Cathepsin propeptide inhibitor"
FT /evidence="ECO:0000259|SMART:SM00848"
FT DOMAIN 171..396
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 401 AA; 44591 MW; 4F442AB1D5390147 CRC64;
MLNGSSRRWL RLPTLDTYIS IAAYIYTHTH TTFYSTFIAF LMDPRLYSLL VAFSLLIVAI
SFTAGDDTII RQVVGDGEYQ LNTEEDHFGD FKRKFRKSYA SQEEHDYRLS IFKTNLRRAK
RHQKLDPPAI HGVTQFSDMT PEEFRKHLGL RSRLKFPADA GKAPILPTDD LPEDFDWRDR
GAVTGVKNQG SCGSCWSFST TGALEGANFL ATGKLESLSE QQLVDCDHEC DPEEQGSCDS
GCNGGLMTSA FEYTLKAGGL MREKEYPYTA TDHGSCKFDK SKVVASVSNF SVVSLDEDQI
AANLVKHGPL AVAINAAYMQ TYIGGVSCPF VCSKRLDHGV LLVGYGAAGY APIRMKEKPY
WIIKNSWGES WGEKGYYKIC KGHNVCGVDS MVSTVVAAHR H
//