ID A0A1D5TI66_WHEAT Unreviewed; 352 AA.
AC A0A1D5TI66;
DT 30-NOV-2016, integrated into UniProtKB/TrEMBL.
DT 30-NOV-2016, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE RecName: Full=Cysteine proteinase {ECO:0008006|Google:ProtNLM};
GN ORFNames=CFC21_018573 {ECO:0000313|EMBL:KAF7003218.1}, CFC21_018574
GN {ECO:0000313|EMBL:KAF7003219.1}, CFC21_018575
GN {ECO:0000313|EMBL:KAF7003220.1}, CFC21_018576
GN {ECO:0000313|EMBL:KAF7003221.1}, CFC21_018577
GN {ECO:0000313|EMBL:KAF7003222.1};
OS Triticum aestivum (Wheat).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Pooideae; Triticodae; Triticeae; Triticinae; Triticum.
OX NCBI_TaxID=4565 {ECO:0000313|EnsemblPlants:TraesCS2A02G421800.1};
RN [1] {ECO:0000313|EMBL:KAF7003218.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Leaf {ECO:0000313|EMBL:KAF7003218.1};
RX PubMed=29069494;
RA Zimin A.V., Puiu D., Hall R., Kingan S., Clavijo B.J., Salzberg S.L.;
RT "The first near-complete assembly of the hexaploid bread wheat genome,
RT Triticum aestivum.";
RL Gigascience 6:1-7(2017).
RN [2] {ECO:0000313|EnsemblPlants:TraesCS2A02G421800.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Chinese Spring
RC {ECO:0000313|EnsemblPlants:TraesCS2A02G421800.1};
RX PubMed=30115783; DOI=10.1126/science.aar7191;
RG International wheat genome sequencing consortium (IWGSC);
RT "Shifting the limits in wheat research and breeding using a fully annotated
RT reference genome.";
RL Science 361:EAAR7191-EAAR7191(2018).
RN [3] {ECO:0000313|EnsemblPlants:TraesCS2A02G421800.1}
RP IDENTIFICATION.
RG EnsemblPlants;
RL Submitted (OCT-2018) to UniProtKB.
RN [4] {ECO:0000313|EMBL:KAF7003218.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Leaf {ECO:0000313|EMBL:KAF7003218.1};
RA Zimin A.V., Puiu D., Shumante A., Alonge M., Salzberg S.L.;
RT "The second near-complete assembly of the hexaploid bread wheat (Triticum
RT aestivum) genome.";
RL Submitted (MAR-2020) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM022214; KAF7003218.1; -; Genomic_DNA.
DR EMBL; CM022214; KAF7003219.1; -; Genomic_DNA.
DR EMBL; CM022214; KAF7003220.1; -; Genomic_DNA.
DR EMBL; CM022214; KAF7003221.1; -; Genomic_DNA.
DR EMBL; CM022214; KAF7003222.1; -; Genomic_DNA.
DR SMR; A0A1D5TI66; -.
DR STRING; 4565.A0A1D5TI66; -.
DR EnsemblPlants; TraesCS2A02G421600.1; TraesCS2A02G421600.1; TraesCS2A02G421600.
DR EnsemblPlants; TraesCS2A02G421700.1; TraesCS2A02G421700.1; TraesCS2A02G421700.
DR EnsemblPlants; TraesCS2A02G421800.1; TraesCS2A02G421800.1; TraesCS2A02G421800.
DR EnsemblPlants; TraesCS2A02G421900.1; TraesCS2A02G421900.1; TraesCS2A02G421900.
DR Gramene; TraesCS2A02G421600.1; TraesCS2A02G421600.1; TraesCS2A02G421600.
DR Gramene; TraesCS2A02G421700.1; TraesCS2A02G421700.1; TraesCS2A02G421700.
DR Gramene; TraesCS2A02G421800.1; TraesCS2A02G421800.1; TraesCS2A02G421800.
DR Gramene; TraesCS2A02G421900.1; TraesCS2A02G421900.1; TraesCS2A02G421900.
DR Gramene; TraesCS2A03G1007900.1; TraesCS2A03G1007900.1.CDS; TraesCS2A03G1007900.
DR Gramene; TraesCS2A03G1008000.1; TraesCS2A03G1008000.1.CDS; TraesCS2A03G1008000.
DR Gramene; TraesCS2A03G1008100.1; TraesCS2A03G1008100.1.CDS; TraesCS2A03G1008100.
DR Gramene; TraesCS2A03G1008200.1; TraesCS2A03G1008200.1.CDS; TraesCS2A03G1008200.
DR Gramene; TraesKAR2A01G0425140.1; cds.TraesKAR2A01G0425140.1; TraesKAR2A01G0425140.
DR Gramene; TraesKAR2A01G0425180.1; cds.TraesKAR2A01G0425180.1; TraesKAR2A01G0425180.
DR Gramene; TraesKAR2A01G0425190.1; cds.TraesKAR2A01G0425190.1; TraesKAR2A01G0425190.
DR Gramene; TraesKAR2A01G0425210.1; cds.TraesKAR2A01G0425210.1; TraesKAR2A01G0425210.
DR Gramene; TraesROB_scaffold_544622_01G000100.1; TraesROB_scaffold_544622_01G000100.1; TraesROB_scaffold_544622_01G000100.
DR OMA; SASCGRE; -.
DR OrthoDB; 808912at2759; -.
DR Proteomes; UP000019116; Chromosome 2A.
DR Proteomes; UP000815260; Chromosome 2A.
DR GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR GO; GO:0004197; F:cysteine-type endopeptidase activity; IBA:GO_Central.
DR GO; GO:0051603; P:proteolysis involved in protein catabolic process; IBA:GO_Central.
DR CDD; cd02248; Peptidase_C1A; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR InterPro; IPR013201; Prot_inhib_I29.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR PANTHER; PTHR12411:SF748; CYSTEINE PROTEINASE-LIKE; 1.
DR Pfam; PF08246; Inhibitor_I29; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00848; Inhibitor_I29; 1.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Reference proteome {ECO:0000313|Proteomes:UP000019116};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..34
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 35..352
FT /note="Cysteine proteinase"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5039972531"
FT DOMAIN 44..101
FT /note="Cathepsin propeptide inhibitor"
FT /evidence="ECO:0000259|SMART:SM00848"
FT DOMAIN 135..348
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 352 AA; 37120 MW; 3522F1C4C89BD31B CRC64;
MAPIHSNSSR RHDGTLLALL LALVAATAFV GAAAARGDAL AARHERWMAK YGRAYTDAAE
KLHRQEVFAA NARHVDAVNR AGNRTYTLGL NQFSDLTNEE FVEKHLGYRH QPGGLRPEDT
PVAAVNMSKA QFQSTPDSMD WRAQGAVTQV KNQASCGSCW AFAAVAATEG LVQIATGNLI
SMSEQQVLDC TGDTSTCKGG SVIAALRYVA ASGGLQPEAA YAYTGQRGAC RSVMPNSAAS
VGAPRWVGLN GDEDALRELA ASQPVAVGVE ADPDFQHYMS GVFVGSSSCG QNLNHAVTVV
GYGADGGGQE YWLVKNQWGT GWGEGGYMRL TRGNGGNCGM ATVAYYPTMN SS
//