ID A0A2U1PT88_ARTAN Unreviewed; 356 AA.
AC A0A2U1PT88;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 27-MAR-2024, entry version 17.
DE SubName: Full=Thiol protease {ECO:0000313|EMBL:PWA88976.1};
GN ORFNames=CTI12_AA093600 {ECO:0000313|EMBL:PWA88976.1};
OS Artemisia annua (Sweet wormwood).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae;
OC Artemisiinae; Artemisia.
OX NCBI_TaxID=35608 {ECO:0000313|EMBL:PWA88976.1, ECO:0000313|Proteomes:UP000245207};
RN [1] {ECO:0000313|EMBL:PWA88976.1, ECO:0000313|Proteomes:UP000245207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Huhao1 {ECO:0000313|Proteomes:UP000245207};
RC TISSUE=Leaf {ECO:0000313|EMBL:PWA88976.1};
RX PubMed=29703587; DOI=10.1016/j.molp.2018.03.015;
RA Shen Q., Zhang L., Liao Z., Wang S., Yan T., Shi P., Liu M., Fu X., Pan Q.,
RA Wang Y., Lv Z., Lu X., Zhang F., Jiang W., Ma Y., Chen M., Hao X., Li L.,
RA Tang Y., Lv G., Zhou Y., Sun X., Brodelius P.E., Rose J.K.C., Tang K.;
RT "The genome of Artemisia annua provides insight into the evolution of
RT Asteraceae family and artemisinin biosynthesis.";
RL Mol. Plant 11:776-788(2018).
CC -!- SIMILARITY: Belongs to the FPP/GGPP synthase family.
CC {ECO:0000256|ARBA:ARBA00006706, ECO:0000256|RuleBase:RU004466}.
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PWA88976.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; PKPP01000761; PWA88976.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2U1PT88; -.
DR STRING; 35608.A0A2U1PT88; -.
DR Proteomes; UP000245207; Unassembled WGS sequence.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:InterPro.
DR GO; GO:0016740; F:transferase activity; IEA:UniProtKB-KW.
DR GO; GO:0008299; P:isoprenoid biosynthetic process; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd02248; Peptidase_C1A; 1.
DR Gene3D; 1.20.58.1980; -; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR Gene3D; 1.10.600.10; Farnesyl Diphosphate Synthase; 1.
DR InterPro; IPR000118; Granulin.
DR InterPro; IPR008949; Isoprenoid_synthase_dom_sf.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR InterPro; IPR000092; Polyprenyl_synt.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR PANTHER; PTHR12411:SF950; PROBABLE THIOL PROTEASE-RELATED; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR Pfam; PF00348; polyprenyl_synt; 1.
DR SMART; SM00277; GRAN; 1.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR SUPFAM; SSF57277; Granulin repeat; 1.
DR SUPFAM; SSF48576; Terpenoid synthases; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Hydrolase {ECO:0000313|EMBL:PWA88976.1};
KW Protease {ECO:0000313|EMBL:PWA88976.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000245207};
KW Transferase {ECO:0000256|RuleBase:RU004466}.
FT DOMAIN 2..152
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
FT DOMAIN 142..199
FT /note="Granulins"
FT /evidence="ECO:0000259|SMART:SM00277"
SQ SEQUENCE 356 AA; 39089 MW; 8E74B9C3A8C99913 CRC64;
MDTAYRWIIK NGGIDSEADY PYTSANGRDV KCDKTKTAKT VVSLDSYVEV ESNEDAVFCA
VASTPVTIGI QGSAYDFQLY TGGVYNGQCS SSAYSIDHAV LIVGYGSQDG KDYWIVKNSW
GTYWGMECYI LMERNTNIKN GCGDFSYCAA DQTCCCIFEF YNLCLIHGCC GYADAVCCKN
SAACCPGDYP ICDVKAGYCY KNSAKTGYSI QPQTDCLDAA LITIFQGLAF QLIDDLLDFT
GTPSSLGKGS LSDIRHGIVT APIIYSMEEF PELQSVAGNC AVIVKHLCFV QISPSSADLG
ETVCSLNFAS RVRGVEHGPA RKQTDATKLF KYKQLVIFFK IHSSNWDNNY YPTKWK
//