GenomeNet

Database: UniProt
Entry: A0A2U1LB71_ARTAN
LinkDB: A0A2U1LB71_ARTAN
Original site: A0A2U1LB71_ARTAN 
ID   A0A2U1LB71_ARTAN        Unreviewed;       353 AA.
AC   A0A2U1LB71;
DT   18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT   18-JUL-2018, sequence version 1.
DT   27-MAR-2024, entry version 15.
DE   SubName: Full=Cysteine peptidase, asparagine active site-containing protein {ECO:0000313|EMBL:PWA46245.1};
GN   ORFNames=CTI12_AA510820 {ECO:0000313|EMBL:PWA46245.1};
OS   Artemisia annua (Sweet wormwood).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae;
OC   Artemisiinae; Artemisia.
OX   NCBI_TaxID=35608 {ECO:0000313|EMBL:PWA46245.1, ECO:0000313|Proteomes:UP000245207};
RN   [1] {ECO:0000313|EMBL:PWA46245.1, ECO:0000313|Proteomes:UP000245207}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Huhao1 {ECO:0000313|Proteomes:UP000245207};
RC   TISSUE=Leaf {ECO:0000313|EMBL:PWA46245.1};
RX   PubMed=29703587; DOI=10.1016/j.molp.2018.03.015;
RA   Shen Q., Zhang L., Liao Z., Wang S., Yan T., Shi P., Liu M., Fu X., Pan Q.,
RA   Wang Y., Lv Z., Lu X., Zhang F., Jiang W., Ma Y., Chen M., Hao X., Li L.,
RA   Tang Y., Lv G., Zhou Y., Sun X., Brodelius P.E., Rose J.K.C., Tang K.;
RT   "The genome of Artemisia annua provides insight into the evolution of
RT   Asteraceae family and artemisinin biosynthesis.";
RL   Mol. Plant 11:776-788(2018).
CC   -!- SIMILARITY: Belongs to the peptidase C1 family.
CC       {ECO:0000256|ARBA:ARBA00008455}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:PWA46245.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; PKPP01010379; PWA46245.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A2U1LB71; -.
DR   STRING; 35608.A0A2U1LB71; -.
DR   Proteomes; UP000245207; Unassembled WGS sequence.
DR   GO; GO:0008234; F:cysteine-type peptidase activity; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR   CDD; cd02248; Peptidase_C1A; 1.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR000169; Pept_cys_AS.
DR   InterPro; IPR025660; Pept_his_AS.
DR   InterPro; IPR013128; Peptidase_C1A.
DR   InterPro; IPR000668; Peptidase_C1A_C.
DR   InterPro; IPR039417; Peptidase_C1A_papain-like.
DR   InterPro; IPR013201; Prot_inhib_I29.
DR   PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR   PANTHER; PTHR12411:SF1013; CYSTEINE PROTEASE XCP2; 1.
DR   Pfam; PF08246; Inhibitor_I29; 1.
DR   Pfam; PF00112; Peptidase_C1; 1.
DR   PRINTS; PR00705; PAPAIN.
DR   SMART; SM00848; Inhibitor_I29; 1.
DR   SMART; SM00645; Pept_C1; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR   PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Reference proteome {ECO:0000313|Proteomes:UP000245207};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..26
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           27..353
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5018716234"
FT   DOMAIN          49..105
FT                   /note="Cathepsin propeptide inhibitor"
FT                   /evidence="ECO:0000259|SMART:SM00848"
FT   DOMAIN          135..350
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
SQ   SEQUENCE   353 AA;  39704 MW;  1EBB6C17FEAC1B43 CRC64;
     MTLFSSSRRS SFFVLFFALL SFSALAHEYS IVGYTPEDLT CIDKVINLFE SWVSKHGKFY
     DSLEEKLHRL EIFKDNLKHI DETNKKVSNY WLGLNEFADL SHEEFKNKFL GLKGELPEKR
     EESSEEFTYR DFVDLPKSVD WRKKGAVAPV KNQGSCGSCW AFSTVAAVEG INQIVTGNLT
     ELSEQELIDC DTSFNNGCNG GLMDYAFSFI VRNGGLHKEE EYPYIMSEGT CDEKKDVSER
     VTISGYHDVP RNNENSFLKA LANQPISVAI DASGRDFQFY SGGVFDGHCG TDLDHGVAAV
     GYGTSKGVDY VTVRNSWGPK WGEKGYIRMK RNTGKSEGMC GLYKMASYPT KQK
//
DBGET integrated database retrieval system