GenomeNet

Database: UniProt
Entry: A0A2U1KJJ3_ARTAN
LinkDB: A0A2U1KJJ3_ARTAN
Original site: A0A2U1KJJ3_ARTAN 
ID   A0A2U1KJJ3_ARTAN        Unreviewed;       437 AA.
AC   A0A2U1KJJ3;
DT   18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT   18-JUL-2018, sequence version 1.
DT   27-MAR-2024, entry version 15.
DE   SubName: Full=Peptidase C1A {ECO:0000313|EMBL:PWA36898.1};
GN   ORFNames=CTI12_AA595490 {ECO:0000313|EMBL:PWA36898.1};
OS   Artemisia annua (Sweet wormwood).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae;
OC   Artemisiinae; Artemisia.
OX   NCBI_TaxID=35608 {ECO:0000313|EMBL:PWA36898.1, ECO:0000313|Proteomes:UP000245207};
RN   [1] {ECO:0000313|EMBL:PWA36898.1, ECO:0000313|Proteomes:UP000245207}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Huhao1 {ECO:0000313|Proteomes:UP000245207};
RC   TISSUE=Leaf {ECO:0000313|EMBL:PWA36898.1};
RX   PubMed=29703587; DOI=10.1016/j.molp.2018.03.015;
RA   Shen Q., Zhang L., Liao Z., Wang S., Yan T., Shi P., Liu M., Fu X., Pan Q.,
RA   Wang Y., Lv Z., Lu X., Zhang F., Jiang W., Ma Y., Chen M., Hao X., Li L.,
RA   Tang Y., Lv G., Zhou Y., Sun X., Brodelius P.E., Rose J.K.C., Tang K.;
RT   "The genome of Artemisia annua provides insight into the evolution of
RT   Asteraceae family and artemisinin biosynthesis.";
RL   Mol. Plant 11:776-788(2018).
CC   -!- SIMILARITY: Belongs to the peptidase C1 family.
CC       {ECO:0000256|ARBA:ARBA00008455}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:PWA36898.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; PKPP01017516; PWA36898.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A2U1KJJ3; -.
DR   STRING; 35608.A0A2U1KJJ3; -.
DR   Proteomes; UP000245207; Unassembled WGS sequence.
DR   GO; GO:0008234; F:cysteine-type peptidase activity; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR   CDD; cd02248; Peptidase_C1A; 1.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR000169; Pept_cys_AS.
DR   InterPro; IPR025660; Pept_his_AS.
DR   InterPro; IPR013128; Peptidase_C1A.
DR   InterPro; IPR000668; Peptidase_C1A_C.
DR   InterPro; IPR039417; Peptidase_C1A_papain-like.
DR   InterPro; IPR013201; Prot_inhib_I29.
DR   PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR   PANTHER; PTHR12411:SF841; CYSTEINE PROTEASE-4; 1.
DR   Pfam; PF08246; Inhibitor_I29; 1.
DR   Pfam; PF00112; Peptidase_C1; 1.
DR   PRINTS; PR00705; PAPAIN.
DR   SMART; SM00848; Inhibitor_I29; 1.
DR   SMART; SM00645; Pept_C1; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR   PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Reference proteome {ECO:0000313|Proteomes:UP000245207};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..26
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           27..437
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5018636608"
FT   DOMAIN          49..105
FT                   /note="Cathepsin propeptide inhibitor"
FT                   /evidence="ECO:0000259|SMART:SM00848"
FT   DOMAIN          135..329
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
SQ   SEQUENCE   437 AA;  49056 MW;  2B800B560D7D4E49 CRC64;
     MAFIISSKKT SLFFLFVSVL ACSALAHEFS ILGYAPEDLT SIHKVVHLFE SWVAKHSKFY
     ESLDEKLHRF EIFMDNLKHI DDTNKKVSNY WLGLNEFADM SHEEFKSKYL GLKGELAERK
     ETSNEEFSYR NFVDLPKSVD WRKKGAVAPV KNQGQCGSCW AFSTVAAVEG INQIVTGNLT
     ELSEQELIDC DTTVNSGCNG GLMDYAFAYI MKAGGLHKEE EYPYIMSEGT CDDKKDISEK
     VTISGYHDVP RNNEDSFLKA LANQPISVAI EASGRDFQFY SGGVFDGHCG IDLDHGVAAV
     GYGTTKGLDY VILLKEKTND VLENNQVDHL NFFDEKDTKH LQVPMIEGRV TPKVYGGAYP
     SHESERDSTT SMGEKTYFEG NVGNNEIPES INVDLNFNYG SSDEDILDVQ QQQTVARKSK
     RSTKMPLSLM IMLWIVI
//
DBGET integrated database retrieval system