ID A0A2U1NVS1_ARTAN Unreviewed; 458 AA.
AC A0A2U1NVS1;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 27-MAR-2024, entry version 16.
DE SubName: Full=Granulin repeat cysteine protease family protein {ECO:0000313|EMBL:PWA77623.1};
GN ORFNames=CTI12_AA219740 {ECO:0000313|EMBL:PWA77623.1};
OS Artemisia annua (Sweet wormwood).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae;
OC Artemisiinae; Artemisia.
OX NCBI_TaxID=35608 {ECO:0000313|EMBL:PWA77623.1, ECO:0000313|Proteomes:UP000245207};
RN [1] {ECO:0000313|EMBL:PWA77623.1, ECO:0000313|Proteomes:UP000245207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Huhao1 {ECO:0000313|Proteomes:UP000245207};
RC TISSUE=Leaf {ECO:0000313|EMBL:PWA77623.1};
RX PubMed=29703587; DOI=10.1016/j.molp.2018.03.015;
RA Shen Q., Zhang L., Liao Z., Wang S., Yan T., Shi P., Liu M., Fu X., Pan Q.,
RA Wang Y., Lv Z., Lu X., Zhang F., Jiang W., Ma Y., Chen M., Hao X., Li L.,
RA Tang Y., Lv G., Zhou Y., Sun X., Brodelius P.E., Rose J.K.C., Tang K.;
RT "The genome of Artemisia annua provides insight into the evolution of
RT Asteraceae family and artemisinin biosynthesis.";
RL Mol. Plant 11:776-788(2018).
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PWA77623.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; PKPP01002098; PWA77623.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2U1NVS1; -.
DR STRING; 35608.A0A2U1NVS1; -.
DR Proteomes; UP000245207; Unassembled WGS sequence.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd02248; Peptidase_C1A; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR Gene3D; 2.10.25.160; Granulin; 1.
DR InterPro; IPR000118; Granulin.
DR InterPro; IPR037277; Granulin_sf.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025661; Pept_asp_AS.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR InterPro; IPR013201; Prot_inhib_I29.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR PANTHER; PTHR12411:SF937; CYSTEINE PROTEINASE RD21A; 1.
DR Pfam; PF00396; Granulin; 1.
DR Pfam; PF08246; Inhibitor_I29; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00277; GRAN; 1.
DR SMART; SM00848; Inhibitor_I29; 1.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR SUPFAM; SSF57277; Granulin repeat; 1.
DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022807};
KW Protease {ECO:0000313|EMBL:PWA77623.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000245207};
KW Signal {ECO:0000256|SAM:SignalP};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..458
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5018611724"
FT DOMAIN 48..104
FT /note="Cathepsin propeptide inhibitor"
FT /evidence="ECO:0000259|SMART:SM00848"
FT DOMAIN 135..350
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
FT DOMAIN 373..430
FT /note="Granulins"
FT /evidence="ECO:0000259|SMART:SM00277"
SQ SEQUENCE 458 AA; 50586 MW; 1A3066A4C51D9563 CRC64;
MKTTMTMTTL LLLLTLFTLS RAADMSIITY NHNHNLNLRT DDEVKALYES WLVQHGKTYN
ALGEKDRRFE IFKDNIRFID EHNSVERSYK VGLNKFADLS NEEYRTMYTG VKKNKNLGLN
KVKSDRYVLR SGESLPESVD WRDKGAVAAV KDQGSCGSCW AFSTTGAVEG INQIFTGDLI
SVSEQELVDC DTSYNQGCNG GLMDYAFEFI IKNGGIDTEE DYPYTGRDGR CDATRKNAKV
VSIDGYEDVP INDESALQKA VSNQPIAVAI EAGGREFQFY TSGIFTGSCG TDLDHGVLAV
GYGSEGGKDY WIVKNSWGAE WGESGYLKME RNIAEKTGKC GIAMEASYPT KTGQNPPNPG
PSPPSPVTPE VVCDEYSTCP EATTCCCIYE YYGYCFAWGC CPLEGASCCD DHYSCCPHDY
PVCNVRRGTC SKSKNSPLEV NAIKRILATP THLKRSSA
//