ID A0A445HQC3_GLYSO Unreviewed; 343 AA.
AC A0A445HQC3;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 27-MAR-2024, entry version 11.
DE SubName: Full=Senescence-specific cysteine protease SAG39 {ECO:0000313|EMBL:RZB75585.1};
GN ORFNames=D0Y65_034173 {ECO:0000313|EMBL:RZB75585.1};
OS Glycine soja (Wild soybean).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine;
OC Glycine subgen. Soja.
OX NCBI_TaxID=3848 {ECO:0000313|EMBL:RZB75585.1, ECO:0000313|Proteomes:UP000289340};
RN [1] {ECO:0000313|EMBL:RZB75585.1, ECO:0000313|Proteomes:UP000289340}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. W05 {ECO:0000313|Proteomes:UP000289340};
RC TISSUE=Hypocotyl of etiolated seedlings {ECO:0000313|EMBL:RZB75585.1};
RA Xie M., Chung C.Y.L., Li M.-W., Wong F.-L., Chan T.-F., Lam H.-M.;
RT "A high-quality reference genome of wild soybean provides a powerful tool
RT to mine soybean genomes.";
RL Submitted (SEP-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RZB75585.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QZWG01000012; RZB75585.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A445HQC3; -.
DR SMR; A0A445HQC3; -.
DR OrthoDB; 5472443at2759; -.
DR Proteomes; UP000289340; Chromosome 12.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd02248; Peptidase_C1A; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025661; Pept_asp_AS.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR InterPro; IPR013201; Prot_inhib_I29.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR PANTHER; PTHR12411:SF1032; SENESCENCE-SPECIFIC CYSTEINE PROTEASE SAG12; 1.
DR Pfam; PF08246; Inhibitor_I29; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00848; Inhibitor_I29; 1.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Protease {ECO:0000256|ARBA:ARBA00022670, ECO:0000313|EMBL:RZB75585.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000289340};
KW Signal {ECO:0000256|SAM:SignalP};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807}.
FT SIGNAL 1..27
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 28..343
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5019489545"
FT DOMAIN 39..96
FT /note="Cathepsin propeptide inhibitor"
FT /evidence="ECO:0000259|SMART:SM00848"
FT DOMAIN 125..342
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 343 AA; 38035 MW; 3CAA2975AF65C00F CRC64;
MVAKNHFYHI SLAMLLCMAF LAFQVTCRTL QDASMYERHE QWMTRYGKVY KDPQEREKRF
RIFKENVNYI EAFNNAANKR YKLAINQFAD LTNEEFIAPR NRFKGHMCSS IIRTTTFKYE
NVTAVPSTVD WRQKGAVTPI KDQGQCGCCW AFSAVAATEG IHALTSGKLI SLSEQELVDC
DTKGVDQGCE GGLMDDAFKF VIQNHGLNTE ANYPYKGVDG KCNANEAANN AATITGYEDV
PANNEKALQK AVANQPVSVA IDASGSDFQF YKSGVFTGSC GTELDHGVTA VGYGVSNDGT
EYWLVKNSWG TEWGEEGYIR MQRGVDSEEG LCGIAMQASY PTA
//