ID A0A251S120_HELAN Unreviewed; 338 AA.
AC A0A251S120;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 27-MAR-2024, entry version 20.
DE SubName: Full=Putative cysteine proteinases superfamily protein {ECO:0000313|EMBL:OTF92153.1};
GN ORFNames=HannXRQ_Chr16g0518721 {ECO:0000313|EMBL:OTF92153.1};
OS Helianthus annuus (Common sunflower).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Asteroideae;
OC Heliantheae alliance; Heliantheae; Helianthus.
OX NCBI_TaxID=4232 {ECO:0000313|EMBL:OTF92153.1, ECO:0000313|Proteomes:UP000215914};
RN [1] {ECO:0000313|Proteomes:UP000215914}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. SF193 {ECO:0000313|Proteomes:UP000215914};
RX PubMed=28538728; DOI=10.1038/nature22380;
RA Badouin H., Gouzy J., Grassa C.J., Murat F., Staton S.E., Cottret L.,
RA Lelandais-Briere C., Owens G.L., Carrere S., Mayjonade B., Legrand L.,
RA Gill N., Kane N.C., Bowers J.E., Hubner S., Bellec A., Berard A.,
RA Berges H., Blanchet N., Boniface M.C., Brunel D., Catrice O., Chaidir N.,
RA Claudel C., Donnadieu C., Faraut T., Fievet G., Helmstetter N., King M.,
RA Knapp S.J., Lai Z., Le Paslier M.C., Lippi Y., Lorenzon L., Mandel J.R.,
RA Marage G., Marchand G., Marquand E., Bret-Mestries E., Morien E.,
RA Nambeesan S., Nguyen T., Pegot-Espagnet P., Pouilly N., Raftis F.,
RA Sallet E., Schiex T., Thomas J., Vandecasteele C., Vares D., Vear F.,
RA Vautrin S., Crespi M., Mangin B., Burke J.M., Salse J., Munos S.,
RA Vincourt P., Rieseberg L.H., Langlade N.B.;
RT "The sunflower genome provides insights into oil metabolism, flowering and
RT Asterid evolution.";
RL Nature 546:148-152(2017).
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM007905; OTF92153.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A251S120; -.
DR STRING; 4232.A0A251S120; -.
DR InParanoid; A0A251S120; -.
DR OMA; MYENYSS; -.
DR Proteomes; UP000215914; Chromosome 16.
DR GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR GO; GO:0004197; F:cysteine-type endopeptidase activity; IBA:GO_Central.
DR GO; GO:0051603; P:proteolysis involved in protein catabolic process; IBA:GO_Central.
DR CDD; cd02248; Peptidase_C1A; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR025661; Pept_asp_AS.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR InterPro; IPR039417; Peptidase_C1A_papain-like.
DR InterPro; IPR013201; Prot_inhib_I29.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR PANTHER; PTHR12411:SF768; CYSTEINE PROTEINASES SUPERFAMILY PROTEIN-RELATED; 1.
DR Pfam; PF08246; Inhibitor_I29; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR PRINTS; PR00705; PAPAIN.
DR SMART; SM00848; Inhibitor_I29; 1.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS00640; THIOL_PROTEASE_ASN; 1.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Reference proteome {ECO:0000313|Proteomes:UP000215914}.
FT DOMAIN 37..94
FT /note="Cathepsin propeptide inhibitor"
FT /evidence="ECO:0000259|SMART:SM00848"
FT DOMAIN 123..337
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
SQ SEQUENCE 338 AA; 38551 MW; 361BAC9BAE3590F6 CRC64;
MGLSRDRTLI LYVLIVGMWV CQITSRTLSK AYVSEKHDQW MVEYGRVYKS NAEKEMRLNI
FKKNFELIES FNSFGNQSYK LAVNQFVDRT KDEYEAYAYG LMNPHDLELP LSTSFKYERV
SEVPYRLDWR MEGAVTKVKN QFQCGCCWAF TAIAAVEGIT QITTGKLLSL SEQQLVDCDR
NVNRGCHGGY YDLAFDYIAK NGINTENYYP YHAVDETCNT TKEAFRAATI TGYEKVPTNN
ETALLMAVSK QPVSVAIDIS CYEFRYYSRG VLTHHCGTNL SHGVTVVGYG VHYGIKYWLV
KNSWGPAWGH KGYIKMKRDV NFAEGLCGIA MRASYPIA
//