GenomeNet

Database: UniProt
Entry: A0A251UNX9_HELAN
LinkDB: A0A251UNX9_HELAN
Original site: A0A251UNX9_HELAN 
ID   A0A251UNX9_HELAN        Unreviewed;       499 AA.
AC   A0A251UNX9;
DT   22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT   22-NOV-2017, sequence version 1.
DT   27-MAR-2024, entry version 23.
DE   SubName: Full=Actinidain {ECO:0000313|EMBL:KAF5810867.1};
DE            EC=3.4.22.14 {ECO:0000313|EMBL:KAF5810867.1};
DE   SubName: Full=Putative granulin, Cysteine peptidase, histidine active site protein {ECO:0000313|EMBL:OTG24723.1};
GN   ORFNames=HannXRQ_Chr05g0139861 {ECO:0000313|EMBL:OTG24723.1},
GN   HanXRQr2_Chr04g0174741 {ECO:0000313|EMBL:KAF5810867.1};
OS   Helianthus annuus (Common sunflower).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   asterids; campanulids; Asterales; Asteraceae; Asteroideae;
OC   Heliantheae alliance; Heliantheae; Helianthus.
OX   NCBI_TaxID=4232 {ECO:0000313|EMBL:OTG24723.1, ECO:0000313|Proteomes:UP000215914};
RN   [1] {ECO:0000313|EMBL:KAF5810867.1, ECO:0000313|Proteomes:UP000215914}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. SF193 {ECO:0000313|Proteomes:UP000215914};
RC   TISSUE=Leaves {ECO:0000313|EMBL:KAF5810867.1};
RX   PubMed=28538728; DOI=10.1038/nature22380;
RA   Badouin H., Gouzy J., Grassa C.J., Murat F., Staton S.E., Cottret L.,
RA   Lelandais-Briere C., Owens G.L., Carrere S., Mayjonade B., Legrand L.,
RA   Gill N., Kane N.C., Bowers J.E., Hubner S., Bellec A., Berard A.,
RA   Berges H., Blanchet N., Boniface M.C., Brunel D., Catrice O., Chaidir N.,
RA   Claudel C., Donnadieu C., Faraut T., Fievet G., Helmstetter N., King M.,
RA   Knapp S.J., Lai Z., Le Paslier M.C., Lippi Y., Lorenzon L., Mandel J.R.,
RA   Marage G., Marchand G., Marquand E., Bret-Mestries E., Morien E.,
RA   Nambeesan S., Nguyen T., Pegot-Espagnet P., Pouilly N., Raftis F.,
RA   Sallet E., Schiex T., Thomas J., Vandecasteele C., Vares D., Vear F.,
RA   Vautrin S., Crespi M., Mangin B., Burke J.M., Salse J., Munos S.,
RA   Vincourt P., Rieseberg L.H., Langlade N.B.;
RT   "The sunflower genome provides insights into oil metabolism, flowering and
RT   Asterid evolution.";
RL   Nature 546:148-152(2017).
RN   [2] {ECO:0000313|EMBL:OTG24723.1}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   TISSUE=Leaves {ECO:0000313|EMBL:OTG24723.1};
RA   Langlade N., Munos S.;
RT   "Sunflower complete genome.";
RL   Submitted (FEB-2017) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|EMBL:KAF5810867.1}
RP   NUCLEOTIDE SEQUENCE.
RC   TISSUE=Leaves {ECO:0000313|EMBL:KAF5810867.1};
RA   Gouzy J., Langlade N., Munos S.;
RT   "Helianthus annuus Genome sequencing and assembly Release 2.";
RL   Submitted (JUN-2020) to the EMBL/GenBank/DDBJ databases.
CC   -!- SIMILARITY: Belongs to the peptidase C1 family.
CC       {ECO:0000256|ARBA:ARBA00008455}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; MNCJ02000319; KAF5810867.1; -; Genomic_DNA.
DR   EMBL; CM007894; OTG24723.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A251UNX9; -.
DR   STRING; 4232.A0A251UNX9; -.
DR   EnsemblPlants; mRNA:HanXRQr2_Chr04g0174741; mRNA:HanXRQr2_Chr04g0174741; HanXRQr2_Chr04g0174741.
DR   Gramene; mRNA:HanXRQr2_Chr04g0174741; mRNA:HanXRQr2_Chr04g0174741; HanXRQr2_Chr04g0174741.
DR   InParanoid; A0A251UNX9; -.
DR   OMA; SCEDAPY; -.
DR   OrthoDB; 5472443at2759; -.
DR   Proteomes; UP000215914; Chromosome 5.
DR   GO; GO:0005615; C:extracellular space; IBA:GO_Central.
DR   GO; GO:0004197; F:cysteine-type endopeptidase activity; IBA:GO_Central.
DR   GO; GO:0051603; P:proteolysis involved in protein catabolic process; IBA:GO_Central.
DR   CDD; cd02248; Peptidase_C1A; 1.
DR   Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR   Gene3D; 2.10.25.160; Granulin; 1.
DR   InterPro; IPR000118; Granulin.
DR   InterPro; IPR037277; Granulin_sf.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR000169; Pept_cys_AS.
DR   InterPro; IPR025660; Pept_his_AS.
DR   InterPro; IPR013128; Peptidase_C1A.
DR   InterPro; IPR000668; Peptidase_C1A_C.
DR   InterPro; IPR039417; Peptidase_C1A_papain-like.
DR   InterPro; IPR013201; Prot_inhib_I29.
DR   PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR   PANTHER; PTHR12411:SF950; PROBABLE THIOL PROTEASE-RELATED; 1.
DR   Pfam; PF00396; Granulin; 1.
DR   Pfam; PF08246; Inhibitor_I29; 1.
DR   Pfam; PF00112; Peptidase_C1; 1.
DR   PRINTS; PR00705; PAPAIN.
DR   SMART; SM00277; GRAN; 1.
DR   SMART; SM00848; Inhibitor_I29; 1.
DR   SMART; SM00645; Pept_C1; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   SUPFAM; SSF57277; Granulin repeat; 1.
DR   PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR   PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Hydrolase {ECO:0000313|EMBL:KAF5810867.1};
KW   Protease {ECO:0000256|ARBA:ARBA00022807};
KW   Reference proteome {ECO:0000313|Proteomes:UP000215914};
KW   Signal {ECO:0000256|SAM:SignalP};
KW   Thiol protease {ECO:0000256|ARBA:ARBA00022807}.
FT   SIGNAL          1..22
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           23..499
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5018532008"
FT   DOMAIN          56..115
FT                   /note="Cathepsin propeptide inhibitor"
FT                   /evidence="ECO:0000259|SMART:SM00848"
FT   DOMAIN          149..369
FT                   /note="Peptidase C1A papain C-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00645"
FT   DOMAIN          399..456
FT                   /note="Granulins"
FT                   /evidence="ECO:0000259|SMART:SM00277"
FT   REGION          373..392
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   499 AA;  55131 MW;  6B487BF8421F0EE2 CRC64;
     MGTSNFILTL LLLIIYVSSL SSSSPISLSS TLPPEFSILH GQENDVLSTE RVSELFVKWK
     EVHGKVYDHQ EEEERRLGNF RNSLKFILER NSKRKSEWDH TVGLTKFADL SNEEFKEMYF
     SKSKGPKSEK LKVWGEKGNT TLNSGSCDAP ASLDWRDKGV VTPMKDQGQC GSCWAFSVTG
     SIEGAHAIAT GDLISLSEQE LVSCDTNDYG CDGGNMDTAY RWIIKNGGLN SEEAYPYTSS
     NGRDGKCDKE KSQISVVSIS SYVEVESNED AVLCAVAKQP VTIGICGSAY DFQLYTGGIY
     NGQCSSSAYS LDHAVLIVGY GTQDGEDYWI VKNQWGTYWG LEGYVLMKRG SDVNKNGVCG
     MYLEAMYPIP AEPSPPSPPA PPSPPHPPPP PPAPTPDKCG NFHYCAADQT CCCILEFYNY
     CFMYGCCGYT NAVCCKGSSY CCPSDYPVCD IQAGYCFKKS GGTFGVAAKK RQMAKHKMPW
     EKVEETVMEE YQPMVWKRK
//
DBGET integrated database retrieval system