ID A0A251U721_HELAN Unreviewed; 1018 AA.
AC A0A251U721;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE SubName: Full=Insulysin {ECO:0000313|EMBL:KAF5796026.1};
DE EC=3.4.24.56 {ECO:0000313|EMBL:KAF5796026.1};
DE SubName: Full=Putative insulinase (Peptidase family M16) family protein {ECO:0000313|EMBL:OTG19130.1};
GN ORFNames=HannXRQ_Chr08g0230671 {ECO:0000313|EMBL:OTG19130.1},
GN HanXRQr2_Chr08g0346621 {ECO:0000313|EMBL:KAF5796026.1};
OS Helianthus annuus (Common sunflower).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Asteroideae;
OC Heliantheae alliance; Heliantheae; Helianthus.
OX NCBI_TaxID=4232 {ECO:0000313|EMBL:OTG19130.1, ECO:0000313|Proteomes:UP000215914};
RN [1] {ECO:0000313|EMBL:KAF5796026.1, ECO:0000313|Proteomes:UP000215914}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. SF193 {ECO:0000313|Proteomes:UP000215914};
RC TISSUE=Leaves {ECO:0000313|EMBL:KAF5796026.1};
RX PubMed=28538728; DOI=10.1038/nature22380;
RA Badouin H., Gouzy J., Grassa C.J., Murat F., Staton S.E., Cottret L.,
RA Lelandais-Briere C., Owens G.L., Carrere S., Mayjonade B., Legrand L.,
RA Gill N., Kane N.C., Bowers J.E., Hubner S., Bellec A., Berard A.,
RA Berges H., Blanchet N., Boniface M.C., Brunel D., Catrice O., Chaidir N.,
RA Claudel C., Donnadieu C., Faraut T., Fievet G., Helmstetter N., King M.,
RA Knapp S.J., Lai Z., Le Paslier M.C., Lippi Y., Lorenzon L., Mandel J.R.,
RA Marage G., Marchand G., Marquand E., Bret-Mestries E., Morien E.,
RA Nambeesan S., Nguyen T., Pegot-Espagnet P., Pouilly N., Raftis F.,
RA Sallet E., Schiex T., Thomas J., Vandecasteele C., Vares D., Vear F.,
RA Vautrin S., Crespi M., Mangin B., Burke J.M., Salse J., Munos S.,
RA Vincourt P., Rieseberg L.H., Langlade N.B.;
RT "The sunflower genome provides insights into oil metabolism, flowering and
RT Asterid evolution.";
RL Nature 546:148-152(2017).
RN [2] {ECO:0000313|EMBL:OTG19130.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC TISSUE=Leaves {ECO:0000313|EMBL:OTG19130.1};
RA Langlade N., Munos S.;
RT "Sunflower complete genome.";
RL Submitted (FEB-2017) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:KAF5796026.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Leaves {ECO:0000313|EMBL:KAF5796026.1};
RA Gouzy J., Langlade N., Munos S.;
RT "Helianthus annuus Genome sequencing and assembly Release 2.";
RL Submitted (JUN-2020) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the peptidase M16 family.
CC {ECO:0000256|ARBA:ARBA00007261, ECO:0000256|RuleBase:RU004447}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MNCJ02000323; KAF5796026.1; -; Genomic_DNA.
DR EMBL; CM007897; OTG19130.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A251U721; -.
DR STRING; 4232.A0A251U721; -.
DR EnsemblPlants; mRNA:HanXRQr2_Chr08g0346621; mRNA:HanXRQr2_Chr08g0346621; HanXRQr2_Chr08g0346621.
DR Gramene; mRNA:HanXRQr2_Chr08g0346621; mRNA:HanXRQr2_Chr08g0346621; HanXRQr2_Chr08g0346621.
DR InParanoid; A0A251U721; -.
DR OMA; INQVMEH; -.
DR OrthoDB; 129328at2759; -.
DR Proteomes; UP000215914; Chromosome 8.
DR GO; GO:0046872; F:metal ion binding; IEA:InterPro.
DR GO; GO:0004222; F:metalloendopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR Gene3D; 3.30.830.10; Metalloenzyme, LuxS/M16 peptidase-like; 4.
DR InterPro; IPR011249; Metalloenz_LuxS/M16.
DR InterPro; IPR011765; Pept_M16_N.
DR InterPro; IPR001431; Pept_M16_Zn_BS.
DR InterPro; IPR007863; Peptidase_M16_C.
DR InterPro; IPR032632; Peptidase_M16_M.
DR PANTHER; PTHR43690:SF18; INSULIN-DEGRADING ENZYME-RELATED; 1.
DR PANTHER; PTHR43690; NARDILYSIN; 1.
DR Pfam; PF00675; Peptidase_M16; 1.
DR Pfam; PF05193; Peptidase_M16_C; 2.
DR Pfam; PF16187; Peptidase_M16_M; 1.
DR SUPFAM; SSF63411; LuxS/MPP-like metallohydrolase; 4.
DR PROSITE; PS00143; INSULINASE; 1.
PE 3: Inferred from homology;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000313|EMBL:KAF5796026.1};
KW Metalloprotease {ECO:0000256|ARBA:ARBA00023049};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Reference proteome {ECO:0000313|Proteomes:UP000215914};
KW Zinc {ECO:0000256|ARBA:ARBA00022833}.
FT DOMAIN 91..224
FT /note="Peptidase M16 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF00675"
FT DOMAIN 250..430
FT /note="Peptidase M16 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF05193"
FT DOMAIN 437..721
FT /note="Peptidase M16 middle/third"
FT /evidence="ECO:0000259|Pfam:PF16187"
FT DOMAIN 726..911
FT /note="Peptidase M16 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF05193"
FT REGION 41..95
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 53..85
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1018 AA; 116157 MW; 061D71CD1B4BDEA5 CRC64;
MVVGASDFSS DDIVVKSPND RRLYRYIKLP NGLCALLVHD PDIYADGPPE TVTPDDVSED
DEDEDEEDDD EESGEEEDDD DEDEERGSAK SNAPQTKKAA AAMCVGMGSF CDPLEAQGLA
HFLEHMLFMG SAEFPDENEY DSYLSKHGGS SNAYTEVEHT CYHFEVKPEF LLGALKRFSQ
FFISPLVKTE AMDREVLAVD SEFNQALQSD ACRLQQLQCH TAAPGHAFNQ FFWGNKKSLV
DAMENGVNLR DQIFKLYNEF YHGGLLKLVV IGGESLDVLE SWVVELFDKV KTSNASKSEV
KPGLPVWRAG KIYRLEAVKD VHILDLSWTL PCLRKAYVKK AEDYLAHLIG HEGRGSLLFF
LKAEGWATSI SAGVGDDGMQ RSSVAYVFGM SIHLTDSGLE KIYEIIGFVY QYLKLLRQVG
PQEWIYRELQ DIANMDFTFT EEQPQDEYAA ELSANLLIYP PEHTIYGDYA YKEWDEEMIK
HVLSFFTPDN MRTDILSKSI NKSQDVKCEP WFGSHYTEED ISPSLLELWR DPPEINAALH
LPEKNEFIPQ DFSIRANNIS FDSMGTTPPK CILDEPLMKF WYKLDTTFRS PRANTYFRVT
LNGAYSGLKH VLLTELFLNL LKDKLNDVVY QASVAKLDTS ISLVSDKLEL KVYGFNDKLP
VLLSKILETA KSFVPADDRF VVIKEDMERN LRNANMKPLN HSSYLRLQLL CQSFWDVDEK
LGLLHKLSLA DLKAFIPELF SQLYIEGLCH GNLLEEEAKN VSDIFKKYFS VQPLPSEMRH
KDNILCLPPS ADLVRDVTVK NKLDTNSVVE LYYQIEPEVG SDLAKLKALI DLLDEIVEEP
LFNQLRTKEQ LGYVVDCSPR VTYRILGFCF RVQSSEYSPV YLQGRIDKFI NEMDGLLSEL
DDESFQNFKS GLIAKLLEKD PSLHYETNRY WGQITDHRYM FDLSAKEAEE VKRLEKSEIT
NWYNTYLRKA SPKCRRLAVR VWGCNTNINE SKTNLASVKV IDDLVAFKAS SAFYPAFC
//