ID A0A2A4JX11_HELVI Unreviewed; 897 AA.
AC A0A2A4JX11;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 24-JAN-2024, entry version 19.
DE RecName: Full=Peptidase S9 prolyl oligopeptidase catalytic domain-containing protein {ECO:0008006|Google:ProtNLM};
DE Flags: Fragment;
GN ORFNames=B5V51_9208 {ECO:0000313|EMBL:PCG76547.1};
OS Heliothis virescens (Tobacco budworm moth).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Noctuoidea;
OC Noctuidae; Heliothinae; Heliothis.
OX NCBI_TaxID=7102 {ECO:0000313|EMBL:PCG76547.1, ECO:0000313|Proteomes:UP000218220};
RN [1] {ECO:0000313|EMBL:PCG76547.1, ECO:0000313|Proteomes:UP000218220}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=HvINT- {ECO:0000313|EMBL:PCG76547.1};
RC TISSUE=Whole body {ECO:0000313|EMBL:PCG76547.1};
RA Fritz M.L., Deyonke A.M., Papanicolaou A., Micinski S., Westbrook J.,
RA Gould F.;
RT "Contemporary evolution of a Lepidopteran species, Heliothis virescens, in
RT response to modern agricultural practices.";
RL Submitted (SEP-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004606}; Single-
CC pass type II membrane protein {ECO:0000256|ARBA:ARBA00004606}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PCG76547.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NWSH01000415; PCG76547.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2A4JX11; -.
DR STRING; 7102.A0A2A4JX11; -.
DR Proteomes; UP000218220; Unassembled WGS sequence.
DR GO; GO:0008236; F:serine-type peptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR Gene3D; 3.40.50.1820; alpha/beta hydrolase; 2.
DR Gene3D; 2.140.10.30; Dipeptidylpeptidase IV, N-terminal domain; 2.
DR InterPro; IPR029058; AB_hydrolase.
DR InterPro; IPR001375; Peptidase_S9.
DR InterPro; IPR002469; Peptidase_S9B_N.
DR PANTHER; PTHR11731:SF200; DIPEPTIDYL PEPTIDASE FAMILY MEMBER 1; 1.
DR PANTHER; PTHR11731; PROTEASE FAMILY S9B,C DIPEPTIDYL-PEPTIDASE IV-RELATED; 1.
DR Pfam; PF00930; DPPIV_N; 1.
DR Pfam; PF00326; Peptidase_S9; 2.
DR SUPFAM; SSF53474; alpha/beta-Hydrolases; 2.
DR SUPFAM; SSF82171; DPP6 N-terminal domain-like; 1.
PE 4: Predicted;
KW Membrane {ECO:0000256|ARBA:ARBA00023136};
KW Reference proteome {ECO:0000313|Proteomes:UP000218220};
KW Signal-anchor {ECO:0000256|ARBA:ARBA00022968};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989}.
FT DOMAIN 42..408
FT /note="Dipeptidylpeptidase IV N-terminal"
FT /evidence="ECO:0000259|Pfam:PF00930"
FT DOMAIN 578..745
FT /note="Peptidase S9 prolyl oligopeptidase catalytic"
FT /evidence="ECO:0000259|Pfam:PF00326"
FT DOMAIN 814..880
FT /note="Peptidase S9 prolyl oligopeptidase catalytic"
FT /evidence="ECO:0000259|Pfam:PF00326"
FT REGION 427..452
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:PCG76547.1"
SQ SEQUENCE 897 AA; 98952 MW; 3D73F96D2A45C0F4 CRC64;
DEFIFRDSYG GISLMFAANQ TTKTLMPNTT FRVLEPASFS VSADRRFLLL AQNVRKIHTH
SYLARYTVYD ILTTESYPLT PLPDEVGGGV ITEGPSLLLA TWTPKGHGLI TVKDYDIYYR
PAPRSSTGYR VTESGVPGTI HNGVPDWLYE EEILGSRTAL WMSADGHMVL YATFNDSLVH
EQKYPWYGAA LDTDDPAKTY PEIRSVRYPK PGTNNPTVTL TVADIADPKH IRTRHLTPPK
VILEEGDYYF TSAQWVSLTE VCVVWLTRMQ NLSVVSVCKS PMWYCQEVYR ISSGLDGWVE
SAPSPLWSAG GGALVTLAPV RDGPAGLFRH IVRTEHNSHG PRALPLTHGS FDVVQLLAWD
HANQHIYYMG IPEGKPGQQH LYRVSSEAPR PGSPQKLPYC VTCNSQPSPS INLEYYGNLA
SSGDSTWGDG DWDEELPATS PSPTKKKKKK YPDGMPQNLP CLYHEAHFSP SSAYFVLECL
GPGVPTFSLH KTALPDPRLL AHLENNTVVK ERLAAIALPT PRTFSVQLSS GHAARVRLLL
PPGLREDEVT KYPLVLKVHG APGTQLVTER WSLDWGSLAA GAGAILASVD ARGAGGRGLG
AHHTLHRRLG TVELQDQLEV AEYLRDSLHF IDARRVAVWG RAHGGFLAAL ALASPLSVFH
CGIALTPIVR WRYYASAYAE RYMGFPNATG NYRGYADADV TKQAAALHDK MLLLVHGTAD
NNVHVQQSMA LARALADQGS MFRQQMLLLV HGTADNNVHV QQSMALARAL ADQGSMFRQQ
MLLLVHGTAD NNVHVQQSMA LARALADQGS MFRQQMLLLV HGTADNNVHV QQSMALAREL
ADQGSMFRQQ IYPDEGHSLS GVKRHLYRTM SSFLDDCFRK QVPPETKAGL RNGGNLD
//