GenomeNet

Database: UniProt
Entry: A0A2A4JZ21_HELVI
LinkDB: A0A2A4JZ21_HELVI
Original site: A0A2A4JZ21_HELVI 
ID   A0A2A4JZ21_HELVI        Unreviewed;       546 AA.
AC   A0A2A4JZ21;
DT   20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT   20-DEC-2017, sequence version 1.
DT   27-MAR-2024, entry version 16.
DE   RecName: Full=Peptidase S1 domain-containing protein {ECO:0000259|PROSITE:PS50240};
GN   ORFNames=B5V51_9048 {ECO:0000313|EMBL:PCG76632.1};
OS   Heliothis virescens (Tobacco budworm moth).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Noctuoidea;
OC   Noctuidae; Heliothinae; Heliothis.
OX   NCBI_TaxID=7102 {ECO:0000313|EMBL:PCG76632.1, ECO:0000313|Proteomes:UP000218220};
RN   [1] {ECO:0000313|EMBL:PCG76632.1, ECO:0000313|Proteomes:UP000218220}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=HvINT- {ECO:0000313|EMBL:PCG76632.1};
RC   TISSUE=Whole body {ECO:0000313|EMBL:PCG76632.1};
RA   Fritz M.L., Deyonke A.M., Papanicolaou A., Micinski S., Westbrook J.,
RA   Gould F.;
RT   "Contemporary evolution of a Lepidopteran species, Heliothis virescens, in
RT   response to modern agricultural practices.";
RL   Submitted (SEP-2017) to the EMBL/GenBank/DDBJ databases.
CC   -!- SIMILARITY: Belongs to the peptidase S1 family. CLIP subfamily.
CC       {ECO:0000256|ARBA:ARBA00024195}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:PCG76632.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; NWSH01000406; PCG76632.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A2A4JZ21; -.
DR   STRING; 7102.A0A2A4JZ21; -.
DR   Proteomes; UP000218220; Unassembled WGS sequence.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR   CDD; cd00190; Tryp_SPc; 1.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR001314; Peptidase_S1A.
DR   InterPro; IPR041515; PPAF-2-like_Clip.
DR   InterPro; IPR001254; Trypsin_dom.
DR   PANTHER; PTHR24258:SF150; FI16631P1-RELATED; 1.
DR   PANTHER; PTHR24258; SERINE PROTEASE-RELATED; 1.
DR   Pfam; PF18322; CLIP_1; 1.
DR   Pfam; PF00089; Trypsin; 1.
DR   PRINTS; PR00722; CHYMOTRYPSIN.
DR   SMART; SM00020; Tryp_SPc; 1.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR   PROSITE; PS50240; TRYPSIN_DOM; 1.
PE   3: Inferred from homology;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Reference proteome {ECO:0000313|Proteomes:UP000218220};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..17
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           18..546
FT                   /note="Peptidase S1 domain-containing protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5012020097"
FT   DOMAIN          291..544
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
FT   REGION          51..113
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        51..80
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        95..109
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   546 AA;  56424 MW;  B9A225EC99FE915E CRC64;
     MCLWILTACL GLLPANPSPV VVVNGVPAPI IVPTAAPTVA TTAATTAATT AATTAATTRP
     PTPAPTTAAT TAATTASPVT PPPTTAAPTT AAPTTSTPVT PPPTTAATTV TPAPPTTVTT
     AAATTLGGMD LSLIICIDPD VVCIFNPDET IAGPLPIDPR LGTTPAPSAA VNMPPISGYS
     AQLPTYDSAV KFPRQRRSLD DEHAVNMKDF LCSVGTGRKH SIVKRQSNCA CVPTGTCVRT
     TTNNGAGMID IRIVTPVATQ CPSGQEYCCT DSTTSTLINC GTLQSVPATA ITPAAGQANF
     GEFPWQALIL TKQNDYIAGG VLVDQLNVLT VTHRLLPYVV SGTAPNVKVR LGEWDAAGTY
     EPVPFQEYTV TKVFSHPSYN ANTLQYDIMV LRLSASVPIT PNVGAAVTIN RACLPPSSTT
     TYTGQRCWVS GWGKNMFGLQ GQYQQILKKV DVPIVAPATC QTQLQTARLG ATYVLDTTSF
     VCAGGEANKD SCTGDGGSGL VCQVNGQWVV VGLVSWGLGC ANANVPAAYV NIAGLLPWIQ
     QQIATT
//
DBGET integrated database retrieval system