ID A0A2A4J7H2_HELVI Unreviewed; 341 AA.
AC A0A2A4J7H2;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 24-JAN-2024, entry version 14.
DE RecName: Full=Cuticular protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=B5V51_5818 {ECO:0000313|EMBL:PCG67911.1};
OS Heliothis virescens (Tobacco budworm moth).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Noctuoidea;
OC Noctuidae; Heliothinae; Heliothis.
OX NCBI_TaxID=7102 {ECO:0000313|EMBL:PCG67911.1, ECO:0000313|Proteomes:UP000218220};
RN [1] {ECO:0000313|EMBL:PCG67911.1, ECO:0000313|Proteomes:UP000218220}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=HvINT- {ECO:0000313|EMBL:PCG67911.1};
RC TISSUE=Whole body {ECO:0000313|EMBL:PCG67911.1};
RA Fritz M.L., Deyonke A.M., Papanicolaou A., Micinski S., Westbrook J.,
RA Gould F.;
RT "Contemporary evolution of a Lepidopteran species, Heliothis virescens, in
RT response to modern agricultural practices.";
RL Submitted (SEP-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PCG67911.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NWSH01002587; PCG67911.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2A4J7H2; -.
DR STRING; 7102.A0A2A4J7H2; -.
DR Proteomes; UP000218220; Unassembled WGS sequence.
DR GO; GO:0042302; F:structural constituent of cuticle; IEA:UniProtKB-UniRule.
DR InterPro; IPR000618; Insect_cuticle.
DR PANTHER; PTHR10380; CUTICLE PROTEIN; 1.
DR PANTHER; PTHR10380:SF234; CUTICULAR PROTEIN 97EA, ISOFORM A; 1.
DR Pfam; PF00379; Chitin_bind_4; 1.
DR PRINTS; PR01217; PRICHEXTENSN.
DR PROSITE; PS51155; CHIT_BIND_RR_2; 1.
PE 4: Predicted;
KW Cuticle {ECO:0000256|PROSITE-ProRule:PRU00497};
KW Reference proteome {ECO:0000313|Proteomes:UP000218220};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..18
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 19..341
FT /note="Cuticular protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012833619"
FT REGION 104..313
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 132..148
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 156..195
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 198..223
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 224..238
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 251..268
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 341 AA; 37925 MW; C6D7BC192B205523 CRC64;
MTPFTRLAVL ALVACAHAQY QDERAPRYIA SEPKAPASTP VPILKQINRH NEDGSYTYGY
EAADGSFKIE TKSPAGEVKG KYGYKDDTGK VRVIEYGANK YGFQPAGEGI TVAPPTLVDE
TRRDEGQRPG KQSQSRRNQN QYQPAPAQSI DYDYNDEPAP PPPPRPVPRP APRPAPQPQY
RPQPAPQPQY RPQPAPQYRA PVQQSHQQFG VPSPTQAPQY RPAPQSQGPT PPKPAFFAGA
SPVPAEDNFF NPEPQQPQRR QYQSPKQDFR PAPQFRDYNQ DNAAPAFPRQ QEYSSPSYSA
PRPPPPQKQG QTFSMLDELL KEYALPKNGA PALHDIVFGA Y
//