ID A0A2A4K7C2_HELVI Unreviewed; 230 AA.
AC A0A2A4K7C2;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 27-MAR-2024, entry version 16.
DE RecName: Full=Cuticle protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=B5V51_10985 {ECO:0000313|EMBL:PCG80137.1};
OS Heliothis virescens (Tobacco budworm moth).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Noctuoidea;
OC Noctuidae; Heliothinae; Heliothis.
OX NCBI_TaxID=7102 {ECO:0000313|EMBL:PCG80137.1, ECO:0000313|Proteomes:UP000218220};
RN [1] {ECO:0000313|EMBL:PCG80137.1, ECO:0000313|Proteomes:UP000218220}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=HvINT- {ECO:0000313|EMBL:PCG80137.1};
RC TISSUE=Whole body {ECO:0000313|EMBL:PCG80137.1};
RA Fritz M.L., Deyonke A.M., Papanicolaou A., Micinski S., Westbrook J.,
RA Gould F.;
RT "Contemporary evolution of a Lepidopteran species, Heliothis virescens, in
RT response to modern agricultural practices.";
RL Submitted (SEP-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PCG80137.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NWSH01000055; PCG80137.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2A4K7C2; -.
DR STRING; 7102.A0A2A4K7C2; -.
DR Proteomes; UP000218220; Unassembled WGS sequence.
DR GO; GO:0042302; F:structural constituent of cuticle; IEA:UniProtKB-UniRule.
DR InterPro; IPR031311; CHIT_BIND_RR_consensus.
DR InterPro; IPR000618; Insect_cuticle.
DR PANTHER; PTHR12236:SF79; CUTICULAR PROTEIN 50CB-RELATED; 1.
DR PANTHER; PTHR12236; STRUCTURAL CONTITUENT OF CUTICLE; 1.
DR Pfam; PF00379; Chitin_bind_4; 1.
DR PROSITE; PS00233; CHIT_BIND_RR_1; 1.
DR PROSITE; PS51155; CHIT_BIND_RR_2; 1.
PE 4: Predicted;
KW Cuticle {ECO:0000256|ARBA:ARBA00022460, ECO:0000256|PROSITE-
KW ProRule:PRU00497}; Reference proteome {ECO:0000313|Proteomes:UP000218220};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..18
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 19..230
FT /note="Cuticle protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012675276"
FT REGION 29..121
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 176..230
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 35..50
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 51..80
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 81..114
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 184..219
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 230 AA; 24519 MW; 83F7FCBBA50BA7AB CRC64;
MWLSSIVIST LFVMAMSAGN EYLPPSKGYN YEPPSIPFPT NNNGNKQPTP PTYRPTTPPR
PYLPPNPQPT PGPGPSPTPG PHDHPHDHHH HDHDHGNGNG NHDHGDHHHH EPGMPFDFSY
QVAEDGNDYS HNAISDGDIT RGEYRVALPD GRTQIVKYTA DWKNGFNAEV TYEGEARYPD
QPTGGYGSGN SGNGYGSGST GGNGYGSGSG GSGTGGQAYV PPSQGGGYQY
//