ID A0A2A4K1X0_HELVI Unreviewed; 836 AA.
AC A0A2A4K1X0;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:PCG78016.1};
DE Flags: Fragment;
GN ORFNames=B5V51_5558 {ECO:0000313|EMBL:PCG78016.1};
OS Heliothis virescens (Tobacco budworm moth).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Noctuoidea;
OC Noctuidae; Heliothinae; Heliothis.
OX NCBI_TaxID=7102 {ECO:0000313|EMBL:PCG78016.1, ECO:0000313|Proteomes:UP000218220};
RN [1] {ECO:0000313|EMBL:PCG78016.1, ECO:0000313|Proteomes:UP000218220}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=HvINT- {ECO:0000313|EMBL:PCG78016.1};
RC TISSUE=Whole body {ECO:0000313|EMBL:PCG78016.1};
RA Fritz M.L., Deyonke A.M., Papanicolaou A., Micinski S., Westbrook J.,
RA Gould F.;
RT "Contemporary evolution of a Lepidopteran species, Heliothis virescens, in
RT response to modern agricultural practices.";
RL Submitted (SEP-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PCG78016.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NWSH01000251; PCG78016.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2A4K1X0; -.
DR STRING; 7102.A0A2A4K1X0; -.
DR Proteomes; UP000218220; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1100; FIBRILLAR COLLAGEN NC1 DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01391; Collagen; 7.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000218220};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 553..598
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 635..801
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 48..505
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 118..132
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 179..193
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 265..281
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 340..354
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 357..375
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 414..429
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 440..454
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 463..480
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:PCG78016.1"
SQ SEQUENCE 836 AA; 86306 MW; 52DD685C6E887B29 CRC64;
VNPDENRLAV VSHTSPFFIG DKGEKGENAL GQCRCSLNDI SQILEVMPEL KGPPGPQGPT
GADGTTGAPG KTGQMGEAGP PGSVGPKGDR GERGETGTPG PEGQLGPKGD PGADGAPGLQ
GPPGPPGPPG PITSALIEST GLYGSSNPGV LGTPGDRGPM GLPGPQGERG YQGNKGERGL
HGPKGDKGER GYVGLRGPHG AKGERGQPGR DGTPGLPGPH GRPAEKGEKG ARGLPGPPGP
PVSVVSENAI STDVTRTETA LMKGGKGDTG EKGDKGEKGL RGMEGPQGFP GTDGKPGERG
DIGPSGLPGT QGPPGLIGPK GDKGDAGPPG PVAISRDEAL VLTKGDKGES GPRGKRGHPG
PPGPRGPPGV PGPPGTPGNN GVSGDIGLPG WTGPPGAAGQ PGAPGPKGEK GDPGGAPLDL
EKVKGEKGDR GYVGAPGPPG KDGPRGPPGP PGTPATNIQY MTVPGPPGPP GPPGPPGVYP
NEVPDTLTDS PGINRLEPTG GKQRDPLQII RNLNHLVQYR QEPYGYRDPL DPVGDNADFE
DDEDGRTIVG TILFKTTDSL IRLGTNSPLG TLAYVIQEQA LLVRVNNGWQ YVAMGSLLAI
HTPPAGGPTR SPLQNILETS SLVHHKNPAI EGPVLRLAAL NEPHTGDMHG VSSSNYECRR
QSQRANMDGT FRAFISSRVQ TIDSIVSWVD REIPVVNTRG DVLFNSWGEM FDGSGALFAH
APRIYSFSGQ NVLMDPGWPT KAVWHGANPN GEPAMDAYCD AWHSSNPDKF GLASSLRSNK
LLDQETYSCS SRLIVLCVEA TPVDTVRRKK RSKYHVSEKL QFLNEIEDRN ETRRKL
//