ID A0A9J7EGB4_SPOLT Unreviewed; 1178 AA.
AC A0A9J7EGB4;
DT 28-JUN-2023, integrated into UniProtKB/TrEMBL.
DT 28-JUN-2023, sequence version 1.
DT 28-JAN-2026, entry version 14.
DE SubName: Full=Collagen alpha-1(XVIII) chain-like isoform X8 {ECO:0000313|RefSeq:XP_022829973.1};
GN Name=LOC111358859 {ECO:0000313|RefSeq:XP_022829973.1};
OS Spodoptera litura (Asian cotton leafworm).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Noctuoidea;
OC Noctuidae; Amphipyrinae; Spodoptera.
OX NCBI_TaxID=69820 {ECO:0000313|Proteomes:UP000301870, ECO:0000313|RefSeq:XP_022829973.1};
RN [1] {ECO:0000313|RefSeq:XP_022829973.1}
RP IDENTIFICATION.
RC STRAIN=Ishihara {ECO:0000313|RefSeq:XP_022829973.1};
RC TISSUE=Whole body {ECO:0000313|RefSeq:XP_022829973.1};
RG RefSeq;
RL Submitted (AUG-2025) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_022829973.1; XM_022974205.1.
DR AlphaFoldDB; A0A9J7EGB4; -.
DR GeneID; 111358859; -.
DR Proteomes; UP000301870; Chromosome 28.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000301870};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}.
FT DOMAIN 28..219
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT REGION 233..317
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 348..440
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 454..828
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 843..873
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 266..275
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 286..296
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 377..386
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 387..396
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 420..429
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 537..548
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 580..592
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 603..612
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 780..795
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 843..855
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1178 AA; 122801 MW; 625A0F875A044854 CRC64;
MSSAVFANDG LFGSKFQTGK HILFDIPEYD LLHAIGVPFS NPKTQYFDEG LDGFPAYGLM
PGSDIKSPYR LFMPEKLYSE FSITATVRPA NKDGGFLFSV VNPLETVVQL GVQLIPSGPG
LTNISLLYTD ANIYALSQTI ASFVVPSFAK KWSRFALRVT TDNVTLFLNC HEFDTVVVKR
NPLELVFDSA STLYVGQAGP LIRGAFHGAF QELKLYGSPT QAEVQCVNTF EVDQEEGDAE
GSGRYGTIPP FPPPPPGLDG GYSLRLKGEK GDRGPRGAPG ESIRGPPGPP GPPGPPGVSG
TPGVIAEISG SGDDRSSVKQ QIFGENYASL GQCGCNSSTI LALLQTAPEL KGPPGPPGIT
GADGRTGAPG LPGQPGTPGD RGPIGPRGDK GDRGDVGPRG PEGPAGTKGE AGVDGRPGIA
GPPGPPGPPG TSDYSNFDSN WKPRQIYKES LLGSYGGAIG RPGAPGPKGD AGQPGPMGPQ
GERGFPGQKG ERGQPGMGGG KGDRGYPGPQ GERGYKGDRG GPGLDGRPGI PGASGRPADK
GEKGERGEPG PPGPPALGMY SPEDAEFLTT GQRQAVVGSK GDKGEKGEKG GRGNDGPPGF
PGKDGKDGDR GDIGPSGMPG ISGPPGSPGL KGERGERGPP GPISVTMAGS DIVTIKGDKG
DAGLRGRRGR PGPSGPKGTA GTPGPPGPPG RPGDKGDIGL PGWINAKGRP GTLGPPGQPG
PVGPKGEKGD PGVNILDVSM FKGEKGERGF DGLPGLPGVP GPAGPPGQSG SLSEAIQYLP
GPPGPPGPPG PPGPPGVSIV GPKGEPGFTH YEEHPVHGNT KFYGRPMSKS PLDELKALKE
LKDLTNRDRD RDRHGPVQSH PAVHHTEEPN KSVPGAAVFQ TTEEMLKLAS SSAVGALAYV
VEEQALFVKV NSGWQYVLLG SLVTQSAPTH PTPAPAPPPP MPAASLVHKP SLSNLVESSP
VPGPSLHLAA LNEPLHGNMH GIRRADYACY RQGRRAGFRG TFRALLTSKI QNLNSIVRYS
DRHLPVVNTQ GEILFKSFSD MFDGNGALAA GARIYSFNGR NVLTDSHWPQ KVIWHGSRAN
GERALDSYCQ EWQNGDPTSR GLGSALHSHK LLAQERYPCS SHFIVLCVEV ASEITSRRKR
ETIRYNTTGI LDEDDYLYNA EEYQQLLNEI FAQPLREN
//