ID A0A9J7EFN5_SPOLT Unreviewed; 1164 AA.
AC A0A9J7EFN5;
DT 28-JUN-2023, integrated into UniProtKB/TrEMBL.
DT 28-JUN-2023, sequence version 1.
DT 28-JAN-2026, entry version 14.
DE SubName: Full=Collagen alpha-1(XVIII) chain-like isoform X10 {ECO:0000313|RefSeq:XP_022829975.1};
GN Name=LOC111358859 {ECO:0000313|RefSeq:XP_022829975.1};
OS Spodoptera litura (Asian cotton leafworm).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Noctuoidea;
OC Noctuidae; Amphipyrinae; Spodoptera.
OX NCBI_TaxID=69820 {ECO:0000313|Proteomes:UP000301870, ECO:0000313|RefSeq:XP_022829975.1};
RN [1] {ECO:0000313|RefSeq:XP_022829975.1}
RP IDENTIFICATION.
RC STRAIN=Ishihara {ECO:0000313|RefSeq:XP_022829975.1};
RC TISSUE=Whole body {ECO:0000313|RefSeq:XP_022829975.1};
RG RefSeq;
RL Submitted (AUG-2025) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_022829975.1; XM_022974207.1.
DR AlphaFoldDB; A0A9J7EFN5; -.
DR GeneID; 111358859; -.
DR Proteomes; UP000301870; Chromosome 28.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000301870};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}.
FT DOMAIN 28..219
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT REGION 250..331
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 362..454
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 468..856
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 280..289
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 300..310
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 391..400
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 401..410
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 434..443
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 551..562
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 594..606
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 617..626
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 794..809
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1164 AA; 121074 MW; 06228B48B59B9FE4 CRC64;
MSSAVFANDG LFGSKFQTGK HILFDIPEYD LLHAIGVPFS NPKTQYFDEG LDGFPAYGLM
PGSDIKSPYR LFMPEKLYSE FSITATVRPA NKDGGFLFSV VNPLETVVQL GVQLIPSGPG
LTNISLLYTD ANIYALSQTI ASFVVPSFAK KWSRFALRVT TDNVTLFLNC HEFDTVVVKR
NPLELVFDSA STLYVGQAGP LIRGAFHGAF QELKLYGSPT QAEVQCVNTF EESGDGDEID
IDNYLVDQEE GDAEGSGRYG TIPPFPPPPP GLDGGYSLRL KGEKGDRGPR GAPGESIRGP
PGPPGPPGPP GVSGTPGVIA EISGSGDDRS SVKQQIFGEN YASLGQCGCN SSTILALLQT
APELKGPPGP PGITGADGRT GAPGLPGQPG TPGDRGPIGP RGDKGDRGDV GPRGPEGPAG
TKGEAGVDGR PGIAGPPGPP GPPGTSDYSN FDSNWKPRQI YKESLLGSYG GAIGRPGAPG
PKGDAGQPGP MGPQGERGFP GQKGERGQPG MGGGKGDRGY PGPQGERGYK GDRGGPGLDG
RPGIPGASGR PADKGEKGER GEPGPPGPPA LGMYSPEDAE FLTTGQRQAV VGSKGDKGEK
GEKGGRGNDG PPGFPGKDGK DGDRGDIGPS GMPGISGPPG SPGLKGERGE RGPPGPISVT
MAGSDIVTIK GDKGDAGLRG RRGRPGPSGP KGTAGTPGPP GPPGRPGDKG DIGLPGWINA
KGRPGTLGPP GQPGPVGPKG EKGDPGVNIL DVSMFKGEKG ERGFDGLPGL PGVPGPAGPP
GQSGSLSEAI QYLPGPPGPP GPPGPPGPPG VSIVGPKGEP GFTHYEEHPV HGNTKFYGRP
RPVQSHPAVH HTEEPNKSVP GAAVFQTTEE MLKLASSSAV GALAYVVEEQ ALFVKVNSGW
QYVLLGSLVT QSAPTHPTPA PAPPPPMPAA SLVHKPSLSN LVESSPVPGP SLHLAALNEP
LHGNMHGIRR ADYACYRQGR RAGFRGTFRA LLTSKIQNLN SIVRYSDRHL PVVNTQGEIL
FKSFSDMFDG NGALAAGARI YSFNGRNVLT DSHWPQKVIW HGSRANGERA LDSYCQEWQN
GDPTSRGLGS ALHSHKLLAQ ERYPCSSHFI VLCVEVASEI TSRRKRETIR YNTTGILDED
DYLYNAEEYQ QLLNEIFAQP LREN
//