ID A0A9J7EK49_SPOLT Unreviewed; 1182 AA.
AC A0A9J7EK49;
DT 28-JUN-2023, integrated into UniProtKB/TrEMBL.
DT 28-JUN-2023, sequence version 1.
DT 28-JAN-2026, entry version 14.
DE SubName: Full=Collagen alpha-1(XVIII) chain-like isoform X7 {ECO:0000313|RefSeq:XP_022829972.1};
GN Name=LOC111358859 {ECO:0000313|RefSeq:XP_022829972.1};
OS Spodoptera litura (Asian cotton leafworm).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Noctuoidea;
OC Noctuidae; Amphipyrinae; Spodoptera.
OX NCBI_TaxID=69820 {ECO:0000313|Proteomes:UP000301870, ECO:0000313|RefSeq:XP_022829972.1};
RN [1] {ECO:0000313|RefSeq:XP_022829972.1}
RP IDENTIFICATION.
RC STRAIN=Ishihara {ECO:0000313|RefSeq:XP_022829972.1};
RC TISSUE=Whole body {ECO:0000313|RefSeq:XP_022829972.1};
RG RefSeq;
RL Submitted (AUG-2025) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_022829972.1; XM_022974204.1.
DR AlphaFoldDB; A0A9J7EK49; -.
DR GeneID; 111358859; -.
DR Proteomes; UP000301870; Chromosome 28.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000301870};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}.
FT DOMAIN 28..219
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT REGION 250..331
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 362..832
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 847..877
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 280..289
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 300..310
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 391..400
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 401..410
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 434..443
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 541..552
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 584..596
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 607..616
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 784..799
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 847..859
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1182 AA; 123036 MW; 3D34157AB8A840F6 CRC64;
MSSAVFANDG LFGSKFQTGK HILFDIPEYD LLHAIGVPFS NPKTQYFDEG LDGFPAYGLM
PGSDIKSPYR LFMPEKLYSE FSITATVRPA NKDGGFLFSV VNPLETVVQL GVQLIPSGPG
LTNISLLYTD ANIYALSQTI ASFVVPSFAK KWSRFALRVT TDNVTLFLNC HEFDTVVVKR
NPLELVFDSA STLYVGQAGP LIRGAFHGAF QELKLYGSPT QAEVQCVNTF EESGDGDEID
IDNYLVDQEE GDAEGSGRYG TIPPFPPPPP GLDGGYSLRL KGEKGDRGPR GAPGESIRGP
PGPPGPPGPP GVSGTPGVIA EISGSGDDRS SVKQQIFGEN YASLGQCGCN SSTILALLQT
APELKGPPGP PGITGADGRT GAPGLPGQPG TPGDRGPIGP RGDKGDRGDV GPRGPEGPAG
TKGEAGVDGR PGIAGPPGPP GPPGTSDYSN FDESLLGSYG GAIGRPGAPG PKGDAGQPGP
MGPQGERGFP GQKGERGQPG MGGGKGDRGY PGPQGERGYK GDRGGPGLDG RPGIPGASGR
PADKGEKGER GEPGPPGPPA LGMYSPEDAE FLTTGQRQAV VGSKGDKGEK GEKGGRGNDG
PPGFPGKDGK DGDRGDIGPS GMPGISGPPG SPGLKGERGE RGPPGPISVT MAGSDIVTIK
GDKGDAGLRG RRGRPGPSGP KGTAGTPGPP GPPGRPGDKG DIGLPGWINA KGRPGTLGPP
GQPGPVGPKG EKGDPGVNIL DVSMFKGEKG ERGFDGLPGL PGVPGPAGPP GQSGSLSEAI
QYLPGPPGPP GPPGPPGPPG VSIVGPKGEP GFTHYEEHPV HGNTKFYGRP MSKSPLDELK
ALKELKDLTN RDRDRDRHGP VQSHPAVHHT EEPNKSVPGA AVFQTTEEML KLASSSAVGA
LAYVVEEQAL FVKVNSGWQY VLLGSLVTQS APTHPTPAPA PPPPMPAASL VHKPSLSNLV
ESSPVPGPSL HLAALNEPLH GNMHGIRRAD YACYRQGRRA GFRGTFRALL TSKIQNLNSI
VRYSDRHLPV VNTQGEILFK SFSDMFDGNG ALAAGARIYS FNGRNVLTDS HWPQKVIWHG
SRANGERALD SYCQEWQNGD PTSRGLGSAL HSHKLLAQER YPCSSHFIVL CVEVASEITS
RRKRETIRYN TTGILDEDDY LYNAEEYQQL LNEIFAQPLR EN
//