ID A0A9J7J0I2_SPOLT Unreviewed; 1186 AA.
AC A0A9J7J0I2;
DT 28-JUN-2023, integrated into UniProtKB/TrEMBL.
DT 28-JUN-2023, sequence version 1.
DT 28-JAN-2026, entry version 14.
DE SubName: Full=Collagen alpha-1(XVIII) chain-like isoform X5 {ECO:0000313|RefSeq:XP_022829970.1};
GN Name=LOC111358859 {ECO:0000313|RefSeq:XP_022829970.1};
OS Spodoptera litura (Asian cotton leafworm).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Noctuoidea;
OC Noctuidae; Amphipyrinae; Spodoptera.
OX NCBI_TaxID=69820 {ECO:0000313|Proteomes:UP000301870, ECO:0000313|RefSeq:XP_022829970.1};
RN [1] {ECO:0000313|RefSeq:XP_022829970.1}
RP IDENTIFICATION.
RC STRAIN=Ishihara {ECO:0000313|RefSeq:XP_022829970.1};
RC TISSUE=Whole body {ECO:0000313|RefSeq:XP_022829970.1};
RG RefSeq;
RL Submitted (AUG-2025) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_022829970.1; XM_022974202.1.
DR AlphaFoldDB; A0A9J7J0I2; -.
DR GeneID; 111358859; -.
DR Proteomes; UP000301870; Chromosome 28.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000301870};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}.
FT DOMAIN 22..213
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT REGION 244..325
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 356..448
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 462..836
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 851..881
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 274..283
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 294..304
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 385..394
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 395..404
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 428..437
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 545..556
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 588..600
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 611..620
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 788..803
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 851..863
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1186 AA; 123642 MW; 6A8E138B2DF962AE CRC64;
MSSAVFANDG LFGSKFQTDI PEYDLLHAIG VPFSNPKTQY FDEGLDGFPA YGLMPGSDIK
SPYRLFMPEK LYSEFSITAT VRPANKDGGF LFSVVNPLET VVQLGVQLIP SGPGLTNISL
LYTDANIYAL SQTIASFVVP SFAKKWSRFA LRVTTDNVTL FLNCHEFDTV VVKRNPLELV
FDSASTLYVG QAGPLIRGAF HGAFQELKLY GSPTQAEVQC VNTFEESGDG DEIDIDNYLV
DQEEGDAEGS GRYGTIPPFP PPPPGLDGGY SLRLKGEKGD RGPRGAPGES IRGPPGPPGP
PGPPGVSGTP GVIAEISGSG DDRSSVKQQI FGENYASLGQ CGCNSSTILA LLQTAPELKG
PPGPPGITGA DGRTGAPGLP GQPGTPGDRG PIGPRGDKGD RGDVGPRGPE GPAGTKGEAG
VDGRPGIAGP PGPPGPPGTS DYSNFDSNWK PRQIYKESLL GSYGGAIGRP GAPGPKGDAG
QPGPMGPQGE RGFPGQKGER GQPGMGGGKG DRGYPGPQGE RGYKGDRGGP GLDGRPGIPG
ASGRPADKGE KGERGEPGPP GPPALGMYSP EDAEFLTTGQ RQAVVGSKGD KGEKGEKGGR
GNDGPPGFPG KDGKDGDRGD IGPSGMPGIS GPPGSPGLKG ERGERGPPGP ISVTMAGSDI
VTIKGDKGDA GLRGRRGRPG PSGPKGTAGT PGPPGPPGRP GDKGDIGLPG WINAKGRPGT
LGPPGQPGPV GPKGEKGDPG VNILDVSMFK GEKGERGFDG LPGLPGVPGP AGPPGQSGSL
SEAIQYLPGP PGPPGPPGPP GPPGVSIVGP KGEPGFTHYE EHPVHGNTKF YGRPMSKSPL
DELKALKELK DLTNRDRDRD RHGPVQSHPA VHHTEEPNKS VPGAAVFQTT EEMLKLASSS
AVGALAYVVE EQALFVKVNS GWQYVLLGSL VTQSAPTHPT PAPAPPPPMP AASLVHKPSL
SNLVESSPVP GPSLHLAALN EPLHGNMHGI RRADYACYRQ GRRAGFRGTF RALLTSKIQN
LNSIVRYSDR HLPVVNTQGE ILFKSFSDMF DGNGALAAGA RIYSFNGRNV LTDSHWPQKV
IWHGSRANGE RALDSYCQEW QNGDPTSRGL GSALHSHKLL AQERYPCSSH FIVLCVEVAS
EITSRRKRET IRYNTTGILD EDDYLYNAEE YQQLLNEIFA QPLREN
//