ID A0A6J2X6Z4_SITOR Unreviewed; 1094 AA.
AC A0A6J2X6Z4;
DT 07-OCT-2020, integrated into UniProtKB/TrEMBL.
DT 07-OCT-2020, sequence version 1.
DT 28-JAN-2026, entry version 23.
DE SubName: Full=Collagen alpha-1(XVIII) chain-like isoform X7 {ECO:0000313|RefSeq:XP_030746998.1};
GN Name=LOC115875628 {ECO:0000313|RefSeq:XP_030746998.1};
OS Sitophilus oryzae (Rice weevil) (Curculio oryzae).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Coleoptera; Polyphaga; Cucujiformia;
OC Curculionidae; Dryophthorinae; Sitophilus.
OX NCBI_TaxID=7048 {ECO:0000313|Proteomes:UP000504635, ECO:0000313|RefSeq:XP_030746998.1};
RN [1] {ECO:0000313|RefSeq:XP_030746998.1}
RP IDENTIFICATION.
RC TISSUE=Gonads {ECO:0000313|RefSeq:XP_030746998.1};
RG RefSeq;
RL Submitted (AUG-2025) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_030746998.1; XM_030891138.1.
DR AlphaFoldDB; A0A6J2X6Z4; -.
DR GeneID; 115875628; -.
DR OrthoDB; 10060752at2759; -.
DR Proteomes; UP000504635; Unplaced.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR Pfam; PF01391; Collagen; 2.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000504635};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..1094
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5026674851"
FT DOMAIN 43..234
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT REGION 243..312
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 341..412
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 427..740
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 243..257
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 277..288
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 299..308
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 364..373
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 396..408
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 540..555
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 678..689
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 716..728
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1094 AA; 113987 MW; FA5083BF2D3DF42A CRC64;
MTIWRLRLLL LPSIWIFLSP TLGEDEFALP PSFAQGESPG VFDYNLLDAV KIPLQDPKTQ
YVADSEDGFP AFGFKAGSDV KMARRQILPD VLSPEFAVLI SAKPRTKRGG FVFAVVNPLD
TVVELGVRIG PENGTFTVLS LFYTDYVAEV NSQIIANFTV PKFTNKWTKF AFRVTLENIT
LFFNCTEADT VSVQRKPLEL VFDSASTLYL AQAGPIIGEP YEGALAHVKI YRDPGRAAEQ
CKTEFQVERT DYPPFREEDS DGLGVFPPPP PPPDAAGRYR GEKGERGPKG PPGEAIRGPP
GPPGPPGPAA VEGSCSCNVT SILSSMGITL PYSPALPALP GKEGVPGLPG EPGRPGEKGS
VGPRGDKGER GEKGPPGPPG LQGSKGEPGE DGIPGAPGPP GPAGPPGPVE FENIDPIRVK
EAVMGGSMIR PGIPGAKGET GKPGNPGPKG DRGSVGPKGN PGQKGEGGDR GLPGERGPQG
PKGENGTPGM DGIPGNPGSP GKDGSKGEPG VSGPPGLPGI SISSDGLTDF VPGPPGIKGE
PGEHGPKGDP GRDGEPGLPG PAGFPGPKGD SGVDGALGPV GPPGSKGEKG ERGPPGSVIM
SNGNEQILTL KGEKGDMGRR GRRGRPGPQG PPGPAGKGGE IGLPGWMNGK GRPGATGIPG
QPGPKGEKGD SGSGMAQKGD KGDKGDRGSD GIPGKDGIPG LPATGSNDDA TRYVPVPGPP
GPPGPPGIPG LSITGPKGEP GEAIYKEAVY NLRPGPKGSL EELRAVKELK DLKDFRAGRS
TASPPLMATS DYSRGAAVPG AVTFRTREAM TRVSQDSPVG TLAYVMEEEA LLVRVNGGWQ
YIALGSLLPI NTPTPPTTSA PQQHPPFEAS NLINQLPSSP KGVVPFASRL PKMLRLAALN
EPATGDVHGV RGADYACYRE AHRAGLKGTF RAFLSSRTQN VDSIVRQADR KLPVSNLRGE
VLFNSWAEMF SGDAAPFPHP PRIFSYSGKN VMTDPSWPIK SVWHGALVDG TRALDTSCDA
WQTSSSSKVG LAGSLRGPRL LDQTPVTCDK KLIVLCIEAT SEVFVQRRRR AITDWTENRL
LTQEEYQHLI NSIN
//