GenomeNet

Database: UniProt
Entry: A0A6J3JZ31_9HYME
LinkDB: A0A6J3JZ31_9HYME
Original site: A0A6J3JZ31_9HYME 
ID   A0A6J3JZ31_9HYME        Unreviewed;      1174 AA.
AC   A0A6J3JZ31;
DT   07-OCT-2020, integrated into UniProtKB/TrEMBL.
DT   07-OCT-2020, sequence version 1.
DT   28-JAN-2026, entry version 24.
DE   SubName: Full=Collagen alpha-1(XVIII) chain isoform X2 {ECO:0000313|RefSeq:XP_033345374.1};
GN   Name=LOC117231243 {ECO:0000313|RefSeq:XP_033345374.1};
OS   Bombus vosnesenskii.
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Hymenoptera; Apocrita; Aculeata; Apoidea;
OC   Anthophila; Apidae; Bombus; Pyrobombus.
OX   NCBI_TaxID=207650 {ECO:0000313|Proteomes:UP000504631, ECO:0000313|RefSeq:XP_033345374.1};
RN   [1] {ECO:0000313|RefSeq:XP_033345374.1}
RP   IDENTIFICATION.
RC   TISSUE=Muscle {ECO:0000313|RefSeq:XP_033345374.1};
RG   RefSeq;
RL   Submitted (AUG-2025) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   RefSeq; XP_033345374.1; XM_033489483.1.
DR   AlphaFoldDB; A0A6J3JZ31; -.
DR   GeneID; 117231243; -.
DR   CTD; 104327; -.
DR   Proteomes; UP000504631; Unplaced.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   CDD; cd00247; Endostatin-like; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119,
KW   ECO:0000313|RefSeq:XP_033345374.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000504631};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..20
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           21..1174
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5027031602"
FT   DOMAIN          45..220
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          242..333
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          348..809
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1146..1174
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        275..287
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        290..305
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        484..502
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        529..541
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        571..601
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        761..771
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        779..792
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1152..1161
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1174 AA;  123606 MW;  EE4C09144D71BDB2 CRC64;
     MRPRLQWLIF VLFCVTRVNA DFFNEKKELE YDLLQASVAL TDDNNLYIDD GLDGFPSFGF
     RPGSEVKQPY RLYLPEKLPA EFTLVATFKP TSFRTSYLFA VLNPFETVVQ LGIRISDGPG
     TNQNVSLVYT NSDLHSHSEE VAKFTVPKLT KKWSKIVIRV STTDVTLYLN CHEMAKQRVT
     RIPLELMFDT ASTLYIAQAG PHIQEKYDGL LQSLKLYSGH PADLVRCTAD FNFNPDEDLG
     SGDINNDLID GLEDVDTNVP DIGRDDDEDR SEESNPPPFI TPPPPNPDYK GPKGEKGDKG
     DKGESVRGPP GPPGPPGQDE GPPGKKGEPG TCTCNATALM ASFTMPKMIQ GPKGEQGVPG
     QEGKQGQMGL TGAAGPPGER GLEGPQGPKG DKGDVGIAGP EGPQGQKGEP GRDGIPGEKG
     AQGPPGPPGK GEFSGYDPSW KPRGIYRTEG ITMRPGLPGQ KGEAGLPGSP GPKGETGIAG
     AKGYKGEPGH KGAKGDHGKE GPRGIQGFKG EPGAPGAPGL PGAPGENGRP AEKGDKGDTG
     PEGKPGPAGV PGPPGLPGLS GSGGVNVGEA MLREKGDKGE SGARGYKGDK GTKGEKGDKG
     DSGPAGIPGV NGIQGPQGNK GEPGKDGVPG APGVIGAKGE KGERGPPGAT AIASSGDYIT
     IKGEKGAEGK RGRRGRPGPP GPVGPPGKTG AMGEIGLPGW AGRPGTPGLP GPVGPVGPKG
     EKGEPGTPSP YGVSVGIKGD KGDDGFPGIP GQPGRDGQRG PPGPPGPPGP PSQGNYIPVP
     GPPGPPGPPG PPGLSLIGQK GEPGIGRSHI FGERDYYGVR QVPKNIKDFK HNMFFGTLQG
     PRTSLDELKA LRELKQLKEL KEHLGAATTA TRGPLESTTK IVPGAVTFQN TEAMTKMSAV
     SPVGTLAYII DEQALLVRVN NGWQYIALGS LLPITTPAPP TTSPPPVNPP FEASNLINQI
     PVKADGTGWY PRMLRMAALN EPFTGDMHGV RGADYACYRQ AKRAGLRGTF RAFLSSRVQN
     VDSIVRLGDR DLPIVNIKGD VLFNSWKEMF NGNGAYFSQN PRIYSFNGKN ILTDFAWPEK
     VAWHGSHKLG DRAMDTYCDA WHSSSSDRYG LGSPLTGGRL LEQVRYSCDN KFALLCIEVT
     SETTRRRRSV EVSEDEDEMS ENDYKEYLDS LMEN
//
DBGET integrated database retrieval system