ID A0AAQ4PW67_GASAC Unreviewed; 1117 AA.
AC A0AAQ4PW67;
DT 02-OCT-2024, integrated into UniProtKB/TrEMBL.
DT 02-OCT-2024, sequence version 1.
DT 28-JAN-2026, entry version 7.
DE RecName: Full=Collagen, type XV, alpha 1b {ECO:0008006|Google:ProtNLM};
OS Gasterosteus aculeatus aculeatus (three-spined stickleback).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Eupercaria; Perciformes; Cottioidei; Gasterosteales; Gasterosteidae;
OC Gasterosteus.
OX NCBI_TaxID=481459 {ECO:0000313|Ensembl:ENSGACP00000043105.1, ECO:0000313|Proteomes:UP000007635};
RN [1] {ECO:0000313|Ensembl:ENSGACP00000043105.1, ECO:0000313|Proteomes:UP000007635}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Lake Benthic {ECO:0000313|Ensembl:ENSGACP00000043105.1,
RC ECO:0000313|Proteomes:UP000007635};
RX PubMed=33598708;
RA Nath S., Shaw D.E., White M.A.;
RT "Improved contiguity of the threespine stickleback genome using long-read
RT sequencing.";
RL G3 (Bethesda) 11:0-0(2021).
RN [2] {ECO:0000313|Ensembl:ENSGACP00000043105.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (AUG-2025) to UniProtKB.
RN [3] {ECO:0000313|Ensembl:ENSGACP00000043105.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2025) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC matrix {ECO:0000256|ARBA:ARBA00004498}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0AAQ4PW67; -.
DR Ensembl; ENSGACT00000068928.1; ENSGACP00000043105.1; ENSGACG00000014478.2.
DR GeneTree; ENSGT00940000164061; -.
DR Proteomes; UP000007635; Chromosome III.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR CDD; cd00247; Endostatin-like; 1.
DR FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000007635};
KW Secreted {ECO:0000256|ARBA:ARBA00022530}.
FT DOMAIN 836..884
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 940..1109
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 77..529
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 563..618
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 641..834
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 899..918
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 131..177
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 180..191
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 192..221
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 265..277
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 307..321
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 411..420
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 476..485
FT /note="Gly residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 641..650
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 663..672
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 707..716
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 776..787
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 806..817
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 818..829
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 899..917
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1117 AA; 113869 MW; BDC2A4D038277E5F CRC64;
MTQQWTRFTV VVEHDEVRLY MDCREAVRTT FSRSPERLNF SHNSGVFVAN GGSTGLEKFL
GSIQQLVIND DPRAAEEQCE DDDSYASGFT SGDDALDDRE TEGVKKTHKR KHGTAKEEDS
LPVIAPPTEA QKAELDEDSG HQTRTEMLIR EHQTEEAGER LEDGNIRHGS KGERGDPGPK
GSPGPPGPPG RPGYSPLPGQ AQPGPRGTKG PQGTPGTPGL PGRDGQQGIK GDRGDPGQRG
SQGFSGLDGE AGPKGEKGEP GVGLPGPPGR PGTPGPPRSR SVPYGADALG SGFEELDSET
DLIRGHPGPP GPPGPPGPPG PLSSSDTVEG HSLVRAGPPG VPGRDGLRGK PGIPGPSGRD
GDPGLPGAVG AKGNQGLSGP TGPKGECGSE GKAGSPGLPG PSGPPGKRGP AGPPGPPGPP
ATTFFIEDME GSGKSDMRVR GPQGPPGIPG PQGAKGEDGT TGAPGLSVKG EPGDPGPEGG
QGPAGLPGAR GAKGETGIQG HKGDRGVDGL TVRGPPGAPG PPGPLLNLSD LFNVTNGNFN
FTEIRGAPGP MGPEGLPGRA GFPGPRGPKG DLGPAGVQGP AGFKGEKGEP GVTIAADGSR
LSAPKGPQGP KGIKGDRGFP GPVGTLGPIG PAGQKGEYGF PGRSGRSGMP GRKGDKGDCL
GIPGPPGPPG APGHPGRIIG LKGTVFPVRP RPHCKKGREA VTTGDSVRAK GDKGDEGIPG
EPGTPAPEIP DGLVGARGDQ GYHGQKGEKG DGGVPGPPGL PGRSGLVGPK GESIIGPSGP
VGPVGQPGAP GYGQPGPRGP PGPAGPRGMA TAPGVNVPGP PGPPGPPGSP GYANPVTIYK
TSHALSRETH RAAEGTLVYV SEKGGELYIK ARNGWRNIQL GDLIQSGPST SAVSQSLSRT
GAWSRPPKTQ SHSQELQEGS RGYQPIYNVL PQTFNDVPGL HLVALNSPLK GDMRGIRGAD
FQCYQQARSM GLTTTYRAFL SSHLQDLATI VRKADRTDMP VVNLRGEVLF SSWMSIFSGN
GGTFNPSTAI YSFDGRDVLS DPAWPEKLVW HGSNAVGIRL TTNYCEAWRT ADMAVTGQAA
LLQTGRLLGQ HTRSCSNHYI VLCIENTYVG NTHQRRT
//