GenomeNet

Database: UniProt
Entry: A0AAQ4RB93_GASAC
LinkDB: A0AAQ4RB93_GASAC
Original site: A0AAQ4RB93_GASAC 
ID   A0AAQ4RB93_GASAC        Unreviewed;      1414 AA.
AC   A0AAQ4RB93;
DT   02-OCT-2024, integrated into UniProtKB/TrEMBL.
DT   02-OCT-2024, sequence version 1.
DT   28-JAN-2026, entry version 7.
DE   SubName: Full=Collagen type XVIII alpha 1 chain a {ECO:0000313|Ensembl:ENSGACP00000059893.1};
OS   Gasterosteus aculeatus aculeatus (three-spined stickleback).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Eupercaria; Perciformes; Cottioidei; Gasterosteales; Gasterosteidae;
OC   Gasterosteus.
OX   NCBI_TaxID=481459 {ECO:0000313|Ensembl:ENSGACP00000059893.1, ECO:0000313|Proteomes:UP000007635};
RN   [1] {ECO:0000313|Ensembl:ENSGACP00000059893.1, ECO:0000313|Proteomes:UP000007635}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Lake Benthic {ECO:0000313|Ensembl:ENSGACP00000059893.1,
RC   ECO:0000313|Proteomes:UP000007635};
RX   PubMed=33598708;
RA   Nath S., Shaw D.E., White M.A.;
RT   "Improved contiguity of the threespine stickleback genome using long-read
RT   sequencing.";
RL   G3 (Bethesda) 11:0-0(2021).
RN   [2] {ECO:0000313|Ensembl:ENSGACP00000059893.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (AUG-2025) to UniProtKB.
RN   [3] {ECO:0000313|Ensembl:ENSGACP00000059893.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2025) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   RefSeq; XP_040057992.1; XM_040202058.1.
DR   Ensembl; ENSGACT00000055997.1; ENSGACP00000059893.1; ENSGACG00000002269.2.
DR   GeneID; 120834193; -.
DR   KEGG; gat:120834193; -.
DR   CTD; 564123; -.
DR   GeneTree; ENSGT00940000158212; -.
DR   Proteomes; UP000007635; Chromosome XVI.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   GO; GO:0001755; P:neural crest cell migration; IEA:Ensembl.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR   Pfam; PF01391; Collagen; 3.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   3: Inferred from homology;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007635};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..20
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           21..1414
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5042878011"
FT   DOMAIN          117..306
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          76..98
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          312..952
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          966..1083
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1146..1237
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        84..96
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        348..357
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        374..390
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        438..450
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        461..488
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        531..541
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        565..576
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        582..593
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        640..656
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        664..673
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        689..700
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        751..760
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        869..878
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        900..912
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        966..977
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1014..1023
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1032..1043
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1062..1073
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1173..1183
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1203..1214
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1414 AA;  144918 MW;  AC6666067AD3B009 CRC64;
     MRTQFGVLFL LTLWVVYSDA WFWPRTGTTT FPPTVDHEGS GSPAFSGEPP SENIAGLGAE
     IIGEGQGIQK LVQARDETTD GPRPTTLMPT TLPETQLDSE TGTAVISSHI RQPEDNGVTL
     LQLIGNPPPS TIRQVLENDN SPGYVLDHNT NLGQLARAHL PNPFYRNFAL IFNLKPTTDG
     AAVIFSVTDG SQKIMYVGVK LSAVQGGNQD VILYYTEPNS EKSYEAARFP VPSMRDTWTR
     FAIAVRGDTV MFYLNCDTDP LVMAMERSPD EMELQAGTGV FVGHAGGADP DKFVGVISEL
     RVVGDPLAAE RLCEEDEDDS DMASGEGSGS EEPRQPKAPG KKLSTTTPLP PSPPIQQPPL
     TRREGDTSTT ETGPKGEKGD RGDPGLKGDR GPIGPKGDAS AGSGSRGGAR GEKGEPGETG
     LKGTAGFGYP GMKGDTGPPG PPGSPGPPGP ATEVLVGSGG SVSSRVPGPR GPAGPQGAAG
     PQGPPGADGE PGDPGEDGKT GAEGPPGFPG SPGNSGPKGD KGDRGDGQPG PRGPPGPPGP
     PGTGSTFVDM EASGFPDLES IRGLPGPPGP PGPPGPATDF VGSASSSGAF GPPGKDGNPG
     QPGLPGSDGL PGVSGPKGEK GDSGDLGLPG AVGEKGSRGD PGLPGTPGET GLAGLPGPIG
     PVGRPGPPGP PGPGYSVGFD DMEASGEGFR NRLPGERGVE GRQGPPGSPG LQGKDGVPGL
     PGQKGTDGLT GRDGRPGLDG FPGPQGPKGD TGNKGERGNP GRDGTGVPGP PGPPGQPGQI
     IYQRPDNGGP GLSGQAGLPG PSGPKGDRGD TGPPGYGDKG EKGEPGQVIG PDGNLVHLEG
     LTGPKGDRGT PGPVGPPGQY GPAGIKGEMG MPGRPGRPGV NGYKGEKGET GVGYGYPGVP
     GQPGPPGPPG PAIPLDRFNR YDDASRNYPA TKGEKGESGD QGPPGIPGTA SNFDIYTLKN
     QLKGERGDLG VKGEKGEPGG GGYYDPRFGG AQGPPGPPGK PGLQGPKGES VVGPPGPQGP
     PGQPGTGYDG RPGPPGPPGP PGPQGSSSLP GAYRPNYSLS VPGPPGPPGP PGSPARSSGV
     TTSRSYDTMI ATARREPEGS LIYILDKADL YLRVRDGVRQ VMLGNYNPFS RDLDNELVEV
     QPPPVILYPQ SHDQSHNNGA GHYSQGGSAV RPIEPPPQPP VDRYPPQYDP RFPDTRLTGQ
     TDGRSDTQRT ENRHPVTPPR RPSLPVQQPA GPVGPSAPGL HIIALNAPQT GNMRGIRGAD
     FLCFQQARAV GLKGTFRAFL SSKLQDLYTI VRRSDRDSTP IVNLKDQVLF SSWESLFGDD
     VSKMRENVPI YSFDGRDVVR DSAWPEKMVW HGSSNKGHRQ TDHYCETWRA GDHAVTGLAS
     SLQSGRLLQQ TSSSCSGSYV VLCIENAFTS HSKK
//
DBGET integrated database retrieval system