GenomeNet

Database: UniProt
Entry: G3PMX4_GASAC
LinkDB: G3PMX4_GASAC
Original site: G3PMX4_GASAC 
ID   G3PMX4_GASAC            Unreviewed;       730 AA.
AC   G3PMX4;
DT   16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT   16-NOV-2011, sequence version 1.
DT   27-MAR-2024, entry version 65.
DE   SubName: Full=Collagen, type VIII, alpha 1b {ECO:0000313|Ensembl:ENSGACP00000018957.1};
GN   Name=COL8A1 {ECO:0000313|Ensembl:ENSGACP00000018957.1};
OS   Gasterosteus aculeatus (Three-spined stickleback).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Eupercaria; Perciformes; Cottioidei; Gasterosteales; Gasterosteidae;
OC   Gasterosteus.
OX   NCBI_TaxID=69293 {ECO:0000313|Ensembl:ENSGACP00000018957.1, ECO:0000313|Proteomes:UP000007635};
RN   [1] {ECO:0000313|Ensembl:ENSGACP00000018957.1, ECO:0000313|Proteomes:UP000007635}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Lindblad-Toh K., Mauceli E., Grabherr M., Chang J.L., Lander E.S.;
RL   Submitted (JAN-2006) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSGACP00000018957.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; G3PMX4; -.
DR   STRING; 69293.ENSGACP00000018957; -.
DR   Ensembl; ENSGACT00000018995.1; ENSGACP00000018957.1; ENSGACG00000014373.1.
DR   eggNOG; KOG3544; Eukaryota.
DR   GeneTree; ENSGT00940000158272; -.
DR   InParanoid; G3PMX4; -.
DR   OMA; GNEMPHL; -.
DR   TreeFam; TF334029; -.
DR   Proteomes; UP000007635; Unassembled WGS sequence.
DR   Bgee; ENSGACG00000014373; Expressed in pharyngeal gill and 1 other cell type or tissue.
DR   GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR   Gene3D; 2.60.120.40; -; 1.
DR   InterPro; IPR001073; C1q_dom.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR008983; Tumour_necrosis_fac-like_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF903; COLLAGEN ALPHA-1(VIII) CHAIN; 1.
DR   Pfam; PF00386; C1q; 1.
DR   Pfam; PF01391; Collagen; 3.
DR   PRINTS; PR00007; COMPLEMNTC1Q.
DR   SMART; SM00110; C1Q; 1.
DR   SUPFAM; SSF49842; TNF-like; 1.
DR   PROSITE; PS50871; C1Q; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000007635};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..26
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           27..730
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5003449720"
FT   DOMAIN          597..730
FT                   /note="C1q"
FT                   /evidence="ECO:0000259|PROSITE:PS50871"
FT   REGION          38..61
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          79..591
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        39..53
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        109..147
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        244..258
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        521..563
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   730 AA;  71528 MW;  7078416D4D079B10 CRC64;
     MAAAPLPPLC LPLVVLQLWS VHLAHGGAYY GHKLPPQQHQ PLPQYSDGYP QQQFLGNEMP
     HLPYGKEVPL LPQYGKELPQ LPLQKGKGRP LIEGKGGEGL REGPQGVQGP PGTPGPPGPQ
     GPPGLPGQGL PGLPGKTGPP GPQGYPGIGK PGVPGLPGKP GGPGLPGPKG ELGPYGGEGQ
     TGLPGPPGLP GPPGLPGISK PGGQGLPGQL GPLGEPGTKG PPGFPGLPGP KGEKGHDQPG
     LPGLKGPSGP PGPPGNVGIP GIGKPGFNGQ PGQQGVPGRP GAPGESGNAG PPGDRGQPGP
     PGVPGIGKPG KDGLTGQSGP LGRKGESGPP GLPGGPGLPG YGKPGYPGPK GHNGHDGLHG
     PPGLKGDKGH AGLPGVIGST GSSGRPGPPG PIGTPGSLGF PGQKGEDGVG GPKGYPGMKG
     ELGPPGLPGQ PGSSAGGGQP GPRGLQGPIG PKGEPGIRGS PGAPGGAGLT GSRGEGGKPG
     EKGYQGPQGI PGLTGPGGSL GPPGLPGPKG ETGLPGKPGY PGEGKPGPPG YIGPQGKPGP
     SGPSGLPGHP GPPGLPGPPG LPAASPDLGQ ILPVTGPYTG QKQGYKKPKN GGNIGANGVE
     MPAFTAELTN PFPLVGSPVV FDKLLYNGNQ NYNPQNGVFT CSVPGVYYFS YHVHCKGSNV
     WVALMKNNEP VMYTYDEYKK GLLDQASGSA VLPLRQGDTV HIQLPSDQAA GLYAGQYVHS
     TFSGYLLYPM
//
DBGET integrated database retrieval system