GenomeNet

Database: UniProt
Entry: A0A6P8GXN0_CLUHA
LinkDB: A0A6P8GXN0_CLUHA
Original site: A0A6P8GXN0_CLUHA 
ID   A0A6P8GXN0_CLUHA        Unreviewed;      1256 AA.
AC   A0A6P8GXN0;
DT   02-DEC-2020, integrated into UniProtKB/TrEMBL.
DT   02-DEC-2020, sequence version 1.
DT   28-JAN-2026, entry version 23.
DE   SubName: Full=Collagen alpha-1(XVIII) chain isoform X1 {ECO:0000313|RefSeq:XP_031440237.1};
GN   Name=col15a1a {ECO:0000313|RefSeq:XP_031440237.1};
OS   Clupea harengus (Atlantic herring).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Clupei; Clupeiformes; Clupeoidei;
OC   Clupeidae; Clupea.
OX   NCBI_TaxID=7950 {ECO:0000313|Proteomes:UP000515152, ECO:0000313|RefSeq:XP_031440237.1};
RN   [1] {ECO:0000313|RefSeq:XP_031440237.1}
RP   IDENTIFICATION.
RG   RefSeq;
RL   Submitted (AUG-2025) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   RefSeq; XP_031440237.1; XM_031584377.2.
DR   AlphaFoldDB; A0A6P8GXN0; -.
DR   GeneID; 105890075; -.
DR   KEGG; char:105890075; -.
DR   CTD; 792366; -.
DR   OrthoDB; 10060752at2759; -.
DR   Proteomes; UP000515152; Chromosome 17.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119,
KW   ECO:0000313|RefSeq:XP_031440237.1};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP000515152};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..22
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           23..1256
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5027700072"
FT   DOMAIN          32..220
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          218..247
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          259..570
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          583..671
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          738..769
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          833..985
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1037..1056
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        280..289
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        299..317
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        335..344
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        417..426
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        457..471
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        554..564
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        621..630
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        631..642
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        643..657
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        862..872
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        884..895
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        939..950
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        960..969
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1046..1056
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1256 AA;  129099 MW;  5DDF0D98BC1BB873 CRC64;
     MMSRGSLWSF ALFLWYCHPT TAFLEERGTK GQLDLTELIG VPLPPDVAFI TGFEGFPAYS
     FGPGANVGRL ARSFVPDPFF RDFAIIVTAK PTTRQGGVLF AITDAMQKVV HLGVMLAAVE
     DGTQRVVLYY TEPGAATTQE AASFKMGELT GRWARFTLAV QGHEVRLYMD CEEYHRVAFQ
     RSAGQLTFQP SSGIFVGNAG NTGLERFVGS IQQLVLTPDP RAPDDQCEED DPYASGYGSG
     DDLDDTERMD EVKKLVEERE YTMPEDFLTM PVQAPPTEEA SSDDEDLEEG TSGQAIDLPD
     ERATEHTARV DTVHRETSLN PGLKGEKGDA GSAGPPGPPG PPGRSSPASG TSSGGGEPGP
     RGPQGPSGTP GAPGKEGEPG VQGRDGSPGQ AGPQGFPGLP GDAGPTGEKG DPGVGLPGPP
     GPPGPPGKST SPRYMDGLDG SGFEDFDSDT EVVRGPAGPP GPPGNPGPPG PSQALLPGQP
     GLKGAPGTDG KDGEMGTPGL PGADGRPGNP GAQGEKGDLG FPGSPGLKGD LGQEGPPGLP
     GPVGPEGPTG PAGQPGPPGP PGPPAQGFKF NLEDVEGSGE LSAMGAMLPK GPPGLPGQPG
     VIGLKGERGA DGQPGLSVKG DAGERGHEGE PGLPGLPGAK GQLGDEGHPG LKGEPGRDAL
     GLVGPPGPPG LPGPIINLQD LMLNDTEGLF NFSGILGPQG PMGPNGLKGE SGIPGVQGPS
     GVKGEKGEPG ITMAADGSLM TGPEGPKGVK GDSGVPGPLG EKGPIGPVGP KGEFGLPGRP
     GRPGMNGFKG DQGQAVFVHG PPGPPGPPGR PGMFNCPKGT VFPVPPRPHC KKGVNGESKG
     EGATCLTNSS KGEKGDRGLQ GIPGIPAPAI AILPKGFSPN RGDQGYHGEK GEKGEAGLPG
     LHGRSGLVGP KGESVMGPRG HPGIPGPPGV PGYGRDGPTG PPGPPGPRGP PGYGSALATP
     GPPGPPGPRG SPGVSGGSTG VKTYPSLQSM TQRSYLDLDG TMYFVTDAGR LYLKVPGGWK
     EIQLGKLLEV QSPIIPQDEA RPRPQSPGSS SSSMPQIHEG QALKLVALNT PLTGNIGSLT
     AADQACRAQA QAMGIRDQYR AFLSNHLQDL VDVIHPQYRR TLPIVNLRGE VLFDNYEHIF
     TKSSALPHGI PLFSFDGRDV MSDPFWPQKA IWHGSSPQGR RLQEKNCESW RAGDMAIVGQ
     ASFLYTGLLN QQSRSCSNQF VVLCVEASPE PSSYQEVRRG TRYAYYYRNP RSSHRT
//
DBGET integrated database retrieval system