GenomeNet

Database: UniProt
Entry: A0A6P8GN59_CLUHA
LinkDB: A0A6P8GN59_CLUHA
Original site: A0A6P8GN59_CLUHA 
ID   A0A6P8GN59_CLUHA        Unreviewed;      1252 AA.
AC   A0A6P8GN59;
DT   02-DEC-2020, integrated into UniProtKB/TrEMBL.
DT   02-DEC-2020, sequence version 1.
DT   28-JAN-2026, entry version 23.
DE   SubName: Full=Collagen alpha-1(XVIII) chain isoform X2 {ECO:0000313|RefSeq:XP_031440238.1};
GN   Name=col15a1a {ECO:0000313|RefSeq:XP_031440238.1};
OS   Clupea harengus (Atlantic herring).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Clupei; Clupeiformes; Clupeoidei;
OC   Clupeidae; Clupea.
OX   NCBI_TaxID=7950 {ECO:0000313|Proteomes:UP000515152, ECO:0000313|RefSeq:XP_031440238.1};
RN   [1] {ECO:0000313|RefSeq:XP_031440238.1}
RP   IDENTIFICATION.
RG   RefSeq;
RL   Submitted (AUG-2025) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   RefSeq; XP_031440238.1; XM_031584378.1.
DR   AlphaFoldDB; A0A6P8GN59; -.
DR   GeneID; 105890075; -.
DR   CTD; 792366; -.
DR   Proteomes; UP000515152; Chromosome 17.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1034; COLLAGEN ALPHA-1(XVIII) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119,
KW   ECO:0000313|RefSeq:XP_031440238.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000515152};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Signal {ECO:0000256|ARBA:ARBA00022729}.
FT   DOMAIN          28..216
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          214..243
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          255..566
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          579..668
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          829..981
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1033..1052
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        276..285
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        295..313
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        331..340
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        413..422
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        453..467
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        550..560
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        617..626
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        627..638
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        639..653
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        858..868
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        880..891
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        935..946
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        956..965
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1042..1052
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1252 AA;  128448 MW;  9BB1D8DBFA29D380 CRC64;
     MQQRLAFHAS CCRPGVLISI NKRGTKGQLD LTELIGVPLP PDVAFITGFE GFPAYSFGPG
     ANVGRLARSF VPDPFFRDFA IIVTAKPTTR QGGVLFAITD AMQKVVHLGV MLAAVEDGTQ
     RVVLYYTEPG AATTQEAASF KMGELTGRWA RFTLAVQGHE VRLYMDCEEY HRVAFQRSAG
     QLTFQPSSGI FVGNAGNTGL ERFVGSIQQL VLTPDPRAPD DQCEEDDPYA SGYGSGDDLD
     DTERMDEVKK LVEEREYTMP EDFLTMPVQA PPTEEASSDD EDLEEGTSGQ AIDLPDERAT
     EHTARVDTVH RETSLNPGLK GEKGDAGSAG PPGPPGPPGR SSPASGTSSG GGEPGPRGPQ
     GPSGTPGAPG KEGEPGVQGR DGSPGQAGPQ GFPGLPGDAG PTGEKGDPGV GLPGPPGPPG
     PPGKSTSPRY MDGLDGSGFE DFDSDTEVVR GPAGPPGPPG NPGPPGPSQA LLPGQPGLKG
     APGTDGKDGE MGTPGLPGAD GRPGNPGAQG EKGDLGFPGS PGLKGDLGQE GPPGLPGPVG
     PEGPTGPAGQ PGPPGPPGPP AQGFKFNLED VEGSGELSAM GAMLPKGPPG LPGQPGVIGL
     KGERGADGQP GLSVKGDAGE RGHEGEPGLP GLPGAKGQLG DEGHPGLKGE PGRDALGLVG
     PPGPPGLPGP IINLQDLMLN DTEGLFNFSG ILGPQGPMGP NGLKGESGIP GVQGPSGVKG
     EKGEPGITMA ADGSLMTGPE GPKGVKGDSG VPGPLGEKGP IGPVGPKGEF GLPGRPGRPG
     MNGFKGDQGQ AVFVHGPPGP PGPPGRPGMF NCPKGTVFPV PPRPHCKKGV NGESKGEGAT
     CLTNSSKGEK GDRGLQGIPG IPAPAIAILP KGFSPNRGDQ GYHGEKGEKG EAGLPGLHGR
     SGLVGPKGES VMGPRGHPGI PGPPGVPGYG RDGPTGPPGP PGPRGPPGYG SALATPGPPG
     PPGPRGSPGV SGGSTGVKTY PSLQSMTQRS YLDLDGTMYF VTDAGRLYLK VPGGWKEIQL
     GKLLEVQSPI IPQDEARPRP QSPGSSSSSM PQIHEGQALK LVALNTPLTG NIGSLTAADQ
     ACRAQAQAMG IRDQYRAFLS NHLQDLVDVI HPQYRRTLPI VNLRGEVLFD NYEHIFTKSS
     ALPHGIPLFS FDGRDVMSDP FWPQKAIWHG SSPQGRRLQE KNCESWRAGD MAIVGQASFL
     YTGLLNQQSR SCSNQFVVLC VEASPEPSSY QEVRRGTRYA YYYRNPRSSH RT
//
DBGET integrated database retrieval system