GenomeNet

Database: UniProt
Entry: A0A672SUN3_SINGR
LinkDB: A0A672SUN3_SINGR
Original site: A0A672SUN3_SINGR 
ID   A0A672SUN3_SINGR        Unreviewed;       953 AA.
AC   A0A672SUN3;
DT   17-JUN-2020, integrated into UniProtKB/TrEMBL.
DT   17-JUN-2020, sequence version 1.
DT   28-JAN-2026, entry version 23.
DE   SubName: Full=Collagen alpha-1(XVIII) chain-like {ECO:0000313|Ensembl:ENSSGRP00000106004.1};
GN   Name=col15a1b {ECO:0000313|Ensembl:ENSSGRP00000106004.1};
OS   Sinocyclocheilus grahami (Dianchi golden-line fish) (Barbus grahami).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Cypriniformes;
OC   Cyprinidae; Cyprininae; Sinocyclocheilus.
OX   NCBI_TaxID=75366 {ECO:0000313|Ensembl:ENSSGRP00000106004.1, ECO:0000313|Proteomes:UP000472262};
RN   [1] {ECO:0000313|Ensembl:ENSSGRP00000106004.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (AUG-2025) to UniProtKB.
RN   [2] {ECO:0000313|Ensembl:ENSSGRP00000106004.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2025) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; A0A672SUN3; -.
DR   Ensembl; ENSSGRT00000112662.1; ENSSGRP00000106004.1; ENSSGRG00000052401.1.
DR   Proteomes; UP000472262; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   3: Inferred from homology;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000472262};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..17
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           18..953
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5025357098"
FT   DOMAIN          34..222
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          222..503
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          571..725
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        293..307
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        325..340
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        394..410
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        453..468
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        491..500
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        594..627
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        709..719
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   953 AA;  99669 MW;  DF047DD43F50A7D4 CRC64;
     MKLWLLSWLV GAHLALSSRS SALHVMEERG SKGHLVLTEL VGVPLPPSVS FITGYEGFPA
     YNFGPNANVG RLTQSFVPEP FFMDFAIIVT IKPSNSRGGV LFAITDPSQM IVHLGLALTP
     VEDKTQRIVL YYSEPGLADT MEVASFKVPD MTQQWYRFTL TVENEEVRLY MDCEEYHSAP
     LKRSQQPLSF KQGSGIFVAN AGSTGLERFV GSIQQLVIKP DPRAAEEQCE EDDPSLQASG
     DRSGDGDYDE EEHGRHEVIF GQTNTYRPTY PVQAPPTVSP DMDEGEFSGH VTPIDERLLR
     GTYKTDETGE STGDGLKGER GEPGPAGPPG PPGSPGPSLP PTHSGQPGQR GPQGPMGPPG
     RQGRPGKDGQ PYGFDALGSG FEDVDIDTER LRGPPGPPGP PGKPGPPGPN GPLGALLPGP
     PGTPGKDGRD GQPGLRGENG DPGLIGPMGP RGVPGPPGIP GPPGPPGPLS NDMMNTLKGE
     RGRDGLSITG PPGPPGPPGP SINFQDLLLN DTAAKLNLTK IRGPPGPMGP EGLPGRAGFP
     GPRGPKGEIG FPGIQGPPGL KGERGEPGVS IAADGSVIPG LRGPRGPKGM KGDIGPSGQP
     GIVGPIGPPG QKGEYGLPGR PGRAGIAGRK GDKGDSSGPP VMFVSKGAKG NNGHKGQNGE
     KGDPGLPGPP GLPGRTGLVG PKGDSIVGPP GDTGPPGHPG LPGYGIPGPQ GPPGPPGPAG
     TPSVRTYKNS QTLIRETSQA AEGTLAYVLD KSELYIRELK SFLPDHVFLQ NAHSMPALHL
     VALNAPFTGD MHGIRGADYQ CYQQARARGL TSTYRAFLSS HLQDLSSIVK KGDRFGLPVV
     NLKGDALFGS WMAMFSGDGA VFDPLTPIYS FDGRNVMTDQ AWPQKLVWHG SNTAGIRMTT
     SYCEAWRTGD MAVTGQASLL QTGRLLGQHT RSCSNHFIVL CIENSYIQNP GRN
//
DBGET integrated database retrieval system