ID A0A3P8UAI1_CYNSE Unreviewed; 991 AA.
AC A0A3P8UAI1;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 28-JAN-2026, entry version 31.
DE SubName: Full=Collagen alpha-1(XVIII) chain-like {ECO:0000313|Ensembl:ENSCSEP00000000298.1};
OS Cynoglossus semilaevis (Tongue sole).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Carangaria; Pleuronectiformes; Pleuronectoidei; Cynoglossidae;
OC Cynoglossinae; Cynoglossus.
OX NCBI_TaxID=244447 {ECO:0000313|Ensembl:ENSCSEP00000000298.1, ECO:0000313|Proteomes:UP000265120};
RN [1] {ECO:0000313|Ensembl:ENSCSEP00000000298.1, ECO:0000313|Proteomes:UP000265120}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=24487278; DOI=10.1038/ng.2890;
RA Chen S., Zhang G., Shao C., Huang Q., Liu G., Zhang P., Song W., An N.,
RA Chalopin D., Volff J.N., Hong Y., Li Q., Sha Z., Zhou H., Xie M., Yu Q.,
RA Liu Y., Xiang H., Wang N., Wu K., Yang C., Zhou Q., Liao X., Yang L.,
RA Hu Q., Zhang J., Meng L., Jin L., Tian Y., Lian J., Yang J., Miao G.,
RA Liu S., Liang Z., Yan F., Li Y., Sun B., Zhang H., Zhang J., Zhu Y., Du M.,
RA Zhao Y., Schartl M., Tang Q., Wang J.;
RT "Whole-genome sequence of a flatfish provides insights into ZW sex
RT chromosome evolution and adaptation to a benthic lifestyle.";
RL Nat. Genet. 46:253-260(2014).
RN [2] {ECO:0000313|Ensembl:ENSCSEP00000000298.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (MAY-2025) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A3P8UAI1; -.
DR STRING; 244447.ENSCSEP00000000307; -.
DR Ensembl; ENSCSET00000000322.1; ENSCSEP00000000298.1; ENSCSEG00000000219.1.
DR Ensembl; ENSCSET00000000331.1; ENSCSEP00000000307.1; ENSCSEG00000000219.1.
DR GeneTree; ENSGT00940000158302; -.
DR OMA; AREANFR; -.
DR Proteomes; UP000265120; Chromosome 3.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1118; LAMININ G DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000265120}.
FT DOMAIN 742..789
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 821..987
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 1..417
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 478..555
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 640..664
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 689..743
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..15
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 16..28
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 31..44
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 54..65
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 114..126
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 137..148
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 175..188
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 227..245
FT /note="Gly residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 297..309
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 366..384
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 387..402
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 408..417
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 498..513
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 695..709
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 723..732
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 991 AA; 99525 MW; A594B5A3636E6898 CRC64;
MPVGSDDEDY GDAEEASGHE EKREKEEEAP TTATGRSGIL SSVFTGPPGA RGYPGPPGPR
GPPGPAGKAA TDGRPGPEGP QGPQGRMGLP GTSGKDGQLG SKGEPGLQGV PGVQGIPGLQ
GEPGPQGEKG DAGIGLPGPP GPQGPPGPVG KLSLYSEGSG YSDFDGDVEG LIRGPPGPPG
PPGRPGPPGN STAGVLSGPP GAPGKNGIDG QRGEPGLPVI GEWFSGSGSG SAFGSGFDSG
TGEGINGKDG EPGSTGAKGE QGAPGVAGQT GPKGDQGHPG IPGLQGSQGV EGQTGPRGPP
GLPGPPGRPG LPGSGLMLDF EDMEGSGVLN GFGPFSKGQQ GQPGVPGLKG KSGLPGIPGT
PGQKGDTGFP GTPGLPGLNG KNGTEGPKGD RGDPGLKGEP GRDGVGLPGP PGPPGPAGPV
INLQELLLND TDGVFNFSEP HALMGIQGPK GDIGSPGIQG PPGLKGEKGE PGIIKTSDGT
LVSGLPGPAG PRGVKGDLGL PGPSGVQGPV GPQGTKGELG LPGRPGRAGV MGSKGEKGDS
GGLQGPPGRP GPPGRPGIFN CPKGMGLRST PSFSCMLHME SGKLKARCHK PTELNPNGTI
SAGNCQTGAK GEKGERGLPG MPAPPATYFH RGISLPVGEQ GVKGQKGEKG ETGSPGLPGI
PGRQGLMGPK GESGVGHPGL PGYQGVPGLP GIGRPGPPGL PGPPGPPGLP ASRYGSVSIA
GPAGPPGPPG PPGTSGNFGS LKTFPSRESM MQQTVRDAEG TLTYVTDTGS LFLKVSQGWK
EIQLGSLIYL STNIIPQDQP DVAYQVRGDT VKRLPSVSNR LNLVALNQPH SGDMLGLDMA
DRMCYEQAKA MGLATNYRAF ISSHRQDLVH VVYPNFRDTL PVTNLRGEVM FWSWKSIFNG
DGAPLNSRIP IYSFDGRDVL ADPFWPQKNI WHGSNSRGYR VLDKHCETWA TDHVSVMGQS
SKLTSGLLLG QQTRSCSNEF IVLCIETHKS A
//