ID A0A8W4FCK3_PIG Unreviewed; 1187 AA.
AC A0A8W4FCK3;
DT 14-DEC-2022, integrated into UniProtKB/TrEMBL.
DT 14-DEC-2022, sequence version 1.
DT 28-JAN-2026, entry version 17.
DE SubName: Full=Collagen type XV alpha 1 chain {ECO:0000313|Ensembl:ENSSSCP00000076033.1};
GN Name=COL15A1 {ECO:0000313|Ensembl:ENSSSCP00000076033.1};
OS Sus scrofa (Pig).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Suina; Suidae; Sus.
OX NCBI_TaxID=9823 {ECO:0000313|Ensembl:ENSSSCP00000076033.1, ECO:0000313|Proteomes:UP000008227};
RN [1] {ECO:0000313|Ensembl:ENSSSCP00000076033.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Duroc {ECO:0000313|Ensembl:ENSSSCP00000076033.1};
RX PubMed=32543654;
RA Warr A., Affara N., Aken B., Beiki H., Bickhart D.M., Billis K., Chow W.,
RA Eory L., Finlayson H.A., Flicek P., Giron C.G., Griffin D.K., Hall R.,
RA Hannum G., Hourlier T., Howe K., Hume D.A., Izuogu O., Kim K., Koren S.,
RA Liu H., Manchanda N., Martin F.J., Nonneman D.J., O'Connor R.E.,
RA Phillippy A.M., Rohrer G.A., Rosen B.D., Rund L.A., Sargent C.A.,
RA Schook L.B., Schroeder S.G., Schwartz A.S., Skinner B.M., Talbot R.,
RA Tseng E., Tuggle C.K., Watson M., Smith T.P.L., Archibald A.L.;
RT "An improved pig reference genome sequence to enable pig genetics and
RT genomics research.";
RL Gigascience 9:giaa051-giaa051(2020).
RN [2] {ECO:0000313|Ensembl:ENSSSCP00000076033.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (AUG-2025) to UniProtKB.
RN [3] {ECO:0000313|Ensembl:ENSSSCP00000076033.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2025) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC matrix {ECO:0000256|ARBA:ARBA00004498}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A8W4FCK3; -.
DR Ensembl; ENSSSCT00000092789.1; ENSSSCP00000076033.1; ENSSSCG00000005380.5.
DR GeneTree; ENSGT00940000158302; -.
DR Proteomes; UP000008227; Chromosome 1.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR CDD; cd00247; Endostatin-like; 1.
DR CDD; cd00110; LamG; 1.
DR FunFam; 3.40.1620.70:FF:000002; Collagen alpha 1 (XV) chain; 1.
DR FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR Pfam; PF01391; Collagen; 2.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR Pfam; PF13385; Laminin_G_3; 1.
DR SMART; SM00282; LamG; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 1: Evidence at protein level;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Proteomics identification {ECO:0007829|PeptideAtlas:A0A8W4FCK3};
KW Reference proteome {ECO:0000313|Proteomes:UP000008227};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 18..206
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT DOMAIN 67..205
FT /note="Laminin G"
FT /evidence="ECO:0000259|SMART:SM00282"
FT REGION 201..229
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 245..297
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 354..392
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 476..734
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 767..904
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 509..518
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 544..554
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 580..590
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 655..669
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 704..718
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 792..805
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 826..839
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1187 AA; 121974 MW; 1E1348AE0C5460EB CRC64;
MCDAWPLSLP AELASQGHLD LTELIGVPLP SSVSFVTGYG GFPAYSFGPG ANVGRPARTL
IPPTFFRDFA ISVTVKPSSA RGGVLFAITD AFQKVIYLGL RLSAVEDGRQ RVILYYTEPG
SQVSHEAASF PVPVMTHRWN RFAVVVQGEE ASLLVECEEQ GHVSFPRSSQ ALAFEPSAGI
FVGNAGATGL ERFTGSIQQL TVHSDPRTPE ELCEAEESSA SGEASGLQET DGVAEMVEAV
TYTQAPSEPA EVEPINMPPT PSPLSEDAEL SGEPVPEGTQ GTTNLSAFPH SSPEQGSGEI
LNDTLERVHT VDGAPVTDTG SGDGAFLHVI EESPHTEEGL AATAAAVESE VTISTAGEAE
TESVPTEGPT LSMSTKDPGE EVTLGPDDEE GSAVTAAEEA EVLVSSPGEA EAGSVPTGEL
TLSMSTQDPG EGGPVHEESL TTAVTAKAPL STFEEEASRV PTDGLAPLIP TVAPEQVFTS
GPGDDDLAAA ATEEPLISSG AEELSGVPPE GPPLPLPTVA PERGGAPGEE GEGLPGPPPP
TRPAGPTAGA EAEGSSLGWG LDIGSGSGDP VRSEELLRGP PGPPGPPGLP GIPGRPGTDV
FVGPPGSPGE DGPAGEPGPP GRIISLSSSL QGDPGSRGLP GPPGKNGQVG TPGVMGPPGP
PGPPGPPGPG CAMGQGFEDT EGSGSIRLLH EPRISGPVAS VGPKGEKGDQ GPKGERGMDG
ASIVGPPGPR GPPGRIEVLS SVSITRSECG CVFPAFAKGD RSLTTTSGPS GLLGMQGGEK
GEPGAILTGD VPLERLRGKK GEPGEHGAPG PMGPKGPPGH KGEFGLPGRP GRPGLNGLKG
AKGDRGVMMP VSATLGPPPH AKLGTGPEAS VKGEKGSWGL PGSKGEKGDQ GAQGPPGPPV
DPAYLRHFLN SLKGENGDRG MKGEKGDTYE GFSVSGPPGL PGSPGLVVSR DPSTSSSLKS
CSFPRSPQSF SNMDDMLQKA HLVIEGTFIY LRDSTEFFIR VRDGWKKLQL GELIPLPADS
LPPPALSGNP HQPQLPLTSI SSVNYDRPAL HLVALNTPSS GDLRADFQCF QQARAAGLLS
TYRAFLSSHL QDLSTIVRKA ERYSLPIVNL KGQVLFNNWD SIFSGHGGQF NTHVPIYSFD
GRDVMTDPSW PQKVVWHGSS THGVRLVDQY CEAWRTADVA VMGLAPR
//