GenomeNet

Database: UniProt
Entry: A0A8W4FNM4_PIG
LinkDB: A0A8W4FNM4_PIG
Original site: A0A8W4FNM4_PIG 
ID   A0A8W4FNM4_PIG          Unreviewed;      1220 AA.
AC   A0A8W4FNM4;
DT   14-DEC-2022, integrated into UniProtKB/TrEMBL.
DT   14-DEC-2022, sequence version 1.
DT   28-JAN-2026, entry version 17.
DE   SubName: Full=Collagen type XV alpha 1 chain {ECO:0000313|Ensembl:ENSSSCP00000080514.1};
GN   Name=COL15A1 {ECO:0000313|Ensembl:ENSSSCP00000080514.1};
OS   Sus scrofa (Pig).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Artiodactyla; Suina; Suidae; Sus.
OX   NCBI_TaxID=9823 {ECO:0000313|Ensembl:ENSSSCP00000080514.1, ECO:0000313|Proteomes:UP000008227};
RN   [1] {ECO:0000313|Ensembl:ENSSSCP00000080514.1}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Duroc {ECO:0000313|Ensembl:ENSSSCP00000080514.1};
RX   PubMed=32543654;
RA   Warr A., Affara N., Aken B., Beiki H., Bickhart D.M., Billis K., Chow W.,
RA   Eory L., Finlayson H.A., Flicek P., Giron C.G., Griffin D.K., Hall R.,
RA   Hannum G., Hourlier T., Howe K., Hume D.A., Izuogu O., Kim K., Koren S.,
RA   Liu H., Manchanda N., Martin F.J., Nonneman D.J., O'Connor R.E.,
RA   Phillippy A.M., Rohrer G.A., Rosen B.D., Rund L.A., Sargent C.A.,
RA   Schook L.B., Schroeder S.G., Schwartz A.S., Skinner B.M., Talbot R.,
RA   Tseng E., Tuggle C.K., Watson M., Smith T.P.L., Archibald A.L.;
RT   "An improved pig reference genome sequence to enable pig genetics and
RT   genomics research.";
RL   Gigascience 9:giaa051-giaa051(2020).
RN   [2] {ECO:0000313|Ensembl:ENSSSCP00000080514.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (AUG-2025) to UniProtKB.
RN   [3] {ECO:0000313|Ensembl:ENSSSCP00000080514.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2025) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; A0A8W4FNM4; -.
DR   Ensembl; ENSSSCT00000095865.1; ENSSSCP00000080514.1; ENSSSCG00000005380.5.
DR   GeneTree; ENSGT00940000158302; -.
DR   Proteomes; UP000008227; Chromosome 1.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   CDD; cd00247; Endostatin-like; 1.
DR   CDD; cd00110; LamG; 1.
DR   FunFam; 3.40.1620.70:FF:000002; Collagen alpha 1 (XV) chain; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR001791; Laminin_G.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF912; COLLAGEN ALPHA-1(XV) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 3.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   Pfam; PF13385; Laminin_G_3; 1.
DR   SMART; SM00282; LamG; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   1: Evidence at protein level;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Proteomics identification {ECO:0007829|PeptideAtlas:A0A8W4FNM4};
KW   Reference proteome {ECO:0000313|Proteomes:UP000008227};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Signal {ECO:0000256|ARBA:ARBA00022729}.
FT   DOMAIN          18..206
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   DOMAIN          67..205
FT                   /note="Laminin G"
FT                   /evidence="ECO:0000259|SMART:SM00282"
FT   REGION          201..229
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          245..297
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          354..392
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          476..732
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          770..851
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          899..933
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        509..518
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        544..554
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        580..590
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        655..669
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        704..718
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        811..824
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1220 AA;  125434 MW;  CEB0E73007C245F1 CRC64;
     MCDAWPLSLP AELASQGHLD LTELIGVPLP SSVSFVTGYG GFPAYSFGPG ANVGRPARTL
     IPPTFFRDFA ISVTVKPSSA RGGVLFAITD AFQKVIYLGL RLSAVEDGRQ RVILYYTEPG
     SQVSHEAASF PVPVMTHRWN RFAVVVQGEE ASLLVECEEQ GHVSFPRSSQ ALAFEPSAGI
     FVGNAGATGL ERFTGSIQQL TVHSDPRTPE ELCEAEESSA SGEASGLQET DGVAEMVEAV
     TYTQAPSEPA EVEPINMPPT PSPLSEDAEL SGEPVPEGTQ GTTNLSAFPH SSPEQGSGEI
     LNDTLERVHT VDGAPVTDTG SGDGAFLHVI EESPHTEEGL AATAAAVESE VTISTAGEAE
     TESVPTEGPT LSMSTKDPGE EVTLGPDDEE GSAVTAAEEA EVLVSSPGEA EAGSVPTGEL
     TLSMSTQDPG EGGPVHEESL TTAVTAKAPL STFEEEASRV PTDGLAPLIP TVAPEQVFTS
     GPGDDDLAAA ATEEPLISSG AEELSGVPPE GPPLPLPTVA PERGGAPGEE GEGLPGPPPP
     TRPAGPTAGA EAEGSSLGWG LDIGSGSGDP VRSEELLRGP PGPPGPPGLP GIPGRPGTDV
     FVGPPGSPGE DGPAGEPGPP GRIISLSSSL QGDPGSRGLP GPPGKNGQVG TPGVMGPPGP
     PGPPGPPGPG CAMGQGFEDT EGSGSIRLLH EPRISGPVAS VGPKGEKGDQ GPKGERGMDG
     ASIVGPPGPR GPPGRIEVLS SSLINITHGF MNLSDIPELV GPPGPEGIPG LPGFPGPRGP
     KGDTGVPGFP GLKGEQGEKG EPGAILTGDV PLERLRGKKG EPGEHGAPGP MGPKGPPGHK
     GEFGLPGRPG RPGLNGLKGA KGDRGVMMPV SATLGPPWGW GGGDTSAVFP IPVRPHCKTP
     VSSVKGEKGS WGLPGSKGEK GDQGAQGPPG PPVDPAYLRH FLNSLKGENG DRGMKGEKGD
     TYEGFSVSGP PGLPGSPGLV VSRDPSTSSS LKSCSFPRSP QSFSNMDDML QKAHLVIEGT
     FIYLRDSTEF FIRVRDGWKK LQLGELIPLP ADSLPPPALS GNPHQPQLPL TSISSVNYDR
     PALHLVALNT PSSGDLRADF QCFQQARAAG LLSTYRAFLS SHLQDLSTIV RKAERYSLPI
     VNLKGQVLFN NWDSIFSGHG GQFNTHVPIY SFDGRDVMTD PSWPQKVVWH GSSTHGVRLV
     DQYCEAWRTA DVAVMGLAPR
//
DBGET integrated database retrieval system