GenomeNet

Database: UniProt
Entry: A0A287BQN5_PIG
LinkDB: A0A287BQN5_PIG
Original site: A0A287BQN5_PIG 
ID   A0A287BQN5_PIG          Unreviewed;      1485 AA.
AC   A0A287BQN5;
DT   22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT   14-DEC-2022, sequence version 3.
DT   28-JAN-2026, entry version 45.
DE   SubName: Full=Collagen type XVIII alpha 1 chain {ECO:0000313|Ensembl:ENSSSCP00000058425.3};
GN   Name=COL18A1 {ECO:0000313|Ensembl:ENSSSCP00000058425.3,
GN   ECO:0000313|VGNC:VGNC:86869};
OS   Sus scrofa (Pig).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Artiodactyla; Suina; Suidae; Sus.
OX   NCBI_TaxID=9823 {ECO:0000313|Ensembl:ENSSSCP00000058425.3, ECO:0000313|Proteomes:UP000008227};
RN   [1] {ECO:0000313|Proteomes:UP000008227}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Duroc {ECO:0000313|Proteomes:UP000008227};
RG   Porcine genome sequencing project;
RL   Submitted (NOV-2009) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSSSCP00000058425.3}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Duroc {ECO:0000313|Ensembl:ENSSSCP00000058425.3};
RX   PubMed=32543654;
RA   Warr A., Affara N., Aken B., Beiki H., Bickhart D.M., Billis K., Chow W.,
RA   Eory L., Finlayson H.A., Flicek P., Giron C.G., Griffin D.K., Hall R.,
RA   Hannum G., Hourlier T., Howe K., Hume D.A., Izuogu O., Kim K., Koren S.,
RA   Liu H., Manchanda N., Martin F.J., Nonneman D.J., O'Connor R.E.,
RA   Phillippy A.M., Rohrer G.A., Rosen B.D., Rund L.A., Sargent C.A.,
RA   Schook L.B., Schroeder S.G., Schwartz A.S., Skinner B.M., Talbot R.,
RA   Tseng E., Tuggle C.K., Watson M., Smith T.P.L., Archibald A.L.;
RT   "An improved pig reference genome sequence to enable pig genetics and
RT   genomics research.";
RL   Gigascience 9:giaa051-giaa051(2020).
RN   [3] {ECO:0000313|Ensembl:ENSSSCP00000058425.3}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (AUG-2025) to UniProtKB.
RN   [4] {ECO:0000313|Ensembl:ENSSSCP00000058425.3}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2025) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   Ensembl; ENSSSCT00000057398.3; ENSSSCP00000058425.3; ENSSSCG00000038089.3.
DR   VGNC; VGNC:86869; COL18A1.
DR   GeneTree; ENSGT00940000158212; -.
DR   Proteomes; UP000008227; Chromosome 13.
DR   Bgee; ENSSSCG00000038089; Expressed in liver and 39 other cell types or tissues.
DR   ExpressionAtlas; A0A287BQN5; baseline and differential.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR010363; DUF959_COL18_N.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1060; COLLAGEN ALPHA-1(IV) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 1.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06121; DUF959; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   Pfam; PF13385; Laminin_G_3; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   1: Evidence at protein level;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Proteomics identification {ECO:0007829|PeptideAtlas:A0A287BQN5};
KW   Reference proteome {ECO:0000313|Proteomes:UP000008227};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Signal {ECO:0000256|ARBA:ARBA00022729}.
FT   DOMAIN          269..457
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          1..39
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          79..265
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          458..683
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          752..955
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          990..1013
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1026..1106
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1179..1208
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1312..1346
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1391..1485
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        128..140
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        153..167
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        198..208
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        209..232
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        239..249
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        468..478
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        521..533
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        536..561
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        565..574
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        601..616
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        663..677
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        786..796
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        824..839
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        841..856
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1070..1086
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1322..1333
FT                   /note="Gly residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1402..1424
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1485 AA;  152456 MW;  5F0F2353D7B65F97 CRC64;
     MNGELAAGPG AGGGSGRAPP PDGRGRPLLP GAHSAMAPDP SGRPGHLGLL LLLACCLAAT
     WADPFPWLWF SEATMPKATP ASVPQATPAP VPPGSSPEQR PETPPRTWTP GMAPPTRGKP
     RPALSLVWRG WRGARARAPL SPRPPREPGP QGGEHRGCGS QDPERGSGHP ALHPAVGRHH
     APRELRRRGD ASPPTPTDPL TTPGPSSTLW ESGTTLWLSS GAPSSPDTQR TEAGTQPLPT
     QPPPSPGGPL APFRAPSVPP PAPEGEAVEV GLQQMLGEPL PRQVALVHDP DVGPAYEFDG
     ISGVGQAALS LVPRTFFHDF SLLMRVRPAS AGAGVLFAVT DAAQAVVLVG VKLAAARGGQ
     QQVQLLYTEP GATRTRTAAS FTLPALHGRW THLALSVDGA HVALFVDCEE VQLEPLSRSL
     RALQLEPDAR LFVAQAGRAD PDKFQGMISE LRVRGDPQVS PLQCRYGDED EDDSDNEASG
     DLGSGLAEEA GERLGAPSGA PLRPRLPEAP PVTSPPLAGA GDEEDSRTEE VEEATTVPSP
     GAQTLPGSGA VTTWDGSSWS PGDSLEERGL KGQEEPGAQG LPGQASPQGP TGPVMRSPEA
     QPVPGPQGPP GPPGPPGKDG APGRDGEPVS AQVPGDTGPQ GFPGTPGDVG PKGEKGDPGV
     GPRGPPGPQG PPGPPGPSFR HDKLTFIDME GSGFGGDLES LRVSVPWPSS PTVEPVGCAE
     AWREGAGATF GCTACRDPSV FFCVQGPRGF PGPPGPPGVP GLPGEPGRFG MNSSDVPGPA
     GACGHTGCPG PTHTSSPGPP GFPGRDGQAG QTGQKGSVGS KGDPGPVGAP GNPGLAGVPG
     PAGPPGPPGP PGPPGPGLAA GFDDMEGSGG PFWWAAQGAD RPQGPAGPKG DLGSKGQPGL
     PGPKGEKGEP GTVFSPDGRP LTPAQKGAKV RGAWGPYGRP GHKGEIGFPG RPGRPGMNGL
     KGEKGEPGDA SVGFSLRVSV LQAVGVGGKG TPRPGRLAPL ARVPELTGGS PSELKHGSFR
     GWWQMQPRPS SLLVPGLGTQ GPKGESIRGQ PGPPGPQGPP GIGYEGRQGP PGPPGPPGPP
     GPPSFPGPYR QSKPGRERVL GSRDPTGLGF QQVRVWATYQ TMLDKVPEVP EGWLMFVAER
     EELYVRVRNG FRKVLVTAAS HPQDNEVAAL QPPVVQLHEG NPYPRREPPQ PTARAWRGDD
     IRASPPRLPV AQPYPGAPHH GAYVHPRPVH PTGSPAHTHH DFQPVLHLVA LNSPQSGGLR
     GIRGADFQCF QQARAVGLAG TFRAFLSSRL QDLYSIVRRA DRAAVPIVNL RVGGRPSRPR
     GPEGGGGRAG GGSSPRHPME LPRGFTFPVH SIPRAVTAQG SPGQGGRARS LSRAVFRGFF
     HEQKYVLSGC SEHPVPQDQR GARSSPGGPG LAAGAQQAGA ASAGRRGGRR RPGVGPSQTA
     PASLRPPSRL SLGQEVTSQD DLTSPRRTRC CSPAGRPCSR ALRAS
//
DBGET integrated database retrieval system