GenomeNet

Database: UniProt
Entry: K7BUN1_PANTR
LinkDB: K7BUN1_PANTR
Original site: K7BUN1_PANTR 
ID   K7BUN1_PANTR            Unreviewed;      1339 AA.
AC   K7BUN1; A0A2J8M8R5;
DT   09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT   09-JAN-2013, sequence version 1.
DT   28-JAN-2026, entry version 64.
DE   SubName: Full=Collagen type XVIII alpha 1 chain {ECO:0000313|Ensembl:ENSPTRP00000092192.1};
DE   SubName: Full=Collagen, type XVIII, alpha 1 {ECO:0000313|EMBL:JAA17789.1};
GN   Name=COL18A1 {ECO:0000313|EMBL:JAA17789.1,
GN   ECO:0000313|Ensembl:ENSPTRP00000092192.1,
GN   ECO:0000313|VGNC:VGNC:10689};
OS   Pan troglodytes (Chimpanzee).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC   Pan.
OX   NCBI_TaxID=9598 {ECO:0000313|EMBL:JAA17789.1};
RN   [1] {ECO:0000313|Ensembl:ENSPTRP00000092192.1, ECO:0000313|Proteomes:UP000002277}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=16136131; DOI=10.1038/nature04072;
RG   Chimpanzee sequencing and analysis consortium;
RT   "Initial sequence of the chimpanzee genome and comparison with the human
RT   genome.";
RL   Nature 437:69-87(2005).
RN   [2] {ECO:0000313|Ensembl:ENSPTRP00000092192.1, ECO:0000313|Proteomes:UP000002277}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=16136134; DOI=10.1038/nature04101;
RA   Hughes J.F., Skaletsky H., Pyntikova T., Minx P.J., Graves T., Rozen S.,
RA   Wilson R.K., Page D.C.;
RT   "Conservation of Y-linked genes during human evolution revealed by
RT   comparative sequencing in chimpanzee.";
RL   Nature 437:100-103(2005).
RN   [3] {ECO:0000313|EMBL:JAA17789.1}
RP   NUCLEOTIDE SEQUENCE.
RC   TISSUE=Smooth vascular {ECO:0000313|EMBL:JAA17789.1};
RA   Maudhoo M.D., Meehan D.T., Norgren R.B.Jr.;
RT   "De novo assembly of the reference chimpanzee transcriptome from NextGen
RT   mRNA sequences.";
RL   Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases.
RN   [4] {ECO:0000313|Ensembl:ENSPTRP00000092192.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (MAY-2025) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AACZ04000578; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AACZ04000579; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AACZ04000580; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AACZ04068940; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; GABF01004356; JAA17789.1; -; mRNA.
DR   Ensembl; ENSPTRT00000098662.1; ENSPTRP00000092192.1; ENSPTRG00000014005.6.
DR   VGNC; VGNC:10689; COL18A1.
DR   GeneTree; ENSGT00940000158212; -.
DR   OMA; VQDQHQN; -.
DR   Proteomes; UP000002277; Chromosome 21.
DR   Bgee; ENSPTRG00000014005; Expressed in liver and 21 other cell types or tissues.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProtKB-ARBA.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   2: Evidence at transcript level;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:JAA17789.1};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000002277};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..33
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           34..1339
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5015095871"
FT   DOMAIN          41..229
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          230..1028
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1096..1140
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        257..266
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        302..323
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        347..374
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        383..395
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        400..416
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        418..431
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        447..459
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        489..499
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        515..527
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        531..546
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        590..606
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        638..650
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        680..694
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        702..711
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        726..738
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        749..758
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        839..853
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        867..881
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        906..926
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        938..947
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        983..999
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1009..1021
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1104..1120
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1126..1136
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1339 AA;  135889 MW;  32FD0868ACE13E2E CRC64;
     MAPRCPWPWP RRRRLLDVLA PLVLLLGVRA ASAEPERVSE EVGLLQLLGD PPPQQVTQTD
     DPDVGLAYVF GPDANSGQVA RYHFPSLFFR DFSLLFHIRP ATEGPGVLFA ITDSAQAVVL
     LGVKLSGVQD GHQDISLLYT EPGAGQTHTA ASFRLPAFVG QWTHLALSVA GGFVALYVDC
     EEFQRMPLAR SSQGLELEPG AGLFVAQAGG ADPDKFQGVI AELKVRRDPQ VSPMHCLDEE
     GDDSDGASGD SGSGLGDTRE LLREETGVAL KPRLPTPPPV TTPPLAGGSS TEDSRSEEIE
     EQTTVASLGA QTLPGSDSVS TWDGSVRTPG GRVKEGGLKG QKGEPGVPGP PGRAGPPGSP
     CLPGPPGLPC PVSPLGPAGP ALQPVPGPQG PPGPPGRDGT PGRDGEPGDP GEDGKPGDTG
     PQGFPGTPGD VGPKGDKGDP GVGERGPPGP QGPPGPPGPS FRHDKLTFID MEGSGFGGDL
     EALRGPRGFP GPPGPPGVPG LPGEPGRFGV NSSDVPGPAG LPGVPGREGP PGFPGLPGPP
     GPPGREGPPG RTGQKGSLGE AGAPGHKGSK GDPGPAGARG ESGLAGAPGP AGPPGPPGPP
     GPPGPGLPAG FDDMEGSGGP FWSTARSAAG PQGPPGLPGL KGDPGVPGLP GAKGEVGADG
     VPGFPGLPGR EGIAGPQGPK GDRGSQGEKG DPGKDGVGQP GLPGPPGPPG PVVYVSEQDG
     SVLSVPGPEG RPGFAGFPGP AGPKGNLGSK GEQGSPGPKG EKGEPGSIFS PDGGALGPAQ
     KGAKGEPGFR GPPGPYGRPG YKGEIGFPGR PGRPGMNGLK GEKGEPGDAS LGFGMRGMPG
     PPGPPGPPGP PGTPVYDSNV FAESSRPGPP GLPGNQGPPG PKGAKGEVGP PGPPGQFPFD
     FLQLEAEMKG EKGDRGDAGQ KGERGEPGGG GFFSSSLPGP PGPPGPRGYP GIPGPKGESI
     RGQPGPPGPQ GPPGIGYEGR QGPPGPPGPP GPPGPPSFPG PHRQTISVPG PPGPPGPPGP
     PGTMGASSGV RLWATRQAML GQVHEVPEGW LIFVAEQEEL YVRVRNGFRK VQLEARTPLP
     RGTDNEVAAL QPPVVQLHDS NPYPRREHPH PTARPWRADD ILASPPRLPE PQPYPGAPHH
     SSYVHLRPAR PTSPPAHTHR DFQPVLHLVA LNSPLSGSMR GIRGADFQCF QQARAVGLAG
     TFRAFLSSRL QDLYSIVRRA DRTAVPIVNL KDELLFPSWE ALFSGSEGPL KPGARIFSFD
     GKDVLRHPTW PQKSVWHGSD PNGRRLTESY CETWRTEAPS ATGQASSLLG GRLLGQSAVS
     CHHAYIVLCI ENSFMTASK
//
DBGET integrated database retrieval system