GenomeNet

Database: UniProt
Entry: A0A8C3N5C8_GEOPR
LinkDB: A0A8C3N5C8_GEOPR
Original site: A0A8C3N5C8_GEOPR 
ID   A0A8C3N5C8_GEOPR        Unreviewed;      1688 AA.
AC   A0A8C3N5C8; A0A8U8ALI7;
DT   19-JAN-2022, integrated into UniProtKB/TrEMBL.
DT   03-MAY-2023, sequence version 2.
DT   28-JAN-2026, entry version 19.
DE   SubName: Full=Collagen type XVIII alpha 1 chain {ECO:0000313|Ensembl:ENSCPVP00000013821.2};
GN   Name=COL18A1 {ECO:0000313|Ensembl:ENSCPVP00000013821.2};
OS   Geospiza parvula (Small tree-finch) (Camarhynchus parvulus).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Neoaves; Telluraves; Australaves;
OC   Passeriformes; Thraupidae; Camarhynchus.
OX   NCBI_TaxID=87175 {ECO:0000313|Ensembl:ENSCPVP00000013821.2, ECO:0000313|Proteomes:UP000694382};
RN   [1] {ECO:0000313|Ensembl:ENSCPVP00000013821.2}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA   Enbody D E., Pettersson E M.;
RL   Submitted (FEB-2020) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSCPVP00000013821.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (AUG-2025) to UniProtKB.
RN   [3] {ECO:0000313|Ensembl:ENSCPVP00000013821.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2025) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   Ensembl; ENSCPVT00000014444.2; ENSCPVP00000013821.2; ENSCPVG00000010097.2.
DR   Proteomes; UP000694382; Chromosome 7.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR   Pfam; PF01391; Collagen; 5.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   3: Inferred from homology;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000694382};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..21
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           22..1688
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5043994763"
FT   DOMAIN          381..569
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          37..153
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          206..241
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          274..309
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          568..1120
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1132..1371
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        121..137
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        283..298
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        576..585
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        622..632
FT                   /note="Gly residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        654..667
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        744..760
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        789..803
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        831..843
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        872..883
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        943..955
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        981..991
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1030..1044
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1050..1059
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1188..1201
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1205..1216
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1276..1285
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1308..1317
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1328..1341
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1351..1363
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1688 AA;  173225 MW;  A34D1F0100C2425F CRC64;
     MAPPRLGFLL LLACCFSCSE TQLLDWLWDG TKTTGSPALP SEGIPESEPP TSAAAPAPSP
     SPGMWGGEVA GNVTTPRQQE PDPATAAPVS EGAAAKTTEQ WDRNATGPVE STAWPSIAPL
     HSTSPAPGQQ AASSPTDDPQ LLGTGTHKDP QLLASGTTED LQFLGSGNTE DPQLLWSGTT
     KDLQLLETGT TEHLLLLRMR TTEDPQFLGS VTPKDPQLLG SGTTENPQLP RMGTTEDPQS
     GATEDLQLLG LGTTEDPQLL GVVSTTDLQL LVAGHTKDPQ PLGTGTTEDP QPLGTGTTKD
     AELPGTRTTE NLQLLVMKNT ELLRTVTTEN PELLGNTENP QLLRTVTPED PQFLGTGTPT
     METQVSLQRP AGLQPESLSA EISLLELIGD PPTEEIHRIY GPDNNPGYVF GPNANTGQVA
     RYHLPSPFYR DFSLLFHIQP TTPRAGVLFA VTDSSQSIIY VGVKLSELRA GQQQIIFYYT
     EPGSPSSYPA ATFTVPTLLN QWTRFAISVE EEEVVLYLDC EEYERVRFGR SPDEMELEEG
     SGLFVAQAGG ADPDKYQGVI ADLKLRGDPR AAERQCEEEE DDTEVSGDFG SGMDGGQRPS
     GKVEGVPGLL DAVPVTSPPV AGGSGPRSGG GSPQQAERTR AEETLRVSTG GAGRKGEKGE
     KGERGLKGDS GTSGIVGPGS VKGQKGEKGD LGVKGSAGFG YPGSKGQKGE PGDPGLPGTL
     SRHAAGSVVE QVTGPPGTPG KDGAPGRDGE PGDPGEDGKP GEMGPPGFPG MPGEPGLKGE
     KGDPGVGPRG PPGPPGPPGP PGPSSKNDKL TFIDMEGSGF AGDLESLRGP RGPPGPPGPP
     GVPGLPGEPG RFGMNRTELP GPPGLPGRDG SPGPPGPVGP QGPPGRDGEA GQPGPKGEQG
     DMGDLGLPGL PGPKGNKGET GPAGPPGEMG LAGLPGPVGP RGQPGPPGPP GPPGPGYEAG
     FGDMEGSGLS FTPGPPGPEG PQGVPGLPGL KGEVGSPGQP GLPGPKGDAG MPGVDGRPGL
     EGFPGPQGPK GDKGSTGEKG ERGQDGVGLP GPPGPPGPPG QVISVSGEDQ SLVAFPGPEG
     RPGRAGFPGP VGPKGDQGSP GPQGAPGLKG EKGEPGVIIS PDGTVVTAKV KGEKGEPGLQ
     GPMGPSGPPG RAGMKGEIGF PGRPGRPGMN GLKGEKGDPA DVLGLRGPPG PPGPPGPPGP
     PGSIVYSNGN TFSDSSHPAF PGFHQFSGQK GEKGDTGPPG PPGQFPYDVS HFGTSLRGDK
     GDAGPKGQKG EPGSTPLYSP GVSGLPGPPG PQGYPGLPGP KGDSIVGPPG PPGPQGPPGI
     GYEGRQGPPG PPGPPGPPSF PGPHRQAVSI PGPPGPPGPP GPPGTGGISL GLQAMPTYQA
     MLSAVHELPE GGLVFLTDRQ ELYVRLRGGF RRVLLEEHNL IPSSALDNEV YDKPPTLHYA
     GPQPRGPLHP LRNHVPSATA RPWRGDEVVA NQHRLPEQPL LHHQHELING YYIHRRPDPA
     PVAAHVHQDF QPALHLVALN TPLSGGMRGI RGADFQCFQQ ARQVGLAGTF RAFLSSRLQD
     LYSIVRRADR TAVPIVNLRD EVLFNNWEAL FTGSGAPLRT GARILSFDGR DVLRDAGWPQ
     KSVWHGSDAK GRRLPESYCE TWRTEERAVT GQASSLASGK LLEQAASSCQ QAFIVLCIEN
     SFMTATKK
//
DBGET integrated database retrieval system