ID H0YYN3_TAEGU Unreviewed; 1443 AA.
AC H0YYN3;
DT 22-FEB-2012, integrated into UniProtKB/TrEMBL.
DT 17-JUN-2020, sequence version 2.
DT 28-JAN-2026, entry version 84.
DE RecName: Full=Collagen alpha-1(XVIII) chain {ECO:0000256|ARBA:ARBA00069367};
GN Name=COL18A1 {ECO:0000313|Ensembl:ENSTGUP00000003411.2};
OS Taeniopygia guttata (Zebra finch) (Poephila guttata).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Neoaves; Telluraves; Australaves;
OC Passeriformes; Passeroidea; Estrildidae; Estrildinae; Taeniopygia.
OX NCBI_TaxID=59729 {ECO:0000313|Ensembl:ENSTGUP00000003411.2, ECO:0000313|Proteomes:UP000007754};
RN [1] {ECO:0000313|Ensembl:ENSTGUP00000003411.2, ECO:0000313|Proteomes:UP000007754}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=20360741; DOI=10.1038/nature08819;
RA Warren W.C., Clayton D.F., Ellegren H., Arnold A.P., Hillier L.W.,
RA Kunstner A., Searle S., White S., Vilella A.J., Fairley S., Heger A.,
RA Kong L., Ponting C.P., Jarvis E.D., Mello C.V., Minx P., Lovell P.,
RA Velho T.A., Ferris M., Balakrishnan C.N., Sinha S., Blatti C., London S.E.,
RA Li Y., Lin Y.C., George J., Sweedler J., Southey B., Gunaratne P.,
RA Watson M., Nam K., Backstrom N., Smeds L., Nabholz B., Itoh Y., Whitney O.,
RA Pfenning A.R., Howard J., Volker M., Skinner B.M., Griffin D.K., Ye L.,
RA McLaren W.M., Flicek P., Quesada V., Velasco G., Lopez-Otin C.,
RA Puente X.S., Olender T., Lancet D., Smit A.F., Hubley R., Konkel M.K.,
RA Walker J.A., Batzer M.A., Gu W., Pollock D.D., Chen L., Cheng Z.,
RA Eichler E.E., Stapley J., Slate J., Ekblom R., Birkhead T., Burke T.,
RA Burt D., Scharff C., Adam I., Richard H., Sultan M., Soldatov A.,
RA Lehrach H., Edwards S.V., Yang S.P., Li X., Graves T., Fulton L.,
RA Nelson J., Chinwalla A., Hou S., Mardis E.R., Wilson R.K.;
RT "The genome of a songbird.";
RL Nature 464:757-762(2010).
RN [2] {ECO:0000313|Ensembl:ENSTGUP00000003411.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (AUG-2025) to UniProtKB.
RN [3] {ECO:0000313|Ensembl:ENSTGUP00000003411.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2025) to UniProtKB.
CC -!- FUNCTION: May regulate extracellular matrix-dependent motility and
CC morphogenesis of endothelial and non-endothelial cells; the function
CC requires homotrimerization and implicates MAPK signaling.
CC {ECO:0000256|ARBA:ARBA00053766}.
CC -!- FUNCTION: Probably plays a major role in determining the retinal
CC structure as well as in the closure of the neural tube.
CC {ECO:0000256|ARBA:ARBA00054383}.
CC -!- SUBUNIT: Forms homotrimers. Recombinant non-collagenous domain 1 has
CC stronger affinity to NID1, HSPG2 and laminin-1:NID1 complex and lower
CC affinity to FBLN1 and FBLN2 than endostatin.
CC {ECO:0000256|ARBA:ARBA00065165}.
CC -!- SUBUNIT: Monomeric. Interacts with KDR/VEGFR2. Interacts with the
CC ITGA5:ITGB1 complex. Interacts with NID1, HSPG2, laminin-1:NID1
CC complex, FBLN1 and FBLN2. {ECO:0000256|ARBA:ARBA00064471}.
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC matrix, basement membrane {ECO:0000256|ARBA:ARBA00004302}.
CC -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC {ECO:0000256|ARBA:ARBA00061275}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00090}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 59729.ENSTGUP00000024291; -.
DR Ensembl; ENSTGUT00000003445.2; ENSTGUP00000003411.2; ENSTGUG00000003295.2.
DR GeneTree; ENSGT00940000158212; -.
DR HOGENOM; CLU_398801_0_0_1; -.
DR Proteomes; UP000007754; Chromosome 7.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0009653; P:anatomical structure morphogenesis; IEA:UniProtKB-ARBA.
DR GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR GO; GO:0042127; P:regulation of cell population proliferation; IEA:UniProtKB-ARBA.
DR CDD; cd00247; Endostatin-like; 1.
DR FunFam; 1.10.2000.10:FF:000017; Alpha 1 type XVIII collagen; 1.
DR FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 1.10.2000.10; Frizzled cysteine-rich domain; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050938; Collagen_Structural_Proteins.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR020067; Frizzled_dom.
DR InterPro; IPR036790; Frizzled_dom_sf.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR37456:SF4; COLLAGEN ALPHA-1(XXIII) CHAIN; 1.
DR PANTHER; PTHR37456; SI:CH211-266K2.1; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR Pfam; PF01392; Fz; 1.
DR SMART; SM00063; FRI; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF63501; Frizzled cysteine-rich domain; 1.
DR PROSITE; PS50038; FZ; 1.
PE 3: Inferred from homology;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00090}; Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW Reference proteome {ECO:0000313|Proteomes:UP000007754};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729};
KW Zinc {ECO:0000256|ARBA:ARBA00022833}.
FT DOMAIN 20..138
FT /note="FZ"
FT /evidence="ECO:0000259|PROSITE:PS50038"
FT REGION 335..875
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 887..1126
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 343..352
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 389..399
FT /note="Gly residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 422..434
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 547..561
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 589..601
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 630..641
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 677..693
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 701..713
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 739..749
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 788..802
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 943..956
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 960..971
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1012..1026
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1031..1040
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1063..1072
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1083..1096
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1106..1118
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 35..81
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00090"
FT DISULFID 72..110
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00090"
SQ SEQUENCE 1443 AA; 148373 MW; 528A14BD27BAE977 CRC64;
MQLSRSDSAG LTSSFLGLIP AAGRCQPIPT RLPFCSVLGT SHVRLPNYLQ HSGEEEIRAA
LHEWEGLLES RCHHYLEWFL CLLLLPGCSP SVPVTPPPCQ GFCEAVRDLC WTHLAGRRLP
LPCDALPEED EGVSCVFINA SAESLSAEIS LLELIGDPPT EEIHRIYGPD NNPGYVFGPN
ANTGQVARYH LPSPFYRDFS LLFHIQPTTP RAGVLFAVTD ASQSIIYVGV KLSELRAGQQ
QIIFYYTEPG SPSSYPAATF TVPTLLNQWT RFAISVEEEE VVLYLDCEEH ERVRFERSPD
EMELEEGSGL FVAQAGGADP DKYQGVIADL KLRGDPRAAE RQCEEEEDDA EVSGDFGSGM
EGGQQSLSKA EGVPGLVDAV PVTSPPVVGG SGPRSGGGSL QQAERTRAEE TLRVPTGGTG
QKGEKGEKGE RGLKGDSGTS GIVGPGSVKG QKGEKGDLGV KGSAGFGYPG SKGQKGEPGN
PGPPGTLSRH ADGSVVEQVT GPPGTPGKDG APGRDGEPGE MGPPGFPGMP GEPGLKGEKG
DPGMGPRGPP GPPGPPGPPG PSSKNDKLTF IDMEGSGFGG DLESLRGPRG PPGPPGPPGV
PGLPGEPGRF GMNCTDLPGP PGLPGRDGIP GPPGPVGPQG PPGRDGEVGQ PGPKGEQGDV
GDLGLPGLPG PKGNKGEMGP AGPPGEMGLA GLPGPIGPRG QPGPPGPPGP PGPGYEAGFG
DMEGSGLSFT PGPPGPEGPQ GVPGLPGVKG EVGSPGQPGL PGPKGDAGMP GMDGRPGPEG
FPGPQGPKGD KGSTGEKGER GQDGVGLPGP PGPPGQVISV SSEDKSLVAF PGPEGRPGRA
GFPGPVGPKG DQGSTGPQGA PGLKGEKGEP GVIISPDGTV VTAKVKGEKG EPGLQGPTGP
SGPPGRAGMK GEIGFPGRPG RPGMNGLKGE KGDPADVLGL RGPPGPPGPP GPPGPPGSIV
YNNGNTFSDS SHPAFRGFHQ FSGQKGEKGD TGPPGPPGQF PYDASHFGTS LRGDKGDAGP
KGEKGEPGST PLYGPGVSGL PGPPGPQGYP GLPGPKGDSI VGPPGPPGPQ GPPGIGYEGR
QGPPGPPGPP GPPSFPGPHR QAVSIPGPPG PPGPPGPPGS SGMSLGLQAM PTYQAMLSAA
HELPEGSLVF LRDREELHVR LRGGFRRVLL EEHNLIPSSA LDNEVYDKPP TLHYAGPQPR
GPLHPLRNHV PSATARPWHG DEVVANQHRL PEQPLLHHQH ELINGYYIHR RPDPAPVAAH
VHQDFQPALH LVALNTPLSG GMRGIRGADF QCFQQARQVG LAGTFRAFLS SRLQDLYSIV
RRADRAAVPI VNLRDEVLFS NWEALFTGSG APLRTGARIL SFDGRDVLRD AGWPQKSVWH
GSDAKGRRLP ESYCETWRTE ERAVVGQASS LASGKLLEQA ASSCQHAFIV LCIENSFMTA
TKK
//