GenomeNet

Database: UniProt
Entry: H0YYN3_TAEGU
LinkDB: H0YYN3_TAEGU
Original site: H0YYN3_TAEGU 
ID   H0YYN3_TAEGU            Unreviewed;      1443 AA.
AC   H0YYN3;
DT   22-FEB-2012, integrated into UniProtKB/TrEMBL.
DT   17-JUN-2020, sequence version 2.
DT   28-JAN-2026, entry version 84.
DE   RecName: Full=Collagen alpha-1(XVIII) chain {ECO:0000256|ARBA:ARBA00069367};
GN   Name=COL18A1 {ECO:0000313|Ensembl:ENSTGUP00000003411.2};
OS   Taeniopygia guttata (Zebra finch) (Poephila guttata).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Neoaves; Telluraves; Australaves;
OC   Passeriformes; Passeroidea; Estrildidae; Estrildinae; Taeniopygia.
OX   NCBI_TaxID=59729 {ECO:0000313|Ensembl:ENSTGUP00000003411.2, ECO:0000313|Proteomes:UP000007754};
RN   [1] {ECO:0000313|Ensembl:ENSTGUP00000003411.2, ECO:0000313|Proteomes:UP000007754}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=20360741; DOI=10.1038/nature08819;
RA   Warren W.C., Clayton D.F., Ellegren H., Arnold A.P., Hillier L.W.,
RA   Kunstner A., Searle S., White S., Vilella A.J., Fairley S., Heger A.,
RA   Kong L., Ponting C.P., Jarvis E.D., Mello C.V., Minx P., Lovell P.,
RA   Velho T.A., Ferris M., Balakrishnan C.N., Sinha S., Blatti C., London S.E.,
RA   Li Y., Lin Y.C., George J., Sweedler J., Southey B., Gunaratne P.,
RA   Watson M., Nam K., Backstrom N., Smeds L., Nabholz B., Itoh Y., Whitney O.,
RA   Pfenning A.R., Howard J., Volker M., Skinner B.M., Griffin D.K., Ye L.,
RA   McLaren W.M., Flicek P., Quesada V., Velasco G., Lopez-Otin C.,
RA   Puente X.S., Olender T., Lancet D., Smit A.F., Hubley R., Konkel M.K.,
RA   Walker J.A., Batzer M.A., Gu W., Pollock D.D., Chen L., Cheng Z.,
RA   Eichler E.E., Stapley J., Slate J., Ekblom R., Birkhead T., Burke T.,
RA   Burt D., Scharff C., Adam I., Richard H., Sultan M., Soldatov A.,
RA   Lehrach H., Edwards S.V., Yang S.P., Li X., Graves T., Fulton L.,
RA   Nelson J., Chinwalla A., Hou S., Mardis E.R., Wilson R.K.;
RT   "The genome of a songbird.";
RL   Nature 464:757-762(2010).
RN   [2] {ECO:0000313|Ensembl:ENSTGUP00000003411.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (AUG-2025) to UniProtKB.
RN   [3] {ECO:0000313|Ensembl:ENSTGUP00000003411.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2025) to UniProtKB.
CC   -!- FUNCTION: May regulate extracellular matrix-dependent motility and
CC       morphogenesis of endothelial and non-endothelial cells; the function
CC       requires homotrimerization and implicates MAPK signaling.
CC       {ECO:0000256|ARBA:ARBA00053766}.
CC   -!- FUNCTION: Probably plays a major role in determining the retinal
CC       structure as well as in the closure of the neural tube.
CC       {ECO:0000256|ARBA:ARBA00054383}.
CC   -!- SUBUNIT: Forms homotrimers. Recombinant non-collagenous domain 1 has
CC       stronger affinity to NID1, HSPG2 and laminin-1:NID1 complex and lower
CC       affinity to FBLN1 and FBLN2 than endostatin.
CC       {ECO:0000256|ARBA:ARBA00065165}.
CC   -!- SUBUNIT: Monomeric. Interacts with KDR/VEGFR2. Interacts with the
CC       ITGA5:ITGB1 complex. Interacts with NID1, HSPG2, laminin-1:NID1
CC       complex, FBLN1 and FBLN2. {ECO:0000256|ARBA:ARBA00064471}.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix, basement membrane {ECO:0000256|ARBA:ARBA00004302}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00090}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   STRING; 59729.ENSTGUP00000024291; -.
DR   Ensembl; ENSTGUT00000003445.2; ENSTGUP00000003411.2; ENSTGUG00000003295.2.
DR   GeneTree; ENSGT00940000158212; -.
DR   HOGENOM; CLU_398801_0_0_1; -.
DR   Proteomes; UP000007754; Chromosome 7.
DR   GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR   GO; GO:0009653; P:anatomical structure morphogenesis; IEA:UniProtKB-ARBA.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   GO; GO:0042127; P:regulation of cell population proliferation; IEA:UniProtKB-ARBA.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 1.10.2000.10:FF:000017; Alpha 1 type XVIII collagen; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 1.10.2000.10; Frizzled cysteine-rich domain; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050938; Collagen_Structural_Proteins.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR020067; Frizzled_dom.
DR   InterPro; IPR036790; Frizzled_dom_sf.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR37456:SF4; COLLAGEN ALPHA-1(XXIII) CHAIN; 1.
DR   PANTHER; PTHR37456; SI:CH211-266K2.1; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   Pfam; PF01392; Fz; 1.
DR   SMART; SM00063; FRI; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   SUPFAM; SSF63501; Frizzled cysteine-rich domain; 1.
DR   PROSITE; PS50038; FZ; 1.
PE   3: Inferred from homology;
KW   Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00090}; Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007754};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729};
KW   Zinc {ECO:0000256|ARBA:ARBA00022833}.
FT   DOMAIN          20..138
FT                   /note="FZ"
FT                   /evidence="ECO:0000259|PROSITE:PS50038"
FT   REGION          335..875
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          887..1126
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        343..352
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        389..399
FT                   /note="Gly residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        422..434
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        547..561
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        589..601
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        630..641
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        677..693
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        701..713
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        739..749
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        788..802
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        943..956
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        960..971
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1012..1026
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1031..1040
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1063..1072
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1083..1096
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1106..1118
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   DISULFID        35..81
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00090"
FT   DISULFID        72..110
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00090"
SQ   SEQUENCE   1443 AA;  148373 MW;  528A14BD27BAE977 CRC64;
     MQLSRSDSAG LTSSFLGLIP AAGRCQPIPT RLPFCSVLGT SHVRLPNYLQ HSGEEEIRAA
     LHEWEGLLES RCHHYLEWFL CLLLLPGCSP SVPVTPPPCQ GFCEAVRDLC WTHLAGRRLP
     LPCDALPEED EGVSCVFINA SAESLSAEIS LLELIGDPPT EEIHRIYGPD NNPGYVFGPN
     ANTGQVARYH LPSPFYRDFS LLFHIQPTTP RAGVLFAVTD ASQSIIYVGV KLSELRAGQQ
     QIIFYYTEPG SPSSYPAATF TVPTLLNQWT RFAISVEEEE VVLYLDCEEH ERVRFERSPD
     EMELEEGSGL FVAQAGGADP DKYQGVIADL KLRGDPRAAE RQCEEEEDDA EVSGDFGSGM
     EGGQQSLSKA EGVPGLVDAV PVTSPPVVGG SGPRSGGGSL QQAERTRAEE TLRVPTGGTG
     QKGEKGEKGE RGLKGDSGTS GIVGPGSVKG QKGEKGDLGV KGSAGFGYPG SKGQKGEPGN
     PGPPGTLSRH ADGSVVEQVT GPPGTPGKDG APGRDGEPGE MGPPGFPGMP GEPGLKGEKG
     DPGMGPRGPP GPPGPPGPPG PSSKNDKLTF IDMEGSGFGG DLESLRGPRG PPGPPGPPGV
     PGLPGEPGRF GMNCTDLPGP PGLPGRDGIP GPPGPVGPQG PPGRDGEVGQ PGPKGEQGDV
     GDLGLPGLPG PKGNKGEMGP AGPPGEMGLA GLPGPIGPRG QPGPPGPPGP PGPGYEAGFG
     DMEGSGLSFT PGPPGPEGPQ GVPGLPGVKG EVGSPGQPGL PGPKGDAGMP GMDGRPGPEG
     FPGPQGPKGD KGSTGEKGER GQDGVGLPGP PGPPGQVISV SSEDKSLVAF PGPEGRPGRA
     GFPGPVGPKG DQGSTGPQGA PGLKGEKGEP GVIISPDGTV VTAKVKGEKG EPGLQGPTGP
     SGPPGRAGMK GEIGFPGRPG RPGMNGLKGE KGDPADVLGL RGPPGPPGPP GPPGPPGSIV
     YNNGNTFSDS SHPAFRGFHQ FSGQKGEKGD TGPPGPPGQF PYDASHFGTS LRGDKGDAGP
     KGEKGEPGST PLYGPGVSGL PGPPGPQGYP GLPGPKGDSI VGPPGPPGPQ GPPGIGYEGR
     QGPPGPPGPP GPPSFPGPHR QAVSIPGPPG PPGPPGPPGS SGMSLGLQAM PTYQAMLSAA
     HELPEGSLVF LRDREELHVR LRGGFRRVLL EEHNLIPSSA LDNEVYDKPP TLHYAGPQPR
     GPLHPLRNHV PSATARPWHG DEVVANQHRL PEQPLLHHQH ELINGYYIHR RPDPAPVAAH
     VHQDFQPALH LVALNTPLSG GMRGIRGADF QCFQQARQVG LAGTFRAFLS SRLQDLYSIV
     RRADRAAVPI VNLRDEVLFS NWEALFTGSG APLRTGARIL SFDGRDVLRD AGWPQKSVWH
     GSDAKGRRLP ESYCETWRTE ERAVVGQASS LASGKLLEQA ASSCQHAFIV LCIENSFMTA
     TKK
//
DBGET integrated database retrieval system