ID G1L4W3_AILME Unreviewed; 1460 AA.
AC G1L4W3;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 02-JUN-2021, sequence version 2.
DT 28-JAN-2026, entry version 63.
DE SubName: Full=Collagen type XVIII alpha 1 chain {ECO:0000313|Ensembl:ENSAMEP00000001923.2};
GN Name=COL18A1 {ECO:0000313|Ensembl:ENSAMEP00000001923.2};
OS Ailuropoda melanoleuca (Giant panda).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Carnivora; Caniformia; Ursidae; Ailuropoda.
OX NCBI_TaxID=9646 {ECO:0000313|Ensembl:ENSAMEP00000001923.2, ECO:0000313|Proteomes:UP000008912};
RN [1] {ECO:0000313|Ensembl:ENSAMEP00000001923.2, ECO:0000313|Proteomes:UP000008912}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=20010809; DOI=10.1038/nature08696;
RA Li R., Fan W., Tian G., Zhu H., He L., Cai J., Huang Q., Cai Q., Li B.,
RA Bai Y., Zhang Z., Zhang Y., Wang W., Li J., Wei F., Li H., Jian M., Li J.,
RA Zhang Z., Nielsen R., Li D., Gu W., Yang Z., Xuan Z., Ryder O.A.,
RA Leung F.C., Zhou Y., Cao J., Sun X., Fu Y., Fang X., Guo X., Wang B.,
RA Hou R., Shen F., Mu B., Ni P., Lin R., Qian W., Wang G., Yu C., Nie W.,
RA Wang J., Wu Z., Liang H., Min J., Wu Q., Cheng S., Ruan J., Wang M.,
RA Shi Z., Wen M., Liu B., Ren X., Zheng H., Dong D., Cook K., Shan G.,
RA Zhang H., Kosiol C., Xie X., Lu Z., Zheng H., Li Y., Steiner C.C.,
RA Lam T.T., Lin S., Zhang Q., Li G., Tian J., Gong T., Liu H., Zhang D.,
RA Fang L., Ye C., Zhang J., Hu W., Xu A., Ren Y., Zhang G., Bruford M.W.,
RA Li Q., Ma L., Guo Y., An N., Hu Y., Zheng Y., Shi Y., Li Z., Liu Q.,
RA Chen Y., Zhao J., Qu N., Zhao S., Tian F., Wang X., Wang H., Xu L., Liu X.,
RA Vinar T., Wang Y., Lam T.W., Yiu S.M., Liu S., Zhang H., Li D., Huang Y.,
RA Wang X., Yang G., Jiang Z., Wang J., Qin N., Li L., Li J., Bolund L.,
RA Kristiansen K., Wong G.K., Olson M., Zhang X., Li S., Yang H., Wang J.,
RA Wang J.;
RT "The sequence and de novo assembly of the giant panda genome.";
RL Nature 463:311-317(2010).
RN [2] {ECO:0000313|Ensembl:ENSAMEP00000001923.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (AUG-2025) to UniProtKB.
RN [3] {ECO:0000313|Ensembl:ENSAMEP00000001923.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2025) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC matrix {ECO:0000256|ARBA:ARBA00004498}.
CC -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC {ECO:0000256|ARBA:ARBA00061275}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSAMET00000002000.2; ENSAMEP00000001923.2; ENSAMEG00000001767.2.
DR GeneTree; ENSGT00940000158212; -.
DR Proteomes; UP000008912; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR CDD; cd00247; Endostatin-like; 1.
DR FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR010363; DUF959_COL18_N.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06121; DUF959; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SMART; SM00282; LamG; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 3: Inferred from homology;
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW Reference proteome {ECO:0000313|Proteomes:UP000008912};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..24
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 25..1460
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5030170550"
FT DOMAIN 213..401
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT DOMAIN 262..400
FT /note="Laminin G"
FT /evidence="ECO:0000259|SMART:SM00282"
FT REGION 42..102
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 128..211
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 402..1149
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1206..1279
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 137..155
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 190..207
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 436..454
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 501..513
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 515..530
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 535..551
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 580..594
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 624..634
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 650..662
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 666..678
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 679..688
FT /note="Gly residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 689..710
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 725..740
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 815..829
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 837..846
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 964..979
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1034..1052
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1064..1076
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1106..1122
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1132..1144
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1244..1254
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1266..1275
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1460 AA; 148335 MW; 84376E49307A3067 CRC64;
MARHPGSHLG PLLLLVYCLA AAQANLLNLN WLWSTGTDGP TAGLVSEPPG SPHVQPREDT
TTHVAPQHGP TEQGRSPASS ELPVERLETG QGRASATGPD LKEENIAGVG AKILNVAQGI
RSFVQLWDDS TPTTNSPGAE TAAPATPTAP LTLPGPSSPP QENGTALWLD SWTLSSPGSQ
RIEAGTQPVP TQPPPFPGGP SVPPPSPESL GTEVGLLQLL GEPPPQQVTQ VDDPDVGPAF
VFGPDANSGQ VARYHFPSPF FRDFSLHFHV RPATKDAGVL FAITDAAQAV VSVGVKLSGV
RDGHQHIQIL YTEPGATRTH TAASFRLPAF TGQWTRFALS VDGATVALFV DCDLFQRVPL
VRSPRGLELE PGAGLFVAQA GGADPDKFQG MIAELRVRGD PQVSPMHCLE EDDEDSGGVS
GDFGSGLEET REPLREPPGP SPKPSLPEAP PVTSPPLAAG KGVEDSRTEE IEEETTVSSL
GARTLPGLDT VTAESVQSFG DRVEEGPAER SPDARPVPGP QGPPGPPGPP GKDGAPGRDG
EPGDPGEDGR PGDPGPQGFP GTPGDVGPKG EKGDPGVGPR GPPGPQGPPG PPGPSFRHDK
LTFIDMEGSG FGGDLESLRG PRGFPGPPGP PGVPGLPGEP GRFGMNSSDV PGPAGLPGVP
GRDGPPGLPG IPGPPGPPGK DGGPGKTGQK GSPGEVGAPG PKGSKGDPGP AGTPGENGLA
GAPGPAGPPG PPGPPGPPGP GLAAGFDDME GSGGPFWSMA RGASGPQGPA GLPGVKGDPG
VTGPPGAKGE VGADGPPGFP GLPGREGTAG AQGPKGEKGT QGEKGDPGRD GVGQPGLPGP
PGPPGPVVYV SEQDRALASV PGPEGPAGPK GDLGSKGQRG LPGPKGEKGE PGPVFSPDGS
MLAPAQKGAK GEPGFRGPPG PYGRPGHKGE IGFPGRPGRP GMNGLKGEKG EPGHASVGFG
VRGPPGPPGP PGPPGPPGTP VYDNNAFMES GRPGPPGLPG LQGPSGPKGD KGDVGPPGPP
GQFPFDLLQL GAEMKGEKGD RGDTGQKGER GEPGGGGFFS SSVPGPPGPP GYPGIPGPKG
ESIRGQPGPP GPQGPPGIGH EGRQGPPGPP GPPGPPGPPS FPGPYRQTIS VPGPPGPPGP
PGPPGTMGIS SGVRIWATYQ TMLDKVPEVP EGWLIFVAER EELYVRVRNG LRKVLLEART
PLPRGTDNEV AALQPPAGNP YPRRELTHST ARPWRADDIL ANPPRLPDPQ PYPGAPHHGS
YLHFQPAHPT SGPTHTHTHQ DYQPVLHLVA LNSPQPGGMR GIRGADFQCF QQARAVGLAG
TFRAFLSSRL QDLYSVVRRA DRTGVPVVNL RDEVLFPSWE ALFSGSEGQL KPGARIFSFD
GRDVLQHPAW PQKSVWHGSD PGGRRLTDSY CETWRTEAAA ATGQASSLLA GRLLGQKAAS
CHNAFIVLCI ENSFMTSSSK
//