GenomeNet

Database: UniProt
Entry: G1L4W3_AILME
LinkDB: G1L4W3_AILME
Original site: G1L4W3_AILME 
ID   G1L4W3_AILME            Unreviewed;      1460 AA.
AC   G1L4W3;
DT   19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT   02-JUN-2021, sequence version 2.
DT   28-JAN-2026, entry version 63.
DE   SubName: Full=Collagen type XVIII alpha 1 chain {ECO:0000313|Ensembl:ENSAMEP00000001923.2};
GN   Name=COL18A1 {ECO:0000313|Ensembl:ENSAMEP00000001923.2};
OS   Ailuropoda melanoleuca (Giant panda).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Carnivora; Caniformia; Ursidae; Ailuropoda.
OX   NCBI_TaxID=9646 {ECO:0000313|Ensembl:ENSAMEP00000001923.2, ECO:0000313|Proteomes:UP000008912};
RN   [1] {ECO:0000313|Ensembl:ENSAMEP00000001923.2, ECO:0000313|Proteomes:UP000008912}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=20010809; DOI=10.1038/nature08696;
RA   Li R., Fan W., Tian G., Zhu H., He L., Cai J., Huang Q., Cai Q., Li B.,
RA   Bai Y., Zhang Z., Zhang Y., Wang W., Li J., Wei F., Li H., Jian M., Li J.,
RA   Zhang Z., Nielsen R., Li D., Gu W., Yang Z., Xuan Z., Ryder O.A.,
RA   Leung F.C., Zhou Y., Cao J., Sun X., Fu Y., Fang X., Guo X., Wang B.,
RA   Hou R., Shen F., Mu B., Ni P., Lin R., Qian W., Wang G., Yu C., Nie W.,
RA   Wang J., Wu Z., Liang H., Min J., Wu Q., Cheng S., Ruan J., Wang M.,
RA   Shi Z., Wen M., Liu B., Ren X., Zheng H., Dong D., Cook K., Shan G.,
RA   Zhang H., Kosiol C., Xie X., Lu Z., Zheng H., Li Y., Steiner C.C.,
RA   Lam T.T., Lin S., Zhang Q., Li G., Tian J., Gong T., Liu H., Zhang D.,
RA   Fang L., Ye C., Zhang J., Hu W., Xu A., Ren Y., Zhang G., Bruford M.W.,
RA   Li Q., Ma L., Guo Y., An N., Hu Y., Zheng Y., Shi Y., Li Z., Liu Q.,
RA   Chen Y., Zhao J., Qu N., Zhao S., Tian F., Wang X., Wang H., Xu L., Liu X.,
RA   Vinar T., Wang Y., Lam T.W., Yiu S.M., Liu S., Zhang H., Li D., Huang Y.,
RA   Wang X., Yang G., Jiang Z., Wang J., Qin N., Li L., Li J., Bolund L.,
RA   Kristiansen K., Wong G.K., Olson M., Zhang X., Li S., Yang H., Wang J.,
RA   Wang J.;
RT   "The sequence and de novo assembly of the giant panda genome.";
RL   Nature 463:311-317(2010).
RN   [2] {ECO:0000313|Ensembl:ENSAMEP00000001923.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (AUG-2025) to UniProtKB.
RN   [3] {ECO:0000313|Ensembl:ENSAMEP00000001923.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2025) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   Ensembl; ENSAMET00000002000.2; ENSAMEP00000001923.2; ENSAMEG00000001767.2.
DR   GeneTree; ENSGT00940000158212; -.
DR   Proteomes; UP000008912; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR010363; DUF959_COL18_N.
DR   InterPro; IPR001791; Laminin_G.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06121; DUF959; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00282; LamG; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   3: Inferred from homology;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000008912};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..24
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           25..1460
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5030170550"
FT   DOMAIN          213..401
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   DOMAIN          262..400
FT                   /note="Laminin G"
FT                   /evidence="ECO:0000259|SMART:SM00282"
FT   REGION          42..102
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          128..211
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          402..1149
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1206..1279
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        137..155
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        190..207
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        436..454
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        501..513
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        515..530
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        535..551
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        580..594
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        624..634
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        650..662
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        666..678
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        679..688
FT                   /note="Gly residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        689..710
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        725..740
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        815..829
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        837..846
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        964..979
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1034..1052
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1064..1076
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1106..1122
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1132..1144
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1244..1254
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1266..1275
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1460 AA;  148335 MW;  84376E49307A3067 CRC64;
     MARHPGSHLG PLLLLVYCLA AAQANLLNLN WLWSTGTDGP TAGLVSEPPG SPHVQPREDT
     TTHVAPQHGP TEQGRSPASS ELPVERLETG QGRASATGPD LKEENIAGVG AKILNVAQGI
     RSFVQLWDDS TPTTNSPGAE TAAPATPTAP LTLPGPSSPP QENGTALWLD SWTLSSPGSQ
     RIEAGTQPVP TQPPPFPGGP SVPPPSPESL GTEVGLLQLL GEPPPQQVTQ VDDPDVGPAF
     VFGPDANSGQ VARYHFPSPF FRDFSLHFHV RPATKDAGVL FAITDAAQAV VSVGVKLSGV
     RDGHQHIQIL YTEPGATRTH TAASFRLPAF TGQWTRFALS VDGATVALFV DCDLFQRVPL
     VRSPRGLELE PGAGLFVAQA GGADPDKFQG MIAELRVRGD PQVSPMHCLE EDDEDSGGVS
     GDFGSGLEET REPLREPPGP SPKPSLPEAP PVTSPPLAAG KGVEDSRTEE IEEETTVSSL
     GARTLPGLDT VTAESVQSFG DRVEEGPAER SPDARPVPGP QGPPGPPGPP GKDGAPGRDG
     EPGDPGEDGR PGDPGPQGFP GTPGDVGPKG EKGDPGVGPR GPPGPQGPPG PPGPSFRHDK
     LTFIDMEGSG FGGDLESLRG PRGFPGPPGP PGVPGLPGEP GRFGMNSSDV PGPAGLPGVP
     GRDGPPGLPG IPGPPGPPGK DGGPGKTGQK GSPGEVGAPG PKGSKGDPGP AGTPGENGLA
     GAPGPAGPPG PPGPPGPPGP GLAAGFDDME GSGGPFWSMA RGASGPQGPA GLPGVKGDPG
     VTGPPGAKGE VGADGPPGFP GLPGREGTAG AQGPKGEKGT QGEKGDPGRD GVGQPGLPGP
     PGPPGPVVYV SEQDRALASV PGPEGPAGPK GDLGSKGQRG LPGPKGEKGE PGPVFSPDGS
     MLAPAQKGAK GEPGFRGPPG PYGRPGHKGE IGFPGRPGRP GMNGLKGEKG EPGHASVGFG
     VRGPPGPPGP PGPPGPPGTP VYDNNAFMES GRPGPPGLPG LQGPSGPKGD KGDVGPPGPP
     GQFPFDLLQL GAEMKGEKGD RGDTGQKGER GEPGGGGFFS SSVPGPPGPP GYPGIPGPKG
     ESIRGQPGPP GPQGPPGIGH EGRQGPPGPP GPPGPPGPPS FPGPYRQTIS VPGPPGPPGP
     PGPPGTMGIS SGVRIWATYQ TMLDKVPEVP EGWLIFVAER EELYVRVRNG LRKVLLEART
     PLPRGTDNEV AALQPPAGNP YPRRELTHST ARPWRADDIL ANPPRLPDPQ PYPGAPHHGS
     YLHFQPAHPT SGPTHTHTHQ DYQPVLHLVA LNSPQPGGMR GIRGADFQCF QQARAVGLAG
     TFRAFLSSRL QDLYSVVRRA DRTGVPVVNL RDEVLFPSWE ALFSGSEGQL KPGARIFSFD
     GRDVLQHPAW PQKSVWHGSD PGGRRLTDSY CETWRTEAAA ATGQASSLLA GRLLGQKAAS
     CHNAFIVLCI ENSFMTSSSK
//
DBGET integrated database retrieval system