ID F7GIS1_MACMU Unreviewed; 1519 AA.
AC F7GIS1;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 11-DEC-2019, sequence version 3.
DT 28-JAN-2026, entry version 71.
DE SubName: Full=Collagen type XVIII alpha 1 chain {ECO:0000313|Ensembl:ENSMMUP00000005961.4};
GN Name=COL18A1 {ECO:0000313|Ensembl:ENSMMUP00000005961.4,
GN ECO:0000313|VGNC:VGNC:81291};
OS Macaca mulatta (Rhesus macaque).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9544 {ECO:0000313|Ensembl:ENSMMUP00000005961.4, ECO:0000313|Proteomes:UP000006718};
RN [1] {ECO:0000313|Proteomes:UP000006718}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=17573 {ECO:0000313|Proteomes:UP000006718};
RX PubMed=17431167; DOI=10.1126/science.1139247;
RA Gibbs R.A., Rogers J., Katze M.G., Bumgarner R., Weinstock G.M.,
RA Mardis E.R., Remington K.A., Strausberg R.L., Venter J.C., Wilson R.K.,
RA Batzer M.A., Bustamante C.D., Eichler E.E., Hahn M.W., Hardison R.C.,
RA Makova K.D., Miller W., Milosavljevic A., Palermo R.E., Siepel A.,
RA Sikela J.M., Attaway T., Bell S., Bernard K.E., Buhay C.J.,
RA Chandrabose M.N., Dao M., Davis C., Delehaunty K.D., Ding Y., Dinh H.H.,
RA Dugan-Rocha S., Fulton L.A., Gabisi R.A., Garner T.T., Godfrey J.,
RA Hawes A.C., Hernandez J., Hines S., Holder M., Hume J., Jhangiani S.N.,
RA Joshi V., Khan Z.M., Kirkness E.F., Cree A., Fowler R.G., Lee S.,
RA Lewis L.R., Li Z., Liu Y.-S., Moore S.M., Muzny D., Nazareth L.V.,
RA Ngo D.N., Okwuonu G.O., Pai G., Parker D., Paul H.A., Pfannkoch C.,
RA Pohl C.S., Rogers Y.-H.C., Ruiz S.J., Sabo A., Santibanez J.,
RA Schneider B.W., Smith S.M., Sodergren E., Svatek A.F., Utterback T.R.,
RA Vattathil S., Warren W., White C.S., Chinwalla A.T., Feng Y., Halpern A.L.,
RA Hillier L.W., Huang X., Minx P., Nelson J.O., Pepin K.H., Qin X.,
RA Sutton G.G., Venter E., Walenz B.P., Wallis J.W., Worley K.C., Yang S.-P.,
RA Jones S.M., Marra M.A., Rocchi M., Schein J.E., Baertsch R., Clarke L.,
RA Csuros M., Glasscock J., Harris R.A., Havlak P., Jackson A.R., Jiang H.,
RA Liu Y., Messina D.N., Shen Y., Song H.X.-Z., Wylie T., Zhang L., Birney E.,
RA Han K., Konkel M.K., Lee J., Smit A.F.A., Ullmer B., Wang H., Xing J.,
RA Burhans R., Cheng Z., Karro J.E., Ma J., Raney B., She X., Cox M.J.,
RA Demuth J.P., Dumas L.J., Han S.-G., Hopkins J., Karimpour-Fard A.,
RA Kim Y.H., Pollack J.R., Vinar T., Addo-Quaye C., Degenhardt J., Denby A.,
RA Hubisz M.J., Indap A., Kosiol C., Lahn B.T., Lawson H.A., Marklein A.,
RA Nielsen R., Vallender E.J., Clark A.G., Ferguson B., Hernandez R.D.,
RA Hirani K., Kehrer-Sawatzki H., Kolb J., Patil S., Pu L.-L., Ren Y.,
RA Smith D.G., Wheeler D.A., Schenck I., Ball E.V., Chen R., Cooper D.N.,
RA Giardine B., Hsu F., Kent W.J., Lesk A., Nelson D.L., O'brien W.E.,
RA Pruefer K., Stenson P.D., Wallace J.C., Ke H., Liu X.-M., Wang P.,
RA Xiang A.P., Yang F., Barber G.P., Haussler D., Karolchik D., Kern A.D.,
RA Kuhn R.M., Smith K.E., Zwieg A.S.;
RT "Evolutionary and biomedical insights from the rhesus macaque genome.";
RL Science 316:222-234(2007).
RN [2] {ECO:0000313|Ensembl:ENSMMUP00000005961.4}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000005961.4};
RA Graves T., Eichler E.E., Wilson R.K.;
RL Submitted (JAN-2019) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|Ensembl:ENSMMUP00000005961.4}
RP IDENTIFICATION.
RC STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000005961.4};
RG Ensembl;
RL Submitted (AUG-2025) to UniProtKB.
RN [4] {ECO:0000313|Ensembl:ENSMMUP00000005961.4}
RP IDENTIFICATION.
RC STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000005961.4};
RG Ensembl;
RL Submitted (SEP-2025) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC matrix {ECO:0000256|ARBA:ARBA00004498}.
CC -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC {ECO:0000256|ARBA:ARBA00061275}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_028701068.1; XM_028845235.1.
DR SMR; F7GIS1; -.
DR FunCoup; F7GIS1; 424.
DR STRING; 9544.ENSMMUP00000005961; -.
DR PaxDb; 9544-ENSMMUP00000005962; -.
DR Ensembl; ENSMMUT00000006338.4; ENSMMUP00000005961.4; ENSMMUG00000004472.4.
DR GeneID; 721919; -.
DR CTD; 80781; -.
DR VEuPathDB; HostDB:ENSMMUG00000004472; -.
DR VGNC; VGNC:81291; COL18A1.
DR eggNOG; KOG3546; Eukaryota.
DR GeneTree; ENSGT00940000158212; -.
DR HOGENOM; CLU_354363_0_0_1; -.
DR InParanoid; F7GIS1; -.
DR OMA; VQDQHQN; -.
DR Proteomes; UP000006718; Chromosome 3.
DR Bgee; ENSMMUG00000004472; Expressed in liver and 21 other cell types or tissues.
DR ExpressionAtlas; F7GIS1; baseline.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProtKB-ARBA.
DR GO; GO:0031012; C:extracellular matrix; IBA:GO_Central.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IBA:GO_Central.
DR GO; GO:0001525; P:angiogenesis; IBA:GO_Central.
DR GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR GO; GO:0001886; P:endothelial cell morphogenesis; IBA:GO_Central.
DR GO; GO:0001501; P:skeletal system development; IBA:GO_Central.
DR CDD; cd00247; Endostatin-like; 1.
DR FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR010363; DUF959_COL18_N.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF906; COLLAGEN ALPHA-2(IX) CHAIN; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06121; DUF959; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 3: Inferred from homology;
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW Reference proteome {ECO:0000313|Proteomes:UP000006718};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..1519
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5023926016"
FT DOMAIN 221..409
FT /note="Thrombospondin-like N-terminal"
FT /evidence="ECO:0000259|SMART:SM00210"
FT REGION 47..104
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 149..220
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 409..434
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 449..1208
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1276..1297
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 57..73
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 157..184
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 482..503
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 527..554
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 563..574
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 580..596
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 598..611
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 627..639
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 669..679
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 695..707
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 711..726
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 770..786
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 818..829
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 860..874
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 882..891
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 906..918
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 927..944
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1019..1033
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1047..1061
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1086..1106
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1118..1133
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1163..1179
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1189..1201
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1519 AA; 154114 MW; CF000BC8E060D845 CRC64;
MAPDPSGCRV LLLLFCCLAA AQADLLNLNW LWFNNKDTSQ AATTIPEPQG LLPVQPTADT
TTHVVPQDGS TEPATAPGSP EPPSELLEDS QGTPTSAESP DMPEENIAGV GAKILNVAQG
IRSFVQLWND TVPTESLARA ETRVVEAPVG TLALPGPSST PQENGTTLWP SSGAPSSPGT
HTTEAGTLPA PTPSPPSLGR PWAPLTGPSV PPPSSERVGE EVGLLQLLGD PPPQQITQTD
DPDVGLAYVF GPDANSGQVA RYHFPSLFFR DFSLLFHIQP ATEGPGVLFA ITDSAQAVVS
LGVKLSGVRD GHQDISLLYT EPGAGQTHTA ASFRLPAFVG QWTHLALSVE GGYVALYVDC
EEFQRMPLAR SSRGLELEPG AGLFVAQAGG ADPDKFQGMI AELKVRGDPQ VGPMHCLDEE
GDDSDGASGD FGSGLADTQE LLREEMGTAL KPRLPTPPPV TAPPLAGGSS TEDSRSEEIE
EQTTVTSLGA QTLPGSDSVS TWDGSVRTPG GRVKEGGLKG QKGEPGIPGP PGRAGPPGSP
CLPGPPGLPC PVSPLGPVGP ALQPVPGPQG PPGLPGRDGT PGRDGEPGDP GEDGKPGDTG
PQGFPGTPGD VGPKGDKGDP GVGARGPPGP QGPPGPPGPS FRHDKLTFID MEGSGFGGDL
EALRGPRGFP GPPGPPGVPG LPGEPGRFGV NGSDVPGPAG LPGVPGREGP PGFPGLPGPP
GPPGKEGPPG RMGQKGSLGE AGAPGHKGSK GDPGPAGARG ESGLAGAPGP AGPPGPPGPP
GPPGPGLPAG FDDMEGSGGP FWSTARGADG PQGPPGLPGL KGDPGVPGLR GAKGEVGANG
APGFPGLPGR EGTAGPQGPK GDRGSRGEKG DPGKDGVGQP GLPGPPGPPG PVVYVSEQDG
AVLSVPGPEG RPGFAGFPGP TGPKGDLGSK GERGSPGPKG EKGEPGSVFS PDGSALGPAQ
KGAKGEPGFR GPPGPYGRPG HKGEIGFPGR PGRPGMNGLK GEKGEPGDAH LGFGMRGMPG
PPGPPGPPGP PGTPVYDSNV FAESSRPGPP GLPGNQGPPG PKGTKGEVGP PGPPGQFPFD
FLQLEAEMKG EKGDRGDAGQ KGERGEPGGG GFFGSSLPGP PGPPGPPGYP GIPGPKGESI
RGQPGPPGPQ GPPGIGYEGR QGPPGPPGPP GPPGPPSFPG PHRQTISVPG PPGPPGPPGP
PGTMGTSSGV RLWATRQAML GQVHEVPEGW LIFVAEQEEL YVRVRNGFRK VQLEPRTPLP
RGTDNEVAAL QPPVVQLHDS NPYPRREFPH PTARPWRADD ILASPPRLPE PQPYPGAPHH
SSYVHLRPAL PTSPPAHTHR DFQPVLHLVA LNSPLPGGMR GIRGADFQCF QQARAVGLVG
TFRAFLSSRL QDLYSIVRRA DRAAVPIVNL KDELLFPSWE ALFAGSEGPL KPGARIFSFD
GKDVLRHPTW PQKSVWHGSD PSGRRLTESY CETWRTESPS VTGQASSLLG GRLLGQNAAS
CHHAYIVLCI ENSFMTASK
//