GenomeNet

Database: UniProt
Entry: F7GIS1_MACMU
LinkDB: F7GIS1_MACMU
Original site: F7GIS1_MACMU 
ID   F7GIS1_MACMU            Unreviewed;      1519 AA.
AC   F7GIS1;
DT   27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT   11-DEC-2019, sequence version 3.
DT   28-JAN-2026, entry version 71.
DE   SubName: Full=Collagen type XVIII alpha 1 chain {ECO:0000313|Ensembl:ENSMMUP00000005961.4};
GN   Name=COL18A1 {ECO:0000313|Ensembl:ENSMMUP00000005961.4,
GN   ECO:0000313|VGNC:VGNC:81291};
OS   Macaca mulatta (Rhesus macaque).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC   Cercopithecidae; Cercopithecinae; Macaca.
OX   NCBI_TaxID=9544 {ECO:0000313|Ensembl:ENSMMUP00000005961.4, ECO:0000313|Proteomes:UP000006718};
RN   [1] {ECO:0000313|Proteomes:UP000006718}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=17573 {ECO:0000313|Proteomes:UP000006718};
RX   PubMed=17431167; DOI=10.1126/science.1139247;
RA   Gibbs R.A., Rogers J., Katze M.G., Bumgarner R., Weinstock G.M.,
RA   Mardis E.R., Remington K.A., Strausberg R.L., Venter J.C., Wilson R.K.,
RA   Batzer M.A., Bustamante C.D., Eichler E.E., Hahn M.W., Hardison R.C.,
RA   Makova K.D., Miller W., Milosavljevic A., Palermo R.E., Siepel A.,
RA   Sikela J.M., Attaway T., Bell S., Bernard K.E., Buhay C.J.,
RA   Chandrabose M.N., Dao M., Davis C., Delehaunty K.D., Ding Y., Dinh H.H.,
RA   Dugan-Rocha S., Fulton L.A., Gabisi R.A., Garner T.T., Godfrey J.,
RA   Hawes A.C., Hernandez J., Hines S., Holder M., Hume J., Jhangiani S.N.,
RA   Joshi V., Khan Z.M., Kirkness E.F., Cree A., Fowler R.G., Lee S.,
RA   Lewis L.R., Li Z., Liu Y.-S., Moore S.M., Muzny D., Nazareth L.V.,
RA   Ngo D.N., Okwuonu G.O., Pai G., Parker D., Paul H.A., Pfannkoch C.,
RA   Pohl C.S., Rogers Y.-H.C., Ruiz S.J., Sabo A., Santibanez J.,
RA   Schneider B.W., Smith S.M., Sodergren E., Svatek A.F., Utterback T.R.,
RA   Vattathil S., Warren W., White C.S., Chinwalla A.T., Feng Y., Halpern A.L.,
RA   Hillier L.W., Huang X., Minx P., Nelson J.O., Pepin K.H., Qin X.,
RA   Sutton G.G., Venter E., Walenz B.P., Wallis J.W., Worley K.C., Yang S.-P.,
RA   Jones S.M., Marra M.A., Rocchi M., Schein J.E., Baertsch R., Clarke L.,
RA   Csuros M., Glasscock J., Harris R.A., Havlak P., Jackson A.R., Jiang H.,
RA   Liu Y., Messina D.N., Shen Y., Song H.X.-Z., Wylie T., Zhang L., Birney E.,
RA   Han K., Konkel M.K., Lee J., Smit A.F.A., Ullmer B., Wang H., Xing J.,
RA   Burhans R., Cheng Z., Karro J.E., Ma J., Raney B., She X., Cox M.J.,
RA   Demuth J.P., Dumas L.J., Han S.-G., Hopkins J., Karimpour-Fard A.,
RA   Kim Y.H., Pollack J.R., Vinar T., Addo-Quaye C., Degenhardt J., Denby A.,
RA   Hubisz M.J., Indap A., Kosiol C., Lahn B.T., Lawson H.A., Marklein A.,
RA   Nielsen R., Vallender E.J., Clark A.G., Ferguson B., Hernandez R.D.,
RA   Hirani K., Kehrer-Sawatzki H., Kolb J., Patil S., Pu L.-L., Ren Y.,
RA   Smith D.G., Wheeler D.A., Schenck I., Ball E.V., Chen R., Cooper D.N.,
RA   Giardine B., Hsu F., Kent W.J., Lesk A., Nelson D.L., O'brien W.E.,
RA   Pruefer K., Stenson P.D., Wallace J.C., Ke H., Liu X.-M., Wang P.,
RA   Xiang A.P., Yang F., Barber G.P., Haussler D., Karolchik D., Kern A.D.,
RA   Kuhn R.M., Smith K.E., Zwieg A.S.;
RT   "Evolutionary and biomedical insights from the rhesus macaque genome.";
RL   Science 316:222-234(2007).
RN   [2] {ECO:0000313|Ensembl:ENSMMUP00000005961.4}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000005961.4};
RA   Graves T., Eichler E.E., Wilson R.K.;
RL   Submitted (JAN-2019) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|Ensembl:ENSMMUP00000005961.4}
RP   IDENTIFICATION.
RC   STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000005961.4};
RG   Ensembl;
RL   Submitted (AUG-2025) to UniProtKB.
RN   [4] {ECO:0000313|Ensembl:ENSMMUP00000005961.4}
RP   IDENTIFICATION.
RC   STRAIN=17573 {ECO:0000313|Ensembl:ENSMMUP00000005961.4};
RG   Ensembl;
RL   Submitted (SEP-2025) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- SIMILARITY: Belongs to the multiplexin collagen family.
CC       {ECO:0000256|ARBA:ARBA00061275}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   RefSeq; XP_028701068.1; XM_028845235.1.
DR   SMR; F7GIS1; -.
DR   FunCoup; F7GIS1; 424.
DR   STRING; 9544.ENSMMUP00000005961; -.
DR   PaxDb; 9544-ENSMMUP00000005962; -.
DR   Ensembl; ENSMMUT00000006338.4; ENSMMUP00000005961.4; ENSMMUG00000004472.4.
DR   GeneID; 721919; -.
DR   CTD; 80781; -.
DR   VEuPathDB; HostDB:ENSMMUG00000004472; -.
DR   VGNC; VGNC:81291; COL18A1.
DR   eggNOG; KOG3546; Eukaryota.
DR   GeneTree; ENSGT00940000158212; -.
DR   HOGENOM; CLU_354363_0_0_1; -.
DR   InParanoid; F7GIS1; -.
DR   OMA; VQDQHQN; -.
DR   Proteomes; UP000006718; Chromosome 3.
DR   Bgee; ENSMMUG00000004472; Expressed in liver and 21 other cell types or tissues.
DR   ExpressionAtlas; F7GIS1; baseline.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005788; C:endoplasmic reticulum lumen; IEA:UniProtKB-ARBA.
DR   GO; GO:0031012; C:extracellular matrix; IBA:GO_Central.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IBA:GO_Central.
DR   GO; GO:0001525; P:angiogenesis; IBA:GO_Central.
DR   GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR   GO; GO:0001886; P:endothelial cell morphogenesis; IBA:GO_Central.
DR   GO; GO:0001501; P:skeletal system development; IBA:GO_Central.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.10.100.10:FF:000008; collagen alpha-1(XVIII) chain isoform X1; 1.
DR   FunFam; 3.40.1620.70:FF:000003; Collagen type XVIII alpha 1; 1.
DR   FunFam; 2.60.120.200:FF:000039; Collagen XV alpha 1 chain; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR010363; DUF959_COL18_N.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF906; COLLAGEN ALPHA-2(IX) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06121; DUF959; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   3: Inferred from homology;
KW   Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Hydroxylation {ECO:0000256|ARBA:ARBA00023278};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000006718};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..23
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           24..1519
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5023926016"
FT   DOMAIN          221..409
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          47..104
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          149..220
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          409..434
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          449..1208
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1276..1297
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        57..73
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        157..184
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        482..503
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        527..554
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        563..574
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        580..596
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        598..611
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        627..639
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        669..679
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        695..707
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        711..726
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        770..786
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        818..829
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        860..874
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        882..891
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        906..918
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        927..944
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1019..1033
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1047..1061
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1086..1106
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1118..1133
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1163..1179
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1189..1201
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1519 AA;  154114 MW;  CF000BC8E060D845 CRC64;
     MAPDPSGCRV LLLLFCCLAA AQADLLNLNW LWFNNKDTSQ AATTIPEPQG LLPVQPTADT
     TTHVVPQDGS TEPATAPGSP EPPSELLEDS QGTPTSAESP DMPEENIAGV GAKILNVAQG
     IRSFVQLWND TVPTESLARA ETRVVEAPVG TLALPGPSST PQENGTTLWP SSGAPSSPGT
     HTTEAGTLPA PTPSPPSLGR PWAPLTGPSV PPPSSERVGE EVGLLQLLGD PPPQQITQTD
     DPDVGLAYVF GPDANSGQVA RYHFPSLFFR DFSLLFHIQP ATEGPGVLFA ITDSAQAVVS
     LGVKLSGVRD GHQDISLLYT EPGAGQTHTA ASFRLPAFVG QWTHLALSVE GGYVALYVDC
     EEFQRMPLAR SSRGLELEPG AGLFVAQAGG ADPDKFQGMI AELKVRGDPQ VGPMHCLDEE
     GDDSDGASGD FGSGLADTQE LLREEMGTAL KPRLPTPPPV TAPPLAGGSS TEDSRSEEIE
     EQTTVTSLGA QTLPGSDSVS TWDGSVRTPG GRVKEGGLKG QKGEPGIPGP PGRAGPPGSP
     CLPGPPGLPC PVSPLGPVGP ALQPVPGPQG PPGLPGRDGT PGRDGEPGDP GEDGKPGDTG
     PQGFPGTPGD VGPKGDKGDP GVGARGPPGP QGPPGPPGPS FRHDKLTFID MEGSGFGGDL
     EALRGPRGFP GPPGPPGVPG LPGEPGRFGV NGSDVPGPAG LPGVPGREGP PGFPGLPGPP
     GPPGKEGPPG RMGQKGSLGE AGAPGHKGSK GDPGPAGARG ESGLAGAPGP AGPPGPPGPP
     GPPGPGLPAG FDDMEGSGGP FWSTARGADG PQGPPGLPGL KGDPGVPGLR GAKGEVGANG
     APGFPGLPGR EGTAGPQGPK GDRGSRGEKG DPGKDGVGQP GLPGPPGPPG PVVYVSEQDG
     AVLSVPGPEG RPGFAGFPGP TGPKGDLGSK GERGSPGPKG EKGEPGSVFS PDGSALGPAQ
     KGAKGEPGFR GPPGPYGRPG HKGEIGFPGR PGRPGMNGLK GEKGEPGDAH LGFGMRGMPG
     PPGPPGPPGP PGTPVYDSNV FAESSRPGPP GLPGNQGPPG PKGTKGEVGP PGPPGQFPFD
     FLQLEAEMKG EKGDRGDAGQ KGERGEPGGG GFFGSSLPGP PGPPGPPGYP GIPGPKGESI
     RGQPGPPGPQ GPPGIGYEGR QGPPGPPGPP GPPGPPSFPG PHRQTISVPG PPGPPGPPGP
     PGTMGTSSGV RLWATRQAML GQVHEVPEGW LIFVAEQEEL YVRVRNGFRK VQLEPRTPLP
     RGTDNEVAAL QPPVVQLHDS NPYPRREFPH PTARPWRADD ILASPPRLPE PQPYPGAPHH
     SSYVHLRPAL PTSPPAHTHR DFQPVLHLVA LNSPLPGGMR GIRGADFQCF QQARAVGLVG
     TFRAFLSSRL QDLYSIVRRA DRAAVPIVNL KDELLFPSWE ALFAGSEGPL KPGARIFSFD
     GKDVLRHPTW PQKSVWHGSD PSGRRLTESY CETWRTESPS VTGQASSLLG GRLLGQNAAS
     CHHAYIVLCI ENSFMTASK
//
DBGET integrated database retrieval system