GenomeNet

Database: UniProt
Entry: A0A8B6F6L9_MYTGA
LinkDB: A0A8B6F6L9_MYTGA
Original site: A0A8B6F6L9_MYTGA 
ID   A0A8B6F6L9_MYTGA        Unreviewed;      1417 AA.
AC   A0A8B6F6L9;
DT   29-SEP-2021, integrated into UniProtKB/TrEMBL.
DT   29-SEP-2021, sequence version 1.
DT   28-JAN-2026, entry version 15.
DE   SubName: Full=Collagen, type XVIII, alpha {ECO:0000313|EMBL:VDI44943.1};
GN   ORFNames=MGAL_10B039461 {ECO:0000313|EMBL:VDI44943.1};
OS   Mytilus galloprovincialis (Mediterranean mussel).
OC   Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Mollusca; Bivalvia;
OC   Autobranchia; Pteriomorphia; Mytilida; Mytiloidea; Mytilidae; Mytilinae;
OC   Mytilus.
OX   NCBI_TaxID=29158 {ECO:0000313|EMBL:VDI44943.1, ECO:0000313|Proteomes:UP000596742};
RN   [1] {ECO:0000313|EMBL:VDI44943.1}
RP   NUCLEOTIDE SEQUENCE.
RA   Alioto T., Alioto T.;
RL   Submitted (NOV-2018) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:VDI44943.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; UYJE01006319; VDI44943.1; -; Genomic_DNA.
DR   OrthoDB; 5983381at2759; -.
DR   Proteomes; UP000596742; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR   Pfam; PF01391; Collagen; 7.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:VDI44943.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000596742};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525}.
FT   DOMAIN          79..272
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          281..442
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          458..1150
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        312..341
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        347..363
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        372..384
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        460..487
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        536..546
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        547..564
FT                   /note="Gly residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        565..574
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        575..589
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        633..658
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        742..789
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        870..881
FT                   /note="Gly residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        937..946
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1131..1140
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1141..1150
FT                   /note="Gly residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1417 AA;  143544 MW;  ACA424A434A91EB9 CRC64;
     MEKEGFNWPA AFNCDNFQTS GLCVHDEKNT ELLAERSQEI FRGTTLEPYP GFFHTRTTRG
     LQQPPTSTTA KNFIPEETEV DLLAAIGLPD PSKGVTYDVG LEGFPAFKLD KSSYIRKAAE
     TYFTDRIGKI NDFAIGITVK VFSKDGILFA VKNIYETVIA FGVRITATGN GKHNVILYYK
     EDHQYGQMSV TIGNFTVDNI DRNSFTKLAI KVQGDTVTLF DNCEEVGRQQ VLNRQPLQFS
     QGSPLYIGQG GPNFEDTSFE GVIQELKYYT NPAKAEELDC LSLDSGSGSG DKDDDEDRFK
     PVITLPPPTA KPGEKGEKGD VGEPGKDGKD GKDGEKGEKG DIGPVGPVGP PGPPPSLTPP
     PPEMLEDMMG AKGERGETGP RGEQGETGID GFPGQKGEKG DPGNNGPEGP AGIQGPKGDP
     GDRGLPGIGQ VGPQGDPGLP ASVNTQEIVD IILSQVHMDQ GQKGEKGETG DRGENGSDGS
     DGREGIPGRD GLPGLDGVKG EKGDTVVGPE GPRGLVGPQG ENGLPGSNGV AGPVGPMGPP
     GPPGLPGPGG LNIEGSGDGD GEPGLPGPPG PVGPMGPQGI QGVVGPMGPA GHKGDAGIIG
     TPGIPGLPGT PGTPGVFAGN ISEMIGKPGK DGAPGLQGEP GLPGEYGPRG FDGPVGPAGD
     KGDKGDAGEP GPRGQAGING EPGATGPIGP QGIQGLSGND GSVGPQGLPG PPGPPGSGYD
     GPRRGDTGLV DIEGSGDCGC EPGTPGIQGP QGPPGKQGLQ GPAGIPGIPG EIGRQGPIGE
     PGIRGPAGPK GDSGKDGIHG TPGEEGIPGK PGLPGRQGDP GLNGQNGPKG DTGEPGKSIQ
     GPTGATGSPG VQGPAGPPGP PGKTIIETDG GSGDIDGGLG VRGPAQKGDK GDDGATGPAG
     PRGVDGVPGI PGEPGNPGLP GLQGLRGEDG LVGLRGSDGK DGKDGEAGPE GPSGPPGVNG
     EKGERGLPGP IGVLPDGVTG GVGVKGEKGD TGEPGPMGPE GPQGPVGPRG FRGPKGDIGM
     TGLPGYRGLR GRKGDRGRPG FNGLRGYKGD SGPVGPAGPP GPGFGGSGEI LKGAKGDLGE
     RGLPGIPGTP GIGAQGPRGD QGEPGRNGLP GFQGPPGPAG KAGTGEVIRG PPGPPGPPGP
     SGVGTNGGGG GGGGAVTYRN YNELMSVARI LPVGTMCYLV EEETVYLRVN KGFRVIETKF
     IKLGETVPLP PITTPAAGVR PLPTEEASNI IVLPEGEMQS DQPRLYIFAL NAPLSGRMGG
     IRGADAKCYK QAKKAGKKGT YRAFLSTKTQ DLTSIVYYDI DKSVPIVNMK DQILFNSWKQ
     LLNGHGGYFN TQRHIYSFDG EDIMTSPRWP QKIVWHGSTS NGDKVDDMTC NHWHSSSPRS
     MGYASSLLDG KLVDMKEYSC SNRFIVLCVE NTSRPMK
//
DBGET integrated database retrieval system