GenomeNet

Database: UniProt
Entry: A0A8B6F802_MYTGA
LinkDB: A0A8B6F802_MYTGA
Original site: A0A8B6F802_MYTGA 
ID   A0A8B6F802_MYTGA        Unreviewed;      1712 AA.
AC   A0A8B6F802;
DT   29-SEP-2021, integrated into UniProtKB/TrEMBL.
DT   29-SEP-2021, sequence version 1.
DT   28-JAN-2026, entry version 17.
DE   SubName: Full=Collagen, type XVIII, alpha {ECO:0000313|EMBL:VDI44935.1};
GN   ORFNames=MGAL_10B039461 {ECO:0000313|EMBL:VDI44935.1};
OS   Mytilus galloprovincialis (Mediterranean mussel).
OC   Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Mollusca; Bivalvia;
OC   Autobranchia; Pteriomorphia; Mytilida; Mytiloidea; Mytilidae; Mytilinae;
OC   Mytilus.
OX   NCBI_TaxID=29158 {ECO:0000313|EMBL:VDI44935.1, ECO:0000313|Proteomes:UP000596742};
RN   [1] {ECO:0000313|EMBL:VDI44935.1}
RP   NUCLEOTIDE SEQUENCE.
RA   Alioto T., Alioto T.;
RL   Submitted (NOV-2018) to the EMBL/GenBank/DDBJ databases.
CC   -!- COFACTOR:
CC       Name=heme; Xref=ChEBI:CHEBI:30413;
CC         Evidence={ECO:0000256|ARBA:ARBA00001971};
CC   -!- SIMILARITY: Belongs to the cytochrome P450 family.
CC       {ECO:0000256|ARBA:ARBA00010617}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:VDI44935.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; UYJE01006319; VDI44935.1; -; Genomic_DNA.
DR   OrthoDB; 5983381at2759; -.
DR   Proteomes; UP000596742; Unassembled WGS sequence.
DR   GO; GO:0005594; C:collagen type IX trimer; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0020037; F:heme binding; IEA:InterPro.
DR   GO; GO:0005506; F:iron ion binding; IEA:InterPro.
DR   GO; GO:0004497; F:monooxygenase activity; IEA:UniProtKB-KW.
DR   GO; GO:0016705; F:oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen; IEA:InterPro.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   FunFam; 1.10.630.10:FF:000182; Cytochrome P450 3A4; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 1.10.630.10; Cytochrome P450; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR001128; Cyt_P450.
DR   InterPro; IPR002401; Cyt_P450_E_grp-I.
DR   InterPro; IPR036396; Cyt_P450_sf.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF918; COLLAGEN ALPHA-1(IX) CHAIN; 1.
DR   Pfam; PF01391; Collagen; 7.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   Pfam; PF00067; p450; 1.
DR   PRINTS; PR00463; EP450I.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   SUPFAM; SSF48264; Cytochrome P450; 1.
PE   3: Inferred from homology;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:VDI44935.1};
KW   Heme {ECO:0000256|ARBA:ARBA00022617}; Iron {ECO:0000256|ARBA:ARBA00023004};
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Monooxygenase {ECO:0000256|ARBA:ARBA00023033};
KW   Oxidoreductase {ECO:0000256|ARBA:ARBA00023002};
KW   Reference proteome {ECO:0000313|Proteomes:UP000596742};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..19
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           20..1712
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5032312324"
FT   DOMAIN          24..217
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          226..387
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          403..1095
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        257..286
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        292..308
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        317..329
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        405..432
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        481..491
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        492..509
FT                   /note="Gly residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        510..519
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        520..534
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        578..603
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        687..734
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        815..826
FT                   /note="Gly residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        882..891
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1076..1085
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1086..1095
FT                   /note="Gly residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1712 AA;  177552 MW;  60F727C4F747266F CRC64;
     MWHFQVIFPL VCIFVQVYTQ SETEVDLLAA IGLPDPSKGV TYDVGLEGFP AFKLDKSSYI
     RKAAETYFTD RIGKINDFAI GITVKVFSKD GILFAVKNIY ETVIAFGVRI TATGNGKHNV
     ILYYKEDHQY GQMSVTIGNF TVDNIDRNSF TKLAIKVQGD TVTLFDNCEE VGRQQVLNRQ
     PLQFSQGSPL YIGQGGPNFE DTSFEGVIQE LKYYTNPAKA EELDCLSLDS GSGSGDKDDD
     EDRFKPVITL PPPTAKPGEK GEKGDVGEPG KDGKDGKDGE KGEKGDIGPV GPVGPPGPPP
     SLTPPPPEML EDMMGAKGER GETGPRGEQG ETGIDGFPGQ KGEKGDPGNN GPEGPAGIQG
     PKGDPGDRGL PGIGQVGPQG DPGLPASVNT QEIVDIILSQ VHMDQGQKGE KGETGDRGEN
     GSDGSDGREG IPGRDGLPGL DGVKGEKGDT VVGPEGPRGL VGPQGENGLP GSNGVAGPVG
     PMGPPGPPGL PGPGGLNIEG SGDGDGEPGL PGPPGPVGPM GPQGIQGVVG PMGPAGHKGD
     AGIIGTPGIP GLPGTPGTPG VFAGNISEMI GKPGKDGAPG LQGEPGLPGE YGPRGFDGPV
     GPAGDKGDKG DAGEPGPRGQ AGINGEPGAT GPIGPQGIQG LSGNDGSVGP QGLPGPPGPP
     GSGYDGPRRG DTGLVDIEGS GDCGCEPGTP GIQGPQGPPG KQGLQGPAGI PGIPGEIGRQ
     GPIGEPGIRG PAGPKGDSGK DGIHGTPGEE GIPGKPGLPG RQGDPGLNGQ NGPKGDTGEP
     GKSIQGPTGA TGSPGVQGPA GPPGPPGKTI IETDGGSGDI DGGLGVRGPA QKGDKGDDGA
     TGPAGPRGVD GVPGIPGEPG NPGLPGLQGL RGEDGLVGLR GSDGKDGKDG EAGPEGPSGP
     PGVNGEKGER GLPGPIGVLP DGVTGGVGVK GEKGDTGEPG PMGPEGPQGP VGPRGFRGPK
     GDIGMTGLPG YRGLRGRKGD RGRPGFNGLR GYKGDSGPVG PAGPPGPGFG GSGEILKGAK
     GDLGERGLPG IPGTPGIGAQ GPRGDQGEPG RNGLPGFQGP PGPAGKAGTG EVIRGPPGPP
     GPPGPSGVGT NGGGGGGGGA VTYRNYNELM SVARILPVGT MCYLVEEETV YLRVNKGFRV
     IELGETVPLP PITTPAAGVR PLPTEEASNI IVLPEGEMQS DQPRLYIFAL NAPLSGRMGG
     IRGADAKCYK QAKKAGKKGT YRAFLSTKTQ DLTSIVYYDI DKSVPIVNMK DQILFNSWKQ
     LLNGHGGYFN TQRHIYSFDG EDIMTSPRWP QKIYTQIFVQ SFGVRIVSLC NHPANRLTSP
     MIGQPNMTNR TRSIGYQGAN GLCADAVHLK YFITIGSPGP KGQQNQASSQ QFVDMFSTII
     NIILHKDCFD LLNIPTGFHG LDIELVKKYG SVVGTYVGHF PSILISDPDM IKEIWVKDFK
     HFSDRPELAY AGEMAYSAVT LAKGEHWKFL RHVLSPSFSP AKLKQMFSSI EDCSDQLIYQ
     MKKQSQRNKH VNMEQICQGF SMDVICGTAF GLKVDSQENP NNEFIKMAKE MTSLKFSSPL
     FLLQAFFPEL RHVFSKFEIR PFPSDAIDFF MDITKTAIQK RKDNKMTRPD FLQLMINNYK
     GERDADIKNG TIEMYKQRGI TETEILANSL IFFLDGYDTT ASGLIFSIYR MAVHQEFQER
     LFKEIDHEIG KVINTLSFCF CTDIVHCYNY VN
//
DBGET integrated database retrieval system