GenomeNet

Database: UniProt
Entry: A0A812EVB9_ACAPH
LinkDB: A0A812EVB9_ACAPH
Original site: A0A812EVB9_ACAPH 
ID   A0A812EVB9_ACAPH        Unreviewed;      1160 AA.
AC   A0A812EVB9;
DT   29-SEP-2021, integrated into UniProtKB/TrEMBL.
DT   29-SEP-2021, sequence version 1.
DT   28-JAN-2026, entry version 14.
DE   SubName: Full=COL18A {ECO:0000313|EMBL:CAE1328304.1};
GN   ORFNames=SPHA_77843 {ECO:0000313|EMBL:CAE1328304.1};
OS   Acanthosepion pharaonis (Pharaoh cuttlefish) (Sepia pharaonis).
OC   Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Mollusca; Cephalopoda;
OC   Coleoidea; Decapodiformes; Sepiida; Sepiina; Sepiidae; Acanthosepion.
OX   NCBI_TaxID=158019 {ECO:0000313|EMBL:CAE1328304.1, ECO:0000313|Proteomes:UP000597762};
RN   [1] {ECO:0000313|EMBL:CAE1328304.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=Farmed {ECO:0000313|EMBL:CAE1328304.1};
RA   Li R., Bekaert M.;
RL   Submitted (JAN-2021) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:CAE1328304.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CAHIKZ030005524; CAE1328304.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A812EVB9; -.
DR   SMR; A0A812EVB9; -.
DR   OrthoDB; 5983381at2759; -.
DR   Proteomes; UP000597762; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1089; COLLAGEN TYPE III, PUTATIVE-RELATED; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP000597762}.
FT   DOMAIN          814..862
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          908..1074
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   REGION          136..563
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          575..812
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        138..148
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        177..187
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        190..205
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        230..242
FT                   /note="Gly residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        254..284
FT                   /note="Gly residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        285..295
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        339..349
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        403..412
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        452..461
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        508..518
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        581..596
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        654..663
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        668..677
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        678..687
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        696..719
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        749..761
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        785..794
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        802..812
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1160 AA;  121190 MW;  41D650CD650CF7FB CRC64;
     MVVKIEKPGI SYIFAVVDSR EMVIQLGVRV ASLSNNAHNV SLYYTPLGVG APYTIRSNVL
     ASFRLYNLLG HWAQINFRVK DDTVHVYVNG TRLGEHIVQR SPLSIEDGST IYIGQAGRLL
     RGTFVGALQE LKLHSNPDEA EYNDDAYVDD GSGSGSGTDD TDDLINKVSR PPYPRYFKGD
     KGEKGERGPP GLPPMISPPP PPPPSARKGE PGIPGAIGLP GVVGQKGQKG EAGGYSGHGA
     VGPKGEKGDI GPKGNPGTGL MGPPGQPGKP GIGLKGEPGV GIPGERGSPG IIGPRGLPGP
     PGPPGRGGYH STLIDSDSSG DFEYSGVRGV QGPPGPAGRP GYPGEPGRAG VDGKDGVNGW
     PGEPGPEGPK GESGRAGYPG EPGIMGQKGE PGRDGANGIP GAHGPPGPPG PPGLSSSVVS
     GDDLFEDLSI DASGERSYTR RGVVGEPGLP GHKGEKGDRG ENGYPGETGV QGEKGEIGYN
     GIDGRKGEKG EPGVGLRGPP GMPGPPGKSE DTEFKDMTSK IMGMKGDKGD SGRNGLDGIP
     GLPGQDGEPG RPGPPGKPGM MMEDTREGCR ECVGEKGDIG PVGPKGDVGR PGPRGVRGSK
     GDIGMPGFPG RPGVQGRKGQ PGYGLKGRKG EQGLPGPPGV SIYAGSNGHG GSMKGEKGDR
     GYTGREGPPG PPGPPGFPSG LPNPGSGDLD MTFKFFKGDK GDRGRDGIPG RINDIDLSRL
     RGPQGPQGRP GPPGPPGLSG NGLTADGLRA PKGERGPKGP KGEPGIGIRG PAGVPGHPGL
     SIPGRPGPPG PPGPAGSGSS STYNDRASSG SNGVVTFRNE EELQSVALSV PVGTLAFVMD
     TEMLLIRLKE SWAQIEYGNR IPLPTPKPEA LKLERHTRMT QKPPIPIQPA RHPGSFSLLR
     SHEKGSRLHL IALNEPMRGD MNGISGADGA CYRQAKYSQY KGTYRAFLSS KVQDMRTLIF
     FKKDRSIPIV NLMGEVLASS WNDLFDRFGG FFNSSVPIYS FDGSNVLNSI RWPQKILWHG
     TDPDGKRVKE NICDGWISDS QTKTGLGSSL VDNKLLGSEE YTCNNAFVVL CISISSGDIH
     NGVAREENPS ELMFMPKDQT LSKMINSNDF FFLSFRISLP LILKHFFFLN HFAIFPTKFK
     PSLQQEMIST PSLCPVLLLV
//
DBGET integrated database retrieval system