GenomeNet

Database: UniProt
Entry: A0A4Y2DSU5_ARAVE
LinkDB: A0A4Y2DSU5_ARAVE
Original site: A0A4Y2DSU5_ARAVE 
ID   A0A4Y2DSU5_ARAVE        Unreviewed;       819 AA.
AC   A0A4Y2DSU5;
DT   18-SEP-2019, integrated into UniProtKB/TrEMBL.
DT   18-SEP-2019, sequence version 1.
DT   28-JAN-2026, entry version 17.
DE   SubName: Full=Collagen alpha-1(XVIII) chain {ECO:0000313|EMBL:GBM18784.1};
GN   Name=COL15A1 {ECO:0000313|EMBL:GBM18784.1};
GN   ORFNames=AVEN_30191_2 {ECO:0000313|EMBL:GBM18784.1};
OS   Araneus ventricosus (Orbweaver spider) (Epeira ventricosa).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Chelicerata; Arachnida; Araneae;
OC   Araneomorphae; Entelegynae; Araneoidea; Araneidae; Araneus.
OX   NCBI_TaxID=182803 {ECO:0000313|EMBL:GBM18784.1, ECO:0000313|Proteomes:UP000499080};
RN   [1] {ECO:0000313|EMBL:GBM18784.1, ECO:0000313|Proteomes:UP000499080}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=31182776;
RA   Kono N., Nakamura H., Ohtoshi R., Moran D.A.P., Shinohara A., Yoshida Y.,
RA   Fujiwara M., Mori M., Tomita M., Arakawa K.;
RT   "Orb-weaving spider Araneus ventricosus genome elucidates the spidroin gene
RT   catalogue.";
RL   Sci. Rep. 9:8380-8380(2019).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:GBM18784.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; BGPR01000411; GBM18784.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A4Y2DSU5; -.
DR   Proteomes; UP000499080; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1109; COLLAGEN ALPHA-4(IV) CHAIN-LIKE; 1.
DR   Pfam; PF01391; Collagen; 4.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:GBM18784.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000499080}.
FT   DOMAIN          539..587
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          618..783
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   REGION          1..71
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          87..190
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          215..361
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          436..473
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          501..534
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        22..35
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        46..59
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        120..147
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        149..159
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        181..190
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        216..225
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        233..243
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        281..291
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        299..310
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        335..349
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        436..447
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        453..465
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        507..519
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   819 AA;  83928 MW;  8BF4C564EAFA6420 CRC64;
     MPGPPGEIGL PGLPGPKGEK GATGTTGLPG LMGLKGEPGL DGIPGAPGPR GLPGPPGPPA
     SATSHRGSIF DLDDTFGGGV YGAPGDGPFT SYLGRPGPQG IPGSPGVPGP RGERGFPGQK
     GDRGDLGPRG YKGDKGSPGE RGSKGEHGIP GARGIAGIPG HDGIKGAKGE SGSPGVGLPG
     PRGPPGPPGP VGYYGVSKEF SPLQLENEDS TVIISKGDKG DRGEDGFPGT HGLKGEKGDK
     GDTGMDGMPG PSGPKGDTGD QGPMGPPGPV SHIAPDGTLI IEEKGEKGDR GRRGKRGYPG
     PPGPPGPPGE PGIVPNLPGF PGRPGTPGLP GQKGEPGEAT KGEKGDRGEP GPPAVGGYVN
     PDGLEIITEI KGEKGEKGDL GPIGPTGSPG VPGVPGEMGP LGLPGMKGLK GDPGEPGPPG
     PVMYIDETDD KYVYVPGPPG PPGPVGPPGK SMPGPPGPPG PPGPAGPLDA LWTNFTNSNG
     KKGNFGMKGL KAFMKSEGFL AKGIIGPPGP PGPPGPPGPT GSVSGDDSKR SQPAVVPGAV
     TLKNVDSLLR VSEISPLGTL GFVLDEETLL VRVSGGWQYV ALGSLVPLPS ATTTTTSTTP
     APLNPPVGGM KADTAPRLRM AALNQPYTGD MHGVRGADYE CYRQSRRANL RGTFRAFLAS
     RVQNLDSIVR HKDSDLPIVN IKGEVLFNSW KDLFAGTAAP FSYPPRIYSF DGRNVLTDNA
     WPHKMVWHGS DRLGVREMEA YCDAWHSEGT TKVGVASSLL RHRLLDQEKH PCDRSFIVLC
     IEATSQDDFK KRRRRGLERD EELLSAHEYA KVLQRMVRS
//
DBGET integrated database retrieval system