ID A0A4Y2DSU5_ARAVE Unreviewed; 819 AA.
AC A0A4Y2DSU5;
DT 18-SEP-2019, integrated into UniProtKB/TrEMBL.
DT 18-SEP-2019, sequence version 1.
DT 28-JAN-2026, entry version 17.
DE SubName: Full=Collagen alpha-1(XVIII) chain {ECO:0000313|EMBL:GBM18784.1};
GN Name=COL15A1 {ECO:0000313|EMBL:GBM18784.1};
GN ORFNames=AVEN_30191_2 {ECO:0000313|EMBL:GBM18784.1};
OS Araneus ventricosus (Orbweaver spider) (Epeira ventricosa).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Chelicerata; Arachnida; Araneae;
OC Araneomorphae; Entelegynae; Araneoidea; Araneidae; Araneus.
OX NCBI_TaxID=182803 {ECO:0000313|EMBL:GBM18784.1, ECO:0000313|Proteomes:UP000499080};
RN [1] {ECO:0000313|EMBL:GBM18784.1, ECO:0000313|Proteomes:UP000499080}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=31182776;
RA Kono N., Nakamura H., Ohtoshi R., Moran D.A.P., Shinohara A., Yoshida Y.,
RA Fujiwara M., Mori M., Tomita M., Arakawa K.;
RT "Orb-weaving spider Araneus ventricosus genome elucidates the spidroin gene
RT catalogue.";
RL Sci. Rep. 9:8380-8380(2019).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GBM18784.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BGPR01000411; GBM18784.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A4Y2DSU5; -.
DR Proteomes; UP000499080; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1109; COLLAGEN ALPHA-4(IV) CHAIN-LIKE; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:GBM18784.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000499080}.
FT DOMAIN 539..587
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 618..783
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 1..71
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 87..190
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 215..361
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 436..473
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 501..534
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 22..35
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 46..59
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 120..147
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 149..159
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 181..190
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 216..225
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 233..243
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 281..291
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 299..310
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 335..349
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 436..447
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 453..465
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 507..519
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 819 AA; 83928 MW; 8BF4C564EAFA6420 CRC64;
MPGPPGEIGL PGLPGPKGEK GATGTTGLPG LMGLKGEPGL DGIPGAPGPR GLPGPPGPPA
SATSHRGSIF DLDDTFGGGV YGAPGDGPFT SYLGRPGPQG IPGSPGVPGP RGERGFPGQK
GDRGDLGPRG YKGDKGSPGE RGSKGEHGIP GARGIAGIPG HDGIKGAKGE SGSPGVGLPG
PRGPPGPPGP VGYYGVSKEF SPLQLENEDS TVIISKGDKG DRGEDGFPGT HGLKGEKGDK
GDTGMDGMPG PSGPKGDTGD QGPMGPPGPV SHIAPDGTLI IEEKGEKGDR GRRGKRGYPG
PPGPPGPPGE PGIVPNLPGF PGRPGTPGLP GQKGEPGEAT KGEKGDRGEP GPPAVGGYVN
PDGLEIITEI KGEKGEKGDL GPIGPTGSPG VPGVPGEMGP LGLPGMKGLK GDPGEPGPPG
PVMYIDETDD KYVYVPGPPG PPGPVGPPGK SMPGPPGPPG PPGPAGPLDA LWTNFTNSNG
KKGNFGMKGL KAFMKSEGFL AKGIIGPPGP PGPPGPPGPT GSVSGDDSKR SQPAVVPGAV
TLKNVDSLLR VSEISPLGTL GFVLDEETLL VRVSGGWQYV ALGSLVPLPS ATTTTTSTTP
APLNPPVGGM KADTAPRLRM AALNQPYTGD MHGVRGADYE CYRQSRRANL RGTFRAFLAS
RVQNLDSIVR HKDSDLPIVN IKGEVLFNSW KDLFAGTAAP FSYPPRIYSF DGRNVLTDNA
WPHKMVWHGS DRLGVREMEA YCDAWHSEGT TKVGVASSLL RHRLLDQEKH PCDRSFIVLC
IEATSQDDFK KRRRRGLERD EELLSAHEYA KVLQRMVRS
//