ID A0A8X6MNJ1_NEPPI Unreviewed; 706 AA.
AC A0A8X6MNJ1;
DT 14-DEC-2022, integrated into UniProtKB/TrEMBL.
DT 14-DEC-2022, sequence version 1.
DT 28-JAN-2026, entry version 10.
DE SubName: Full=Collagen alpha-1(XV) chain {ECO:0000313|EMBL:GFS69691.1};
GN Name=COL15A1 {ECO:0000313|EMBL:GFS69691.1};
GN ORFNames=NPIL_186442 {ECO:0000313|EMBL:GFS69691.1};
OS Nephila pilipes (Giant wood spider) (Nephila maculata).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Chelicerata; Arachnida; Araneae;
OC Araneomorphae; Entelegynae; Araneoidea; Nephilidae; Nephila.
OX NCBI_TaxID=299642 {ECO:0000313|EMBL:GFS69691.1, ECO:0000313|Proteomes:UP000887013};
RN [1] {ECO:0000313|EMBL:GFS69691.1}
RP NUCLEOTIDE SEQUENCE.
RA Kono N., Nakamura H., Mori M., Yoshida Y., Ohtoshi R., Malay A.D.,
RA Moran D.A.P., Tomita M., Numata K., Arakawa K.;
RT "Multicomponent nature underlies the extraordinary mechanical properties of
RT spider dragline silk.";
RL Submitted (AUG-2020) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GFS69691.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BMAW01000572; GFS69691.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A8X6MNJ1; -.
DR OrthoDB; 5983381at2759; -.
DR Proteomes; UP000887013; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR Pfam; PF01391; Collagen; 2.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:GFS69691.1};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000887013};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 32..51
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 449..497
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 505..670
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 56..103
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 121..397
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 410..446
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 124..133
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 189..199
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 207..218
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 243..257
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 266..275
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 276..285
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 344..353
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 361..375
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 417..429
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 706 AA; 74061 MW; 9A8A7A0EC9C45C83 CRC64;
MQIRGDACMG SCAIALKPES LRCRIGCQLD DGLMILLVVV YCLQWSFYIP LGRQDPKYTG
SPRFPGPRGE RVPGQKGEPG AQGIGIAGPR GPPGPPGPIG YYGVSKEFSP LQLENEGATV
IVSKGDKGDR GEDGAYGNPG VKGEKGNKGD SGMDGTSGAP GPKGDVGDQG PPGAPGPVSH
LAPDGTLIIE EKGEKGDRGR RGKRGYPGPP GPPGPPGEPG IISELPGFPG RPGAPGHAGQ
KGEPGEAVKG EKGDRGEPGP PGNGGYMNPE GLEIITEIKG EKGGKGDIGP VGPTGSPGIT
GVPGETGPMG LPGMKGLKGD VGEPGPPGPV VYIDETDEKY IYVPGPPGPP GPEGPLGKSS
PGPPGPPGPP GPPSPLFSGP QWANFTNGNG RKGNFGMKGL KQLMKSEGFL MKGLTGPPGP
PGPPGPPGSP GSFTGEDSKR NQPTVVPGAV TLKNVDSLLR VSEISPLGTL GFVLDEETLL
VRVSGGWQYV ALGSLVPLPS ATTTLRMAAL NQPYTGDMHG VRGADYECYR QSRRANLRGT
FRAFLASRVQ NLDSIVRHKD SDLPIVNIKG EVLFNSWKDL FAGTAAPFSY PPRIYSFDGR
NVLTDNAWPH KLVWHGSDRL GVREMEAYCD AWHSEGTTKV GVASSLLRHR LLDQEKHPCD
RSFIVLCIEA TSQDDFKKRR KRGIKIEEIP FNAQEYSKVL QRIVRS
//