GenomeNet

Database: UniProt
Entry: A0A8X6MNJ1_NEPPI
LinkDB: A0A8X6MNJ1_NEPPI
Original site: A0A8X6MNJ1_NEPPI 
ID   A0A8X6MNJ1_NEPPI        Unreviewed;       706 AA.
AC   A0A8X6MNJ1;
DT   14-DEC-2022, integrated into UniProtKB/TrEMBL.
DT   14-DEC-2022, sequence version 1.
DT   28-JAN-2026, entry version 10.
DE   SubName: Full=Collagen alpha-1(XV) chain {ECO:0000313|EMBL:GFS69691.1};
GN   Name=COL15A1 {ECO:0000313|EMBL:GFS69691.1};
GN   ORFNames=NPIL_186442 {ECO:0000313|EMBL:GFS69691.1};
OS   Nephila pilipes (Giant wood spider) (Nephila maculata).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Chelicerata; Arachnida; Araneae;
OC   Araneomorphae; Entelegynae; Araneoidea; Nephilidae; Nephila.
OX   NCBI_TaxID=299642 {ECO:0000313|EMBL:GFS69691.1, ECO:0000313|Proteomes:UP000887013};
RN   [1] {ECO:0000313|EMBL:GFS69691.1}
RP   NUCLEOTIDE SEQUENCE.
RA   Kono N., Nakamura H., Mori M., Yoshida Y., Ohtoshi R., Malay A.D.,
RA   Moran D.A.P., Tomita M., Numata K., Arakawa K.;
RT   "Multicomponent nature underlies the extraordinary mechanical properties of
RT   spider dragline silk.";
RL   Submitted (AUG-2020) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:GFS69691.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; BMAW01000572; GFS69691.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A8X6MNJ1; -.
DR   OrthoDB; 5983381at2759; -.
DR   Proteomes; UP000887013; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR   Pfam; PF01391; Collagen; 2.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:GFS69691.1};
KW   Membrane {ECO:0000256|SAM:Phobius};
KW   Reference proteome {ECO:0000313|Proteomes:UP000887013};
KW   Transmembrane {ECO:0000256|SAM:Phobius};
KW   Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT   TRANSMEM        32..51
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   DOMAIN          449..497
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          505..670
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   REGION          56..103
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          121..397
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          410..446
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        124..133
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        189..199
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        207..218
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        243..257
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        266..275
FT                   /note="Low complexity"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        276..285
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        344..353
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        361..375
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        417..429
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   706 AA;  74061 MW;  9A8A7A0EC9C45C83 CRC64;
     MQIRGDACMG SCAIALKPES LRCRIGCQLD DGLMILLVVV YCLQWSFYIP LGRQDPKYTG
     SPRFPGPRGE RVPGQKGEPG AQGIGIAGPR GPPGPPGPIG YYGVSKEFSP LQLENEGATV
     IVSKGDKGDR GEDGAYGNPG VKGEKGNKGD SGMDGTSGAP GPKGDVGDQG PPGAPGPVSH
     LAPDGTLIIE EKGEKGDRGR RGKRGYPGPP GPPGPPGEPG IISELPGFPG RPGAPGHAGQ
     KGEPGEAVKG EKGDRGEPGP PGNGGYMNPE GLEIITEIKG EKGGKGDIGP VGPTGSPGIT
     GVPGETGPMG LPGMKGLKGD VGEPGPPGPV VYIDETDEKY IYVPGPPGPP GPEGPLGKSS
     PGPPGPPGPP GPPSPLFSGP QWANFTNGNG RKGNFGMKGL KQLMKSEGFL MKGLTGPPGP
     PGPPGPPGSP GSFTGEDSKR NQPTVVPGAV TLKNVDSLLR VSEISPLGTL GFVLDEETLL
     VRVSGGWQYV ALGSLVPLPS ATTTLRMAAL NQPYTGDMHG VRGADYECYR QSRRANLRGT
     FRAFLASRVQ NLDSIVRHKD SDLPIVNIKG EVLFNSWKDL FAGTAAPFSY PPRIYSFDGR
     NVLTDNAWPH KLVWHGSDRL GVREMEAYCD AWHSEGTTKV GVASSLLRHR LLDQEKHPCD
     RSFIVLCIEA TSQDDFKKRR KRGIKIEEIP FNAQEYSKVL QRIVRS
//
DBGET integrated database retrieval system