GenomeNet

Database: UniProt
Entry: A0A1D2MSN4_ORCCI
LinkDB: A0A1D2MSN4_ORCCI
Original site: A0A1D2MSN4_ORCCI 
ID   A0A1D2MSN4_ORCCI        Unreviewed;       619 AA.
AC   A0A1D2MSN4;
DT   30-NOV-2016, integrated into UniProtKB/TrEMBL.
DT   30-NOV-2016, sequence version 1.
DT   27-MAR-2024, entry version 30.
DE   SubName: Full=Collagen alpha-1(XV) chain {ECO:0000313|EMBL:ODM96026.1};
GN   ORFNames=Ocin01_10652 {ECO:0000313|EMBL:ODM96026.1};
OS   Orchesella cincta (Springtail) (Podura cincta).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Collembola;
OC   Entomobryomorpha; Entomobryoidea; Orchesellidae; Orchesellinae; Orchesella.
OX   NCBI_TaxID=48709 {ECO:0000313|EMBL:ODM96026.1, ECO:0000313|Proteomes:UP000094527};
RN   [1] {ECO:0000313|EMBL:ODM96026.1, ECO:0000313|Proteomes:UP000094527}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   TISSUE=Mixed pool {ECO:0000313|EMBL:ODM96026.1};
RX   PubMed=27289101; DOI=10.1093/gbe/evw134;
RA   Faddeeva-Vakhrusheva A., Derks M.F., Anvar S.Y., Agamennone V., Suring W.,
RA   Smit S., van Straalen N.M., Roelofs D.;
RT   "Gene Family Evolution Reflects Adaptation to Soil Environmental Stressors
RT   in the Genome of the Collembolan Orchesella cincta.";
RL   Genome Biol. Evol. 8:2106-2117(2016).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:ODM96026.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; LJIJ01000592; ODM96026.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A1D2MSN4; -.
DR   STRING; 48709.A0A1D2MSN4; -.
DR   OMA; TGNMHGV; -.
DR   OrthoDB; 5363002at2759; -.
DR   Proteomes; UP000094527; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1105; MULTIPLEXIN, ISOFORM R; 1.
DR   Pfam; PF01391; Collagen; 1.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:ODM96026.1};
KW   Reference proteome {ECO:0000313|Proteomes:UP000094527}.
FT   DOMAIN          311..359
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          399..563
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   REGION          1..87
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          101..205
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          218..307
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        28..42
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        224..238
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        291..307
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   619 AA;  65116 MW;  64AFA7DF573527BC CRC64;
     MYKRYYKNRV VNDGGERGSI GPKGERGQIG ARGEKGERGH SGPKGDAGAS GIDGLPGAPG
     KDGKKGDAGP AGPPGSPGIA TGDGGSISID EIMNSIETIK GERGDLGPVG PQGERGLPGL
     PGMKGDSGER GPPGPASAVQ MDGSYAPAKG EKGDRGRRGK RGKPGLPGPP GVVGELGVPG
     WPEIDGGRAG PPGLAIQGPK GDKGEPAVLP ENFFNYEVSG LRGLPGPPGP PGPPGPPGKG
     NDNEIGGSGP LYVPVPGPPG PKGEPGLPGL SVVGEQGDVG PVGPPGPPGQ SGSGDGSTSS
     SSSSNNVVPG AVVFLSRDAM IKMSEFSPIG TISFVKDEES LFTRVNEGWK QILLGDLIKA
     PNMAQVEPKT ELPVPRPPPE TSNLVNKVEG PSRGMNQIRL AALNDPSNGD MRGVRGADYS
     CYRQARNSNL QGTFRAFLSS WVQNLDSIVK YSDRNLPVVN TKGEILFSSW MDIFHMGGKF
     VTKPPQIHSF NGRNVFTEFH WPQKLIWHGA DTTGSRVANS YCDAWHTDAV TNVGLASDLL
     KQELMGQEKV SCNHKLIVLC IEIASQHHYR KKRDVLPRDI PFSNDDEVQI LTSKYSNRLH
     NLTFDEYTQL IDDYDQGSK
//
DBGET integrated database retrieval system