ID A0A1D2MSN4_ORCCI Unreviewed; 619 AA.
AC A0A1D2MSN4;
DT 30-NOV-2016, integrated into UniProtKB/TrEMBL.
DT 30-NOV-2016, sequence version 1.
DT 27-MAR-2024, entry version 30.
DE SubName: Full=Collagen alpha-1(XV) chain {ECO:0000313|EMBL:ODM96026.1};
GN ORFNames=Ocin01_10652 {ECO:0000313|EMBL:ODM96026.1};
OS Orchesella cincta (Springtail) (Podura cincta).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Collembola;
OC Entomobryomorpha; Entomobryoidea; Orchesellidae; Orchesellinae; Orchesella.
OX NCBI_TaxID=48709 {ECO:0000313|EMBL:ODM96026.1, ECO:0000313|Proteomes:UP000094527};
RN [1] {ECO:0000313|EMBL:ODM96026.1, ECO:0000313|Proteomes:UP000094527}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC TISSUE=Mixed pool {ECO:0000313|EMBL:ODM96026.1};
RX PubMed=27289101; DOI=10.1093/gbe/evw134;
RA Faddeeva-Vakhrusheva A., Derks M.F., Anvar S.Y., Agamennone V., Suring W.,
RA Smit S., van Straalen N.M., Roelofs D.;
RT "Gene Family Evolution Reflects Adaptation to Soil Environmental Stressors
RT in the Genome of the Collembolan Orchesella cincta.";
RL Genome Biol. Evol. 8:2106-2117(2016).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ODM96026.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LJIJ01000592; ODM96026.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1D2MSN4; -.
DR STRING; 48709.A0A1D2MSN4; -.
DR OMA; TGNMHGV; -.
DR OrthoDB; 5363002at2759; -.
DR Proteomes; UP000094527; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1105; MULTIPLEXIN, ISOFORM R; 1.
DR Pfam; PF01391; Collagen; 1.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119, ECO:0000313|EMBL:ODM96026.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000094527}.
FT DOMAIN 311..359
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 399..563
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 1..87
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 101..205
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 218..307
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 28..42
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 224..238
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 291..307
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 619 AA; 65116 MW; 64AFA7DF573527BC CRC64;
MYKRYYKNRV VNDGGERGSI GPKGERGQIG ARGEKGERGH SGPKGDAGAS GIDGLPGAPG
KDGKKGDAGP AGPPGSPGIA TGDGGSISID EIMNSIETIK GERGDLGPVG PQGERGLPGL
PGMKGDSGER GPPGPASAVQ MDGSYAPAKG EKGDRGRRGK RGKPGLPGPP GVVGELGVPG
WPEIDGGRAG PPGLAIQGPK GDKGEPAVLP ENFFNYEVSG LRGLPGPPGP PGPPGPPGKG
NDNEIGGSGP LYVPVPGPPG PKGEPGLPGL SVVGEQGDVG PVGPPGPPGQ SGSGDGSTSS
SSSSNNVVPG AVVFLSRDAM IKMSEFSPIG TISFVKDEES LFTRVNEGWK QILLGDLIKA
PNMAQVEPKT ELPVPRPPPE TSNLVNKVEG PSRGMNQIRL AALNDPSNGD MRGVRGADYS
CYRQARNSNL QGTFRAFLSS WVQNLDSIVK YSDRNLPVVN TKGEILFSSW MDIFHMGGKF
VTKPPQIHSF NGRNVFTEFH WPQKLIWHGA DTTGSRVANS YCDAWHTDAV TNVGLASDLL
KQELMGQEKV SCNHKLIVLC IEIASQHHYR KKRDVLPRDI PFSNDDEVQI LTSKYSNRLH
NLTFDEYTQL IDDYDQGSK
//