ID A0A8R2ALT8_BOMMO Unreviewed; 942 AA.
AC A0A8R2ALT8;
DT 12-OCT-2022, integrated into UniProtKB/TrEMBL.
DT 12-OCT-2022, sequence version 1.
DT 28-JAN-2026, entry version 13.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblMetazoa:XP_004932979.3};
OS Bombyx mori (Silk moth).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Bombycoidea;
OC Bombycidae; Bombycinae; Bombyx.
OX NCBI_TaxID=7091 {ECO:0000313|EnsemblMetazoa:XP_004932979.3, ECO:0000313|Proteomes:UP000005204};
RN [1] {ECO:0000313|Proteomes:UP000005204}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=p50T {ECO:0000313|Proteomes:UP000005204};
RX PubMed=19121390; DOI=10.1016/j.ibmb.2008.11.004;
RG International Silkworm Genome Consortium;
RT "The genome of a lepidopteran model insect, the silkworm Bombyx mori.";
RL Insect Biochem. Mol. Biol. 38:1036-1045(2008).
RN [2] {ECO:0000313|EnsemblMetazoa:XP_004932979.3}
RP IDENTIFICATION.
RC STRAIN=p50T (Dazao) {ECO:0000313|EnsemblMetazoa:XP_004932979.3};
RG EnsemblMetazoa;
RL Submitted (JUN-2022) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_004932979.3; XM_004932922.5.
DR AlphaFoldDB; A0A8R2ALT8; -.
DR EnsemblMetazoa; XM_004932922.4; XP_004932979.3; LOC101737883.
DR GeneID; 101737883; -.
DR KEGG; bmor:101737883; -.
DR CTD; 104327; -.
DR Proteomes; UP000005204; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050938; Collagen_Structural_Proteins.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR37456:SF6; COLLAGEN ALPHA-1(XXIII) CHAIN-LIKE ISOFORM X2; 1.
DR PANTHER; PTHR37456; SI:CH211-266K2.1; 1.
DR Pfam; PF01391; Collagen; 4.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000005204};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..19
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 20..942
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5035891570"
FT DOMAIN 641..688
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 730..895
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 61..139
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 162..600
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 89..98
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 109..122
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 123..132
FT /note="Low complexity"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 201..216
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 234..245
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 341..352
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 382..394
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 405..414
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 574..586
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 942 AA; 96698 MW; B902282F7E844969 CRC64;
MNMSVRGLLI FVVVQGAFQE LKLYGSPSQA EVQCVNTFEE IDKSGEGDEI DIDNFLVDQE
EDSEGSGRYG TIPPFPPPPP GLDGYSLRLR GDKGERGPRG PPGESIRGPP GPPGPPGPPG
PPGTATESSG SGDDQIFGEN YASLGQCGCN ASTIMSLLQT APELRGSPGP PGMIGADGRA
GPPGMPGQPG TPGERGPVGP RGDKGDRGDA GTRGHEGQPG PKGEAGVDGR PGTAGPPGPP
GPPGTPDYNN FEESLMGAYG SAIGRPGAPG PKGDSGQPGP MGPQGERGFP GQKGERGQNG
AVGAKGDRGH PGPQGDRGVK GDRGNPGFDG RPGIPGANGR PADKGEKGER GEPGPPGSPP
AGVFNAADPE FMVAGSQVTG AKGDKGEKGE KGTRGNDGPP GFPGKDGKQG ERGDIGPSGL
PGMAGSPGPQ GFKGDRGERG PPGPISVASA GTDIITIKGD KGEPGARGRR GRSGPGGARG
ATGAPGPPGP AGRPGDKGDT GLPGWMGRPG TLGPPGAPGP VGPKGEKGDP GVNILDVSMF
KGEKGDRGFD GVPGVQGPPG PPGKPASEPV QYLPGPPGPP GPPGSPGTPG VSVVGPKGEP
GVSYYEEGPV HGSPKFYGRP GFSARSQPAD ESSAQKTVPG AAVFQTTEEM MKLASSSPVG
ALAYVAEEQA LFVKVNSGWQ YVLLGSLVTQ SKPPPTPAPA PLPMPAASLV HVPPISNLVE
NSPTPIGPSL RMAALNDPLS GNMHGVRRAD YACYRQARRA GLRGTFRAFL TSRIQNLDST
VRYADRHLPV LNTQGDVLFK SFSDIFDGSG GIVAGTPRIY SFNGKNIITD SHWPQKLIWH
GSHASGERAL DTFCEEWQSN DPTNKGMAAS LYSHKILSQE RYSCNNHFAV LCIEATSNVS
VRRKRDTHRF NTTEDSDEDY EMRPDEYEEL INDIIAQPLR YN
//