GenomeNet

Database: UniProt
Entry: A0A8R2HPJ5_BOMMO
LinkDB: A0A8R2HPJ5_BOMMO
Original site: A0A8R2HPJ5_BOMMO 
ID   A0A8R2HPJ5_BOMMO        Unreviewed;      1174 AA.
AC   A0A8R2HPJ5;
DT   12-OCT-2022, integrated into UniProtKB/TrEMBL.
DT   12-OCT-2022, sequence version 1.
DT   28-JAN-2026, entry version 12.
DE   RecName: Full=Thrombospondin-like N-terminal domain-containing protein {ECO:0000259|SMART:SM00210};
OS   Bombyx mori (Silk moth).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Bombycoidea;
OC   Bombycidae; Bombycinae; Bombyx.
OX   NCBI_TaxID=7091 {ECO:0000313|EnsemblMetazoa:XP_021208109.2, ECO:0000313|Proteomes:UP000005204};
RN   [1] {ECO:0000313|Proteomes:UP000005204}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=p50T {ECO:0000313|Proteomes:UP000005204};
RX   PubMed=19121390; DOI=10.1016/j.ibmb.2008.11.004;
RG   International Silkworm Genome Consortium;
RT   "The genome of a lepidopteran model insect, the silkworm Bombyx mori.";
RL   Insect Biochem. Mol. Biol. 38:1036-1045(2008).
RN   [2] {ECO:0000313|EnsemblMetazoa:XP_021208109.2}
RP   IDENTIFICATION.
RC   STRAIN=p50T (Dazao) {ECO:0000313|EnsemblMetazoa:XP_021208109.2};
RG   EnsemblMetazoa;
RL   Submitted (JUN-2022) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; A0A8R2HPJ5; -.
DR   SMR; A0A8R2HPJ5; -.
DR   EnsemblMetazoa; XM_021352434.2; XP_021208109.2; LOC101737883.
DR   Proteomes; UP000005204; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050938; Collagen_Structural_Proteins.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR048287; TSPN-like_N.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR37456:SF6; COLLAGEN ALPHA-1(XXIII) CHAIN-LIKE ISOFORM X2; 1.
DR   PANTHER; PTHR37456; SI:CH211-266K2.1; 1.
DR   Pfam; PF01391; Collagen; 3.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SMART; SM00210; TSPN; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR   PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP000005204};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..20
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           21..1174
FT                   /note="Thrombospondin-like N-terminal domain-containing
FT                   protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5035901257"
FT   DOMAIN          41..232
FT                   /note="Thrombospondin-like N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00210"
FT   REGION          245..325
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          355..446
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          464..808
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          843..870
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        277..286
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        297..310
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        394..409
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        427..438
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        544..555
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        585..597
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        608..617
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        780..792
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        843..852
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        856..868
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1174 AA;  122653 MW;  207E3D6D464D0D75 CRC64;
     MESFRYISLP LLLLVTACLA TDGLFGSSMY QNGEPFIDIP EYDLLHAIGV PFSNPKTQYF
     DEGLDGFPAY GLMPGSDIKS PYRLFMPEKL YSEFSITATV RPANKDGGFL FSVVNPLETV
     VQLGVQLIPS GPGLMNISLL YTDANIYAFS QTIASFVVPT FAKKWSRFAL KVSLENVTLY
     LNCHEFDTVV VRRNPLELVF DSASTLYVGQ AGPLIRGAFH GAFQELKLYG SPSQAEVQCV
     NTFEVDQEED SEGSGRYGTI PPFPPPPPGL DGYSLRLRGD KGERGPRGPP GESIRGPPGP
     PGPPGPPGPP GTATESSGSG DDRSSVKQIF GENYASLGQC GCNASTIMSL LQTAPELRGS
     PGPPGMIGAD GRAGPPGMPG QPGTPGERGP VGPRGDKGDR GDAGTRGHEG QPGPKGEAGV
     DGRPGTAGPP GPPGPPGTPD YNNFESNWKP RQIYKESLMG AYGSAIGRPG APGPKGDSGQ
     PGPMGPQGER GFPGQKGERG QNGAVGAKGD RGHPGPQGDR GVKGDRGNPG FDGRPGIPGA
     NGRPADKGEK GERGEPGPPG SPPAGVFNAA DPEFMVAGSQ VTGAKGDKGE KGEKGTRGND
     GPPGFPGKDG KQGERGDIGP SGLPGMAGSP GPQGFKGDRG ERGPPGPISV ASAGTDIITI
     KGDKGEPGAR GRRGRSGPGG ARGATGAPGP PGPAGRPGDK GDTGLPGWMN SKGRPGTLGP
     PGAPGPVGPK GEKGDPGVNI LDVSMFKGEK GDRGFDGVPG VQGPPGPPGK PASEPVQYLP
     GPPGPPGPPG SPGTPGVSVV GPKGEPGVSY YEEGPVHGSP KFYGRPVLKS PLDELKALKE
     LKDLKDKERD RGGFSARSQP ADESSAQKTV PGAAVFQTTE EMMKLASSSP VGALAYVAEE
     QALFVKVNSG WQYVLLGSLV TQSKPPPTPA PAPLPMPAAS LVHVPPISNL VENSPTPIGP
     SLRMAALNDP LSGNMHGVRR ADYACYRQAR RAGLRGTFRA FLTSRIQNLD STVRYADRHL
     PVLNTQGDVL FKSFSDIFDG SGGIVAGTPR IYSFNGKNII TDSHWPQKLI WHGSHASGER
     ALDTFCEEWQ SNDPTNKGMA ASLYSHKILS QERYSCNNHF AVLCIEATSN VSVRRKRDTH
     RFNTTEDSDE DYEMRPDEYE ELINDIIAQP LRYN
//
DBGET integrated database retrieval system