ID I3JKV6_ORENI Unreviewed; 925 AA.
AC I3JKV6;
DT 11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT 17-JUN-2020, sequence version 2.
DT 24-JAN-2024, entry version 74.
DE SubName: Full=Thrombospondin-4-B {ECO:0000313|Ensembl:ENSONIP00000009500.2};
GN Name=LOC100696639 {ECO:0000313|Ensembl:ENSONIP00000009500.2};
OS Oreochromis niloticus (Nile tilapia) (Tilapia nilotica).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC Pseudocrenilabrinae; Oreochromini; Oreochromis.
OX NCBI_TaxID=8128 {ECO:0000313|Ensembl:ENSONIP00000009500.2, ECO:0000313|Proteomes:UP000005207};
RN [1] {ECO:0000313|Proteomes:UP000005207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Broad Institute Genome Assembly Team;
RG Broad Institute Sequencing Platform;
RA Di Palma F., Johnson J., Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Oreochromis niloticus (Nile Tilapia).";
RL Submitted (JAN-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSONIP00000009500.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (JUL-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Endoplasmic reticulum
CC {ECO:0000256|ARBA:ARBA00004240}. Sarcoplasmic reticulum
CC {ECO:0000256|ARBA:ARBA00004369}. Secreted, extracellular space,
CC extracellular matrix {ECO:0000256|ARBA:ARBA00004498}.
CC -!- SIMILARITY: Belongs to the thrombospondin family.
CC {ECO:0000256|ARBA:ARBA00009456}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; I3JKV6; -.
DR STRING; 8128.ENSONIP00000077190; -.
DR Ensembl; ENSONIT00000009506.2; ENSONIP00000009500.2; ENSONIG00000007533.2.
DR eggNOG; ENOG502QRK8; Eukaryota.
DR GeneTree; ENSGT00940000155227; -.
DR HOGENOM; CLU_009257_1_1_1; -.
DR TreeFam; TF324917; -.
DR Proteomes; UP000005207; Linkage group LG7.
DR GO; GO:0005576; C:extracellular region; IEA:InterPro.
DR GO; GO:0005509; F:calcium ion binding; IEA:UniProtKB-UniRule.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR CDD; cd00054; EGF_CA; 2.
DR Gene3D; 1.20.5.10; -; 1.
DR Gene3D; 2.60.120.200; -; 2.
DR Gene3D; 2.10.25.10; Laminin; 4.
DR Gene3D; 4.10.1080.10; TSP type-3 repeat; 2.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR003367; Thrombospondin_3-like_rpt.
DR InterPro; IPR017897; Thrombospondin_3_rpt.
DR InterPro; IPR008859; Thrombospondin_C.
DR InterPro; IPR024665; TSP/COMP_coiled-coil.
DR InterPro; IPR046970; TSP/COMP_coiled-coil_sf.
DR InterPro; IPR028974; TSP_type-3_rpt.
DR InterPro; IPR048287; TSPN-like_N.
DR PANTHER; PTHR10199; THROMBOSPONDIN; 1.
DR PANTHER; PTHR10199:SF92; THROMBOSPONDIN-4; 1.
DR Pfam; PF11598; COMP; 1.
DR Pfam; PF07645; EGF_CA; 2.
DR Pfam; PF02412; TSP_3; 6.
DR Pfam; PF05735; TSP_C; 1.
DR SMART; SM00181; EGF; 4.
DR SMART; SM00179; EGF_CA; 2.
DR SMART; SM00210; TSPN; 1.
DR SUPFAM; SSF58006; Assembly domain of cartilage oligomeric matrix protein; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 2.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF103647; TSP type-3 repeat; 3.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 2.
DR PROSITE; PS01187; EGF_CA; 1.
DR PROSITE; PS51234; TSP3; 3.
DR PROSITE; PS51236; TSP_CTER; 1.
PE 3: Inferred from homology;
KW Calcium {ECO:0000256|ARBA:ARBA00022837, ECO:0000256|PROSITE-
KW ProRule:PRU00634}; Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Mitogen {ECO:0000256|ARBA:ARBA00023246};
KW Reference proteome {ECO:0000313|Proteomes:UP000005207};
KW Sarcoplasmic reticulum {ECO:0000256|ARBA:ARBA00022951};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729};
KW Unfolded protein response {ECO:0000256|ARBA:ARBA00023230}.
FT DOMAIN 313..350
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 410..452
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REPEAT 486..521
FT /note="TSP type-3"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00634"
FT REPEAT 545..580
FT /note="TSP type-3"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00634"
FT REPEAT 682..717
FT /note="TSP type-3"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00634"
FT DOMAIN 721..910
FT /note="TSP C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51236"
FT REGION 483..555
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 580..671
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 492..506
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 515..555
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 597..618
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 648..662
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 925 AA; 101528 MW; 9143F09CDF7D0C15 CRC64;
FSAPLRVVNI HVSSSITHVL SRFPAFLSVS TVYDLLTSPD CLPDLLQGGL TGQGVNEAFI
LTSFKLQPKT GTTVFGLYNP RDNSKYFEFT VMGKLNRAVL RYLRSDKRMA SVTFNNLVLA
DGQQHRLLFH LRGLQQQQQQ QQGPGGMELH LDCRLVETVR DLPAAFQGLP AGYGTVELKT
MQAREQESLD ELKLVVGDSF ENVASLQDCH FQQRDSVQTL GVNTKQLSNQ MLELTKVINE
LKDVLIQQVK ETSFLRNTIS ECQACGLGGT EVVKQRCAPG VCFRDDMCTE TAEGVECGPC
PDGYTGDGFN CDDVDECQFN PCFPGVRCVN TAPGFRCDAC PLGYTGLPVE GVGIVYAQTN
KQVCDDIDEC KGPDNGGCTA NSICHNSVGS YHCGSCKSGF TGDQVKGCKP EISCGNSLTN
PCDVNAQCIV ERDGSISCQC GIGWAGNGYL CGKDMDIDGY PDEKLKCKDP NCRKDNCVFV
PNSGQEDADR DGYGDACDED ADGDGIPNEQ DNCRLKPNVD QRNSDKDSHG DACDNCRLVE
NPDQRDTDGD GKGDACDDDM DGDGLKNFLD NCQRVQNRDQ LDRDGDGVGD ACDSCPDIPN
PNQSDVDNDL VGDSCDTNQD SDGDGHQDTK DNCPLVINSS QLDTDKDGLG DECDDDDDND
GIPDVLPPGP DNCRLVPNPD QIDDNNDGVG DICESDFDQD KVIDRIDNCP ENAEITLTDF
RAYQTVVLDP EGDAQIDPNW VVLNQSDIFI YTCYTAFSGV DFEGTFHVNT VTDDDYAGFI
FGYQDSSSFY VVMWKQTEQT YWQATPFRAV AEPGIQLKAV KSRSGPGEHL RNSLWHTGDT
TDQVRLLWKD PRNVEGQHCV ARFYEGTELV ADSGVTIDTT MRGGRLGVFC FSQENIIWSN
LKYRCNDTIP EDFQEFSTQH GVDSL
//