ID L8J119_9CETA Unreviewed; 1148 AA.
AC L8J119;
DT 03-APR-2013, integrated into UniProtKB/TrEMBL.
DT 03-APR-2013, sequence version 1.
DT 27-MAR-2024, entry version 48.
DE SubName: Full=Thrombospondin-1 {ECO:0000313|EMBL:ELR61299.1};
DE Flags: Fragment;
GN ORFNames=M91_12734 {ECO:0000313|EMBL:ELR61299.1};
OS Bos mutus (wild yak).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Bovinae; Bos.
OX NCBI_TaxID=72004 {ECO:0000313|EMBL:ELR61299.1, ECO:0000313|Proteomes:UP000011080};
RN [1] {ECO:0000313|EMBL:ELR61299.1, ECO:0000313|Proteomes:UP000011080}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=yakQH1 {ECO:0000313|Proteomes:UP000011080};
RX PubMed=22751099; DOI=10.1038/ng.2343;
RA Qiu Q., Zhang G., Ma T., Qian W., Wang J., Ye Z., Cao C., Hu Q., Kim J.,
RA Larkin D.M., Auvil L., Capitanu B., Ma J., Lewin H.A., Qian X., Lang Y.,
RA Zhou R., Wang L., Wang K., Xia J., Liao S., Pan S., Lu X., Hou H., Wang Y.,
RA Zang X., Yin Y., Ma H., Zhang J., Wang Z., Zhang Y., Zhang D., Yonezawa T.,
RA Hasegawa M., Zhong Y., Liu W., Zhang Y., Huang Z., Zhang S., Long R.,
RA Yang H., Wang J., Lenstra J.A., Cooper D.N., Wu Y., Wang J., Shi P.,
RA Wang J., Liu J.;
RT "The yak genome and adaptation to life at high altitude.";
RL Nat. Genet. 44:946-949(2012).
CC -!- SIMILARITY: Belongs to the thrombospondin family.
CC {ECO:0000256|ARBA:ARBA00009456}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JH880460; ELR61299.1; -; Genomic_DNA.
DR AlphaFoldDB; L8J119; -.
DR STRING; 72004.ENSBMUP00000000400; -.
DR Proteomes; UP000011080; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:InterPro.
DR GO; GO:0005509; F:calcium ion binding; IEA:UniProtKB-UniRule.
DR GO; GO:0008201; F:heparin binding; IEA:UniProtKB-KW.
DR GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR CDD; cd00054; EGF_CA; 1.
DR Gene3D; 2.60.120.200; -; 2.
DR Gene3D; 6.20.200.20; -; 1.
DR Gene3D; 2.10.25.10; Laminin; 3.
DR Gene3D; 2.20.100.10; Thrombospondin type-1 (TSP1) repeat; 3.
DR Gene3D; 4.10.1080.10; TSP type-3 repeat; 2.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR024731; EGF_dom.
DR InterPro; IPR003367; Thrombospondin_3-like_rpt.
DR InterPro; IPR017897; Thrombospondin_3_rpt.
DR InterPro; IPR008859; Thrombospondin_C.
DR InterPro; IPR000884; TSP1_rpt.
DR InterPro; IPR036383; TSP1_rpt_sf.
DR InterPro; IPR028974; TSP_type-3_rpt.
DR InterPro; IPR048287; TSPN-like_N.
DR InterPro; IPR001007; VWF_dom.
DR PANTHER; PTHR10199; THROMBOSPONDIN; 1.
DR PANTHER; PTHR10199:SF78; THROMBOSPONDIN-1; 1.
DR Pfam; PF12947; EGF_3; 1.
DR Pfam; PF00090; TSP_1; 3.
DR Pfam; PF02412; TSP_3; 6.
DR Pfam; PF05735; TSP_C; 1.
DR Pfam; PF00093; VWC; 1.
DR PRINTS; PR01705; TSP1REPEAT.
DR SMART; SM00181; EGF; 3.
DR SMART; SM00179; EGF_CA; 2.
DR SMART; SM00209; TSP1; 3.
DR SMART; SM00210; TSPN; 1.
DR SMART; SM00214; VWC; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 2.
DR SUPFAM; SSF57196; EGF/Laminin; 1.
DR SUPFAM; SSF57603; FnI-like domain; 1.
DR SUPFAM; SSF103647; TSP type-3 repeat; 3.
DR SUPFAM; SSF82895; TSP-1 type 1 repeat; 3.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 2.
DR PROSITE; PS50092; TSP1; 3.
DR PROSITE; PS51234; TSP3; 2.
DR PROSITE; PS51236; TSP_CTER; 1.
DR PROSITE; PS01208; VWFC_1; 1.
DR PROSITE; PS50184; VWFC_2; 1.
PE 3: Inferred from homology;
KW Calcium {ECO:0000256|ARBA:ARBA00022837, ECO:0000256|PROSITE-
KW ProRule:PRU00634}; Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Heparin-binding {ECO:0000256|ARBA:ARBA00022674};
KW Reference proteome {ECO:0000313|Proteomes:UP000011080};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..18
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 19..1148
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003993150"
FT DOMAIN 316..373
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 547..587
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 646..690
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REPEAT 727..762
FT /note="TSP type-3"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00634"
FT REPEAT 786..821
FT /note="TSP type-3"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00634"
FT DOMAIN 938..1148
FT /note="TSP C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51236"
FT REGION 839..927
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 841..868
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 883..927
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1148
FT /evidence="ECO:0000313|EMBL:ELR61299.1"
SQ SEQUENCE 1148 AA; 126925 MW; 7E21FFBCE64E5C4D CRC64;
MGLAWGLGVL LLLHACGSNR IPESGGDNSV FDIFELTGAA RKGSGRRLVK GPDPSSPAFR
IEDANLIPPV PDKKFQDLVD AVRAEKGFLL LASLRQMKKT RGTLLAVERK DHSGQVFSVV
SNGKAGTLDL SLTVQGKQHV VSVEEALLAT GQWKSITLFV QEDRAQLYID CEKMENVELD
VPIQSIFTRD LASIARLRIA KGGVNDNFQG VLQNVRFVFG TTPEDILRNK GCSSSTSVFV
TLDNNVVNGS SPAIRTDYIG HKTKDLQAIC GISCDELSSM VLELRGLRTI VTTLQDSIRK
VTEENKELAN ELRRPPLCYH NGVQYRTGDE WTVDSCTECR CQNSVTICKK VSCPIMPCSN
ATVPDGECCP RCWPSDSADD GWSPWSEWTS CSVTCGNGIQ QRGRSCDSLN NRCEGSSVQT
RTCHIQECDK RFKQDGGWSH WSPWSSCSVT CGDGVITRIR LCNSPSPQMN GKPCEGKARE
TKACQKDSCP INGGWGPWSP WDICSVTCGG GVQKRSRLCN NPTPQFGGKD CVGDVTENQI
CNKQDCPIDG CLSNPCFAGV QCTSYPDGSW KCGACPPGYS GDGVECKDVD ECKEVPDACF
NHNGEHRCEN TDPGYNCLPC PPRFTGSQPF GRGVEHATAN KQVCKPRNPC TDGTHDCNKN
AKCNYLGHYS DPMYRCECKP GYAGNGIICG EDTDLDGWPN EDLLCVANAT YHCRKDNCPN
LPNSGQEDYD KDGIGDACDD DDDNDKIPDD RDNCPFHYNP AQYDYDRDDV GDRCDNCPYN
HNPDQADTDN NGEGDACAAD IDGDGILNER DNCQYVYNVD QKDTDMDGVG DQCDNCPLEH
NPDQLDSDSD RIGDTCDNNQ DIDEDGHQNN LDNCPYVPNA NQADHDKDGK GDACDHDDDN
DGSDGRGDAC KDDFDQDKVP DIDDICPENV DISETDFRRF QMIPLDPKGT SQNDPNWVVR
HQGKELVQTV NCDPGLAVGY DEFNAVDFSG TFFINTERDD DYAGFVFGYQ SSSRFYVVMW
KQVTQSYWDT NPTRAQGYSG LSVKVVNSTT GPGEHLRNAL WHTGNTSGQV RTLWHDPRHI
GWKDFTAYRW HLSHRPKTGF IRVVMYEGKK IMADSGPIYD KTYAGGRLGL FVFSQEMVFF
SDLKYECR
//