ID I3N7P1_ICTTR Unreviewed; 4921 AA.
AC I3N7P1;
DT 11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 2.
DT 27-MAR-2024, entry version 74.
DE RecName: Full=SCO-spondin {ECO:0000256|ARBA:ARBA00020523};
GN Name=Sspo {ECO:0000313|Ensembl:ENSSTOP00000020387.2};
OS Ictidomys tridecemlineatus (Thirteen-lined ground squirrel) (Spermophilus
OS tridecemlineatus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Sciuromorpha; Sciuridae;
OC Xerinae; Marmotini; Ictidomys.
OX NCBI_TaxID=43179 {ECO:0000313|Ensembl:ENSSTOP00000020387.2, ECO:0000313|Proteomes:UP000005215};
RN [1] {ECO:0000313|Proteomes:UP000005215}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG The Broad Institute Genome Assembly & Analysis Group;
RG Computational R&D Group;
RG and Sequencing Platform;
RA Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA MacCallum I., Young S., Walker B.J., Lindblad-Toh K.;
RT "The Draft Genome of Spermophilus tridecemlineatus.";
RL Submitted (NOV-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSSTOP00000020387.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space
CC {ECO:0000256|ARBA:ARBA00004239}.
CC -!- SIMILARITY: Belongs to the thrombospondin family.
CC {ECO:0000256|ARBA:ARBA00009456}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00039}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGTP01045246; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGTP01045247; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGTP01045248; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGTP01045249; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR Ensembl; ENSSTOT00000028591.2; ENSSTOP00000020387.2; ENSSTOG00000023004.2.
DR GeneTree; ENSGT00940000155829; -.
DR HOGENOM; CLU_223278_0_0_1; -.
DR Proteomes; UP000005215; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR GO; GO:0030414; F:peptidase inhibitor activity; IEA:InterPro.
DR GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR CDD; cd00112; LDLa; 8.
DR CDD; cd19941; TIL; 9.
DR Gene3D; 2.10.70.10; Complement Module, domain 1; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 2.10.25.10; Laminin; 11.
DR Gene3D; 4.10.400.10; Low-density Lipoprotein Receptor; 9.
DR Gene3D; 2.20.100.10; Thrombospondin type-1 (TSP1) repeat; 23.
DR InterPro; IPR006207; Cys_knot_C.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000421; FA58C.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR036055; LDL_receptor-like_sf.
DR InterPro; IPR023415; LDLR_class-A_CS.
DR InterPro; IPR002172; LDrepeatLR_classA_rpt.
DR InterPro; IPR036201; Pacifastin_dom_sf.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR002919; TIL_dom.
DR InterPro; IPR000884; TSP1_rpt.
DR InterPro; IPR036383; TSP1_rpt_sf.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001007; VWF_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR11339:SF388; -; 1.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR Pfam; PF08742; C8; 3.
DR Pfam; PF00754; F5_F8_type_C; 1.
DR Pfam; PF00057; Ldl_recept_a; 8.
DR Pfam; PF01826; TIL; 8.
DR Pfam; PF00090; TSP_1; 22.
DR Pfam; PF00093; VWC; 1.
DR Pfam; PF00094; VWD; 3.
DR PRINTS; PR00261; LDLRECEPTOR.
DR SMART; SM00832; C8; 3.
DR SMART; SM00041; CT; 1.
DR SMART; SM00181; EGF; 7.
DR SMART; SM00231; FA58C; 1.
DR SMART; SM00192; LDLa; 10.
DR SMART; SM00209; TSP1; 25.
DR SMART; SM00214; VWC; 6.
DR SMART; SM00215; VWC_out; 8.
DR SMART; SM00216; VWD; 2.
DR SUPFAM; SSF57603; FnI-like domain; 5.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR SUPFAM; SSF57424; LDL receptor-like module; 9.
DR SUPFAM; SSF57283; PMP inhibitors; 1.
DR SUPFAM; SSF57567; Serine protease inhibitors; 11.
DR SUPFAM; SSF82895; TSP-1 type 1 repeat; 23.
DR PROSITE; PS01225; CTCK_2; 1.
DR PROSITE; PS50022; FA58C_3; 1.
DR PROSITE; PS01209; LDLRA_1; 3.
DR PROSITE; PS50068; LDLRA_2; 9.
DR PROSITE; PS50092; TSP1; 26.
DR PROSITE; PS01208; VWFC_1; 1.
DR PROSITE; PS50184; VWFC_2; 2.
DR PROSITE; PS51233; VWFD; 3.
PE 3: Inferred from homology;
KW Calcium {ECO:0000256|ARBA:ARBA00022837};
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00124}; EGF-like domain {ECO:0000256|ARBA:ARBA00022536};
KW Reference proteome {ECO:0000313|Proteomes:UP000005215};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..17
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 18..4921
FT /note="SCO-spondin"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012768193"
FT DOMAIN 145..227
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 427..587
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 865..1035
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 1823..1883
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 1922..2079
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT DOMAIN 4760..4816
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 4815..4914
FT /note="CTCK"
FT /evidence="ECO:0000259|PROSITE:PS01225"
FT REGION 1391..1413
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2111..2197
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1395..1413
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2177..2197
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 1249..1264
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1269..1281
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1276..1294
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1288..1303
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1305..1317
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1312..1330
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1324..1339
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1345..1357
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1352..1370
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1418..1430
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1425..1443
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1437..1452
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2086..2098
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2093..2111
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2105..2120
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2238..2250
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2245..2263
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2257..2272
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2295..2307
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2302..2320
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2314..2329
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
SQ SEQUENCE 4921 AA; 522932 MW; 0ABFD6BD681FCCFD CRC64;
MLLLAVLFGM LWALANGRWC EQTETIHVEE EVAPRKEDLV PCASLYHYTR LGWRLDLPRS
GHEGLSGPPA PGLCRIYKPP ETRPATWNRT VRACCPGWGG AQCTDALMED SPGGHCFATW
QCQPLGGSAN TSTENLEDPA VVVRLSLQWQ GDWLVLSGGL GVVVRLDRSS SVSISVDHEH
SGQTQGLCGL YNGRPEDDFS EPGGGLAMLA ATFGNSWKLP SSEPGCLDAV EVDQGCEGPL
KGTEAGMDAG QLRAEAQDVC HQLLESPFWQ CHAQVSPDEY HEACLFAYCA GAALGRQEGR
QEAVCATFAN YAQACARRRI HVLWRKPGFC ERLCPGGQLY SDCVSSCPPS CSAVGQEEEG
SCREECVSGC ECPPGLFWDG ALCVPASRCP CYHRRQRYAP GDTVHQLCNP CVCQDGHWHC
AQALCPAECA VGGDGHYLTF DGRSFFFRGH SGCHYSLVQV GTHPSVLAAP SIYLPPPPHH
QAPFPGAVLV DGQDVGLPWI GTAGLSISRA SSTFLLLRWP GVHVLWGVAD PAAYITLAPH
HAHQVQGLCG TFTWNQQDDF LTPAGDVETS TADFASRFQV AGNGKCSSGD SVLLFPCSTH
SQHLAFAEAT CAILHGPAFQ DCHRLVDREP FQLRCLAAVC GCAPGRDCLC AVLSAYAHRC
AQEGALLSWR NQTLCPVLCP GGQEYQECAP VCGHHCGEPE DCKELTGCVA GCNCPPGLLW
DPEGQCVPPS LCSCQLGAHR YSPGSVTMKD CNRCICQERG LWNCTAHHCA LQRAFCPGEL
VHAPGACLLT CDSPSANHSC PMGSTDGCVC PPGTVLLDER CVPPDLCPCR HSGQWYPPNA
TIQDNCNTCV CQGRQWHCTG QLCSGWCQAS GAPHYVTFDG LTLTFPGACE YLLVREASGR
FSVSAQNLPC GASGLTCTKT LAVRLEGTTV HMLRGRAVTV NGVSVTPPKA YTGPGISLHR
AGLFLLLTTR LGLTLLWDGG TRVLIQLSPH FRGRVAGLCG DFDGDASNDL RSRQGVLEPT
AELAAHSWRL SALCPEPGDP PHPCTVNTHR AAWARARCGV MLQQLFVPCH TEVPPQQHYE
WCVYDTCGCD SGGDCECLCS AIAAYADECA RHGHHVRWRS QELCPLQCEG GQVYEACGPT
CPPACPTHHA EPAWHCRAIA CVEGCFCPEG TLLHGGICME PSACPCEWGG SFFPPGTVLQ
KDCGNCTCQE SQWHCGGDSA PCEELVPDCA EGESPCQGSG HCIPHEWLCD NQDDCGDGSD
EEGCATPGCV EGQMSCRSGH CLPLSLLCDG QDDCGDGTDE QGCPCPHDSL ACADGRCLPL
TLLCDGHPDC PDAADEESCL GQVNCIPGEV SCMDGTCVGT IQLCDGVWDC PDGADEGPGY
CPLPSLPVPP ASTMPGPSTG SLEGASSPLG SASPARSCGP LEFPCGSGEC TPRGWRCDQE
EDCADGSDEL GCGGPCALHH APCARGSHCV SPGELCDGVP QCPDGSDEGP DACGESRSLN
RTEFPCPEYS CPNGDCLGFH LVCDGQPNCE LVGEAGLSPE EQGCGTWGPW SPWGPCSQTC
GLGVQSQSRS CSPSGLLVLQ HCPGPANQSQ ACFTKACPVD GEWSSWSSWS LCSEPCGGTR
TRQRQCHPPQ NGGQACALLP GGPHSTHQTG EWKGEGTGGG RGLLGGHLSL LLCLPLQSLT
PLSEPCPQDG CPNGQCVWPR QCPCLVDGAR YWPGQHIKAN CQLCVCQDGR PRRCREYPVP
CILPGYPAGH SPLPPGPNSC PLSVNCGWSS WSPWAECLGP CGSQSIQWSF RSPNNPRLSG
RGRQCRGIHR KARRYQCQTQ PCEGCEQQGH THQVGERWRG GPCKVCQCLH MGTVHCSPYC
PLGSCPQGWI LVEGVGESCC HCALPENQTV PPTATPAPAP APSPQTGPPL VTYVLPPLGD
ACYSPLGLAL PPGSLWTWSR QLEPPTWAAL LGAPTKGPGP RAGAYAEWHS GPPFLQLDLL
QPRNLTGVIV WGPTSSSPPG SSFSLQFGTD GLHWQDYRDI LPGTLSPPKS FPRNWEDVAS
KVWAFSRMVQ ARYIRVWPHS VPHSDRQPGF FLWVELLGLS PPVPPCPGAG HRCANGDCAL
RGVPCDGAAD CEHGSDEEGC GPLGHSTART PTFSTQPGPL PPQPSEVREL GSPHSFSPLP
TEKRPVSPAP ASKAPHPSSG ESVQTGSNTP TFHAGFQSLT PGMAATTEAQ GHEVPPQLAM
VAPTGQRVAP SPFPPVRCSP GQLPCKVLGC VEPEQLCDGK EDCLDGDDER HCASPMPFMV
PTTVLPGLPA SRTLCSQSQL SCGSGECLPT ERRCDLRPDC QDGSDEDGCV DCVLAPWSGW
SGCSRSCGLG LTFQRRELLR PPLPGGSCPL DQLRRQPCFV QACPVAGAWA EWETWGPCSV
SCGGGHQSRQ RSCMDPPPKN GGAPCPGASW ERAPCGLQPC TGDTDCGLGR VHVNAELCQK
GLVPPCPPSC LDPETNGSCT GPCVEGCRCP PGLLLHDSRC LPLSECPCLV GEELKPPGMS
FLLDNCSQCI CERGTLVCEP GACPQPCGWS AWSSWTPCDR SCGSGVRARF RSPSNPPAAF
GGAPCEGDKQ ELQVCHVDCG TEVLGWTPWT SWSSCSQSCL VPGGVPGWRH RSRLCPGLRD
TSCPGEATQE EPCSPPVCPV PSSWGAWASW SACSAPCNGG IQIRGRSCSG SAPGNPACRG
PHSQTRDCNT QPCTAQCPED MVFRSAEQCH QDGGPCPQLC LAQDSGVECT GFCTPSCTCP
PGLFLHNASC LPRSQCPCQL HGKLYAPGAV ARLDSCNNCT CISGEMACTS ELCPVACGWS
PWTPWSPCSR SCDVGIRRRF RAGTSPPAAF GGAECQGPNI EAEFCSLRPC QSPGGEWGPW
ASCSVPCGGG YRNRTRGSGP RVLMVFSTCN LQPCAGPVPG MCPRGQLWLD CAQGPASCAE
LSAPRETKQT CHPGCYCPHG MHLLNNVCVP AQACPCAHEG RLHPPGSAVL RPCENCSCIS
GLITNCTSWP CEEGQPTWSP WTPWSVCSAS CGPARRHRHR FCDRPSSVAP STLALGPSPA
TPTPLCPGPE AEEEPCLLPG CDRAGGWGPW GPWSSCSRSC GGGLRSRTRA CDQPPPQGLG
DFCEGPQAQG EACQAQPCPV TDCTAIEGAQ YSPCGPPCPR SCDDLVHCVW HCQPGCYCPP
GQVLSADGTL CVHPGHCGCL DLLTGERHRP GAQLARPDGS GCTPGRLNCS ELPCPVPGGW
CPWSEWTACF QPCRGQTRTR SRACACPAPQ HGGALCPGEM GAQHQRETCP SSTTCPAWSP
WGPWSPCDAC LGQSHRSREC SQPPTSEGGR PCPGVHWQSE GSSRAGVWGG DRWGALSCLE
PPCSLLRPSG IPENQSRSAG SGLSSWESLE PGEVVTGPCN NCTCVAGILQ CWEVPSCPGA
GVWGPWGPWE DCSVSCGGGE QLRSRHCARP PCPGLARQSR TCHTQVCREA GCPAGRLYRE
CLPNGGCPFS CAHVAGRVAC FSDGCEEGCH CPEGTFQHGL ACVQECPCML TALLLRELGA
SSTDAGVHAA SLGEEGQPLR PGDELSPGQM FRMGCSNCSC AHGKLSCSIE DCPEAPSFSP
WDPWGPCSRS CGGLGTRTRS RQCVHPALAT GGQGCQGPRQ DLEYCLSPDC SGAEGSTAEP
VTGLPGGWGP WAPWSPCSRS CTDPTHPAWR SRTRLCLTNC TVGRPSQERP CNLPSCTGPL
CSSPGCGPGN CSWTSWGPWE PCSRSCGVGQ QHRLRAYRPP GPGGHWCPDI LTAFQERRFC
SLRACPVPGG WSRWSPWSWC DRSCGGGQSL RSRSCSSPPP KNGGAPCVGE KHHARLCNPM
PCGIAVVTAA SHPYWASVSK AGGPQGPAPA LKSHLPTGSL EQDSSCVPVG HCECTDAQGH
SWAPGSQHQD ACNNCSCQAG QLSCTAQPCP PPAHCAWSRW SAWSPCSHSC GPRGLQSRFR
SSTSGSWAPE CQEEQSQSQP CPQPPCPPLC LHEGRVHTLG DSWLQGGCQQ CSCTPEGIIC
EDTKCPGGWG SWTLWSLWSD CPVSCGGGNQ VRTRACVVPD PHHREPLCQG PDTQTQPCGQ
QPCQPLLEAC SWGPWGPCSR SCGSGLASRS GSCPCLPAEA EPTCNGTFPH LDTQACYAGP
CLEECLWSGW SSWTRCSCQV LVQQRYRHQG PAPGGTAEGP PCTRLDGHFR PCPTGNCSED
SCTPPFEFQA CGSPCAGLCA THLSRQLCQD LPACQPGCYC PKGLLEQAGA CIPPEQCNCW
HISEEGAEVT LAPGDHLQLG CKECECWHGE LRCTSGGCAG LLPLSSWSEW SPCGPCLPRS
ALTRTSRTTL EEHWPPNTTG LWPPSTSLLV SEQHRHRLCL DPETGRPWAG DPQLCTAPLN
QQRLCPDPEA CQDSCQWNPW GPWSPCQVPC SGGFRMRWRE AGGLPGGACR GPWAQTQSCN
MGPCPGKSCE AKDTVPTLEC ANQCPRSCMD LWDRVQCLQG PCSPGCRCPP GQLVQDGHCV
PISSCRCGLP SANASWELAP AQVVQLDCHN CTCINGSLVC PHLECPTLGP WSAWSKCSVA
CGGGTMDRHR SCKEHPQGAL CQAQDMKQQQ DCNLQPCPGE HPGPRGLPSL PPSLHHCWIQ
APTYLSGACP PQLLHNGTCV PPAACPCTQL LLPWGLTLTL EEQAQELPPG AVLTRNCTRC
TCQDGAFSCS PIDCQECPPG EMWQHVGPEE LGPCEWTCQE TNTTAAQGNC SAVQTPGCIC
QEGYFRSQAG PCVPADQCEC WHHGHLHLLG SEWQEDCESC QCLRGRSVCT RHCPLLNCAQ
DEVTVQEPGS CCPTCRRETL EAQSASCRHL TELRNLTKGP CHLDQVEVSY CSGHCPSSTN
VMPEEPYLQS QCDCCSYRLD PDSPVRILNL QCPDGRTEPV VLPVINSCQC SACQAGDFSK
R
//