ID A0A2K6L3M6_RHIBE Unreviewed; 5135 AA.
AC A0A2K6L3M6;
DT 28-MAR-2018, integrated into UniProtKB/TrEMBL.
DT 28-MAR-2018, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE RecName: Full=SCO-spondin {ECO:0000256|ARBA:ARBA00020523};
OS Rhinopithecus bieti (Black snub-nosed monkey) (Pygathrix bieti).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Colobinae; Rhinopithecus.
OX NCBI_TaxID=61621 {ECO:0000313|Ensembl:ENSRBIP00000018117.1, ECO:0000313|Proteomes:UP000233180};
RN [1] {ECO:0000313|Ensembl:ENSRBIP00000018117.1, ECO:0000313|Proteomes:UP000233180}
RP NUCLEOTIDE SEQUENCE.
RA Wu, C.-I. and Zhang, Y.;
RT "Genome of Rhinopithecus bieti.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSRBIP00000018117.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space
CC {ECO:0000256|ARBA:ARBA00004239}.
CC -!- SIMILARITY: Belongs to the thrombospondin family.
CC {ECO:0000256|ARBA:ARBA00009456}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00039}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 61621.ENSRBIP00000018117; -.
DR Ensembl; ENSRBIT00000041976.1; ENSRBIP00000018117.1; ENSRBIG00000033043.1.
DR GeneTree; ENSGT00940000155829; -.
DR OMA; QTKNELC; -.
DR Proteomes; UP000233180; Unplaced.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR GO; GO:0098797; C:plasma membrane protein complex; IEA:UniProt.
DR GO; GO:0030414; F:peptidase inhibitor activity; IEA:InterPro.
DR GO; GO:0007155; P:cell adhesion; IEA:UniProtKB-KW.
DR CDD; cd00057; FA58C; 1.
DR CDD; cd00112; LDLa; 8.
DR CDD; cd19941; TIL; 15.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 2.10.25.10; Laminin; 14.
DR Gene3D; 4.10.400.10; Low-density Lipoprotein Receptor; 8.
DR Gene3D; 2.20.100.10; Thrombospondin type-1 (TSP1) repeat; 24.
DR InterPro; IPR006207; Cys_knot_C.
DR InterPro; IPR000421; FA58C.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR036055; LDL_receptor-like_sf.
DR InterPro; IPR023415; LDLR_class-A_CS.
DR InterPro; IPR002172; LDrepeatLR_classA_rpt.
DR InterPro; IPR036201; Pacifastin_dom_sf.
DR InterPro; IPR036084; Ser_inhib-like_sf.
DR InterPro; IPR002919; TIL_dom.
DR InterPro; IPR000884; TSP1_rpt.
DR InterPro; IPR036383; TSP1_rpt_sf.
DR InterPro; IPR014853; VWF/SSPO/ZAN-like_Cys-rich_dom.
DR InterPro; IPR001007; VWF_dom.
DR InterPro; IPR001846; VWF_type-D.
DR PANTHER; PTHR11339; EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED; 1.
DR PANTHER; PTHR11339:SF396; SCO-SPONDIN; 1.
DR Pfam; PF08742; C8; 3.
DR Pfam; PF00754; F5_F8_type_C; 1.
DR Pfam; PF00057; Ldl_recept_a; 6.
DR Pfam; PF01826; TIL; 11.
DR Pfam; PF00090; TSP_1; 21.
DR Pfam; PF00094; VWD; 3.
DR PRINTS; PR00261; LDLRECEPTOR.
DR SMART; SM00832; C8; 3.
DR SMART; SM00231; FA58C; 1.
DR SMART; SM00192; LDLa; 9.
DR SMART; SM00209; TSP1; 24.
DR SMART; SM00214; VWC; 7.
DR SMART; SM00215; VWC_out; 10.
DR SMART; SM00216; VWD; 3.
DR SUPFAM; SSF57603; FnI-like domain; 3.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR SUPFAM; SSF57424; LDL receptor-like module; 9.
DR SUPFAM; SSF57283; PMP inhibitors; 1.
DR SUPFAM; SSF57567; Serine protease inhibitors; 13.
DR SUPFAM; SSF82895; TSP-1 type 1 repeat; 23.
DR PROSITE; PS01225; CTCK_2; 1.
DR PROSITE; PS01286; FA58C_2; 1.
DR PROSITE; PS50022; FA58C_3; 1.
DR PROSITE; PS01209; LDLRA_1; 4.
DR PROSITE; PS50068; LDLRA_2; 9.
DR PROSITE; PS50092; TSP1; 25.
DR PROSITE; PS01208; VWFC_1; 1.
DR PROSITE; PS50184; VWFC_2; 2.
DR PROSITE; PS51233; VWFD; 3.
PE 3: Inferred from homology;
KW Calcium {ECO:0000256|ARBA:ARBA00022837};
KW Cell adhesion {ECO:0000256|ARBA:ARBA00022889};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00124}; EGF-like domain {ECO:0000256|ARBA:ARBA00022536};
KW Reference proteome {ECO:0000313|Proteomes:UP000233180};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022525}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..17
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 18..5135
FT /note="SCO-spondin"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5014409532"
FT DOMAIN 193..362
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 563..736
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 1014..1184
FT /note="VWFD"
FT /evidence="ECO:0000259|PROSITE:PS51233"
FT DOMAIN 1956..2016
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 2056..2215
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT DOMAIN 4974..5030
FT /note="VWFC"
FT /evidence="ECO:0000259|PROSITE:PS50184"
FT DOMAIN 5041..5128
FT /note="CTCK"
FT /evidence="ECO:0000259|PROSITE:PS01225"
FT REGION 1513..1545
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2299..2329
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3468..3496
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1514..1528
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1530..1545
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2302..2316
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 1418..1430
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1425..1443
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1437..1452
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1477..1489
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1484..1502
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1550..1562
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1557..1575
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1569..1584
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 1652..1670
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2225..2237
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2232..2250
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2244..2259
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2382..2394
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2389..2407
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2401..2416
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2449..2461
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2456..2474
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
FT DISULFID 2468..2483
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00124"
SQ SEQUENCE 5135 AA; 546369 MW; D3DAD20EC1C1111B CRC64;
MLLPALLFGM MWALADGRWC EWTETIHVEE EVAPRQEDLV PCANLDHYSR LGWRLDLPWS
GRAGLTRSPA PGLCPIYKPP ETRPAKWNRT VRACCPGWGG AHCTEALAKA SPEGHCFAMW
QCQLQAGSAN ASAGSLEECC AWPWGRSWRD GSSQACLSCS SRHLPGGASS PALLQPLAGA
VGQLWSQHQR PSATCASWSG FHYRTFDGRH YHFLGRCTYL LAGAADSTWA VHLTPGDHCP
QPGHCQLVTM GPEKVLIQAG NVSVKGQLVP EGQSWLLHGL SLQWLGDWLV LSGGLGVVVR
LDRAGSISIS VDHELWGQTQ GLCGLYNGQP EDDFMEPGGG PAMLAATFGN SWKLPDSEPG
CLDAVEVAQG CDGPLGLTEA DVEPGHLRAE AQDVCHQLLE GPFGQCHAQV SPDEYHEACL
FAYCAAAMAG SRQEGQRQAV CATFASYAQA CARRHIHIRW RKPGFCERLC PGGQLYSDCA
SLCPPSCEAV GQGEEESCRE ECVSGCECPQ GLFWNGTLCV PAAHCPCYYR RQRYAPRDTV
RQLCNPCVCR DGRWHCAQAL CPAECAVGGD GHYLTFDGRS YSFRGGQGCR YSLVQDYVKG
QLLILLEHGA CDSGSCLHAI SVSLEDTHIQ LRDSGAVLVN GQDVGLPWIG AEGLSVRQTS
SAFLLLRWPG AQVLWGLSDP AAYITLDPRH AHQVQGLCGT FTQNQQDDFL TPAGDVETSI
AAFASKFQVA GQGSCPSGDS APLSPCTTHS QRHTFAETAC AILHSSVFQE CHRLVDREPF
YLRCLAAVCG CDPGRDCLCP VLSAYARHCA QEGASPPWRN QTLCPVLCPG GQEYRECAPA
CGQHCGEPED CGALGSCVAS CNCPLGLLWD PEGQCVPPSL CPCQLGARRY APGSATMKEC
NRCVCQERGL WNCTARRCPL QQAFCPRELV YAPGACLLTC DNPTANHSCP AGSADGCVCP
PGMVLLDERC VPPDLCPCRH SGQWYPPNAT IQEDCNICVC RGRQWHCTGQ RCSGRCQASG
APHYVTFDGL AFTFPGACEY LLVREASGLF TVSAQNLPCG ASGLTCTKAL AVRLEGTVVH
MLRGRAVTVN GLRVTPPKVY TGPGLSLHRA GLFLLLSTRL GLTLLWDGGT RVLVQLSPQF
RGRVAGLCGD FDGDASNDLR SRQGVLEPTA ELAAHSWRLS TLCPEPGDLP HPCAVNTHRA
GWARARCGVL LQPLFASCHA EVPPQQHYEW CLHDACGCDS GGDCECLCSA IATYADECAR
HGYHVRWRSQ ELCPLQCEGG QVYEACGPTC PPTCHEQHPE PGWHCQVVAC VEGCFCPEGT
LLHGGACLEP ASCPCEWGSN SFPLGSVLQK ECGNCTCQEG QWHCGGDGGH CEEPVPGCAE
GEALCQENGH CVPHGWLCDN QDDCGDGSDE EGECLCSCVE GLLACADGRC LTPGLLCDGH
PDCPDAADEE SCLGELPISS RAPTSWGTPN SKRQVTCVPG EVSCVDGSCL GAIQLCDGVW
DCPDGADEGP GHCPLPSLPT PPAGTLPGPS PGSLDTTPSS LASASPAPSC GPFEFRCGSG
ECIPRGWRCD QEEDCPDGSD EHGCGEPCAP HDAPCAHGPH CVSREQLCDG VRQCPDGSDE
DPDACGEAPG GPAGWARDAA HTPWLPCPEY TCPNGTCIGF KLVCDGQPDC GGSGQAGPSP
EEQGCGAWGP WSPWGPCSRT CGPGGQGRSR RCSPLGLLVL QHCPGPEHQS QTCFTAACPV
DGEWSAWSPW SVCSEPCRGT MTRQRQCHPP QNGGRTCAAL PGGPHSTRQT KPCPQDSCPN
ATCSGELMFQ PCAPCPLTCD DISSQVMCPP DRPCGSPGCW CPEGQVLGSE GWCVWPRQCP
CLVDGARYWP GQRIKADCRL CICQDGRPRR CRLNPDCAVD CGWSSWSPWA ECLGPCGSQS
IQWSFRSPNN PRPSGRGRQC RGIHRKARRC QTEPCEGCEH QGQVHRVGER WRGGPCRVCQ
CLHNLTARCS PYCPLGSCPQ GWVLVEGTGE SCCHCALPGE NQTVQPMATP AAAPAPSPQI
RFPLATYILP PPGDPCYSPL GLAGLAEGSL HASSQQLEHP TQAALMGAPT QEPSPHGRRA
GGYAYAKWHT RPHYLQLDLL QPRNLTGIIV PETGSSNASA ASFSLQFSSN GLRWLDYHDI
LPGILPLPKL FPRHWDDLDP AVWTFGQMVQ ARFVRVWPRD AHHSDVPLRV ELLGCEPGSP
PAPLCPGVGL RCASGECALR GSLCDGVLDC KDGSDEEGCV LPPEGTGRFH STAKTLALSS
AQPGQLLHWL REGLAETERW PPGQESPTSP TETRPVSPGP ASGVPHHGES MQMVTTTPIS
QMEARTLPPG MAAVMVLTPH TVTPATPAGQ SVAPGPFPPV QCGPGQMPCE VLGCVEQAQV
CDGREDCLDG SDERHCGELL EGLLSSASTV PFTVPTMALP GLPASRALCS PSQLSCGSGE
CLSAERRCDL RPDCQDGSDE DGCVDCVLAP WSIWSSCSRS CGLGLTFQRQ ELLRPPLPGG
SCPPDRFRSQ SCFVQACPVA GAWAMWEAWG PCSVSCGGGH QSRRRSCVDP PPKNSGAPCP
GPSQERVPCG LQPCSGGTDC ELGRVYVSAD LCQKGLVPLC PPSCLDPKAN RSCSGHCVEG
CRCPSGLLLH DTRCLPLSEC PCLVGEELKW PGVSFVLANC SQCVCEKGEL LCQPGGCPLP
CGWSAWSSWA PCDRSCGSGV RARFRSPSNP PAAWGGAPCE GDRQELQGCH TECGTEVLGW
TPWTSWSSCS QSCLAPGGGP GWRSRSRLCP SPGDSSCPGE ATQEEPCSPP VCPVPSIWGL
WAPWSTCSAP CDGGIQTRGR SCSSLAPGDT SCPGPHSQTR DCNTQPCTAQ CPENMVFRSA
EQCRQEGGPC PRLCLTHGPG IECSGFCAPG CACPPGLFLH NASCLPRSQC PCQLHGQLYA
PGAMARLDSC NNCTCVSGEM ACTSEHCPVA CGWSPWTPWS LCSRSCNVGI RRRFRAGTAP
PAAFGGAECQ GPTMEAEFCS LRPCRGPGGE WGPWSPCSVP CGGGYRNRTR GSGLHSPMEF
STCGLQPCTG PVPGVCPRGK QWLDCAQGPA SCAELSASRG TNKTCHPGCH CPSGMLLLNN
VCVPTQDCPC AHEGHLYPPG STVVRPCENC SCVSGLIANC SSWTCVEGEP TWSPWTPWSQ
CSASCGPARR HRHRFCARSP SAAPSTVAPL SLPATHTPLC PGPEAEEEPC LLPGCDRAGG
WGPWGPGLRS RTRACDQPHP RASGITARGH GPRGSPSCLP SVTNCTAIEG AEYSPCGPPC
PRSCDDLVHC VWRCQPGCYC PPGQVLSSNG AICVQPGHCG CLDLLTGRWH HPGAQLARPD
GCNHCTCLEG RLNCTDLPCP VPGGWCPWSE WTMCSQPCRN QTRSRSRACA CPTPQYGGAP
CTGETGEAGA QHQREACPSS ATCPVDGAWG PWGPWSSCDK CLGQSHRSRA CSRPPTPEGG
RPCPGSHTQS RPCQDNSTRC TDCGGGQSLH PCGQPCPRSC QDLSPGSVCQ PGSAGCQPSC
GCPLGQLSQD GLCIPLARCR CQYQPGAMGI PENQSRSAGS RFSSWESLEP GEVVTGPCDN
CTCVAGILQC QEVPGCPDPG VWSSWGPWED CSVSCGGGEQ PRSRRCARPP CPGPARQSRT
CSTQVCREAG CPAGRLYREC QPGEGCPFSC AHITQQMGCF SEGCEEGCHC PEGTFQHRLA
CVQECPCVLT AWLLQELGAI RGDPGQSLGP GDELGSGQTL HTSCGNCSCA HGTLSCSLED
CFEADGGFGP WSPWGPCSRS CGGLGTRTRS RQCVLPMPVP SGQGCRGPRQ GLEYCPSPDC
PGAEGSTVEP VTGLPGGWGP WSSWSPCSRS CTDPARPAWR SRTRLCLANC TMGDPLQERP
CNLPSCTELP LCPGPGCGAG NCSWTSWAPW EPCSRSCGVG QQRRLRAYRP PGPSGHWCPD
ILTAYQERRF CNLRACPVPG GWSRWSPWSW CDRSCGGGQS LRSRSCSSPP PKNGGAPCAG
ERHQARLCNP TPCEAGCPAG MEVVTCANHC PRRCSDLQEG IVCQDDQVCQ KGCRCPKGSL
EQDGGCVPIG HCDCTDAQGH IWAPGSQHQD ACNNCSCQAG RLFCTAQPCP PPTHCAWSRW
SAWSPCSHSC GPGGQQSRFR SSTSGSWAPE CREEQSQSQP CPQPSCPPLC LQGTRPRTLG
DSWLHGECQQ CSCTPEGVIC EDTECAVPEA WTLWSSWSDC PVSCGGGNQV RTRACRAAAP
HHGSPPCLGP DIQTQPCGQQ PCPGLLEACS WGPWGPCSRS CGLGLASRSG SCPCLIAKAD
PTCNGTFLHL DTQGCYPGPC PEECVWSSWS SWTRCSCQVL VQQRYRHQGP ASQGARAGAP
CTQLDGQFRP CLIGNCSEDS CTPPFEFHAC GSPCAGLCAT HLSYQLCQDL PPCQPGCYSG
SWGNFQEKGG PLPTLPLLLI SCNCWHTSAA GARITLAPGD RLQLGCKECE CRRGELHCTS
QGCQGLLPLS EWSEWSPCGP CLPPSTLAPA SRTALEERWL QDATSLSPTS APLLASEQHR
HRLCLDPATG RPWTGAPHLC TVPLRQQRLC PDPGACPDSC QWSLWGPWSP CQVPCSGGFR
LRWRGTEAPT GGGCRGPWAQ TESCNRGPCP GESCEARNTV LTLDCANQCP RSCADLWDRV
QCLQGPCRPG CRCPPGQLVQ DGHCVPISSC RCGLPSANAS WELAPDQVVQ LDCQNCTCVN
GSLVCPHQEC PVLGPWSAWS SCSAPCGGGT MERHRSCEGG PGMAPCQAQD TEQWQECNLQ
PCPECPPGQV LSACATSCPR LCWHLQPGAI CVQEPCQPGC GCPGGQLLHN GTCMPPAACP
CTQHSLPWGL TLTLEEQAQE LPPGTVLTWN CTRCVCHGGA FSCSLIDCQE CPPGEMWQQV
APGELGLCEQ TCQEMNATET RSNCSSAQAS GCVCQPGHFR SQAGPCVPED LCECWHLGHP
HLPGSEWQEA CESCLCRSGR PVCTQRCSSL TCPQGEEMVL EPGSCCPSCR MEAPEEQLPS
CQLLTELRNF TKGTCYLDKV EVSYCSGYCL SSTHVMPEEP YLQSQCDCCS YRLDPESPVR
ILNLRCLGGH TEPVVLPVIH SCQCSSCQGG DFSKH
//