ID A0A3P9ARZ3_9CICH Unreviewed; 455 AA.
AC A0A3P9ARZ3;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE SubName: Full=SPARC (osteonectin), cwcv and kazal like domains proteoglycan 1 {ECO:0000313|Ensembl:ENSMZEP00005000441.1};
GN Name=SPOCK1 {ECO:0000313|Ensembl:ENSMZEP00005000441.1};
OS Maylandia zebra (zebra mbuna).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC Pseudocrenilabrinae; Haplochromini; Maylandia; Maylandia zebra complex.
OX NCBI_TaxID=106582 {ECO:0000313|Ensembl:ENSMZEP00005000441.1, ECO:0000313|Proteomes:UP000265160};
RN [1] {ECO:0000313|Ensembl:ENSMZEP00005000441.1, ECO:0000313|Proteomes:UP000265160}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=25186727; DOI=10.1038/nature13726;
RA Brawand D., Wagner C.E., Li Y.I., Malinsky M., Keller I., Fan S.,
RA Simakov O., Ng A.Y., Lim Z.W., Bezault E., Turner-Maier J., Johnson J.,
RA Alcazar R., Noh H.J., Russell P., Aken B., Alfoldi J., Amemiya C.,
RA Azzouzi N., Baroiller J.F., Barloy-Hubler F., Berlin A., Bloomquist R.,
RA Carleton K.L., Conte M.A., D'Cotta H., Eshel O., Gaffney L., Galibert F.,
RA Gante H.F., Gnerre S., Greuter L., Guyon R., Haddad N.S., Haerty W.,
RA Harris R.M., Hofmann H.A., Hourlier T., Hulata G., Jaffe D.B., Lara M.,
RA Lee A.P., MacCallum I., Mwaiko S., Nikaido M., Nishihara H.,
RA Ozouf-Costaz C., Penman D.J., Przybylski D., Rakotomanga M., Renn S.C.P.,
RA Ribeiro F.J., Ron M., Salzburger W., Sanchez-Pulido L., Santos M.E.,
RA Searle S., Sharpe T., Swofford R., Tan F.J., Williams L., Young S., Yin S.,
RA Okada N., Kocher T.D., Miska E.A., Lander E.S., Venkatesh B., Fernald R.D.,
RA Meyer A., Ponting C.P., Streelman J.T., Lindblad-Toh K., Seehausen O.,
RA Di Palma F.;
RT "The genomic substrate for adaptive radiation in African cichlid fish.";
RL Nature 513:375-381(2014).
RN [2] {ECO:0000313|Ensembl:ENSMZEP00005000441.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC matrix {ECO:0000256|ARBA:ARBA00004498}.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00500}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A3P9ARZ3; -.
DR STRING; 106582.ENSMZEP00005000441; -.
DR Ensembl; ENSMZET00005000502.1; ENSMZEP00005000441.1; ENSMZEG00005000438.1.
DR GeneTree; ENSGT00940000158371; -.
DR Proteomes; UP000265160; LG2.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR CDD; cd00104; KAZAL_FS; 1.
DR CDD; cd00191; TY; 1.
DR Gene3D; 3.30.60.30; -; 1.
DR Gene3D; 1.10.238.10; EF-hand; 1.
DR Gene3D; 4.10.800.10; Thyroglobulin type-1; 1.
DR InterPro; IPR011992; EF-hand-dom_pair.
DR InterPro; IPR002350; Kazal_dom.
DR InterPro; IPR036058; Kazal_dom_sf.
DR InterPro; IPR019577; SPARC/Testican_Ca-bd-dom.
DR InterPro; IPR000716; Thyroglobulin_1.
DR InterPro; IPR036857; Thyroglobulin_1_sf.
DR PANTHER; PTHR13866; SPARC OSTEONECTIN; 1.
DR PANTHER; PTHR13866:SF17; TESTICAN-1; 1.
DR Pfam; PF07648; Kazal_2; 1.
DR Pfam; PF10591; SPARC_Ca_bdg; 1.
DR Pfam; PF00086; Thyroglobulin_1; 1.
DR SMART; SM00280; KAZAL; 1.
DR SMART; SM00211; TY; 1.
DR SUPFAM; SSF47473; EF-hand; 1.
DR SUPFAM; SSF100895; Kazal-type serine protease inhibitors; 1.
DR SUPFAM; SSF57610; Thyroglobulin type-1 domain; 1.
DR PROSITE; PS51465; KAZAL_2; 1.
DR PROSITE; PS00484; THYROGLOBULIN_1_1; 1.
DR PROSITE; PS51162; THYROGLOBULIN_1_2; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00500}; Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00022974};
KW Heparan sulfate {ECO:0000256|ARBA:ARBA00023207};
KW Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW Reference proteome {ECO:0000313|Proteomes:UP000265160};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..455
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5018208871"
FT DOMAIN 141..192
FT /note="Kazal-like"
FT /evidence="ECO:0000259|PROSITE:PS51465"
FT DOMAIN 324..390
FT /note="Thyroglobulin type-1"
FT /evidence="ECO:0000259|PROSITE:PS51162"
FT REGION 59..90
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 383..455
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 60..79
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 438..455
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 362..369
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00500"
SQ SEQUENCE 455 AA; 51034 MW; 6A86A525BBB1CC88 CRC64;
MFFFLLPVVA LLLSRETVVS ANNNEKWLGT VAQYKDRSWN RFRDDDYFKS WAPAKSLDQE
RVAEPGRYGD PRTKAHSSSA KKTVQDGPDA TKDPCLKVHC PPHKICVSHD YQTAICTNHK
QPGHSVKPRK GSAGHKHRLE AGAHGKCRLC SALQSTPVCG SDGHTYSSEC KLEFQSCLSG
KKISVKCDGL CPCLPSQELS KPLHNGEKAA CTDTELHSLS ARLKDWFGVL HLDANRDLKS
SDSFDSTTGH FDTSILPICK DSLGWMFNKL DMNFDLLLDQ SELSAIYLDK YELCMKPLFN
SCDSFKDGKL SNNEWCYCFQ KPDGLPCQTE KSRIQSQSRR KSLIGTYIPR CTDEGYFKPT
QCHGSTGQCW CVDKYGNEIA GSRKQGNPNC DEDQETSGDF GSGGAVILLD DQEEEPSQTG
RSRQKKRRGR IHPRGTIEDD EDEEEDKDDE IGYVW
//