GenomeNet

Database: UniProt
Entry: A0A3P9ARZ3_9CICH
LinkDB: A0A3P9ARZ3_9CICH
Original site: A0A3P9ARZ3_9CICH 
ID   A0A3P9ARZ3_9CICH        Unreviewed;       455 AA.
AC   A0A3P9ARZ3;
DT   13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT   13-FEB-2019, sequence version 1.
DT   27-MAR-2024, entry version 22.
DE   SubName: Full=SPARC (osteonectin), cwcv and kazal like domains proteoglycan 1 {ECO:0000313|Ensembl:ENSMZEP00005000441.1};
GN   Name=SPOCK1 {ECO:0000313|Ensembl:ENSMZEP00005000441.1};
OS   Maylandia zebra (zebra mbuna).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC   Pseudocrenilabrinae; Haplochromini; Maylandia; Maylandia zebra complex.
OX   NCBI_TaxID=106582 {ECO:0000313|Ensembl:ENSMZEP00005000441.1, ECO:0000313|Proteomes:UP000265160};
RN   [1] {ECO:0000313|Ensembl:ENSMZEP00005000441.1, ECO:0000313|Proteomes:UP000265160}
RP   NUCLEOTIDE SEQUENCE.
RX   PubMed=25186727; DOI=10.1038/nature13726;
RA   Brawand D., Wagner C.E., Li Y.I., Malinsky M., Keller I., Fan S.,
RA   Simakov O., Ng A.Y., Lim Z.W., Bezault E., Turner-Maier J., Johnson J.,
RA   Alcazar R., Noh H.J., Russell P., Aken B., Alfoldi J., Amemiya C.,
RA   Azzouzi N., Baroiller J.F., Barloy-Hubler F., Berlin A., Bloomquist R.,
RA   Carleton K.L., Conte M.A., D'Cotta H., Eshel O., Gaffney L., Galibert F.,
RA   Gante H.F., Gnerre S., Greuter L., Guyon R., Haddad N.S., Haerty W.,
RA   Harris R.M., Hofmann H.A., Hourlier T., Hulata G., Jaffe D.B., Lara M.,
RA   Lee A.P., MacCallum I., Mwaiko S., Nikaido M., Nishihara H.,
RA   Ozouf-Costaz C., Penman D.J., Przybylski D., Rakotomanga M., Renn S.C.P.,
RA   Ribeiro F.J., Ron M., Salzburger W., Sanchez-Pulido L., Santos M.E.,
RA   Searle S., Sharpe T., Swofford R., Tan F.J., Williams L., Young S., Yin S.,
RA   Okada N., Kocher T.D., Miska E.A., Lander E.S., Venkatesh B., Fernald R.D.,
RA   Meyer A., Ponting C.P., Streelman J.T., Lindblad-Toh K., Seehausen O.,
RA   Di Palma F.;
RT   "The genomic substrate for adaptive radiation in African cichlid fish.";
RL   Nature 513:375-381(2014).
RN   [2] {ECO:0000313|Ensembl:ENSMZEP00005000441.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2023) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular
CC       matrix {ECO:0000256|ARBA:ARBA00004498}.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00500}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; A0A3P9ARZ3; -.
DR   STRING; 106582.ENSMZEP00005000441; -.
DR   Ensembl; ENSMZET00005000502.1; ENSMZEP00005000441.1; ENSMZEG00005000438.1.
DR   GeneTree; ENSGT00940000158371; -.
DR   Proteomes; UP000265160; LG2.
DR   GO; GO:0005576; C:extracellular region; IEA:UniProtKB-KW.
DR   GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR   CDD; cd00104; KAZAL_FS; 1.
DR   CDD; cd00191; TY; 1.
DR   Gene3D; 3.30.60.30; -; 1.
DR   Gene3D; 1.10.238.10; EF-hand; 1.
DR   Gene3D; 4.10.800.10; Thyroglobulin type-1; 1.
DR   InterPro; IPR011992; EF-hand-dom_pair.
DR   InterPro; IPR002350; Kazal_dom.
DR   InterPro; IPR036058; Kazal_dom_sf.
DR   InterPro; IPR019577; SPARC/Testican_Ca-bd-dom.
DR   InterPro; IPR000716; Thyroglobulin_1.
DR   InterPro; IPR036857; Thyroglobulin_1_sf.
DR   PANTHER; PTHR13866; SPARC OSTEONECTIN; 1.
DR   PANTHER; PTHR13866:SF17; TESTICAN-1; 1.
DR   Pfam; PF07648; Kazal_2; 1.
DR   Pfam; PF10591; SPARC_Ca_bdg; 1.
DR   Pfam; PF00086; Thyroglobulin_1; 1.
DR   SMART; SM00280; KAZAL; 1.
DR   SMART; SM00211; TY; 1.
DR   SUPFAM; SSF47473; EF-hand; 1.
DR   SUPFAM; SSF100895; Kazal-type serine protease inhibitors; 1.
DR   SUPFAM; SSF57610; Thyroglobulin type-1 domain; 1.
DR   PROSITE; PS51465; KAZAL_2; 1.
DR   PROSITE; PS00484; THYROGLOBULIN_1_1; 1.
DR   PROSITE; PS51162; THYROGLOBULIN_1_2; 1.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00500}; Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00022974};
KW   Heparan sulfate {ECO:0000256|ARBA:ARBA00023207};
KW   Proteoglycan {ECO:0000256|ARBA:ARBA00022974};
KW   Reference proteome {ECO:0000313|Proteomes:UP000265160};
KW   Secreted {ECO:0000256|ARBA:ARBA00022525};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..21
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           22..455
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5018208871"
FT   DOMAIN          141..192
FT                   /note="Kazal-like"
FT                   /evidence="ECO:0000259|PROSITE:PS51465"
FT   DOMAIN          324..390
FT                   /note="Thyroglobulin type-1"
FT                   /evidence="ECO:0000259|PROSITE:PS51162"
FT   REGION          59..90
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          383..455
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        60..79
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        438..455
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   DISULFID        362..369
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00500"
SQ   SEQUENCE   455 AA;  51034 MW;  6A86A525BBB1CC88 CRC64;
     MFFFLLPVVA LLLSRETVVS ANNNEKWLGT VAQYKDRSWN RFRDDDYFKS WAPAKSLDQE
     RVAEPGRYGD PRTKAHSSSA KKTVQDGPDA TKDPCLKVHC PPHKICVSHD YQTAICTNHK
     QPGHSVKPRK GSAGHKHRLE AGAHGKCRLC SALQSTPVCG SDGHTYSSEC KLEFQSCLSG
     KKISVKCDGL CPCLPSQELS KPLHNGEKAA CTDTELHSLS ARLKDWFGVL HLDANRDLKS
     SDSFDSTTGH FDTSILPICK DSLGWMFNKL DMNFDLLLDQ SELSAIYLDK YELCMKPLFN
     SCDSFKDGKL SNNEWCYCFQ KPDGLPCQTE KSRIQSQSRR KSLIGTYIPR CTDEGYFKPT
     QCHGSTGQCW CVDKYGNEIA GSRKQGNPNC DEDQETSGDF GSGGAVILLD DQEEEPSQTG
     RSRQKKRRGR IHPRGTIEDD EDEEEDKDDE IGYVW
//
DBGET integrated database retrieval system