ID A0A3P9CIT9_9CICH Unreviewed; 2335 AA.
AC A0A3P9CIT9;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 24-JAN-2024, entry version 21.
DE SubName: Full=Chondroitin sulfate proteoglycan 4 {ECO:0000313|Ensembl:ENSMZEP00005022213.1};
OS Maylandia zebra (zebra mbuna).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC Pseudocrenilabrinae; Haplochromini; Maylandia; Maylandia zebra complex.
OX NCBI_TaxID=106582 {ECO:0000313|Ensembl:ENSMZEP00005022213.1, ECO:0000313|Proteomes:UP000265160};
RN [1] {ECO:0000313|Ensembl:ENSMZEP00005022213.1, ECO:0000313|Proteomes:UP000265160}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=25186727; DOI=10.1038/nature13726;
RA Brawand D., Wagner C.E., Li Y.I., Malinsky M., Keller I., Fan S.,
RA Simakov O., Ng A.Y., Lim Z.W., Bezault E., Turner-Maier J., Johnson J.,
RA Alcazar R., Noh H.J., Russell P., Aken B., Alfoldi J., Amemiya C.,
RA Azzouzi N., Baroiller J.F., Barloy-Hubler F., Berlin A., Bloomquist R.,
RA Carleton K.L., Conte M.A., D'Cotta H., Eshel O., Gaffney L., Galibert F.,
RA Gante H.F., Gnerre S., Greuter L., Guyon R., Haddad N.S., Haerty W.,
RA Harris R.M., Hofmann H.A., Hourlier T., Hulata G., Jaffe D.B., Lara M.,
RA Lee A.P., MacCallum I., Mwaiko S., Nikaido M., Nishihara H.,
RA Ozouf-Costaz C., Penman D.J., Przybylski D., Rakotomanga M., Renn S.C.P.,
RA Ribeiro F.J., Ron M., Salzburger W., Sanchez-Pulido L., Santos M.E.,
RA Searle S., Sharpe T., Swofford R., Tan F.J., Williams L., Young S., Yin S.,
RA Okada N., Kocher T.D., Miska E.A., Lander E.S., Venkatesh B., Fernald R.D.,
RA Meyer A., Ponting C.P., Streelman J.T., Lindblad-Toh K., Seehausen O.,
RA Di Palma F.;
RT "The genomic substrate for adaptive radiation in African cichlid fish.";
RL Nature 513:375-381(2014).
RN [2] {ECO:0000313|Ensembl:ENSMZEP00005022213.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00122}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 106582.ENSMZEP00005022213; -.
DR Ensembl; ENSMZET00005022955.1; ENSMZEP00005022213.1; ENSMZEG00005016645.1.
DR GeneTree; ENSGT00940000154091; -.
DR Proteomes; UP000265160; LG7.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR CDD; cd00110; LamG; 2.
DR Gene3D; 2.60.120.200; -; 2.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR039005; CSPG_rpt.
DR InterPro; IPR001791; Laminin_G.
DR PANTHER; PTHR45739; MATRIX PROTEIN, PUTATIVE-RELATED; 1.
DR PANTHER; PTHR45739:SF8; TNFR-CYS DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF16184; Cadherin_3; 11.
DR Pfam; PF02210; Laminin_G_2; 2.
DR SMART; SM00282; LamG; 2.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 2.
DR PROSITE; PS51854; CSPG; 10.
DR PROSITE; PS50025; LAM_G_DOMAIN; 2.
PE 4: Predicted;
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000265160};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 2183..2206
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 1..158
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 168..352
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT REPEAT 384..481
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 612..711
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 729..824
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 844..938
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 970..1062
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1078..1177
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1392..1482
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1497..1599
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1628..1727
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1815..1904
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REGION 2065..2094
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2106..2167
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2289..2335
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2106..2120
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2298..2328
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2335 AA; 258464 MW; C9B94513811493BD CRC64;
TSIQTLIHVR FRTSSQSGLL LLAAGHTDFM LLELISGYLQ VRLDLGSGEH SLHSEKGIHL
SDLAWHTVDL THTRHNITMT VDQNSHTGLQ IQGPDLELGV EDGLLVGGTA GLKHLYLHNI
SSGFRGCMDE VVFNQHDLLS SLRPHSGYKT VHEVSLGCSP QFSATEEDPV SFFSSEAFMS
LPPWDVLQEG VFECELHPAA KEEEDGIVLY SSDNQGGFVA IEIVRGHLVA SVGGGEGSKT
ELHSLTNVNS NHTWYTIQLH LLPHSVQLKV GKELVKASLS PEIQVLQLTG PLFVGGLDEA
ARGQARRARL ISVPSGGEGG GSFKGCLSEI RVNTQKTGLP HATVTKDVTV GCRTGQAPQR
ASLTSPTDLP GVDITTAQTN AKRSPNFLML RKLEVSEGAA PIYNSLWTLC QVNLDFRKLG
IHPSQFMFRV EEQPVHGQLR LDLRIEEKDR TFSMLDLWQG RVMYVHSGSE DQSDFFTFSV
FSSNKKELPV FLKGNRLHQF EISISAVNDA PVLSLPEGNY FTLLEKALDP DSSPEELVFS
SLGNLNTEAG YLEHQDYPGR PINLFSLNDL EEGKISFIHT GVSTSRLALR VSDGQKVSKI
AAFLRIITVP LEDKLVNNTG LEVNQGEASV ITTNHLAVQV NVADPTVEIW YNVTEFPQYG
ELQRLHSSGE WKPATSFSQK LLEKERIRYL STYRGLQMQN NITDHFKFKV SIGSLAKQEA
VFPIAVRWIH FKITRSKMEL NGVQAAVLTP EDLHVISKGV KLSESDLHFR MVTVPKKGQL
LLNNKALQRN STFSQRNISD GLVRYELINR QHDDTRDTFS FQVFSAHSNS TTYDFRINIK
AESTTVTMIS KGLSVMEGGS KVITRDILFT HTASNREVLY NIIESPRHGH IRRINLSNST
SINDNIMAFT NQDITEERIM YVHDDSETKQ DSFTFQIVVY KAHKHMNKKE DGNGEQHTFN
ISVQLVNDQR PLRVVDKVFH VAREGQRLVT LSDLRYRDDD SDFEDSWLVY TRRGIPMGEL
VLASDPSHKL YEFTQRDLEQ KKVLFVHKGV SFGRFVLFVS DGKHYVSTLL EVMAQDPYLQ
VENNTGIMVE QGGITTLTSA NLSVFSNLDI RDPHEVTFEV FIPPKHGVIC FIDRESGIIT
DADAISIFTQ RDLIAGRLVY RHDGSHKLSD SFNVTARARE RSTERQVERG KREVHLDIGV
SVKIYLESHQ RPPTVRNNRP VVVEEGQNIS ISRDNLEVSK TKILTYAGKF SDYKICLVSY
SQSTPTFTQE DLNQGLVIYQ QQTTGSTNDS VLLEATNGVT KVGPIRLEID IIPILLPLQV
KRSNGGQLTK PESIASHHFS GMNFLYQYIS YIHDGSDTLR DNFTIVANQT ETRKHSLPCA
VHINITPVND ETPVITTNHG LKVWVGSVTE ITTGDLSAED TDTPPEGLEF IVTPPSNGHL
ALKSAPSRHI LNFTQSHIES KQLVFVHNGA PSGGFHFQVN DGLNFAPRQI FSTTAHSLIL
TLITFFVYPE GSVTPISDKE LQAVTNIADG NIRRNQSVVF AVTSPPKLGR LVRRMPDNST
RNVSTFTQSM LDDGVILYDQ NKPESVGWSA ADTFSFTASL PPASLPPHTF TILISYQANE
HHDSSQHKTR LINNAGAVVA EGGRVTIDRS KLDASNLLGK IPESHRKDHH IMYRVISLPR
YGILSIRGHN LTRNQPDFSQ VTLNKFGITY FHDDSETTSD SFTFRAWVAP LDLPSSSSSS
SLSAFSSDSS SSSSSFSSLY SASSSPFSSL DTVSHHHVKD TTGVTEMFNI TVTPVNDQPP
LIRSRLFSLE VIKPSVSIVN NTGLSLIQGR TAVVLTTNQL AAQTNGQSRA TVSYTVTTPP
RHGRIAINDQ EVTTFRHEDL QFGRVVYHMT DLSESEDTFQ MSVSASSPGI NYGNVTATVK
VTVRPLIYLR EPVRVPSGIA VKLGKAMIDA SELVRISRAN PVFEVLSPPK HGKLVKVKPF
VVMSRASEVL KSFTFRDVVQ GRVAIEENLS DSDNQLHNNE STLTTAQVHA PPAKGELHFT
ILPHHQMRHG PGGFNKIDKI GREHTTARLP AHNRTTAGRA GREGGRGGST AHGEVTVGLH
PHILSHHNRT HHKLRPHNRL GNHTRNGSPG GRSKSSVEGA GGGHSHAPQP HTPSIPEKHG
PENPPDTQLI HVEVLPRPAS DPLLIILPLL ACLLLIIILI VLILAFRHRK EKQARLRLVQ
ALAAVPVPNE DSPYLGRPER SVAMPSVMVT PLGSASCPTS PRVPTSPRRS LAPGMTFWGP
FEADLAGGDR NIRGCNSNER DNKTSSNPLP RTTGFRTSVR SRSPTPTLKE GQYWV
//