ID A0A096M0Z0_POEFO Unreviewed; 2288 AA.
AC A0A096M0Z0;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 24-JAN-2024, entry version 43.
DE SubName: Full=Chondroitin sulfate proteoglycan 4 {ECO:0000313|Ensembl:ENSPFOP00000025081.1};
GN Name=CSPG4 {ECO:0000313|Ensembl:ENSPFOP00000025081.1};
OS Poecilia formosa (Amazon molly) (Limia formosa).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC Poecilia.
OX NCBI_TaxID=48698 {ECO:0000313|Ensembl:ENSPFOP00000025081.1, ECO:0000313|Proteomes:UP000028760};
RN [1] {ECO:0000313|Proteomes:UP000028760}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=female {ECO:0000313|Proteomes:UP000028760};
RA Schartl M., Warren W.;
RL Submitted (OCT-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSPFOP00000025081.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00122}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AYCK01002338; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AYCK01002339; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR Ensembl; ENSPFOT00000028171.1; ENSPFOP00000025081.1; ENSPFOG00000014126.2.
DR GeneTree; ENSGT00940000154091; -.
DR Proteomes; UP000028760; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR CDD; cd00110; LamG; 2.
DR Gene3D; 2.60.120.200; -; 2.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR039005; CSPG_rpt.
DR InterPro; IPR001791; Laminin_G.
DR PANTHER; PTHR45739; MATRIX PROTEIN, PUTATIVE-RELATED; 1.
DR PANTHER; PTHR45739:SF8; TNFR-CYS DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF16184; Cadherin_3; 10.
DR Pfam; PF02210; Laminin_G_2; 2.
DR SMART; SM00282; LamG; 2.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 2.
DR PROSITE; PS51854; CSPG; 10.
DR PROSITE; PS50025; LAM_G_DOMAIN; 2.
PE 4: Predicted;
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000028760};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Signal {ECO:0000256|ARBA:ARBA00022729};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 2194..2219
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 25..194
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 204..385
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT REPEAT 555..650
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 784..877
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 897..990
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1015..1107
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1123..1214
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1236..1338
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1352..1441
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1465..1555
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1693..1792
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REPEAT 1825..1916
FT /note="CSPG"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT REGION 2169..2188
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2169..2183
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2288 AA; 254958 MW; 88E5276BA5736565 CRC64;
LSIFIFRACA LVYILSSAPK SRNRESYFGD SHCNIVAVQD VSTFQLSLNF KTSRRSGLLL
LAAGMEDYLF LELLNGKLQA RMKVEAREII LSSSQEVQLN NLRDHQAFLT LQDGKLTLTI
DGLFSTYVPV DVGGERLNID LGLWLGGTGD LDAPYLSNAI PPFRGCITDV KFESHQFNIL
NSAFKQCQDT KESCSSEFEH TDGEAISLST PDSFVSFPTW SGSTRAPRSL EFLMKTTIDD
ALLVFHHGQT SDFIAIGVVN GYLKGVLDLG YGLKSLENTQ VQLDDDQWHR VKIQVGPDSF
VLTVDSQTSS IPLNSSQKLD LVGNLYFGGI QRKMKEVFSE SGSLNRFEDY ITSESFIGCF
GEIKVNQRDK GLQDALVTKD VHIKCEGDDY DYSTYGEPDG ITTSSPIVDF VTQFAEPQLN
EQECHPTDFE PEVFRNVTKL LNITTLRVPD GEEAYFDINN LRPTFDLRAA GLSQSQIIFT
LIHDPWYGLV DVNVNRNTRK FSLLDVVDKK IKYMHDGNEG TSDNISLYVH VQSNSYLPEC
LKGTQRYSIP VEIIPARASP GSTVELPITK KGRSHLSPSL LKISQSASNC DQLVITVTSA
PSFDFGYLVN SQQPGRRISE FTCRELKDGN IFFIHKAGWT SEMTLKVSDG GSVTQSSTFM
FLAIQPNAII VGTNGFSLSV VQGSNASIGI RNLGVIPHPR NGDILYNITQ PPVLGELRVK
GSDGMYTQVT SFHQFHLDRE LIRYFSTDTD SHENTEVDNI VFDIHLGEFV RPNNLFRVEI
QPPKVKVSYL EPLEAKPGEK QTITKLQAEV QGKTPNPQTI KYILVKSPAH GSLQISDKEL
NEGDMFTQKD ILDNALSYVV QMQGFVDTND QFQFRVSAED QYSPLYSFPI SILAGMDDYL
LTNKHLVVLQ GGELALNKNH LWLQSSVSQD ILYRVTQEPK HGRLIRDTPE LPRFDGAVRY
FSNEDLEHGR LIYKHDGSKT TSDEFHFSAT TQDSDSPKVV MGIFKIQIQF KNEHVPVRIV
DKVFNVVRRG QRLLTTDVIQ FKDDDSNFND TQIVYAREGI LSGNIMSASN PSQPVFRFTQ
ADLRDKNILF VHHGADQERF PLQVSDGFHK TTAMLQIQAG EPYLQLVNNT MIVIDHGSTK
TLDTSLLSAD SNMDIRDDSE IKFEVTSPPS DGRIIVSGIE ASQFTQEDLK KGVVSYEHSY
ESLKAKDSFG FTVRAKELSE KGIFKIKIFK QGYLSEPEVI TNEVIISYEG EHTIISQDHL
KVEQTDILPT DMIFTLKRLP KLGHVVMLKN SSEVAASPVL DYIHSFSQED IDLSRVFYVS
ASLQGSDSFT LDVSNGFSTV EDLTIKVDIM PRFIPIQASN FSIKEGLSRV INKEILNISN
YFYSLANIDF SLEELPHHGE IRNLKGDELS YFTWEEVKFG QVVYMHDSSE TTEDSFTLSA
SSYEIERRSL PVTISVTIIP VNDEPPMVAR NAGLEVLPGE QVTITASVLS TEDADTPAEE
LVYHIDIPTN GVMALKQEPG ESIQNFSQAH INRGEIIFTH EGEESGEFSF TVTDGEHTSP
LYRFVIKARP HTITLETGEE LVVFPGTRQV IKTTNLKVGT SEDGNEASFL LVRAPRLGRL
ILANERNQFV ETSLFSQSQL ESGAVYYEHQ LPTEPFWVVR DSMEFTASSQ ASPDVRHSLP
ITVSYYAAHS NISSQMWRNK GLEIVQGQRK AIDNSILDAS NLLASLPEEK KTDADVVIEI
KRFPDHGRIT LLGEDLPRDN PLFTQSDVSQ GKLEYLHDDS GAAFDSLAFK AYLKLRSGDR
TSPSESVILD EIFHISVKRR GSNPPELVTT DMLLEVLQGS KTSLTQKHLN TQDEDNPPDE
VLFKVTKAPR NGYLFNSVTS EPISKFTQEM INRVEVAFQS DGTLKGGFVE FTISDGEHET
GPHTLHIGIL ARTLLLELVP EIKVRQGDDQ ILVTEEMLRA STGGPVEEEI LYKITSTPKY
AAVMVDRQPT SAFTQKQIKE GRVSVRFVKS TSPRDSVSFT ARSRAANVSS ILNITVQPLA
KIAQNPLLPQ GSLVQLDRKL LDASPLANKT RASPTFTVIQ EPRGARIVAF KDPDAGQPIQ
TFTQKDLDEG RVAMEILDTA SRSKAKGNQD EARFLLKAHG VPPAECVLSF QTSPYNTSAV
YPATLLKTPG DSVTPSSPGS GTHGKPHVSR RGNFWSVFIP ILVVLLLLLL AAVLAYYLIH
KNKTGKHNVQ TVASKPKNGE VATTETFRKT DPANNIPMSN VDSKGADPEL LQHCRTTNPA
LKKNQYWV
//