GenomeNet

Database: UniProt
Entry: A0A3P8P7E1_ASTCA
LinkDB: A0A3P8P7E1_ASTCA
Original site: A0A3P8P7E1_ASTCA 
ID   A0A3P8P7E1_ASTCA        Unreviewed;      2401 AA.
AC   A0A3P8P7E1;
DT   13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT   13-FEB-2019, sequence version 1.
DT   27-MAR-2024, entry version 20.
DE   SubName: Full=Chondroitin sulfate proteoglycan 4-like {ECO:0000313|Ensembl:ENSACLP00000012926.1};
OS   Astatotilapia calliptera (Eastern happy) (Chromis callipterus).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC   Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC   Pseudocrenilabrinae; Haplochromini; Astatotilapia.
OX   NCBI_TaxID=8154 {ECO:0000313|Ensembl:ENSACLP00000012926.1, ECO:0000313|Proteomes:UP000265100};
RN   [1] {ECO:0000313|Ensembl:ENSACLP00000012926.1, ECO:0000313|Proteomes:UP000265100}
RP   NUCLEOTIDE SEQUENCE.
RA   Datahose.;
RL   Submitted (MAY-2018) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSACLP00000012926.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (SEP-2023) to UniProtKB.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00122}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   STRING; 8154.ENSACLP00000012926; -.
DR   Ensembl; ENSACLT00000013245.1; ENSACLP00000012926.1; ENSACLG00000008822.1.
DR   GeneTree; ENSGT00940000154091; -.
DR   OMA; EELHFMV; -.
DR   OrthoDB; 4072625at2759; -.
DR   Proteomes; UP000265100; Chromosome 12.
DR   Bgee; ENSACLG00000008822; Expressed in muscle tissue and 4 other cell types or tissues.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR   CDD; cd00110; LamG; 2.
DR   Gene3D; 2.60.120.200; -; 2.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR039005; CSPG_rpt.
DR   InterPro; IPR001791; Laminin_G.
DR   PANTHER; PTHR45739:SF12; CHONDROITIN SULFATE PROTEOGLYCAN 4-LIKE ISOFORM X2; 1.
DR   PANTHER; PTHR45739; MATRIX PROTEIN, PUTATIVE-RELATED; 1.
DR   Pfam; PF16184; Cadherin_3; 12.
DR   Pfam; PF02210; Laminin_G_2; 2.
DR   SMART; SM00282; LamG; 2.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 2.
DR   PROSITE; PS51854; CSPG; 12.
DR   PROSITE; PS50025; LAM_G_DOMAIN; 2.
PE   4: Predicted;
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Membrane {ECO:0000256|SAM:Phobius}; Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP};
KW   Transmembrane {ECO:0000256|SAM:Phobius};
KW   Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT   SIGNAL          1..30
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           31..2401
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5018069290"
FT   TRANSMEM        2273..2296
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   DOMAIN          28..202
FT                   /note="Laminin G"
FT                   /evidence="ECO:0000259|PROSITE:PS50025"
FT   DOMAIN          212..391
FT                   /note="Laminin G"
FT                   /evidence="ECO:0000259|PROSITE:PS50025"
FT   REPEAT          436..531
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          562..655
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          672..770
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          904..998
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          1030..1122
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          1138..1228
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          1250..1349
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          1367..1458
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          1482..1572
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          1589..1689
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          1714..1813
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REPEAT          1847..1939
FT                   /note="CSPG"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU01201"
FT   REGION          2303..2333
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2306..2333
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   2401 AA;  269283 MW;  97A612AD95308076 CRC64;
     MDSVGKNTQK KKLWLCGLLW WSLLVHLASG ASFYGDGFVQ LKARQSSDRN VLRIRFRTSS
     TNGLLFLAAG QTHYFLLELH TGRLQLKLDL GSGEQMLESD RGTHLNDLAW HHVEVHHVQL
     NITLTVDSNS HSSVKVPGPH YSLNTDGLYV GGSGGLDRPY LPRDPVCFRG CMDDVLFNER
     DLLSSLRPYT GYKNVHEVSL GCSPQFFATE EDPISFFSSR AYISLPTWIT EQEAAFECVM
     HTSAKEGIIL YNAAREEKFV ALEIQDGLLV AVVGTGETKT ELRSLTLIND RKWHSIKLYF
     SSKSLELTID GEMVKSGIIP HPKALQLKGY LFVGGIDDGT RSEVKKMGLV SISGKRARGG
     SFKGCLKNIA VNRVKMGLSN AVVTKDISVG CEPEKESDPP TALNPTNTPG SLFHLITPTP
     SIDMSTLAWG LDRRYGFNFV QLKNLIVQEG GRASLEAKHI KVLLDFRKLG IRQSQIMFRI
     EEQPLRGQIR LDVDQDQEEN TFSMLDLWHE RVMYIHGGSE EPDDFFMFSI FSSSRKQVPD
     YLKGTKLYRY NISVTSTNDA PELSLPQGNL FVLLENSKKR LTTDVLMATD IDSNYTDLFF
     SVLGNLNADA GYLEVESNPG KAVTSFSYID LDELRVYYVH TGVRNSRIVL RVSDGEKVSN
     TVVLRIMAVA LEYKIANNTG LDIIQGETAI IGSKQLALQT NAVNQVVDIR YDVIEPPQYG
     ELQRLHSSGE WRHTDSFSQR LLEKERLRYV STFQDIQTSN PTDYFKCKVN VAARETTEIL
     FPITVKWVKY HLVRNTMIDL DKVRKVTLNS ECLFATTEGV TLSEDDLYFR VLTTPKKGKL
     LLNTIVLKKN STFSQRDVTD LKVHYELVDR PYQDTTDRFK FHLVSKHAQS QNYDFQFSIK
     ADVNSVFIRN AGLSLQEGES KLITKDELFA ETLSTKDMFY TVINSPKHGK LVRISQSNSN
     ASYDNILTFS NKDILEERII YVHDDSETTQ DEFTFIASTT QGFKPFIAED EPGSKESVFN
     ISIQLLNDQK PVRVIDKVFH VVRDGQKLLT TEDLQYQDAD SDFDDGQLVY TRRGIPMGDL
     VLVNDTTYRL FQFRQKDLEE KRVLFIHKGV SSGRFVLFVS DGKHFVSGLL HISAHDPFLK
     VDNNTGLLVQ KGHSVVFSTS NYSVMSNLDI RDDKEVIFKL DDGPKHGSLY RNETTVVTFT
     QADLKAGLIR YQHNDSKYLT DYFNITVKAK SLQLTSRVNV KVYLESHQRP PIVQHHDTLL
     VEEGKPAKID ETKLEVTHED NLPSEIVFTV KVAPSYGFLR RFVEAEERYI GTKQSPVNTF
     TQNDINSGNI QYVQVEPNKV NDTFILDATN GVTDVTNIKM FVDIIPLLIP LQVSNITLNE
     GAAKALTQDV LKVTNRHFSG INFFYNLTQP PQHGHIEHSR HPGVAITTFT RRQVEHEFIY
     YVHDSSETLA DNFTLVANDT SLRKQSAAQM VHIQVIPAND EPPVIITNRV LRVWVSSVTE
     ITLDDLSVQD QDTPPEELHF MVTPPSNGHL ALKSAPMKAV LNFTQAHIDQ GQLVFVHKGA
     MSGGFNFQAN DGVNFTPRQI FSITAKALAL SLEKSQPLKV FPGSSRPITN EYLQAVTNDM
     SNTSNRVITF SVTRHPKLGR LVMRQPNNST ADISTFTQDM VDRKEVFYIQ TPVESVGWEA
     MDSMTFSVAS PPASVDSLTF RFDISYENTG PEHNTILLAN TGAEVTEGES VIIDESKLDA
     TNLMSKLPTP QRSSYEVWFQ VTSLPQHGVI VVGERNLTKE KPNFSQFIIK KYGITYKHDN
     SETTQDSFIF SAWLNPKGKT AQRPEDDSDV VVEHFNITVI PVNDQPPLLK TKAPSLKVVQ
     GDTIALGPEN LKIEDLDNPP DDIKFSVISK PSNGYLALKG SLNESIVAFT QAQINNRSVC
     FVHDGSSVSG AFYFSVTDGY HKPIYKLFNL EVSEITITLV KNTGLELQQG RTLVSLTEDN
     LAAETNGKNV TVYYQITRPL RFGKLLRDNR EVTLFDQEDL QTGRLSYHMI SLSSVEDSFE
     FTAFTSEANL TDQVLNITVK PLLQTGKGLR IPNGITVKFN VNFLNASELA NISDSDPVFE
     VLSHPRYGKV VRPKLKMTIK AIPVESFTFQ EIMQAKVALE LNANMTGVEE LNDSVVFVLK
     ADSVQPAKGE LHFTVVPYDP AFFPPTKSPV SATSSVPQAS IKTAFPVLST VFLSSQQPSI
     NQQKFRGRNR WGNSNRTSVF GTTLGKPSQT EEFLFKNTPV RVESYPQKTS NPLLIILPLL
     ALLLLVIIFV VLVVFLRHHR QRKQSTATAQ EQTSTGLPNS SSYQGQTQRS PAVPTVTVTP
     LNHNCPGSPV LDRLLTPNQG TTYNAIDSNM VVSSWSNGSP VTSCQMIRTA TPTLQKNQYW
     V
//
DBGET integrated database retrieval system