ID W5LIR7_ASTMX Unreviewed; 2609 AA.
AC W5LIR7;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 2.
DT 27-MAR-2024, entry version 55.
DE SubName: Full=Si:ch211-186j3.6 {ECO:0000313|Ensembl:ENSAMXP00000019729.2};
OS Astyanax mexicanus (Blind cave fish) (Astyanax fasciatus mexicanus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Characiformes;
OC Characoidei; Characidae; Astyanax.
OX NCBI_TaxID=7994 {ECO:0000313|Ensembl:ENSAMXP00000019729.2, ECO:0000313|Proteomes:UP000018467};
RN [1] {ECO:0000313|Proteomes:UP000018467}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RA Jeffery W., Warren W., Wilson R.K.;
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000018467}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RX PubMed=25329095; DOI=10.1038/ncomms6307;
RA McGaugh S.E., Gross J.B., Aken B., Blin M., Borowsky R., Chalopin D.,
RA Hinaux H., Jeffery W.R., Keene A., Ma L., Minx P., Murphy D., O'Quin K.E.,
RA Retaux S., Rohner N., Searle S.M., Stahl B.A., Tabin C., Volff J.N.,
RA Yoshizawa M., Warren W.C.;
RT "The cavefish genome reveals candidate genes for eye loss.";
RL Nat. Commun. 5:5307-5307(2014).
RN [3] {ECO:0000313|Ensembl:ENSAMXP00000019729.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 7994.ENSAMXP00000019729; -.
DR Ensembl; ENSAMXT00000019729.2; ENSAMXP00000019729.2; ENSAMXG00000019158.2.
DR eggNOG; KOG3594; Eukaryota.
DR GeneTree; ENSGT00940000164020; -.
DR HOGENOM; CLU_243436_0_0_1; -.
DR InParanoid; W5LIR7; -.
DR Proteomes; UP000018467; Unassembled WGS sequence.
DR Bgee; ENSAMXG00000019158; Expressed in brain and 2 other cell types or tissues.
DR GO; GO:0005886; C:plasma membrane; IEA:InterPro.
DR GO; GO:0005509; F:calcium ion binding; IEA:UniProtKB-UniRule.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR GO; GO:0007156; P:homophilic cell adhesion via plasma membrane adhesion molecules; IEA:InterPro.
DR CDD; cd11304; Cadherin_repeat; 17.
DR CDD; cd00054; EGF_CA; 2.
DR CDD; cd00110; LamG; 2.
DR Gene3D; 2.60.120.200; -; 2.
DR Gene3D; 2.60.40.60; Cadherins; 17.
DR Gene3D; 2.10.25.10; Laminin; 1.
DR InterPro; IPR002126; Cadherin-like_dom.
DR InterPro; IPR015919; Cadherin-like_sf.
DR InterPro; IPR020894; Cadherin_CS.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR001791; Laminin_G.
DR PANTHER; PTHR24025:SF23; CADHERIN-4; 1.
DR PANTHER; PTHR24025; DESMOGLEIN FAMILY MEMBER; 1.
DR Pfam; PF00028; Cadherin; 13.
DR Pfam; PF00008; EGF; 1.
DR Pfam; PF02210; Laminin_G_2; 2.
DR PRINTS; PR00205; CADHERIN.
DR SMART; SM00112; CA; 16.
DR SMART; SM00181; EGF; 2.
DR SMART; SM00179; EGF_CA; 2.
DR SMART; SM00282; LamG; 2.
DR SUPFAM; SSF49313; Cadherin-like; 17.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 2.
DR PROSITE; PS00010; ASX_HYDROXYL; 1.
DR PROSITE; PS00232; CADHERIN_1; 8.
DR PROSITE; PS50268; CADHERIN_2; 16.
DR PROSITE; PS00022; EGF_1; 2.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 2.
DR PROSITE; PS50025; LAM_G_DOMAIN; 2.
PE 4: Predicted;
KW Calcium {ECO:0000256|ARBA:ARBA00022837, ECO:0000256|PROSITE-
KW ProRule:PRU00043};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Reference proteome {ECO:0000313|Proteomes:UP000018467};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..17
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 18..2609
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5017325128"
FT DOMAIN 33..131
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 251..352
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 355..453
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 454..554
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 564..656
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 657..763
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 767..872
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 873..972
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 973..1084
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 1085..1199
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 1200..1301
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 1302..1408
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 1411..1524
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 1525..1632
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 1642..1738
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 1738..1855
FT /note="Cadherin"
FT /evidence="ECO:0000259|PROSITE:PS50268"
FT DOMAIN 2100..2139
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2140..2341
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 2344..2383
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2386..2549
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT REGION 214..258
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2556..2590
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 2129..2138
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 2373..2382
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 2609 AA; 286049 MW; 033B21A83B10A148 CRC64;
MFWTLLLLSC CATLALGNGT RDAERVLFFH GHVFENSPVG SRVNGLSIPA RRVGAEPGAR
LRLLGDGSEA FRAFAHHKRG HVLLKTAAVL DRERRSGYVL GLNSGAAGAA ASPVASVRVD
VLDTNDHKPT FRHRAVTLAL DDATALRSVV HRVAAEDTDS GKNAELTYFA LPRNGSFYVV
PKTGDVLLVD SILGLATPVR FTVFARDRGW PSRTSPGVEI EVRPRQARPS VPVTKNTRNP
TKSRRSVHEP PEPPALVSVS EDAAVGSVIM SLSPARFPAA TFELLLQEEA DRDPPVAVNR
DSGELVISRS LDRETEPLVE VTVRVQDKRG PDWYLVRVEL TVLDVNDNAP EWTMVPVPYL
AVVSPSSPAS TLVYKLQARD GDEGINGEVE YFLSDGGDGR FDVDRKTGHV RTTGLPLQRD
REYLLSVVAA DRLGSRSPPA VVSVVAGPRA PQFTNASYTI SIPENTPEGQ AFMVTPALSF
QKQPISYSLL INPSSLFSIQ QETGEISLTR DIDYETDQHR YLLLVRASES KDSLSSAAEV
RVIITDENDC VPEFLQSIYS KDGVPETVTT ATSLLQVSAS DCDSEQNADI TYYTLSSDFI
ISPHGTIFPA GPLDYERPNH LYEFVVMAVD KGEVPRTGTA TVRLRMANVN DEPPEFSQPV
YRTFVSEDAG PNTLVATVLA KDPDGDGIMY KISSGNEEGN FVIDSQKGLI RLRSSPPPKL
QGVEYVLNVT ATDDNASGGP QSLSTTAQVI VGVDDVNNNK PIFEKCHQYK ERASVAENKP
AGTFVLQVHA VDADEGANGK VTYGFMHKDS TVPAFNIDPE TGAIVTARKF DRERQREYAV
TVTATDQAAD PLIGICQLNI LILDENDNSP KFENLRYEYF LREDTMIGTS FLRVAAHDDD
YSTNAAITYS MSKEQPEYLR VNPVTGWVYV NQPISQRAYI TREIIATDGG NQSSSVELSV
TITNVKNQPP QWEKDSYEVV IPENTVRDTP VVTVKATSPL GDPRVTYNLE DGMVPETNMP
VRFYLKPNRE DGSASILVAE PLDYETTRNF LLRVRAQNVA AVPLGAFTTV YVNLTDVNDN
VPFFTSSIYE ASVTEGAEIG TLVLQVSAND LDLGLNGKIS YSLLNDRSGD YQYFRIDPEL
GSIYTEAVFD RETKGSYLLE VKSTDSWESA RPGRHGQPNS DTAYVRIFIS DVNDNKPVFS
QTLYEVDVDE DADVGSTILT VSANDEDEGA NAKLRYQITS GNTGGVFDVE PEVGTIFIAQ
PLDYEQTKRY KLHILASDGK WEDYTAVVVN VVNKNDEAPV FSVNEYYGSV TEELDGSPVF
VLQVTATDPD KDADQEALRY SLHGQGAESE FIIDEVTGKI YAQRTLDREA RAVWRFVVLA
TDEGGEGLTG FTDVIISVWD INDNAPIFAC APDSCHSEVA ENSASGTSVM EMTATDLDDA
AVGQNAVLAY RVLSNLALNG GNNGAEMFTI NPATGTVSVA MSGLDREHIE SYVLVVEARD
GGGMSGTATA TIHVKDVNDH APRFLDRSCS ARIPESSEQN AAVLELAAED ADAGENGQLT
FSIVAGDPEQ KFYMVSHRQE QRGTLRLKKR LDYEKPGEQS FNLTIKVEDL DYSSLLHCTL
EIKDCNDHAP VFIPHFLQLP ALREDIPVGT SVAMVVASDS DSGLNREITY TIAPESDPFD
LFLVDQSGLV TVAGQLDREQ ASQHHLVVLA TDHGSPPLTG TATIQLSLLD VNDNGPEFES
TYSPVVWENV AGPQVVRLNA SSTLLRVIDR DSVENGSPFS FSVPPEYRYS NDFHLQDNEN
DTATVTALRA FDRERQKQFL LPVIMTDSGK PPKTVTSTLT ITIGDKNDHA HLPGEKKIYI
NSHRGRMPTT VLGKVYAPDP DDWDNKTYAF EGHVPNYFIL NKRTGFLVIK ENAPPGMYEF
QVRVSDEEWP DAVSTVIVRV RELRDDIIYN SASLRIADIT AKEFMERRGG LRSRYELLGD
FLSEMLSVGP DDINIFSLVE VRERTVDVRF SVHSALFLRA ERIHGYLAAH KQKLQSFLQV
NVTQVHVDEC AAADCGGGGG CSTRLSVSDR PTVVDSGSMA LVSVTLEAAA VCSCSAREHL
HQGCSTYPWN PCHNGGVCVD TQSGYRCQCP AQFEGPECQQ TKHSFHGNGY AWFPPIRPCF
ESHLSLEFIT EVADGLLLYS GPLAQLQPWE PEDFMAIELI DGTPTLKINH GSGTLVLQLP
GNVNVADRRW HRLDVRSNSK DVRFTLDRCA GATVMEMEGV GSWLTTEDHT SCEVTGVTPN
LDRHLNVTQV LQLGGVNENL PYIYPQLQHK HFTGCIRNLV VDSKLYDLGS PADASGSSPG
CLMTDSSCVN MGFPSCGTRG RCHGEWGSFS CQCIAGYSGH QCEQEVPEYS FDGRSHVHYQ
ISGPLPPRHM QVQVLIRTRK HSSSILSLLS SQQSEYLRLE IFQGLLAVFY NLGDGDFNLT
IPSHRLDNGE WHELYLDRHD NEMTLRVDGG GGRREVTGSP GRSREIVIDP AMVMLGSSFP
IAHNKSFQGV PRVTWSRPTP RASSACTRCV PAAPATVAPA WPSRPPNSPA TARRATVGGT
ARSRWPSTAT TWASASARSS PYASASWLC
//