GenomeNet

Database: UniProt
Entry: W5LIR7_ASTMX
LinkDB: W5LIR7_ASTMX
Original site: W5LIR7_ASTMX 
ID   W5LIR7_ASTMX            Unreviewed;      2609 AA.
AC   W5LIR7;
DT   16-APR-2014, integrated into UniProtKB/TrEMBL.
DT   05-DEC-2018, sequence version 2.
DT   27-MAR-2024, entry version 55.
DE   SubName: Full=Si:ch211-186j3.6 {ECO:0000313|Ensembl:ENSAMXP00000019729.2};
OS   Astyanax mexicanus (Blind cave fish) (Astyanax fasciatus mexicanus).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Characiformes;
OC   Characoidei; Characidae; Astyanax.
OX   NCBI_TaxID=7994 {ECO:0000313|Ensembl:ENSAMXP00000019729.2, ECO:0000313|Proteomes:UP000018467};
RN   [1] {ECO:0000313|Proteomes:UP000018467}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RA   Jeffery W., Warren W., Wilson R.K.;
RL   Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Proteomes:UP000018467}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RX   PubMed=25329095; DOI=10.1038/ncomms6307;
RA   McGaugh S.E., Gross J.B., Aken B., Blin M., Borowsky R., Chalopin D.,
RA   Hinaux H., Jeffery W.R., Keene A., Ma L., Minx P., Murphy D., O'Quin K.E.,
RA   Retaux S., Rohner N., Searle S.M., Stahl B.A., Tabin C., Volff J.N.,
RA   Yoshizawa M., Warren W.C.;
RT   "The cavefish genome reveals candidate genes for eye loss.";
RL   Nat. Commun. 5:5307-5307(2014).
RN   [3] {ECO:0000313|Ensembl:ENSAMXP00000019729.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   STRING; 7994.ENSAMXP00000019729; -.
DR   Ensembl; ENSAMXT00000019729.2; ENSAMXP00000019729.2; ENSAMXG00000019158.2.
DR   eggNOG; KOG3594; Eukaryota.
DR   GeneTree; ENSGT00940000164020; -.
DR   HOGENOM; CLU_243436_0_0_1; -.
DR   InParanoid; W5LIR7; -.
DR   Proteomes; UP000018467; Unassembled WGS sequence.
DR   Bgee; ENSAMXG00000019158; Expressed in brain and 2 other cell types or tissues.
DR   GO; GO:0005886; C:plasma membrane; IEA:InterPro.
DR   GO; GO:0005509; F:calcium ion binding; IEA:UniProtKB-UniRule.
DR   GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR   GO; GO:0007156; P:homophilic cell adhesion via plasma membrane adhesion molecules; IEA:InterPro.
DR   CDD; cd11304; Cadherin_repeat; 17.
DR   CDD; cd00054; EGF_CA; 2.
DR   CDD; cd00110; LamG; 2.
DR   Gene3D; 2.60.120.200; -; 2.
DR   Gene3D; 2.60.40.60; Cadherins; 17.
DR   Gene3D; 2.10.25.10; Laminin; 1.
DR   InterPro; IPR002126; Cadherin-like_dom.
DR   InterPro; IPR015919; Cadherin-like_sf.
DR   InterPro; IPR020894; Cadherin_CS.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR   InterPro; IPR000742; EGF-like_dom.
DR   InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR   InterPro; IPR001791; Laminin_G.
DR   PANTHER; PTHR24025:SF23; CADHERIN-4; 1.
DR   PANTHER; PTHR24025; DESMOGLEIN FAMILY MEMBER; 1.
DR   Pfam; PF00028; Cadherin; 13.
DR   Pfam; PF00008; EGF; 1.
DR   Pfam; PF02210; Laminin_G_2; 2.
DR   PRINTS; PR00205; CADHERIN.
DR   SMART; SM00112; CA; 16.
DR   SMART; SM00181; EGF; 2.
DR   SMART; SM00179; EGF_CA; 2.
DR   SMART; SM00282; LamG; 2.
DR   SUPFAM; SSF49313; Cadherin-like; 17.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 2.
DR   PROSITE; PS00010; ASX_HYDROXYL; 1.
DR   PROSITE; PS00232; CADHERIN_1; 8.
DR   PROSITE; PS50268; CADHERIN_2; 16.
DR   PROSITE; PS00022; EGF_1; 2.
DR   PROSITE; PS01186; EGF_2; 1.
DR   PROSITE; PS50026; EGF_3; 2.
DR   PROSITE; PS50025; LAM_G_DOMAIN; 2.
PE   4: Predicted;
KW   Calcium {ECO:0000256|ARBA:ARBA00022837, ECO:0000256|PROSITE-
KW   ProRule:PRU00043};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00076};
KW   EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW   ProRule:PRU00076}; Reference proteome {ECO:0000313|Proteomes:UP000018467};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..17
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           18..2609
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5017325128"
FT   DOMAIN          33..131
FT                   /note="Cadherin"
FT                   /evidence="ECO:0000259|PROSITE:PS50268"
FT   DOMAIN          251..352
FT                   /note="Cadherin"
FT                   /evidence="ECO:0000259|PROSITE:PS50268"
FT   DOMAIN          355..453
FT                   /note="Cadherin"
FT                   /evidence="ECO:0000259|PROSITE:PS50268"
FT   DOMAIN          454..554
FT                   /note="Cadherin"
FT                   /evidence="ECO:0000259|PROSITE:PS50268"
FT   DOMAIN          564..656
FT                   /note="Cadherin"
FT                   /evidence="ECO:0000259|PROSITE:PS50268"
FT   DOMAIN          657..763
FT                   /note="Cadherin"
FT                   /evidence="ECO:0000259|PROSITE:PS50268"
FT   DOMAIN          767..872
FT                   /note="Cadherin"
FT                   /evidence="ECO:0000259|PROSITE:PS50268"
FT   DOMAIN          873..972
FT                   /note="Cadherin"
FT                   /evidence="ECO:0000259|PROSITE:PS50268"
FT   DOMAIN          973..1084
FT                   /note="Cadherin"
FT                   /evidence="ECO:0000259|PROSITE:PS50268"
FT   DOMAIN          1085..1199
FT                   /note="Cadherin"
FT                   /evidence="ECO:0000259|PROSITE:PS50268"
FT   DOMAIN          1200..1301
FT                   /note="Cadherin"
FT                   /evidence="ECO:0000259|PROSITE:PS50268"
FT   DOMAIN          1302..1408
FT                   /note="Cadherin"
FT                   /evidence="ECO:0000259|PROSITE:PS50268"
FT   DOMAIN          1411..1524
FT                   /note="Cadherin"
FT                   /evidence="ECO:0000259|PROSITE:PS50268"
FT   DOMAIN          1525..1632
FT                   /note="Cadherin"
FT                   /evidence="ECO:0000259|PROSITE:PS50268"
FT   DOMAIN          1642..1738
FT                   /note="Cadherin"
FT                   /evidence="ECO:0000259|PROSITE:PS50268"
FT   DOMAIN          1738..1855
FT                   /note="Cadherin"
FT                   /evidence="ECO:0000259|PROSITE:PS50268"
FT   DOMAIN          2100..2139
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          2140..2341
FT                   /note="Laminin G"
FT                   /evidence="ECO:0000259|PROSITE:PS50025"
FT   DOMAIN          2344..2383
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          2386..2549
FT                   /note="Laminin G"
FT                   /evidence="ECO:0000259|PROSITE:PS50025"
FT   REGION          214..258
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2556..2590
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   DISULFID        2129..2138
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        2373..2382
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ   SEQUENCE   2609 AA;  286049 MW;  033B21A83B10A148 CRC64;
     MFWTLLLLSC CATLALGNGT RDAERVLFFH GHVFENSPVG SRVNGLSIPA RRVGAEPGAR
     LRLLGDGSEA FRAFAHHKRG HVLLKTAAVL DRERRSGYVL GLNSGAAGAA ASPVASVRVD
     VLDTNDHKPT FRHRAVTLAL DDATALRSVV HRVAAEDTDS GKNAELTYFA LPRNGSFYVV
     PKTGDVLLVD SILGLATPVR FTVFARDRGW PSRTSPGVEI EVRPRQARPS VPVTKNTRNP
     TKSRRSVHEP PEPPALVSVS EDAAVGSVIM SLSPARFPAA TFELLLQEEA DRDPPVAVNR
     DSGELVISRS LDRETEPLVE VTVRVQDKRG PDWYLVRVEL TVLDVNDNAP EWTMVPVPYL
     AVVSPSSPAS TLVYKLQARD GDEGINGEVE YFLSDGGDGR FDVDRKTGHV RTTGLPLQRD
     REYLLSVVAA DRLGSRSPPA VVSVVAGPRA PQFTNASYTI SIPENTPEGQ AFMVTPALSF
     QKQPISYSLL INPSSLFSIQ QETGEISLTR DIDYETDQHR YLLLVRASES KDSLSSAAEV
     RVIITDENDC VPEFLQSIYS KDGVPETVTT ATSLLQVSAS DCDSEQNADI TYYTLSSDFI
     ISPHGTIFPA GPLDYERPNH LYEFVVMAVD KGEVPRTGTA TVRLRMANVN DEPPEFSQPV
     YRTFVSEDAG PNTLVATVLA KDPDGDGIMY KISSGNEEGN FVIDSQKGLI RLRSSPPPKL
     QGVEYVLNVT ATDDNASGGP QSLSTTAQVI VGVDDVNNNK PIFEKCHQYK ERASVAENKP
     AGTFVLQVHA VDADEGANGK VTYGFMHKDS TVPAFNIDPE TGAIVTARKF DRERQREYAV
     TVTATDQAAD PLIGICQLNI LILDENDNSP KFENLRYEYF LREDTMIGTS FLRVAAHDDD
     YSTNAAITYS MSKEQPEYLR VNPVTGWVYV NQPISQRAYI TREIIATDGG NQSSSVELSV
     TITNVKNQPP QWEKDSYEVV IPENTVRDTP VVTVKATSPL GDPRVTYNLE DGMVPETNMP
     VRFYLKPNRE DGSASILVAE PLDYETTRNF LLRVRAQNVA AVPLGAFTTV YVNLTDVNDN
     VPFFTSSIYE ASVTEGAEIG TLVLQVSAND LDLGLNGKIS YSLLNDRSGD YQYFRIDPEL
     GSIYTEAVFD RETKGSYLLE VKSTDSWESA RPGRHGQPNS DTAYVRIFIS DVNDNKPVFS
     QTLYEVDVDE DADVGSTILT VSANDEDEGA NAKLRYQITS GNTGGVFDVE PEVGTIFIAQ
     PLDYEQTKRY KLHILASDGK WEDYTAVVVN VVNKNDEAPV FSVNEYYGSV TEELDGSPVF
     VLQVTATDPD KDADQEALRY SLHGQGAESE FIIDEVTGKI YAQRTLDREA RAVWRFVVLA
     TDEGGEGLTG FTDVIISVWD INDNAPIFAC APDSCHSEVA ENSASGTSVM EMTATDLDDA
     AVGQNAVLAY RVLSNLALNG GNNGAEMFTI NPATGTVSVA MSGLDREHIE SYVLVVEARD
     GGGMSGTATA TIHVKDVNDH APRFLDRSCS ARIPESSEQN AAVLELAAED ADAGENGQLT
     FSIVAGDPEQ KFYMVSHRQE QRGTLRLKKR LDYEKPGEQS FNLTIKVEDL DYSSLLHCTL
     EIKDCNDHAP VFIPHFLQLP ALREDIPVGT SVAMVVASDS DSGLNREITY TIAPESDPFD
     LFLVDQSGLV TVAGQLDREQ ASQHHLVVLA TDHGSPPLTG TATIQLSLLD VNDNGPEFES
     TYSPVVWENV AGPQVVRLNA SSTLLRVIDR DSVENGSPFS FSVPPEYRYS NDFHLQDNEN
     DTATVTALRA FDRERQKQFL LPVIMTDSGK PPKTVTSTLT ITIGDKNDHA HLPGEKKIYI
     NSHRGRMPTT VLGKVYAPDP DDWDNKTYAF EGHVPNYFIL NKRTGFLVIK ENAPPGMYEF
     QVRVSDEEWP DAVSTVIVRV RELRDDIIYN SASLRIADIT AKEFMERRGG LRSRYELLGD
     FLSEMLSVGP DDINIFSLVE VRERTVDVRF SVHSALFLRA ERIHGYLAAH KQKLQSFLQV
     NVTQVHVDEC AAADCGGGGG CSTRLSVSDR PTVVDSGSMA LVSVTLEAAA VCSCSAREHL
     HQGCSTYPWN PCHNGGVCVD TQSGYRCQCP AQFEGPECQQ TKHSFHGNGY AWFPPIRPCF
     ESHLSLEFIT EVADGLLLYS GPLAQLQPWE PEDFMAIELI DGTPTLKINH GSGTLVLQLP
     GNVNVADRRW HRLDVRSNSK DVRFTLDRCA GATVMEMEGV GSWLTTEDHT SCEVTGVTPN
     LDRHLNVTQV LQLGGVNENL PYIYPQLQHK HFTGCIRNLV VDSKLYDLGS PADASGSSPG
     CLMTDSSCVN MGFPSCGTRG RCHGEWGSFS CQCIAGYSGH QCEQEVPEYS FDGRSHVHYQ
     ISGPLPPRHM QVQVLIRTRK HSSSILSLLS SQQSEYLRLE IFQGLLAVFY NLGDGDFNLT
     IPSHRLDNGE WHELYLDRHD NEMTLRVDGG GGRREVTGSP GRSREIVIDP AMVMLGSSFP
     IAHNKSFQGV PRVTWSRPTP RASSACTRCV PAAPATVAPA WPSRPPNSPA TARRATVGGT
     ARSRWPSTAT TWASASARSS PYASASWLC
//
DBGET integrated database retrieval system