GenomeNet

Database: UniProt
Entry: W5L4F2_ASTMX
LinkDB: W5L4F2_ASTMX
Original site: W5L4F2_ASTMX 
ID   W5L4F2_ASTMX            Unreviewed;      1803 AA.
AC   W5L4F2;
DT   16-APR-2014, integrated into UniProtKB/TrEMBL.
DT   05-DEC-2018, sequence version 2.
DT   03-JUL-2019, entry version 44.
DE   SubName: Full=Uncharacterized protein {ECO:0000313|Ensembl:ENSAMXP00000014714};
OS   Astyanax mexicanus (Blind cave fish) (Astyanax fasciatus mexicanus).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Characiformes;
OC   Characoidei; Characidae; Astyanax.
OX   NCBI_TaxID=7994 {ECO:0000313|Ensembl:ENSAMXP00000014714, ECO:0000313|Proteomes:UP000018467};
RN   [1] {ECO:0000313|Ensembl:ENSAMXP00000014714, ECO:0000313|Proteomes:UP000018467}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=female {ECO:0000313|Ensembl:ENSAMXP00000014714};
RA   Jeffery W., Warren W., Wilson R.K.;
RL   Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Ensembl:ENSAMXP00000014714}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=female {ECO:0000313|Ensembl:ENSAMXP00000014714};
RX   PubMed=25329095; DOI=10.1038/ncomms6307;
RA   McGaugh S.E., Gross J.B., Aken B., Blin M., Borowsky R., Chalopin D.,
RA   Hinaux H., Jeffery W.R., Keene A., Ma L., Minx P., Murphy D.,
RA   O'Quin K.E., Retaux S., Rohner N., Searle S.M., Stahl B.A., Tabin C.,
RA   Volff J.N., Yoshizawa M., Warren W.C.;
RT   "The cavefish genome reveals candidate genes for eye loss.";
RL   Nat. Commun. 5:5307-5307(2014).
RN   [3] {ECO:0000313|Ensembl:ENSAMXP00000014714}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (FEB-2014) to UniProtKB.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation
CC       of feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00122}.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   -----------------------------------------------------------------------
DR   STRING; 7994.ENSAMXP00000014714; -.
DR   Ensembl; ENSAMXT00000014714; ENSAMXP00000014714; ENSAMXG00000014261.
DR   GeneTree; ENSGT00940000165424; -.
DR   OMA; GMDCPTI; -.
DR   Proteomes; UP000018467; Unassembled WGS sequence.
DR   GO; GO:0048514; P:blood vessel morphogenesis; IEA:Ensembl.
DR   GO; GO:0007155; P:cell adhesion; IEA:InterPro.
DR   GO; GO:0001885; P:endothelial cell development; IEA:Ensembl.
DR   GO; GO:0016203; P:muscle attachment; IEA:Ensembl.
DR   GO; GO:0030903; P:notochord development; IEA:Ensembl.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR000742; EGF-like_dom.
DR   InterPro; IPR010307; Laminin_dom_II.
DR   InterPro; IPR002049; Laminin_EGF.
DR   InterPro; IPR001791; Laminin_G.
DR   Pfam; PF00053; Laminin_EGF; 5.
DR   Pfam; PF02210; Laminin_G_2; 5.
DR   Pfam; PF06009; Laminin_II; 1.
DR   SMART; SM00181; EGF; 5.
DR   SMART; SM00180; EGF_Lam; 5.
DR   SMART; SM00282; LamG; 5.
DR   SUPFAM; SSF49899; SSF49899; 5.
DR   PROSITE; PS01248; EGF_LAM_1; 2.
DR   PROSITE; PS50027; EGF_LAM_2; 5.
DR   PROSITE; PS50025; LAM_G_DOMAIN; 5.
PE   4: Predicted;
KW   Coiled coil {ECO:0000256|SAM:Coils};
KW   Complete proteome {ECO:0000313|Proteomes:UP000018467};
KW   Disulfide bond {ECO:0000256|PROSITE-ProRule:PRU00460,
KW   ECO:0000256|SAAS:SAAS00814887};
KW   Laminin EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00460,
KW   ECO:0000256|SAAS:SAAS00580772};
KW   Reference proteome {ECO:0000313|Proteomes:UP000018467};
KW   Repeat {ECO:0000256|SAAS:SAAS00814929}.
FT   DOMAIN       19     68       Laminin EGF-like. {ECO:0000259|PROSITE:
FT                                PS50027}.
FT   DOMAIN       69    124       Laminin EGF-like. {ECO:0000259|PROSITE:
FT                                PS50027}.
FT   DOMAIN      125    178       Laminin EGF-like. {ECO:0000259|PROSITE:
FT                                PS50027}.
FT   DOMAIN      179    225       Laminin EGF-like. {ECO:0000259|PROSITE:
FT                                PS50027}.
FT   DOMAIN      226    272       Laminin EGF-like. {ECO:0000259|PROSITE:
FT                                PS50027}.
FT   DOMAIN      827   1026       LAM_G_DOMAIN. {ECO:0000259|PROSITE:
FT                                PS50025}.
FT   DOMAIN     1038   1215       LAM_G_DOMAIN. {ECO:0000259|PROSITE:
FT                                PS50025}.
FT   DOMAIN     1222   1387       LAM_G_DOMAIN. {ECO:0000259|PROSITE:
FT                                PS50025}.
FT   DOMAIN     1451   1618       LAM_G_DOMAIN. {ECO:0000259|PROSITE:
FT                                PS50025}.
FT   DOMAIN     1625   1800       LAM_G_DOMAIN. {ECO:0000259|PROSITE:
FT                                PS50025}.
FT   COILED      384    404       {ECO:0000256|SAM:Coils}.
FT   COILED      589    616       {ECO:0000256|SAM:Coils}.
FT   DISULFID     38     47       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID     95    104       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID    150    159       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID    162    176       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID    198    207       {ECO:0000256|PROSITE-ProRule:PRU00460}.
FT   DISULFID    246    255       {ECO:0000256|PROSITE-ProRule:PRU00460}.
SQ   SEQUENCE   1803 AA;  199292 MW;  961B0A1854F029EC CRC64;
     MQVCARGFFM SEQLTCLPCN CKGHAQTCED ITGICIDCQD HSVGNFCEDC EDGYVLEVFP
     DGRHECRPCA CPISLESNNF AAYCDKKGVV LRCVCQEGYA GHYCERCAPG YYGNPMRVGS
     SCKKCDCNGN TDPNLIFNEC HNVTGQCLHC WDNTAGHNCE RCAPGYYGDA IGAKDCRECQ
     CNKCGTSSCD DRTGVCHCKP GVTGRLCDQC EEGYSGFSSC QGCRRCECAP AGLRATCHPL
     THTCQCQPGA GGRYCERCLP GHWDYTPNGC KKCDCPEGSC DMHTGECLPE TSQVSECNTE
     CDECIWHLIG DVRQSNKTVD QLRNTVLNIS TGAAANDRIK YYNYTALRLQ AQFVGWKNKS
     TVLRRQTGQL EESADTLLTD MELLKQQEED VASLGRRADE ETLQSSTLAE TLMANLTAVN
     VLIEGTAAKT IQLLITHLYI TFCTVYRSGF GSSGAPIFRQ VRQMEKRVVG TEGRVPAVRE
     LMNRFSSKIS NAQLLISKAE DTLQNSYTTH STNQLRLQRL QFQQQRLLEN YGAVNQTMRT
     AKEVYDEADA GVEELDIMNI TAYHAEVDGA QASLQNKIDQ LSEVDYELLE RATDHAEELE
     RHADELKHNL KGSDANGFVQ KALDASNVYD NIVKYINDAN ITSLTTLNQS QRAEDVSDGV
     SLQPHPNWDS FFFFIIPEME STVADTLNYI EETKDMRQSS SNKLEAIVED IKTIQEARQP
     LQLQVTLEAT EVCLNRSTEV LDTVTPIKER VEEWNRNMRN NQYSTHAYEQ AVDSAQDTVA
     DLSEIVPTLL TKLRSVEEAK PINNVTTNIM RIRELIAQAR SVARKVQVSM KFNGQSAVEV
     QPHSNLDELK AVTSISLYVR VDPDKDPIED RFLLYLGDKN GGKDYMGLAI KNDNLVFVYN
     LGGEDVEISL TSKPVSSWPP VFNLVRVERL GRHGKVFLTV PSQETTAEQK FIQKGEADGT
     DSLFHLDPKN TVFIVGGVPP DIKLPPALSL APFVGCIELA TLNNDVISLY NFKKLHMMNV
     VTSVPCPRYK LAFSQGRVSS YLFDGMGYAL VNNIERRGRI GVVTRFDIEV RTVANNGILF
     LMVNQSNFFV LELKNGFLRL VYDFGFANGP IVMESNLAKL QINDARYHEV SVIYHFSKKV
     VLLVDRGFVK SMENEKKPLP FSDIYIGGAP SNLLLSRPEL SSLVGLRGCV KGFQFQKKDF
     NLLEEPGTIG ISSGCPEESF MSRQAYFTGE GYLSSTAKIS PFSSFEGGMN FRTLQSSGLL
     FYYNEGPNEF TLSLENGAVV LHSKGTKVKS QEKNYSDGRP HFLVVSVTKQ KYQIVIDDGD
     KQKQNNLDSS QTENTLKTFF YGGSPYNKIQ NFSGCISYTY ISRQDRDIEA EDFQRYTENG
     NVSLQDCPVE RPPAALMSAD RAQSTAACKV GRDKTGVLQG ILGLQSDPIA APEAEAKPCY
     MIPQSQANTH AHQFSGSAHS RQEYDGVAEG IRERSRFSMS VKTQSAAGVV LYVSDESEEN
     FLSLYLSHGK LFFTFGSGQQ KLRLRTTDTY NDGEWHDISV TRDGAVVKLI VDRRPVVENR
     NFPILLQDPL YVGGVSPGRA LKHIPKTSVS SLLGCVKDLQ LNGRRISSVS CSFGVTPCLE
     GPVESGAYFS EEGGYVVLGN SWGLKFEVVV EVRPRVVSGV LLHVFSSSEE YLTVYLHQGQ
     VTVNTVNSGV GVFSTHVTPQ EAICDGNWHK ITVIRDSNVV QLIVDSEVTH VVGPVSPITS
     NTETTRAPVF IGGAPDSLLP VGVVSRRGFS GCMRNLFVGE SAVDLSKAAL VSGAVSLSSC
     PAA
//
DBGET integrated database retrieval system