ID W5KHS8_ASTMX Unreviewed; 970 AA.
AC W5KHS8;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 2.
DT 27-MAR-2024, entry version 53.
DE SubName: Full=EGF like, fibronectin type III and laminin G domains {ECO:0000313|Ensembl:ENSAMXP00000007140.2};
GN Name=EGFLAM {ECO:0000313|Ensembl:ENSAMXP00000007140.2};
OS Astyanax mexicanus (Blind cave fish) (Astyanax fasciatus mexicanus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Characiformes;
OC Characoidei; Characidae; Astyanax.
OX NCBI_TaxID=7994 {ECO:0000313|Ensembl:ENSAMXP00000007140.2, ECO:0000313|Proteomes:UP000018467};
RN [1] {ECO:0000313|Proteomes:UP000018467}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RA Jeffery W., Warren W., Wilson R.K.;
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000018467}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RX PubMed=25329095; DOI=10.1038/ncomms6307;
RA McGaugh S.E., Gross J.B., Aken B., Blin M., Borowsky R., Chalopin D.,
RA Hinaux H., Jeffery W.R., Keene A., Ma L., Minx P., Murphy D., O'Quin K.E.,
RA Retaux S., Rohner N., Searle S.M., Stahl B.A., Tabin C., Volff J.N.,
RA Yoshizawa M., Warren W.C.;
RT "The cavefish genome reveals candidate genes for eye loss.";
RL Nat. Commun. 5:5307-5307(2014).
RN [3] {ECO:0000313|Ensembl:ENSAMXP00000007140.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; W5KHS8; -.
DR STRING; 7994.ENSAMXP00000007140; -.
DR Ensembl; ENSAMXT00000007140.2; ENSAMXP00000007140.2; ENSAMXG00000006956.2.
DR eggNOG; KOG0613; Eukaryota.
DR eggNOG; KOG3509; Eukaryota.
DR GeneTree; ENSGT00940000158504; -.
DR HOGENOM; CLU_013380_1_0_1; -.
DR InParanoid; W5KHS8; -.
DR Proteomes; UP000018467; Unassembled WGS sequence.
DR Bgee; ENSAMXG00000006956; Expressed in camera-type eye and 1 other cell type or tissue.
DR GO; GO:0005604; C:basement membrane; IEA:UniProt.
DR GO; GO:0042995; C:cell projection; IEA:UniProtKB-KW.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR CDD; cd00054; EGF_CA; 2.
DR CDD; cd00063; FN3; 2.
DR CDD; cd00110; LamG; 3.
DR Gene3D; 2.60.120.200; -; 3.
DR Gene3D; 2.60.40.10; Immunoglobulins; 2.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR001791; Laminin_G.
DR PANTHER; PTHR15036:SF88; PIKACHURIN; 1.
DR PANTHER; PTHR15036; PIKACHURIN-LIKE PROTEIN; 1.
DR Pfam; PF00041; fn3; 1.
DR Pfam; PF00054; Laminin_G_1; 1.
DR Pfam; PF02210; Laminin_G_2; 2.
DR SMART; SM00181; EGF; 3.
DR SMART; SM00179; EGF_CA; 2.
DR SMART; SM00060; FN3; 2.
DR SMART; SM00282; LamG; 3.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 3.
DR SUPFAM; SSF49265; Fibronectin type III; 1.
DR PROSITE; PS00022; EGF_1; 3.
DR PROSITE; PS01186; EGF_2; 2.
DR PROSITE; PS50026; EGF_3; 3.
DR PROSITE; PS50853; FN3; 2.
DR PROSITE; PS50025; LAM_G_DOMAIN; 3.
PE 4: Predicted;
KW Cell projection {ECO:0000256|ARBA:ARBA00023273};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Reference proteome {ECO:0000313|Proteomes:UP000018467}.
FT DOMAIN 18..98
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 106..201
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 307..345
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 350..528
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 529..566
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 573..752
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 748..784
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 791..968
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT REGION 240..287
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 242..287
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 316..333
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 335..344
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 556..565
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 774..783
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 941..968
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00122"
SQ SEQUENCE 970 AA; 106318 MW; 1328F0C7C60EC8D8 CRC64;
VKVLLTILYH FNYKYLNPPL DVQLESVNCT AISVRWRMPW RHVGTITGYK VNFTCGPTPS
TVIGNLEVSA EYSVTVGAYG WAGQGRPSMP RSISTIPDDH CMLPVPPPEP RVTAVSDTEF
ELSWKPGTGE GSPPVLYFLV YYIRPEMDTE WTSVKVPTHT RSMVLRGMSP DTQYQFMMRA
ANMYGESHPS PVTGPIWTLS IEEDSSGQQP FMDPQLNDDH SSVYDYDIDV FTGELMHDLP
GNQEIRSSSG KSASQPNSSS GFTTTMPTAP SSPEPSSSSS SEPASTTAAP IAYVPLNHWT
GPVRRLHDLP CEDTACPPNS VCIDDYGNGG SRCHCALGRG GDTCSEAVAV RFPRFTGFSH
IAFEPLKNSY YSFELILEFR ADSEDGLLLY CGENDQGEGD FASLALIRGK LHFRYNCGTG
AAQIAAQSPV KLGVWYTVTV YREGLSGWLR VDNDTPVTGR SQGEFTKITF RTPLYLGGSP
AVYWLARAAG TSRGFQGCVQ SLAVNGRMID MRPWPLGRAL SGADVGECSD GVCTDVSCEN
GGVCYANRAD GYICLCPLGY RGPLCQETFS LFLPHFNAAL MPYLSAPWPR PAQYYLSFTE
FEMTFLPEAF DGTLLYSEDL DSRDFLSVVM VEGHVEFRFD CGSGTATIKS EEPVSLFRWH
ELRVSRTARR GILQLDNLRP VEGMAEGAFT QIRCSSPLYI GGVPNYNLVK NSASVLLPFT
GSIQKVTVND RVVRLTAASV KGVNVGNAVH PCADSLCANA GVCRPKHDGY ECDCPLGYRG
KHCQSADSGA VEIPQFTGRS YLMYDSKDIL KRISGPRTHV QLRFRSSAQD GLLMWIGDTN
MRHNSDYMFL ALHGGTVVFS YNLGSGTNTL RVNGTFIDGR WHTVRAVRDG QMGKLSVGSS
TTRVGKSPGR MRQLNTSGAL YIGGIKEASF HIPYLRGLVG CMSHLTLSPD HHLRLIEDAS
DGKNINTCLN
//