ID F6RAH7_HORSE Unreviewed; 3137 AA.
AC F6RAH7;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 11-DEC-2019, sequence version 3.
DT 27-MAR-2024, entry version 71.
DE SubName: Full=Eyes shut homolog {ECO:0000313|Ensembl:ENSECAP00000003472.3};
GN Name=EYS {ECO:0000313|Ensembl:ENSECAP00000003472.3};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000003472.3, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000003472.3, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000003472.3,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000003472.3}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000003472.3};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00196}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 9796.ENSECAP00000003472; -.
DR PaxDb; 9796-ENSECAP00000003472; -.
DR Ensembl; ENSECAT00000004966.4; ENSECAP00000003472.3; ENSECAG00000005001.4.
DR GeneTree; ENSGT00940000163729; -.
DR HOGENOM; CLU_055944_0_0_1; -.
DR InParanoid; F6RAH7; -.
DR TreeFam; TF343620; -.
DR Proteomes; UP000002281; Chromosome 20.
DR Bgee; ENSECAG00000005001; Expressed in retina and 5 other cell types or tissues.
DR GO; GO:0005886; C:plasma membrane; IBA:GO_Central.
DR GO; GO:0032991; C:protein-containing complex; IBA:GO_Central.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0050908; P:detection of light stimulus involved in visual perception; IEA:Ensembl.
DR GO; GO:0045197; P:establishment or maintenance of epithelial cell apical/basal polarity; IBA:GO_Central.
DR GO; GO:0007157; P:heterophilic cell-cell adhesion via plasma membrane cell adhesion molecules; IBA:GO_Central.
DR GO; GO:0043403; P:skeletal muscle tissue regeneration; IEA:Ensembl.
DR CDD; cd00054; EGF_CA; 14.
DR CDD; cd00110; LamG; 5.
DR Gene3D; 2.60.120.200; -; 5.
DR Gene3D; 2.10.25.10; Laminin; 22.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR013032; EGF-like_CS.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR InterPro; IPR001791; Laminin_G.
DR InterPro; IPR001190; SRCR.
DR PANTHER; PTHR24033; EGF-LIKE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24033:SF151; PROTEIN EYES SHUT; 1.
DR Pfam; PF00008; EGF; 7.
DR Pfam; PF12661; hEGF; 3.
DR Pfam; PF02210; Laminin_G_2; 5.
DR SMART; SM00181; EGF; 26.
DR SMART; SM00179; EGF_CA; 20.
DR SMART; SM00282; LamG; 5.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 5.
DR SUPFAM; SSF57196; EGF/Laminin; 16.
DR PROSITE; PS00010; ASX_HYDROXYL; 5.
DR PROSITE; PS00022; EGF_1; 21.
DR PROSITE; PS01186; EGF_2; 16.
DR PROSITE; PS50026; EGF_3; 23.
DR PROSITE; PS01187; EGF_CA; 3.
DR PROSITE; PS50025; LAM_G_DOMAIN; 5.
DR PROSITE; PS50287; SRCR_2; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Reference proteome {ECO:0000313|Proteomes:UP000002281};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..3137
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5023862519"
FT DOMAIN 171..213
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 214..255
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 257..293
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 333..368
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 370..406
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 514..652
FT /note="SRCR"
FT /evidence="ECO:0000259|PROSITE:PS50287"
FT DOMAIN 642..678
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 769..805
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 807..845
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 847..886
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 888..924
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 926..962
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 964..1000
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1002..1038
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1040..1075
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1077..1113
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1120..1157
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1159..1195
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 1877..2057
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 2093..2134
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2139..2333
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 2365..2402
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2413..2603
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 2604..2640
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2642..2683
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2690..2868
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DOMAIN 2869..2905
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2906..2943
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 2950..3137
FT /note="Laminin G"
FT /evidence="ECO:0000259|PROSITE:PS50025"
FT DISULFID 184..201
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 203..212
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 245..254
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 283..292
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 337..347
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 358..367
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 396..405
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 795..804
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 816..833
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 835..844
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 876..885
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 914..923
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 990..999
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1028..1037
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1044..1054
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1065..1074
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1103..1112
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1147..1156
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 1185..1194
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 2124..2133
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 2392..2401
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 2630..2639
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 2673..2682
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 2841..2868
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00122"
FT DISULFID 2895..2904
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 2933..2942
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 3137 AA; 345709 MW; 9DB154829B247869 CRC64;
MANKSVMLLS LVVLHSTVIN GKMTCKWQLV EEWHTQPSSY VVNWTLSENI CTDFYGDCWF
GDVNTKMSPL GNQVVPQICP LQIQLGDILV ISSEPSLQSP EINLMNVSEA SFIDCMQNAT
TEDQLLFGCK LKGMHTVNSQ WLSVGTHYFI TVMANGPSLC QLGLRLNVTV KEQFCQEPLH
SEFCSGHGKC LSEVWSKVYS CHCQPPFSGK YCQEVDACSH KPCENNGSCI NKRGKWNKQG
YECICHPPFT GINCSEIIDQ CQPYVCFHGN YSNITTNSFI CECDEPFSGP FCEESKKYCI
SQSFWKEGIC QNKSLACIFE CPEGFLNQSC ETDVNECSVR CQDGADCIDI PNEIMCICST
AFTGKLCKRL QTPREQFPCK NNATCVKYEK DYHCSCLPGF TGKNCEKVID HCRLLCVNCQ
NEGWCFNIIG RFRNACSPEC MRNSCWFLKN VCLNHLYPCY CRAASHDICP AKVPHPQFKH
VWQLGLTGSE CEKCEVAVDP CVFLAANCTE DAVYRNKSED VGYEQWFLCE GTMEICANGS
LTEEDNKTYW CLCMPRWPGK MFMENTTDYE ESGCQHETTH KDEINRSRCS YYLGRIDRFC
ILDVEDCLGN ESISVHGLCL VRLHNCNCSC LQRYERNICE IETEDCKSVP CKNGTTGIHS
SGYFFCKCVP GFTGTRSETD TDERASHPFK NGATCADQPG NYFCQRVAPL KVAVGFSCLC
SAACVGLRCE QDIDDCNLNA CEHTSACKDL YLSYQCVYLS SWKGNFYEES NDCKTNPCKN
NSTCTDLYNS YRCECTSGWT GQNCSEEINE CDSDPCLNGA LCHESTIPGQ FVCLCPPFFT
GKFCQEYRSS CDPLNDPCRN NATCLTLVDG QRYCVCREGF EGEHCEINTN ECFSLPCQNY
GDCEDGVNSF RCVCRPGFSG PLCEIETNEC SSKPCKNNGT CVDLTNRFLC NGEPGYSGSF
CELDMNECET SPCPDGENCV NRTGGYKCLC APGYTGIDCA VSVGDCLSKP CLHDGAWTDG
VNHYTCDCQS GFLGTHCETN ANDCLSNPCL HGRCVDLINE YQCLCEAGWT GSRCETKIND
CTSVSCLNEG ICQKSVHGVT CICPGGYTGV YCEMHVDGSA EPEPNLVLCL NGGICVDGAG
RTLYCRCLPG FSGQFCEINI NECSSSPCLN GANCEDHING YICKCQRGWS GDHCEKELDA
CIPSSCAHGI CVGNEPSFGS TCLCIPGFVT CSIGLLCGDE RRRITCLSPI SARTDTISTQ
THTVPAPATS VHNFPRTGAP RLWTTMDTYP VDQGPKQTDI FKHDVLPTTG LAALGTGISF
ERYLLKHVIA AKELLAKHSL PSSTDVSSSR FLNFGVPGPA QVVWGKTSVP HLPIQASAAT
PRFFFLDRGE RTPFIISSMT DFIFPTQSLL FESDRSVASS ATTMSSVISG ILGADVELNR
HSLLSHGFLL KTASTGAPPV VSMGAQEGIE EYSAVSLISR REYWRLLSSS MPPISPAKVI
ISKQVAIVNS SSLHRFTTQD SIPSEYQVIT EASSNQRLTN IKSQSADSLS ELSQTCATCS
MTEIKSSHEF SDQVLHSKQS HFYETFWMNS EILASWYALM RTQTITSGHS FSSATEIMPS
VAFMEVSSSF PSKKSTKRRI STPSVEDSIA LSTNLDANLC LDKTRLSIVP SQTVSSDLLN
SDLTSELTED LSVSENILKL LKIGQYGITM GPTEVLNQDN LLAVHESKGS HKQLKLHTSD
RSLDFELNLP SHPETRHSSE LKNNLPPYMD SRSDLSEVTS NVAFYTVSAT QSLPVQTSFL
TSVLAPDWTY YTDYLTLTSD LKQEVRTSSE WSKWELQPSV HDWESPAASQ TPAITRSLTL
PSLESIPAPR QLMISDFTCV CYYGDSYLEF QDVFLNPQNN ISLEFQTSSS YGLLLYMQQD
SNSIDGFVTQ LFIENGTLKY HFFCAGEAKL KNINTTIKVD DGQKYALLIR QELDPCEAEL
TILGRTMKAS ESISHVSGRS LPESGSIFIG GFPDLHGVSQ ISGPVENFTG CIEVIELNNW
RSFIPSKAVK KIHVENCRSQ DSPLSAASAF VAPSGVTEGV ASTWTSLSAP PAAPSVCQGA
VCHNGGTCHP VFLSSGAFSF QCDCPLHFTG RFCEQDAGLC FPSFNGNSYL ELPFLKSVLE
KEHNRIVTIY LTIKTNTLNG TILYSSEKNF GQQFLHLFLL EGRPTVKYGC GNSQNILTLS
ANYSINTNVF IPITIRYTIP VGSPGVACMI EMTADGKPPI QKKDTETPHA SQAYFESMFL
GHVPTNVKIH KKAGPIYGFR GCIRELQVND KEFFIIDEAL RGRNIENCHV PWCAHHLCHN
NGTCISDSEN WFCECPRLSS GKLCQFATCE NNPCGNGATC VPKSGTEIVC LCPYGRSGVL
CTDAINITQP SFSGTDAFGY TSFLAYSRVP DIGFDYEFHV TFQLANNHSA LQNNLIFFTG
QKGHGRNGDD FLAVGLRDGR VVYSYNLGSG IASVSSDPLD RSLGIHAVRL GRFLQMGWLK
VDDHKNKSIV APGRLVGLNV FSQFYVGGYS EYTPELLPNG SEFKNGFQGC IFSLRVRTGK
NGRFRSLGDP EGRPAAGRSV GQCGASACGS GRCGHGGACA ESGGAVHCDC PSGWKGAFCT
EMVSTCDPEH DPPHNCSKGA TCVPLPHGYT CRCPLGTTGI YCERALSVSD PSFRSHELSW
MSFSSFRIRK RTHIQLQFRP LSADGILFYV AQNLKAQSGD FLCISLVNGS VQLRYNLGDR
TIILETLQKV NMNGSTWHVI KAGRVGAEGY LDLDGKTVTE KAKAEMNSLD TNTDFYIGGV
SSLNLVNPMA IANEPVGFQG CIREVIINNQ ELQLTELGAK GGSNVGDCDG TACGYNVCRN
RGECVVNGTT FSCQCSPPWA GNTCEQSAYC LNNLCLHQSL CVPDQSSSYR CLCTLGWEGR
YCENKISFST AKFMGNSYIK YIDPDYRMRN HHFTTVSLNF STTETEGLIV WIGKAQNEEN
DFLAIGLHNQ SLKIAVNLGE SISVPVIYSN GTFCCNKWHH VIVSQNQTLI KAYLDDNLIL
SEDIDPHKKF VALNYDGISY LGGFEYGRKV NTVTQEIFKR DFVGKIKDVF FQDSKKIELI
KSEGYNVYNG DEQNVTR
//