ID W5K6J5_ASTMX Unreviewed; 1114 AA.
AC W5K6J5;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 2.
DT 27-MAR-2024, entry version 57.
DE SubName: Full=Multiple EGF like domains 11 {ECO:0000313|Ensembl:ENSAMXP00000003206.2};
OS Astyanax mexicanus (Blind cave fish) (Astyanax fasciatus mexicanus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Characiformes;
OC Characoidei; Characidae; Astyanax.
OX NCBI_TaxID=7994 {ECO:0000313|Ensembl:ENSAMXP00000003206.2, ECO:0000313|Proteomes:UP000018467};
RN [1] {ECO:0000313|Proteomes:UP000018467}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RA Jeffery W., Warren W., Wilson R.K.;
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000018467}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RX PubMed=25329095; DOI=10.1038/ncomms6307;
RA McGaugh S.E., Gross J.B., Aken B., Blin M., Borowsky R., Chalopin D.,
RA Hinaux H., Jeffery W.R., Keene A., Ma L., Minx P., Murphy D., O'Quin K.E.,
RA Retaux S., Rohner N., Searle S.M., Stahl B.A., Tabin C., Volff J.N.,
RA Yoshizawa M., Warren W.C.;
RT "The cavefish genome reveals candidate genes for eye loss.";
RL Nat. Commun. 5:5307-5307(2014).
RN [3] {ECO:0000313|Ensembl:ENSAMXP00000003206.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_015461263.1; XM_015605777.1.
DR AlphaFoldDB; W5K6J5; -.
DR STRING; 7994.ENSAMXP00000003206; -.
DR Ensembl; ENSAMXT00000003206.2; ENSAMXP00000003206.2; ENSAMXG00000003031.2.
DR GeneID; 103029924; -.
DR KEGG; amex:103029924; -.
DR CTD; 84465; -.
DR eggNOG; KOG1218; Eukaryota.
DR GeneTree; ENSGT00940000155333; -.
DR HOGENOM; CLU_008281_1_0_1; -.
DR InParanoid; W5K6J5; -.
DR OrthoDB; 2540323at2759; -.
DR Proteomes; UP000018467; Unassembled WGS sequence.
DR Bgee; ENSAMXG00000003031; Expressed in brain and 3 other cell types or tissues.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-KW.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR Gene3D; 2.10.25.140; -; 1.
DR Gene3D; 2.170.300.10; Tie2 ligand-binding domain superfamily; 6.
DR InterPro; IPR013032; EGF-like_CS.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR011489; EMI_domain.
DR InterPro; IPR002049; LE_dom.
DR InterPro; IPR007527; Znf_SWIM.
DR PANTHER; PTHR24035; MULTIPLE EPIDERMAL GROWTH FACTOR-LIKE DOMAINS PROTEIN; 1.
DR PANTHER; PTHR24035:SF127; MULTIPLE EPIDERMAL GROWTH FACTOR-LIKE DOMAINS PROTEIN 11; 1.
DR Pfam; PF12661; hEGF; 6.
DR Pfam; PF00053; Laminin_EGF; 6.
DR PRINTS; PR00011; EGFLAMININ.
DR SMART; SM00181; EGF; 17.
DR SMART; SM00180; EGF_Lam; 15.
DR PROSITE; PS00022; EGF_1; 13.
DR PROSITE; PS01186; EGF_2; 2.
DR PROSITE; PS50026; EGF_3; 10.
DR PROSITE; PS51041; EMI; 1.
DR PROSITE; PS50966; ZF_SWIM; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00325};
KW Reference proteome {ECO:0000313|Proteomes:UP000018467};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00325};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00325}.
FT SIGNAL 1..25
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 26..1114
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5017376894"
FT TRANSMEM 855..878
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 30..107
FT /note="EMI"
FT /evidence="ECO:0000259|PROSITE:PS51041"
FT DOMAIN 149..179
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 187..222
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 230..265
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 273..300
FT /note="SWIM-type"
FT /evidence="ECO:0000259|PROSITE:PS50966"
FT DOMAIN 278..308
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 316..351
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 405..440
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 577..612
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 665..700
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 713..743
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 794..829
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REGION 1085..1114
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1097..1114
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 169..178
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 212..221
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 255..264
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 298..307
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 341..350
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 430..439
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 602..611
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 690..699
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 733..742
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 819..828
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 1114 AA; 119589 MW; 19DD9634FE3EBF3A CRC64;
MGPVRASPLA QPVLAVLGLV IGVWSLNPDD PNVCSHWESY AVTVQESYSH PFDQIYYTRC
TDILNWFKCT RHRISYKTAY RRGVRTMYRR RSQCCPGYFE SGELCVPLCT EECAHGRCVS
PDTCQCEPGW GGLDCSSGCE SGYWGPHCSN RCQCQNGALC NPITGACVCT DGYQGWRCED
HCEPGYYGKG CQLQCQCLNG ATCHHETGEC ICAPGYMGAV CGERCPSGSH GPQCEQRCPC
QNGGTCHHIT GECSCPAGWT GSVCAQPCPL GKYGINCSRD CSCRNGGLCD HITGQCQCVA
GYTGRRCQEE CPVGTYGPQC ALHCDCQNGA KCYHINGACL CDTGFKGPHC QDRFCPPGLY
GLICDKYCPC NATNTLSCHP LSGECSCAAG WTGLYCNETC PPGYYGEGCS VQCHCANGAD
CHGTTGACVC APGFTGDDCS HTCSPGFYGT NCSSTCHCHN QASCSPSDGS CICKEGWQGV
DCSILCSSGT WGVSCNQTCL CANGAACDPM DGSCTCAPGW RDKHCQLSCP DGTYGLDCRE
HCDCNHADGC DPVSGYCRCL AGWTGIHCDN VCPQGFWGPN CSMTCSCQNG GSCSPEDGTC
VCAPGYRGTS CKRICSPGFY GHRCSQTCPQ CVHSTGPCHH ITGHCECLSG FFGPLCNQVC
PSGRYGKACA EVCFCTNNGT CNPIDGSCQC FPGWIGDDCS RACPIGQWGP DCLNSCNCHN
GAQCSLYDGE CRCSPGWTGL YCTQRCPSGF YGRDCAEVCR CQNGADCDHM TGQCACRTGF
IGASCEQKCP PGTFGYGCQQ LCECMNNATC DYVTGTCYCS PGYKGIRCDQ AALMMEELNP
YTKISPALTS ERQSAGAVMG IIFLLLIIMA MLSLFVWYRQ RQRDKGQDMP SVSYTPALRI
TNTDYSLSDT SQSSSSGQCF SNPSYHTVAQ CTGPSSNTNN LDGTLTLKKG ERNSSEWRAY
CNLNDLGVSR EDTLTLRDNK TTRVAKEYIK TSMCSSSSCS LNSENPYATI RDPPGLACKH
TESSYVEMKS PAHRDLSFCS STSTTLTSAS RNVYDVEPTV SVLQGPNGLA TGFSQNPYDL
PRNSHIPSHY DILPMRHSPP HTPPPPDSPP SSLL
//