ID W5PNP1_SHEEP Unreviewed; 432 AA.
AC W5PNP1;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 24-JAN-2024, entry version 52.
DE SubName: Full=Milk fat globule EGF and factor V/VIII domain containing {ECO:0000313|Ensembl:ENSOARP00000012067.1};
GN Name=MFGE8 {ECO:0000313|Ensembl:ENSOARP00000012067.1};
OS Ovis aries (Sheep).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Caprinae; Ovis.
OX NCBI_TaxID=9940 {ECO:0000313|Ensembl:ENSOARP00000012067.1, ECO:0000313|Proteomes:UP000002356};
RN [1] {ECO:0000313|Ensembl:ENSOARP00000012067.1, ECO:0000313|Proteomes:UP000002356}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Texel {ECO:0000313|Ensembl:ENSOARP00000012067.1,
RC ECO:0000313|Proteomes:UP000002356};
RX PubMed=20809919; DOI=10.1111/j.1365-2052.2010.02100.x;
RA Archibald A.L., Cockett N.E., Dalrymple B.P., Faraut T., Kijas J.W.,
RA Maddox J.F., McEwan J.C., Hutton Oddy V., Raadsma H.W., Wade C., Wang J.,
RA Wang W., Xun X.;
RT "The sheep genome reference sequence: a work in progress.";
RL Anim. Genet. 41:449-453(2010).
RN [2] {ECO:0000313|Ensembl:ENSOARP00000012067.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (JUL-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMGL01041411; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01041412; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; W5PNP1; -.
DR SMR; W5PNP1; -.
DR STRING; 9940.ENSOARP00000012067; -.
DR PaxDb; 9940-ENSOARP00000012067; -.
DR Ensembl; ENSOART00000012243.1; ENSOARP00000012067.1; ENSOARG00000011260.1.
DR eggNOG; ENOG502QVK3; Eukaryota.
DR HOGENOM; CLU_030066_0_1_1; -.
DR OMA; YVKTFKV; -.
DR Proteomes; UP000002356; Chromosome 18.
DR Bgee; ENSOARG00000011260; Expressed in vas deferens and 52 other cell types or tissues.
DR CDD; cd00054; EGF_CA; 2.
DR CDD; cd00057; FA58C; 2.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 2.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000421; FA58C.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR PANTHER; PTHR24543:SF317; LACTADHERIN; 1.
DR PANTHER; PTHR24543; MULTICOPPER OXIDASE-RELATED; 1.
DR Pfam; PF00008; EGF; 2.
DR Pfam; PF00754; F5_F8_type_C; 2.
DR SMART; SM00181; EGF; 2.
DR SMART; SM00231; FA58C; 2.
DR SUPFAM; SSF57196; EGF/Laminin; 2.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 2.
DR PROSITE; PS00022; EGF_1; 2.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 2.
DR PROSITE; PS01285; FA58C_1; 2.
DR PROSITE; PS01286; FA58C_2; 2.
DR PROSITE; PS50022; FA58C_3; 2.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|PROSITE-ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|PROSITE-ProRule:PRU00076};
KW Reference proteome {ECO:0000313|Proteomes:UP000002356};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..432
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004869336"
FT DOMAIN 28..65
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 68..111
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 114..270
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT DOMAIN 275..432
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT DISULFID 55..64
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 101..110
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 432 AA; 48360 MW; 7B36A4986FBB6911 CRC64;
MPSPRLLAAL WAALCCLSGH HLFPLTCDLC DSSQCLHGGT CLLNEEKTPP FYCLCPEGFT
GLLCNETEYG PCFPNPCHND AECQETDSLR GDVFTHYTCK CSLGYVGTHC ENTCTSPLGM
QTGAIADSQI SASSMHLGFM GLQRWAPELA RLYQTGIVNA WTSSNYDKTP WIQVNLLRKM
WVTGVVTQGA SRAGSAEYVK TFKVAYSNDG RQFQFIQVAG QLGDKIFVGN RNNSGLKINL
FDSPLEVQYV RLVPIICRRG CTLRFELLGC ELDGCTEPLG LKHNTIPDKQ ITASSYYKTW
GLSAFSWFPY YARLDNWGKF NAWTAQTNSA SEWLQIDLGS QKRVTGIITQ GARDFGHIQY
VAAYRVAYSD DGVTWTEYKD PETSKSKIFP GNMDNNSHKK NIFEVPFQAR FVRIQPVAWH
NRITLRVELL GC
//