ID F6W674_HORSE Unreviewed; 1249 AA.
AC F6W674;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 3.
DT 27-MAR-2024, entry version 68.
DE SubName: Full=Heart development protein with EGF like domains 1 {ECO:0000313|Ensembl:ENSECAP00000001417.3};
GN Name=HEG1 {ECO:0000313|Ensembl:ENSECAP00000001417.3};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000001417.3, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000001417.3, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000001417.3,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000001417.3}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000001417.3};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 9796.ENSECAP00000001417; -.
DR PaxDb; 9796-ENSECAP00000001417; -.
DR Ensembl; ENSECAT00000001970.4; ENSECAP00000001417.3; ENSECAG00000001870.4.
DR GeneTree; ENSGT00710000106813; -.
DR HOGENOM; CLU_010549_0_0_1; -.
DR InParanoid; F6W674; -.
DR OMA; MGTERAM; -.
DR OrthoDB; 5358383at2759; -.
DR TreeFam; TF335941; -.
DR Proteomes; UP000002281; Chromosome 19.
DR Bgee; ENSECAG00000001870; Expressed in synovial membrane of synovial joint and 23 other cell types or tissues.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR GO; GO:0007507; P:heart development; IBA:GO_Central.
DR CDD; cd00054; EGF_CA; 2.
DR Gene3D; 2.10.25.10; Laminin; 2.
DR InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR InterPro; IPR000742; EGF-like_dom.
DR InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR InterPro; IPR018097; EGF_Ca-bd_CS.
DR PANTHER; PTHR24037; HEART DEVELOPMENT PROTEIN WITH EGF-LIKE DOMAINS 1; 1.
DR PANTHER; PTHR24037:SF3; PROTEIN HEG HOMOLOG 1; 1.
DR Pfam; PF00008; EGF; 1.
DR Pfam; PF07645; EGF_CA; 1.
DR SMART; SM00181; EGF; 3.
DR SMART; SM00179; EGF_CA; 2.
DR SUPFAM; SSF57196; EGF/Laminin; 2.
DR PROSITE; PS00010; ASX_HYDROXYL; 1.
DR PROSITE; PS00022; EGF_1; 1.
DR PROSITE; PS01186; EGF_2; 1.
DR PROSITE; PS50026; EGF_3; 2.
DR PROSITE; PS01187; EGF_CA; 1.
PE 4: Predicted;
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW ProRule:PRU00076};
KW EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000002281};
KW Signal {ECO:0000256|ARBA:ARBA00022729};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 1116..1141
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 853..891
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT DOMAIN 893..931
FT /note="EGF-like"
FT /evidence="ECO:0000259|PROSITE:PS50026"
FT REGION 1..171
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 183..221
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 262..422
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 447..491
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 525..590
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 637..769
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 803..854
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..98
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 106..144
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 183..220
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 282..304
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 330..361
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 370..387
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 404..422
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 525..568
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT DISULFID 862..879
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT DISULFID 881..890
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ SEQUENCE 1249 AA; 132255 MW; E8DBB91AD5E054EB CRC64;
MNFSTAASSE GVETQTPRDS HASSDAPGNS TAQPKTARAE GRNNSSRRAG FTVSSVGPEI
ATALTSQSGS LASESMEEVN SSTVLQSPSV SQTKRVPVAT TFPDGVPRML QSSTVSLGPL
NETESFPEDS EIATTSASVH SSPSEAESKR NDEVMGNPGH GEFTEPFTEN GFGLTSSKVS
VGLWQNDSPT SRGHQLASSS EAENRSPMSP TQTVSRSLPL VTDGEGTARW FSTDSKTFTD
VTGSSTFYPD VVNASDLTQF SASAPQSRGS DTALGDRGYS EPATETLSSP ASKNQNSSSP
RGERSTTEDG PELRVISEAQ EGTSEGATGA RAPSPHTSAT FTGSGERTLR SLTNGSTTPG
DVGHSVAAPR ETESATQQGN VTMTDDAHLV SGSPAASPAL GVTGIGSPWN QVSGTDVEER
TSSDYTGHTY VSSTFPKGEW ALLSITDNSS SSDVRESSTS SIKISNSSYS DYPSSSQAQT
ERSNVSSYEG EYAQPSTHSL LLRTANLPSY TPTVNMSDPL VLLDSDTGSL GDSSSSSSGS
PLPLPSVSQS HQLFSSTLPS TRASAHPLQS TPGAPIPLSS SPPPVPISLT ASTPASKSIF
QTTLPPSSST LVLPRARDTP VTSVWMSTMT SVTILPSSRT ADPKNQSNSH HEQIITESEL
PSLESLPTEA TEAVTMRSTS GISMSPASTE SSTEQTLPAT STSVAQTSPA LTTTYLETSS
PPVTTPSPAS STAALTAGPT VQTTTRKQLL TTSPEIPAPP ISTEGSVTTE RNQVRIDATI
RLVPLTSTPT SAEELTTGVG ITEEDTPTSH FLRTSPSPQT TDVSTAKMLP PKSTASTARS
STQSPTALSS PASANSCATN PCLHDGKCVV DPASRGYQCV CSPSWQGDDC SVDVNECLLN
PCPPLATCNN TQGSFTCKCP VGYQLEKEIC NLVRTFVTDF KLKKTFLNTT VEKYSDLREA
EKEITRTLNL CFSTLPGYTR STVHASRESS AVAMSLQTTF SLASNVTLFD LADRMQKCVN
SCRSSAEVCQ LLGSQKRLFR AGSLCKRKTP ECDKETSICT DLDGVALCQC KSGYFQFNKM
DHSCRACEDG YRLENETCMS CPFGLGGLNC GNPYQLITVV IAAAGGGLLL ILGIALIVTC
CRKNKNDISK LIFKSGDFQM SPYAEYPKNP RSQEWGREAI EMHENGSTKN LLQMTDVYYS
PTSVRNPELE RNGLYPAYTG LPGSRHSCIF PGQYNPSFIS DESRRRDYF
//