GenomeNet

Database: UniProt
Entry: F6W674_HORSE
LinkDB: F6W674_HORSE
Original site: F6W674_HORSE 
ID   F6W674_HORSE            Unreviewed;      1249 AA.
AC   F6W674;
DT   27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT   13-SEP-2023, sequence version 3.
DT   27-MAR-2024, entry version 68.
DE   SubName: Full=Heart development protein with EGF like domains 1 {ECO:0000313|Ensembl:ENSECAP00000001417.3};
GN   Name=HEG1 {ECO:0000313|Ensembl:ENSECAP00000001417.3};
OS   Equus caballus (Horse).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX   NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000001417.3, ECO:0000313|Proteomes:UP000002281};
RN   [1] {ECO:0000313|Ensembl:ENSECAP00000001417.3, ECO:0000313|Proteomes:UP000002281}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000001417.3,
RC   ECO:0000313|Proteomes:UP000002281};
RX   PubMed=19892987; DOI=10.1126/science.1178158;
RG   Broad Institute Genome Sequencing Platform;
RG   Broad Institute Whole Genome Assembly Team;
RA   Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA   Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA   Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA   Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA   Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA   Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA   Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA   Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA   Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA   Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT   "Genome sequence, comparative analysis, and population genetics of the
RT   domestic horse.";
RL   Science 326:865-867(2009).
RN   [2] {ECO:0000313|Ensembl:ENSECAP00000001417.3}
RP   IDENTIFICATION.
RC   STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000001417.3};
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- CAUTION: Lacks conserved residue(s) required for the propagation of
CC       feature annotation. {ECO:0000256|PROSITE-ProRule:PRU00076}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   STRING; 9796.ENSECAP00000001417; -.
DR   PaxDb; 9796-ENSECAP00000001417; -.
DR   Ensembl; ENSECAT00000001970.4; ENSECAP00000001417.3; ENSECAG00000001870.4.
DR   GeneTree; ENSGT00710000106813; -.
DR   HOGENOM; CLU_010549_0_0_1; -.
DR   InParanoid; F6W674; -.
DR   OMA; MGTERAM; -.
DR   OrthoDB; 5358383at2759; -.
DR   TreeFam; TF335941; -.
DR   Proteomes; UP000002281; Chromosome 19.
DR   Bgee; ENSECAG00000001870; Expressed in synovial membrane of synovial joint and 23 other cell types or tissues.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR   GO; GO:0005509; F:calcium ion binding; IEA:InterPro.
DR   GO; GO:0007507; P:heart development; IBA:GO_Central.
DR   CDD; cd00054; EGF_CA; 2.
DR   Gene3D; 2.10.25.10; Laminin; 2.
DR   InterPro; IPR001881; EGF-like_Ca-bd_dom.
DR   InterPro; IPR000742; EGF-like_dom.
DR   InterPro; IPR000152; EGF-type_Asp/Asn_hydroxyl_site.
DR   InterPro; IPR018097; EGF_Ca-bd_CS.
DR   PANTHER; PTHR24037; HEART DEVELOPMENT PROTEIN WITH EGF-LIKE DOMAINS 1; 1.
DR   PANTHER; PTHR24037:SF3; PROTEIN HEG HOMOLOG 1; 1.
DR   Pfam; PF00008; EGF; 1.
DR   Pfam; PF07645; EGF_CA; 1.
DR   SMART; SM00181; EGF; 3.
DR   SMART; SM00179; EGF_CA; 2.
DR   SUPFAM; SSF57196; EGF/Laminin; 2.
DR   PROSITE; PS00010; ASX_HYDROXYL; 1.
DR   PROSITE; PS00022; EGF_1; 1.
DR   PROSITE; PS01186; EGF_2; 1.
DR   PROSITE; PS50026; EGF_3; 2.
DR   PROSITE; PS01187; EGF_CA; 1.
PE   4: Predicted;
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157, ECO:0000256|PROSITE-
KW   ProRule:PRU00076};
KW   EGF-like domain {ECO:0000256|ARBA:ARBA00022536, ECO:0000256|PROSITE-
KW   ProRule:PRU00076}; Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Membrane {ECO:0000256|SAM:Phobius};
KW   Reference proteome {ECO:0000313|Proteomes:UP000002281};
KW   Signal {ECO:0000256|ARBA:ARBA00022729};
KW   Transmembrane {ECO:0000256|SAM:Phobius};
KW   Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT   TRANSMEM        1116..1141
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   DOMAIN          853..891
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   DOMAIN          893..931
FT                   /note="EGF-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50026"
FT   REGION          1..171
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          183..221
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          262..422
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          447..491
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          525..590
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          637..769
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          803..854
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1..98
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        106..144
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        183..220
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        282..304
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        330..361
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        370..387
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        404..422
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        525..568
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   DISULFID        862..879
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
FT   DISULFID        881..890
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00076"
SQ   SEQUENCE   1249 AA;  132255 MW;  E8DBB91AD5E054EB CRC64;
     MNFSTAASSE GVETQTPRDS HASSDAPGNS TAQPKTARAE GRNNSSRRAG FTVSSVGPEI
     ATALTSQSGS LASESMEEVN SSTVLQSPSV SQTKRVPVAT TFPDGVPRML QSSTVSLGPL
     NETESFPEDS EIATTSASVH SSPSEAESKR NDEVMGNPGH GEFTEPFTEN GFGLTSSKVS
     VGLWQNDSPT SRGHQLASSS EAENRSPMSP TQTVSRSLPL VTDGEGTARW FSTDSKTFTD
     VTGSSTFYPD VVNASDLTQF SASAPQSRGS DTALGDRGYS EPATETLSSP ASKNQNSSSP
     RGERSTTEDG PELRVISEAQ EGTSEGATGA RAPSPHTSAT FTGSGERTLR SLTNGSTTPG
     DVGHSVAAPR ETESATQQGN VTMTDDAHLV SGSPAASPAL GVTGIGSPWN QVSGTDVEER
     TSSDYTGHTY VSSTFPKGEW ALLSITDNSS SSDVRESSTS SIKISNSSYS DYPSSSQAQT
     ERSNVSSYEG EYAQPSTHSL LLRTANLPSY TPTVNMSDPL VLLDSDTGSL GDSSSSSSGS
     PLPLPSVSQS HQLFSSTLPS TRASAHPLQS TPGAPIPLSS SPPPVPISLT ASTPASKSIF
     QTTLPPSSST LVLPRARDTP VTSVWMSTMT SVTILPSSRT ADPKNQSNSH HEQIITESEL
     PSLESLPTEA TEAVTMRSTS GISMSPASTE SSTEQTLPAT STSVAQTSPA LTTTYLETSS
     PPVTTPSPAS STAALTAGPT VQTTTRKQLL TTSPEIPAPP ISTEGSVTTE RNQVRIDATI
     RLVPLTSTPT SAEELTTGVG ITEEDTPTSH FLRTSPSPQT TDVSTAKMLP PKSTASTARS
     STQSPTALSS PASANSCATN PCLHDGKCVV DPASRGYQCV CSPSWQGDDC SVDVNECLLN
     PCPPLATCNN TQGSFTCKCP VGYQLEKEIC NLVRTFVTDF KLKKTFLNTT VEKYSDLREA
     EKEITRTLNL CFSTLPGYTR STVHASRESS AVAMSLQTTF SLASNVTLFD LADRMQKCVN
     SCRSSAEVCQ LLGSQKRLFR AGSLCKRKTP ECDKETSICT DLDGVALCQC KSGYFQFNKM
     DHSCRACEDG YRLENETCMS CPFGLGGLNC GNPYQLITVV IAAAGGGLLL ILGIALIVTC
     CRKNKNDISK LIFKSGDFQM SPYAEYPKNP RSQEWGREAI EMHENGSTKN LLQMTDVYYS
     PTSVRNPELE RNGLYPAYTG LPGSRHSCIF PGQYNPSFIS DESRRRDYF
//
DBGET integrated database retrieval system