ID F6WGE3_HORSE Unreviewed; 1116 AA.
AC F6WGE3;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 11-DEC-2019, sequence version 3.
DT 27-MAR-2024, entry version 78.
DE SubName: Full=MDS1 and EVI1 complex locus {ECO:0000313|Ensembl:ENSECAP00000018770.3};
GN Name=MECOM {ECO:0000313|Ensembl:ENSECAP00000018770.3,
GN ECO:0000313|VGNC:VGNC:20065};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000018770.3, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000018770.3, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000018770.3,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000018770.3}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000018770.3};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; F6WGE3; -.
DR Ensembl; ENSECAT00000022694.4; ENSECAP00000018770.3; ENSECAG00000020788.4.
DR VGNC; VGNC:20065; MECOM.
DR GeneTree; ENSGT00940000157208; -.
DR HOGENOM; CLU_006627_0_0_1; -.
DR TreeFam; TF315309; -.
DR Proteomes; UP000002281; Chromosome 19.
DR Bgee; ENSECAG00000020788; Expressed in oviduct epithelium and 17 other cell types or tissues.
DR ExpressionAtlas; F6WGE3; baseline.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 8.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR036236; Znf_C2H2_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR24393:SF15; IP01201P-RELATED; 1.
DR PANTHER; PTHR24393; ZINC FINGER PROTEIN; 1.
DR Pfam; PF21549; PRDM2_PR; 1.
DR Pfam; PF00096; zf-C2H2; 8.
DR Pfam; PF13912; zf-C2H2_6; 1.
DR SMART; SM00355; ZnF_C2H2; 10.
DR SUPFAM; SSF57667; beta-beta-alpha zinc fingers; 5.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 7.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 10.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Reference proteome {ECO:0000313|Proteomes:UP000002281};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00042}.
FT DOMAIN 85..113
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 139..166
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 167..194
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 195..224
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 225..252
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 253..280
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 282..309
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 798..825
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 826..854
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 855..882
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT REGION 387..407
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 423..495
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 601..694
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 876..896
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 917..998
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 388..407
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 436..456
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 457..477
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 479..495
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 638..662
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 663..683
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 880..895
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 917..931
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 952..971
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1116 AA; 125731 MW; 5DB01DD8E458DCF4 CRC64;
VILDEFCNVK FCIDASQPDV GSWLKYIRFA GCYDQHNLVA CQINDQIFYR VVADIAPGEE
LLLFMKSEDY PHETMAPDIH EERQYRCEDC DQLFESKAEL ADHQKFPCST PHSAFSMVEE
DFQQKLEGEN DLREIHAIQE CKECDQVFPD LQSLEKHMLS HTEEREYKCD QCPKAFNWKS
NLIRHQMSHD SGKHYECENC AKQVFTDPSN LQRHIRSQHV GARAHACPEC GKTFATSSGL
KQHKHIHSSV KPFICEVCHK SYTQFSNLCR HKRMHADCRT QIKCKDCGQM FSTTSSLNKH
RRFCEGKNHF AAGGFFGQGI SLPGTPAMDK TSMVNMSHAN PGLADYFGAN RHPAGLTFPT
APGFSFSFPG LFPSGLYHRP PLIPASSPVK GLSSTEQTNK SQSPLMTHPQ ILPATQDILK
ALSKHPPVGD NKPVELQPER SSEERPLEKI SDQSESSDLD DVSTPSGSDL ETTSGSDLES
DIESDKEKFK ENGKMFKDKV SPLQNLASIH NKKEYSNHSI FSPSLEEQTA VSGAVNDSIK
AIASIAEKYF GSTGLVGLQD KKVGALPYPS MFPLPFFPAF SQSMYPFPDR DLRALPLKME
PQSPSEVKKL QKGSSESPFD LTTKRKDEKP LTPVPSKAPV TPVTSQDQPL DLSMGSRSRA
SGTKLTEPRK NHVFGEKKGS NVEPRLASDG SLQHARPTPF FMDPIYRVEK RKLTDPLEAL
KEKYLRPSPG FLFHPQFQLP DQRTWMSAIE NMAEKLESFS ALKPEASELL QSVPSMFNFR
APPSALPETL LRKGKERYTC RYCGKIFPRS ANLTRHLRTH TGEQPYRCKY CDRSFSISSN
LQRHVRNIHN KEKPFKCHLC DRCFGQQTNL DRHLKKHENG NMSGTATSSP HSELESTGAI
LDDKEDAYFT EIRNFIGNSN HSSQSPRNTE ERMNGSHFKD EKALVASQNS DLLDDEEAED
EVLLDEEDED NDITGKTGKE PVTGNLHEGS PEDDYEETSA LEMSCKTSPV RYKEEEYKTG
LSALDHIRHF TDSLKMRKME DNQYTEAELS SFSTSHVPEE LKQPLHRKSK SQAYAMMLSL
SDKESLHSTS HSSSNVWHSM ARTAAESSAI QSISHV
//