ID A0A452F5E4_CAPHI Unreviewed; 550 AA.
AC A0A452F5E4;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 24-JAN-2024, entry version 24.
DE SubName: Full=MIER family member 3 {ECO:0000313|Ensembl:ENSCHIP00000019602.1};
GN Name=MIER3 {ECO:0000313|Ensembl:ENSCHIP00000019602.1};
OS Capra hircus (Goat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Caprinae; Capra.
OX NCBI_TaxID=9925 {ECO:0000313|Ensembl:ENSCHIP00000019602.1, ECO:0000313|Proteomes:UP000291000};
RN [1] {ECO:0000313|Ensembl:ENSCHIP00000019602.1, ECO:0000313|Proteomes:UP000291000}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Bickhart D.M., Koren S., Rosen B., Hastie A., Liachko I., Sullivan S.T.,
RA Burton J., Sayre B.L., Huson H.J., Lee J., Lam E., Kelley C.M.,
RA Hutchison J.L., Zhou Y., Sun J., Crisa A., Schwartz J.C., Hammond J.A.,
RA Schroeder S.G., Liu G.E., Dunham M., Shendure J., Sonstegard T.S.,
RA Phillippy A.M., Van Tassell C.P., Smith T.P.;
RT "Polished mammalian reference genomes with single-molecule sequencing and
RT chromosome conformation capture applied to the Capra hircus genome.";
RL Submitted (APR-2016) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSCHIP00000019602.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LWLT01000017; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_017921184.1; XM_018065695.1.
DR AlphaFoldDB; A0A452F5E4; -.
DR STRING; 9925.ENSCHIP00000019602; -.
DR Ensembl; ENSCHIT00000027417.1; ENSCHIP00000019602.1; ENSCHIG00000018553.1.
DR GeneID; 102186803; -.
DR KEGG; chx:102186803; -.
DR CTD; 166968; -.
DR GeneTree; ENSGT01030000234573; -.
DR OMA; PMNICSE; -.
DR OrthoDB; 3059114at2759; -.
DR Proteomes; UP000291000; Chromosome 20.
DR Bgee; ENSCHIG00000018553; Expressed in thymus and 16 other cell types or tissues.
DR GO; GO:0005654; C:nucleoplasm; IEA:Ensembl.
DR GO; GO:0032991; C:protein-containing complex; IEA:Ensembl.
DR CDD; cd11661; SANT_MTA3_like; 1.
DR Gene3D; 4.10.1240.50; -; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR000949; ELM2_dom.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR040138; MIER/MTA.
DR InterPro; IPR045787; MIER1/3_C.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR017884; SANT_dom.
DR PANTHER; PTHR10865:SF22; MESODERM INDUCTION EARLY RESPONSE PROTEIN 3; 1.
DR PANTHER; PTHR10865; METASTASIS-ASSOCIATED PROTEIN AND MESODERM INDUCTION EARLY RESPONSE PROTEIN; 1.
DR Pfam; PF01448; ELM2; 1.
DR Pfam; PF19426; MIER1_3_C; 1.
DR Pfam; PF00249; Myb_DNA-binding; 1.
DR SMART; SM01189; ELM2; 1.
DR SMART; SM00717; SANT; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS51156; ELM2; 1.
DR PROSITE; PS51293; SANT; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000291000}.
FT DOMAIN 174..272
FT /note="ELM2"
FT /evidence="ECO:0000259|PROSITE:PS51156"
FT DOMAIN 277..329
FT /note="SANT"
FT /evidence="ECO:0000259|PROSITE:PS51293"
FT REGION 1..28
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 40..62
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 113..170
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..16
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 43..62
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 124..144
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 550 AA; 61400 MW; B02B4DD371AC4FE5 CRC64;
MAEASFGSSS PVGSLSSEDH DFDPTAEMLV HDYDDERTLE EEEMMDEGKH FSSEIEDLEK
EGNMPLEDLL AFYGYEPTIP AVANSSANSS PSELADELPD MTLDKEEIAK DLLSGDDEET
QSSADDLTPS VTSHETSDFF PRPLRSNTTC DGDKESEVED VETDSGNSPE DLRKEIMIGL
QYQAEIPPYL GEYAGNEKVY ENEDQLLWRP GVVLESKVKE YLVETSLRTG NEKIMDRISA
GTHTRDNEQA LYELLKCNHN IKEAIERYCC NGKASQEGMT AWTEEECRSF EHALMLFGKD
FHLIQKNKVR TRTVAECVAF YYMWKKSERY DYFAQQTKFG KKRYNHHPGV TDYMDRLVDE
TEALGGTVNS SALTSNRPEP VPDQQLSILN SFTASDLTAL TSSVATVCNP TDVNCLDDSF
PPLGNTPRGQ VNHVPVVTEE LLTLPSNGES DCFNLFETGF YHSELNPMNM CSEESERPAK
RLKMGIAVPE SFMNEVSVNN LGVDFENHTH HITSAKMAVS VADFGSLSAN ETNGFISAHA
LHQHAALHSE
//