ID F6Q6Z8_HORSE Unreviewed; 636 AA.
AC F6Q6Z8;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 3.
DT 27-MAR-2024, entry version 66.
DE SubName: Full=Highly divergent homeobox {ECO:0000313|Ensembl:ENSECAP00000022404.3};
GN Name=HDX {ECO:0000313|VGNC:VGNC:18725};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000022404.3, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000022404.3, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000022404.3,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000022404.3}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000022404.3};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; F6Q6Z8; -.
DR Ensembl; ENSECAT00000026812.4; ENSECAP00000022404.3; ENSECAG00000024862.4.
DR VGNC; VGNC:18725; HDX.
DR GeneTree; ENSGT00390000008591; -.
DR HOGENOM; CLU_025064_0_0_1; -.
DR TreeFam; TF330998; -.
DR Proteomes; UP000002281; Chromosome X.
DR Bgee; ENSECAG00000024862; Expressed in brainstem and 9 other cell types or tissues.
DR ExpressionAtlas; F6Q6Z8; baseline.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR PANTHER; PTHR11636:SF80; HIGHLY DIVERGENT HOMEOBOX; 1.
DR PANTHER; PTHR11636; POU DOMAIN; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Homeobox {ECO:0000256|PROSITE-ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000002281}.
FT DOMAIN 379..443
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 381..444
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 58..89
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 449..485
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 593..636
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 616..636
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 636 AA; 70478 MW; B86A45EBACA8A471 CRC64;
MSSKNSESGA ATTGTVSSTS LAAPDITVRN VVNIARPSSQ QSSWTSANND VIVTGIYSPA
SSSSRQGTTK HTNTQITEAH KIPVQKTASK NDTEFQLHIP VQRQVAHCKN ASLLLGEKTI
ILSRQTSVLN AGNSVYNHTK KNYGGSSMQA SEMTVPQKPS VCHRPCKIEP VGIQRLYKPE
HTGLASHNLC GQKPPIRDPY CRTQNLEIRE VFSLAVSDYP QRILRGNVPQ KPSSAEGTCL
SIAMETGDAD DEYAREEELA LMGAQTPSYS RFYESGSSLR AENQSTALPG PGRSMPNSQM
VNIRDLSDNV LYQNRDYHLT PRTSLHTAST TMYRNTNPSR SNFSSHFASS NQLRLSQNQN
NYQISGNLTV PWITGCSRKR ALQDRTQFSD RDLATLKKYW DNGMTSLGSV CREKIEAVAT
ELNVDCEIVR TWIGNRRRKY RLMGIEVPPP RGGPADFSEQ PESGSLSAFT PGEEAGPEVG
EDNDRNDEVS ICLSEGSSQE ESNEVVPNEA RAHKEEDHHA VSTDNVKIEI IDDEESDMIS
NSEVEQVNSV LDYKNEEVRF IENELEIQKQ KYFKLQTFVR SLILAMKADD KEQQKALLSD
LPPELEEMDF NHASPEPDDT SFSVSSLSEK NASDSL
//