GenomeNet

Database: UniProt
Entry: A0A9L0SZQ3_HORSE
LinkDB: A0A9L0SZQ3_HORSE
Original site: A0A9L0SZQ3_HORSE 
ID   A0A9L0SZQ3_HORSE        Unreviewed;       200 AA.
AC   A0A9L0SZQ3;
DT   13-SEP-2023, integrated into UniProtKB/TrEMBL.
DT   13-SEP-2023, sequence version 1.
DT   18-JUN-2025, entry version 11.
DE   RecName: Full=HTH CENPB-type domain-containing protein {ECO:0008006|Google:ProtNLM};
OS   Equus caballus (Horse).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX   NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000079735.1, ECO:0000313|Proteomes:UP000002281};
RN   [1] {ECO:0000313|Ensembl:ENSECAP00000079735.1, ECO:0000313|Proteomes:UP000002281}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000079735.1,
RC   ECO:0000313|Proteomes:UP000002281};
RX   PubMed=19892987; DOI=10.1126/science.1178158;
RG   Broad Institute Genome Sequencing Platform;
RG   Broad Institute Whole Genome Assembly Team;
RA   Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA   Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA   Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA   Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA   Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA   Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA   Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA   Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA   Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA   Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT   "Genome sequence, comparative analysis, and population genetics of the
RT   domestic horse.";
RL   Science 326:865-867(2009).
RN   [2] {ECO:0000313|Ensembl:ENSECAP00000079735.1}
RP   IDENTIFICATION.
RC   STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000079735.1};
RG   Ensembl;
RL   Submitted (MAR-2025) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00320}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   Ensembl; ENSECAT00000097036.1; ENSECAP00000079735.1; ENSECAG00000059901.1.
DR   GeneTree; ENSGT00940000163154; -.
DR   Proteomes; UP000002281; Chromosome 3.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR   Gene3D; 1.10.10.60; Homeodomain-like; 2.
DR   InterPro; IPR050863; CenT-Element_Derived.
DR   InterPro; IPR009057; Homeodomain-like_sf.
DR   InterPro; IPR006600; HTH_CenpB_DNA-bd_dom.
DR   InterPro; IPR007889; HTH_Psq.
DR   PANTHER; PTHR19303:SF59; HTH CENPB-TYPE DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR19303; TRANSPOSON; 1.
DR   Pfam; PF04218; CENP-B_N; 1.
DR   Pfam; PF03221; HTH_Tnp_Tc5; 1.
DR   SMART; SM00674; CENPB; 1.
DR   SUPFAM; SSF46689; Homeodomain-like; 2.
DR   PROSITE; PS51253; HTH_CENPB; 1.
DR   PROSITE; PS50960; HTH_PSQ; 1.
PE   4: Predicted;
KW   DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00320};
KW   Nucleus {ECO:0000256|PROSITE-ProRule:PRU00320};
KW   Reference proteome {ECO:0000313|Proteomes:UP000002281}.
FT   DOMAIN          13..64
FT                   /note="HTH psq-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50960"
FT   DOMAIN          78..157
FT                   /note="HTH CENPB-type"
FT                   /evidence="ECO:0000259|PROSITE:PS51253"
FT   DNA_BIND        40..60
FT                   /note="H-T-H motif"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00320"
SQ   SEQUENCE   200 AA;  22467 MW;  18FF29C9E2ABBA1B CRC64;
     MPGKRRLSAA VIPSAKRERK AITLDVKLQV LRRFEAGEKL SEIAKALGLA VSTVATIRDN
     KEKIKASSRV ATPLRASRLT RHRSAVMETM ERLLHVWLED QTQRNVPLSV TIIQEKAKSL
     FDDLQRQKGE SSRTETFSAS KGWFVRFKER HCLPHFKMNG TAPGHKDGYP EVLKSIIQEG
     SLVSGPQLGE HKRACNTLLC
//
DBGET integrated database retrieval system