ID A0A1B0AQ76_9MUSC Unreviewed; 556 AA.
AC A0A1B0AQ76;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 24-JAN-2024, entry version 32.
DE RecName: Full=HTH CENPB-type domain-containing protein {ECO:0008006|Google:ProtNLM};
OS Glossina palpalis gambiensis.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Hippoboscoidea;
OC Glossinidae; Glossina.
OX NCBI_TaxID=67801 {ECO:0000313|EnsemblMetazoa:GPPI004651-PA, ECO:0000313|Proteomes:UP000092460};
RN [1] {ECO:0000313|Proteomes:UP000092460}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=IAEA {ECO:0000313|Proteomes:UP000092460};
RA Aksoy S., Warren W., Wilson R.K.;
RL Submitted (JAN-2015) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:GPPI004651-PA}
RP IDENTIFICATION.
RC STRAIN=IAEA {ECO:0000313|EnsemblMetazoa:GPPI004651-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JXJN01001729; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; A0A1B0AQ76; -.
DR EnsemblMetazoa; GPPI004651-RA; GPPI004651-PA; GPPI004651.
DR VEuPathDB; VectorBase:GPPI004651; -.
DR Proteomes; UP000092460; Unassembled WGS sequence.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR006600; HTH_CenpB_DNA-bd_dom.
DR InterPro; IPR006612; THAP_Znf.
DR Pfam; PF03221; HTH_Tnp_Tc5; 1.
DR Pfam; PF05485; THAP; 1.
DR SMART; SM00674; CENPB; 1.
DR SMART; SM00692; DM3; 1.
DR SMART; SM00980; THAP; 1.
DR SUPFAM; SSF57716; Glucocorticoid receptor-like (DNA-binding domain); 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS51253; HTH_CENPB; 1.
DR PROSITE; PS50950; ZF_THAP; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00309};
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00309};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00309};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00309}.
FT DOMAIN 1..81
FT /note="THAP-type"
FT /evidence="ECO:0000259|PROSITE:PS50950"
FT DOMAIN 97..171
FT /note="HTH CENPB-type"
FT /evidence="ECO:0000259|PROSITE:PS51253"
FT REGION 227..503
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 241..269
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 299..313
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 314..373
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 382..398
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 409..503
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 556 AA; 63040 MW; 0212371ADDC05AE7 CRC64;
MPTIRRCCIV GCMSNSRQNP QLQFFQFPKP DNPFYKMWTH ACHASLSRIL PFKKPVVCAL
HFHPHCIGGR RLTGSAVPTL KLEVPSNLKA VEQQAMVEEI ERSRKCAYIN AVVYEWLVRA
NLNPQLRGSI THGMIKEKAE TAKQIIGCDT FTADNRWLNR FRESHLTGFA QKLASNQLKP
MGPSLWIPDI VQDLQHLFPP TSAERIESFE NVPEQYINYM KQYGEYEEEE EVEGEQKYNE
HEQSYQQQQQ HQHQQTSHNY YNQPSGYDPY GQQHHHHPGP PTGYNAYHEY YSHQQQQQHH
HHQQQHHQHA HHLNNSYGHS YHHTQTSNVP TSHTPPQPQP TLLSHSQTPP LNETSRRASP
YPTQSNNVSN PPKRRRTNDT EPNEQSNGAN SPPSLNGVRN DQEIEDLTDG PENNGNNDPN
HSNNTNTVTS ATNSNSNNYQ NEDSQNYKVK LPLPATNGSS TPKTTLTNGS NSGSNSPKST
TNEATAHTNG HSSASPTPST STQIKELETY AQALEYLKPL EDFALFKENF RAIGLISQLE
VILRKGDRSG LPNLAE
//