ID A0A182XDJ4_ANOQN Unreviewed; 712 AA.
AC A0A182XDJ4;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 34.
DE RecName: Full=BHLH domain-containing protein {ECO:0008006|Google:ProtNLM};
OS Anopheles quadriannulatus (Mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=34691 {ECO:0000313|EnsemblMetazoa:AQUA007895-PA, ECO:0000313|Proteomes:UP000076407};
RN [1] {ECO:0000313|Proteomes:UP000076407}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=SANGQUA {ECO:0000313|Proteomes:UP000076407};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Howell P., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles quadriannulatus QUAD4_A.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:AQUA007895-PA}
RP IDENTIFICATION.
RC STRAIN=SANGQUA {ECO:0000313|EnsemblMetazoa:AQUA007895-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A182XDJ4; -.
DR STRING; 34691.A0A182XDJ4; -.
DR EnsemblMetazoa; AQUA007895-RA; AQUA007895-PA; AQUA007895.
DR VEuPathDB; VectorBase:AQUA007895; -.
DR Proteomes; UP000076407; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd11410; bHLH_O_HES; 1.
DR Gene3D; 6.10.250.980; -; 1.
DR Gene3D; 4.10.280.10; Helix-loop-helix DNA-binding domain; 1.
DR InterPro; IPR011598; bHLH_dom.
DR InterPro; IPR036638; HLH_DNA-bd_sf.
DR InterPro; IPR003650; Orange_dom.
DR PANTHER; PTHR10985; BASIC HELIX-LOOP-HELIX TRANSCRIPTION FACTOR, HES-RELATED; 1.
DR PANTHER; PTHR10985:SF77; TRANSCRIPTION FACTOR HES-1; 1.
DR Pfam; PF07527; Hairy_orange; 1.
DR Pfam; PF00010; HLH; 1.
DR SMART; SM00353; HLH; 1.
DR SMART; SM00511; ORANGE; 1.
DR SUPFAM; SSF47459; HLH, helix-loop-helix DNA-binding domain; 1.
DR SUPFAM; SSF158457; Orange domain-like; 1.
DR PROSITE; PS50888; BHLH; 1.
DR PROSITE; PS51054; ORANGE; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125}.
FT DOMAIN 50..112
FT /note="BHLH"
FT /evidence="ECO:0000259|PROSITE:PS50888"
FT DOMAIN 123..165
FT /note="Orange"
FT /evidence="ECO:0000259|PROSITE:PS51054"
FT REGION 33..60
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 240..264
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 388..416
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 548..712
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 241..255
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 584..598
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 661..676
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 677..704
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 712 AA; 75296 MW; D45C5B7C8569133B CRC64;
MTAKKDKDYK NHKHGSELSA TAISSKLDSH LASQTGTTAP GLIHAAQDPL KRTNKPLMEK
RRRARINQSL AILKALILES TVKSKSGDGQ TKHSKLEKAD ILELTVRHFQ RHRNLDSPGI
DKYRAGYTDC AREVARYLAT PEPPPLPSVP TLTDAGSKAR LLRHLDNCIA EIDTEICPKT
VATGANGLPP VAASPGVGQD GKMLVDGVGA GGAAMYDGLK KTKDLLVEYG GAGGVVPQDT
NPLDFSKTNR EIGRSPYSYR GSPTDLALLG PKSTSLLHQD ENNNRGTSGG TIGVHSAKPT
GHHHPQATVS VANLEDMIHG SEIATTSAAA SSHSAAALNK MMLGHHQQHG KLSIEMASLK
NCRIDPSAIK HEPGTAAALL ANGYSTASSA VSSPTPSPAG GILESDKDVS NHSEHSAVVP
QPVAAGYPPQ SQVALLLPDH YIQLATALGL GSTPPMLDHH SAAAAAAAAA AAASLTSTDF
ETLIEQNRRQ AAVLAHEKMT AEMLGSLHAL PDNVDEGYWE LYKKSLGIPD SISKLSLTDV
RALMAKGMPP KVECGPPSSS VTPVPTVPPV TKMEIDDQQQ QQHHHHHQQH HHQQQHHAVH
HTAGSVKSEP PSSGSPGLTS EPDVADGRAS ANGSRSPHSV KKESSSDGDV PMPAARPDPS
HTPYLEERRS LSPRSVEDRR SPTSSNASGG SSERCNSTST VDGLHAPTWR PW
//