ID A0A182K2R4_9DIPT Unreviewed; 598 AA.
AC A0A182K2R4;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 35.
DE RecName: Full=Transcriptional repressors of the hairy/espl family {ECO:0008006|Google:ProtNLM};
OS Anopheles christyi.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=43041 {ECO:0000313|EnsemblMetazoa:ACHR005048-PA, ECO:0000313|Proteomes:UP000075881};
RN [1] {ECO:0000313|Proteomes:UP000075881}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ACHKN1017 {ECO:0000313|Proteomes:UP000075881};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Besansky N., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles christyi ACHKN1017.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:ACHR005048-PA}
RP IDENTIFICATION.
RC STRAIN=ACHKN1017 {ECO:0000313|EnsemblMetazoa:ACHR005048-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A182K2R4; -.
DR STRING; 43041.A0A182K2R4; -.
DR EnsemblMetazoa; ACHR005048-RA; ACHR005048-PA; ACHR005048.
DR VEuPathDB; VectorBase:ACHR005048; -.
DR OrthoDB; 2968390at2759; -.
DR Proteomes; UP000075881; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd11440; bHLH-O_Cwo_like; 1.
DR Gene3D; 4.10.280.10; Helix-loop-helix DNA-binding domain; 1.
DR InterPro; IPR011598; bHLH_dom.
DR InterPro; IPR036638; HLH_DNA-bd_sf.
DR InterPro; IPR003650; Orange_dom.
DR PANTHER; PTHR10985; BASIC HELIX-LOOP-HELIX TRANSCRIPTION FACTOR, HES-RELATED; 1.
DR PANTHER; PTHR10985:SF141; TRANSCRIPTION FACTOR CWO; 1.
DR Pfam; PF07527; Hairy_orange; 1.
DR Pfam; PF00010; HLH; 1.
DR SMART; SM00353; HLH; 1.
DR SUPFAM; SSF47459; HLH, helix-loop-helix DNA-binding domain; 1.
DR SUPFAM; SSF158457; Orange domain-like; 1.
DR PROSITE; PS50888; BHLH; 1.
DR PROSITE; PS51054; ORANGE; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125}.
FT DOMAIN 42..97
FT /note="BHLH"
FT /evidence="ECO:0000259|PROSITE:PS50888"
FT DOMAIN 112..143
FT /note="Orange"
FT /evidence="ECO:0000259|PROSITE:PS51054"
FT REGION 20..44
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 337..424
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 482..506
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 29..44
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 349..383
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 486..505
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 598 AA; 65419 MW; AF67A0BD10A72828 CRC64;
EAGVPGYTYC GEPGLNFSAN NTTYSEDDAD FPPGRRGKTS RQDPLSHRII EKRRRDRMNS
CLADLSRLIP QQYMRKGRGR VEKTEIIEMA IRHLKNLQSQ ECGRESSCAE QYRHGYNECL
AEAAKFMMRE RGEEMCFRMV AHLKEHCNEI MKGELAKTRC GAELANGGSP IYLAGGQLGH
LREMLTCPSD LEHSSNDHHD VKDLSFRSAT STSSSTHSNN PPQAAVITST APSIVQHLDT
SSNHSTQDYD APSPPARLCP NGSVASLQQD TNNNINQHES VLRTIRMRKF SEHASPEHEH
SHNSYKFKNY IQQRFSQDTH ENGHSTDFDR SPVSLHEDQH LHHAHSLRAS PLTNGSMSGA
DSKSSSTSTL TGKASTVASS TDEPLSLKRK LPSAPARNGA DSPAAGPMEN GTHHQHQQHG
ASECAPHEKK LMLSKNGHTM VATSSSSSSS ATPLPAADIK HELLSSLGGS GGTMVALGPT
VTPSFAHHHH HQQQQQQQQQ HHHHSSYHPV PIFACHTQGF YIPLNVDYET LLPYLNGIDL
LSKNFLQMPP LHPISISVNY TPGTLAGSGG SLLLKASTVN GLNSQTKAKL VEGIINGC
//