ID A0A182K4F1_9DIPT Unreviewed; 1239 AA.
AC A0A182K4F1;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 22-FEB-2023, entry version 34.
DE RecName: Full=BHLH domain-containing protein {ECO:0000259|PROSITE:PS50888};
OS Anopheles christyi.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=43041 {ECO:0000313|EnsemblMetazoa:ACHR005636-PA, ECO:0000313|Proteomes:UP000075881};
RN [1] {ECO:0000313|Proteomes:UP000075881}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ACHKN1017 {ECO:0000313|Proteomes:UP000075881};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Besansky N., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles christyi ACHKN1017.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:ACHR005636-PA}
RP IDENTIFICATION.
RC STRAIN=ACHKN1017 {ECO:0000313|EnsemblMetazoa:ACHR005636-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A182K4F1; -.
DR STRING; 43041.A0A182K4F1; -.
DR EnsemblMetazoa; ACHR005636-RA; ACHR005636-PA; ACHR005636.
DR VEuPathDB; VectorBase:ACHR005636; -.
DR OrthoDB; 4230728at2759; -.
DR Proteomes; UP000075881; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:InterPro.
DR GO; GO:0030374; F:nuclear receptor coactivator activity; IEA:InterPro.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; IEA:UniProt.
DR CDD; cd00130; PAS; 1.
DR Gene3D; 4.10.280.10; Helix-loop-helix DNA-binding domain; 1.
DR Gene3D; 3.30.450.20; PAS domain; 2.
DR InterPro; IPR011598; bHLH_dom.
DR InterPro; IPR036638; HLH_DNA-bd_sf.
DR InterPro; IPR017426; Nuclear_rcpt_coactivator.
DR InterPro; IPR000014; PAS.
DR InterPro; IPR035965; PAS-like_dom_sf.
DR PANTHER; PTHR10684; NUCLEAR RECEPTOR COACTIVATOR; 1.
DR PANTHER; PTHR10684:SF4; TAIMAN, ISOFORM G; 1.
DR Pfam; PF14598; PAS_11; 1.
DR SUPFAM; SSF81995; beta-sandwich domain of Sec23/24; 1.
DR SUPFAM; SSF47459; HLH, helix-loop-helix DNA-binding domain; 1.
DR SUPFAM; SSF55785; PYP-like sensor domain (PAS domain); 2.
DR PROSITE; PS50888; BHLH; 1.
PE 4: Predicted;
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..20
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 21..1239
FT /note="BHLH domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5008125077"
FT DOMAIN 150..204
FT /note="BHLH"
FT /evidence="ECO:0000259|PROSITE:PS50888"
FT REGION 63..115
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 135..160
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 235..254
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 556..596
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 635..684
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 698..721
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 755..780
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 804..1029
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1087..1115
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1159..1178
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1189..1239
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 635..680
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 707..721
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 758..775
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 804..852
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 865..934
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 954..981
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 982..1029
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1090..1115
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1189..1232
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1239 AA; 134045 MW; 054AC4B0452774C6 CRC64;
MDHRFFVFLL ITHFFRLGPC ELPLPDQWLV HTQLSHSTQS SQASFYSASQ SVGQSLGGLG
GLSASTLPIQ QHSSSSSAQQ HQHQQQQSPS QLQNQINGPQ TTAGAVTPGA APASQLQSQS
KIMNVVVPSV ACPPTASKKI RRKPDAKQPQ SQINKCNNEK RRRELENEYI EQLGEFLQIK
RDMTACKPDK AAILSEVVRT FRHMLEQGNP NLTGSRCSKC SPDCSASCKL HPVQQGEVSS
TEPPLPEPSV NGHSPEKSAY FEAVQHYISN VGWALLEINS EGVIECATEN VRDVLHYSRT
ELHGQSIYSY LHTGDHSKLS PILNKNSFEL NWDQNEMFYQ TPKRTIRTKI RWLLRAPEGA
NETIEQKQQR LEKYKDLLII SAPVKDDTEE SSSVLCLITL PEDDQATIET TTMPQTLDEQ
LTLKLDTSGN IIDYNSSTLR KQFAGNLTKE TIRSIYEICH YQDRQRLNEH LGNVRSSNGT
PHELTYRLRL GGPDVYVHVK TQSKFFRCTK PNETDFIMAI CTVLTENEVA MLSADSGAMV
MGGGGGGSSG GLSAAMSGSN INNNNGNSSS SNPTSSSSSN LGLLMPSTSS TSSITGGGLN
GANDVVARHM AQITQGVSNM GGPLMSSVLN GGGSATGAGS GGVSNTASSM LGSSMTGGSM
SVSGVGMTGP GGSSTPGSGA SGLVGGVGGM GTTASAGMGP AAASMGSIVS PRSNPNSSSL
LCAPSSDNSN FFNTEFELDF PHSTFDMEAV GVGWDSRPDS RTSVTPVSTP RPPSVSAYSP
AAAPMCASPM TPYYSGSTMG GMPSPSNNNG VPGGGGSSNS GGSLMNPSLS LNNNNNNTNN
NNTGSPFGTN AFQFPFEDSK DKLQDMQSQQ QQQQQQQQHH HPHMSPMQQH QSAMSAQQMV
QRQQHHQQQQ QQQHLQQHLH QQSQQPSNTH DSERLRNLLT TKRPHSNASS SSGLDMDHDH
RNPNRILKGL LNSEEDKDTS GNKLSTASLS QRIPQSVRAS TQGSNNGGGV SGLGLATRAG
NDTNKSGSNN MLLQLLNDKS DDDESDARNR QGPSELLKQL QKVKDEPKEH NPPPLNNEEL
IQMLRVQSND RKRPSTEPDE GAAVKRTDNR PSKLRERNKM LASLLANPAK APMPMQTAMP
FNRIIPDIPN SGLARQLANV SSSTAPNQTL NNNNLTTSNN NLKQVQQLNQ MRLQQQQQQQ
QQQQQQHQMR KSAMPSQSQQ PPTSSDIYLN HPQQQHHHQ
//