ID Q7QE88_ANOGA Unreviewed; 1045 AA.
AC Q7QE88;
DT 15-DEC-2003, integrated into UniProtKB/TrEMBL.
DT 27-JUL-2011, sequence version 5.
DT 27-MAR-2024, entry version 147.
DE SubName: Full=AGAP000773-PA {ECO:0000313|EMBL:EAA06894.5};
GN ORFNames=AgaP_AGAP000773 {ECO:0000313|EMBL:EAA06894.5};
OS Anopheles gambiae (African malaria mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=7165 {ECO:0000313|EMBL:EAA06894.5};
RN [1] {ECO:0000313|EMBL:EAA06894.5}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PEST {ECO:0000313|EMBL:EAA06894.5};
RX PubMed=12364791; DOI=10.1126/science.1076181;
RA Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z.,
RA Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V.,
RA Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I.,
RA Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A.,
RA McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D.,
RA O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B.,
RA Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M.,
RA Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA Collins F.H., Hoffman S.L.;
RT "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL Science 298:129-149(2002).
RN [2] {ECO:0000313|EMBL:EAA06894.5}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA06894.5};
RG The Anopheles Genome Sequencing Consortium;
RL Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:EAA06894.5}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA06894.5};
RX PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT "The Anopheles gambiae genome: an update.";
RL Trends Parasitol. 20:49-52(2004).
RN [4] {ECO:0000313|EMBL:EAA06894.5}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA06894.5};
RX PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5;
RA Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F.,
RA Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.;
RT "Update of the Anopheles gambiae PEST genome assembly.";
RL Genome Biol. 8:R5.1-R5.13(2007).
RN [5] {ECO:0000313|EMBL:EAA06894.5}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA06894.5};
RG VectorBase;
RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EAA06894.5}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAAB01008847; EAA06894.5; -; Genomic_DNA.
DR RefSeq; XP_311311.5; XM_311311.5.
DR AlphaFoldDB; Q7QE88; -.
DR STRING; 7165.Q7QE88; -.
DR PaxDb; 7165-AGAP000773-PA; -.
DR GeneID; 1272364; -.
DR KEGG; aga:AgaP_AGAP000773; -.
DR VEuPathDB; VectorBase:AGAP000773; -.
DR eggNOG; KOG3559; Eukaryota.
DR HOGENOM; CLU_291905_0_0_1; -.
DR InParanoid; Q7QE88; -.
DR OMA; FLEPYYD; -.
DR OrthoDB; 5396877at2759; -.
DR GO; GO:0005737; C:cytoplasm; IEA:InterPro.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0005667; C:transcription regulator complex; IEA:InterPro.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR GO; GO:0000977; F:RNA polymerase II transcription regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR GO; GO:0045944; P:positive regulation of transcription by RNA polymerase II; IEA:UniProt.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd00130; PAS; 2.
DR Gene3D; 4.10.280.10; Helix-loop-helix DNA-binding domain; 1.
DR Gene3D; 3.30.450.20; PAS domain; 2.
DR InterPro; IPR011598; bHLH_dom.
DR InterPro; IPR036638; HLH_DNA-bd_sf.
DR InterPro; IPR001067; Nuc_translocat.
DR InterPro; IPR001610; PAC.
DR InterPro; IPR000014; PAS.
DR InterPro; IPR035965; PAS-like_dom_sf.
DR InterPro; IPR013767; PAS_fold.
DR InterPro; IPR013655; PAS_fold_3.
DR PANTHER; PTHR23043; HYPOXIA-INDUCIBLE FACTOR 1 ALPHA; 1.
DR PANTHER; PTHR23043:SF36; SIM BHLH TRANSCRIPTION FACTOR 1B-RELATED; 1.
DR Pfam; PF00010; HLH; 1.
DR Pfam; PF00989; PAS; 1.
DR Pfam; PF08447; PAS_3; 1.
DR PRINTS; PR00785; NCTRNSLOCATR.
DR SMART; SM00353; HLH; 1.
DR SMART; SM00086; PAC; 1.
DR SMART; SM00091; PAS; 2.
DR SUPFAM; SSF47459; HLH, helix-loop-helix DNA-binding domain; 1.
DR SUPFAM; SSF55785; PYP-like sensor domain (PAS domain); 2.
DR PROSITE; PS50888; BHLH; 1.
DR PROSITE; PS50112; PAS; 2.
PE 4: Predicted;
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 1..53
FT /note="BHLH"
FT /evidence="ECO:0000259|PROSITE:PS50888"
FT DOMAIN 78..141
FT /note="PAS"
FT /evidence="ECO:0000259|PROSITE:PS50112"
FT DOMAIN 276..331
FT /note="PAS"
FT /evidence="ECO:0000259|PROSITE:PS50112"
FT REGION 162..185
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 390..423
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 559..578
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 622..681
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 693..762
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 791..864
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 916..1000
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1016..1045
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 170..185
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 390..404
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 559..577
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 623..646
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 728..761
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 791..810
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 843..864
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 916..972
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1045 AA; 112184 MW; A7DE62BF251E2BE9 CRC64;
MKEKSKNAAR SRREKENAEF LELAKLLPLP SAITSQLDKA SIIRLTTSYL KMRAVFPEGL
GDAWGTQHIP NNPRDLAIKE LGSHLLQTLD GFIFVVAPDG KIMYISETAS VHLGLSQVEL
TGNSIYEYIH NYDQDEMTSV LSLQPNMYVG PPAAVYGSGP GGYGGMPQPP HHHHNHHHGH
YHHHHHHSYG QHQTIEIERT FFLRMKCVLA KRNAGLTTSG YKVIHCSGYL KARIYPGDAT
YGEGHSCIQN LGLVAVGHSL PPSAITEIKL YQNMFMFRAS MDLKLIFLDA KVAQLTGYEP
QDLIEKTLYQ YVHAADILHM RYSHQVLMYK GQVTTKYYRF LTKGGGWAWV QSYATVVHNT
RSSRPHCIVS VNYVLSDQEA QDLLLNEVQQ PQHSAPTQHQ PSPSVKPEAG GGGGGGVAAS
NKERSLSEPA TALAATMTPV SGLASGVPAH MSPIGGTVVV GGNNGTARRG ASVQPPHQAS
MSYGLLGKEL QHQDHLQQQQ QQQQQHQHNT FLQLANLDDD ASLGPIPGAP SACLVTHPGS
TGLPPSTGQD EYELQMQYGP GTTHQQEPNT GSTGYCMNAA PTGGLLEGHG AGPHPGTDGS
DAFLEPYYDQ FYGGYDSRQD PISMRPYSAS SNSCSSSEAE GVLGHQQQHH QQHHHQQQQQ
HHQQHPLQQQ PPPPSYGSMS MAGAGVLRQH VTTGDLGNTY GDHHQLDFVP SNGHQQQQHH
HHQLQQLDQH TGHTNGGNSS FLYDASPATM DGSTSSASPF DGIAQPFELH HLPTTHGHFR
QQQHADLQQQ LGESPLVSST SSSSSSALLG GSPARRPAPD RDESVVSGAE PGASKQPKLA
NHHQHQHTLA NGTVSSNGTT TGSATALQAT NGAASSNHLR GAGNGSIIIS TTDTNQGADT
NGVVTVSINL NHLHSSTGGV SSSSGSTSGG PSSISNSHLL STSANDHNGA INNNNNNNNN
NTNSNNNNLH SLEKKGGPVP NRNSQPVRSQ PPHYTADSHY DLPHYTSVIV ESTANGSTAA
PNNGANSVTG KLSSNEYISQ QQPCT
//