ID Q7PWX5_ANOGA Unreviewed; 1676 AA.
AC Q7PWX5;
DT 15-DEC-2003, integrated into UniProtKB/TrEMBL.
DT 27-JUL-2011, sequence version 4.
DT 24-JAN-2024, entry version 140.
DE SubName: Full=AGAP001099-PA {ECO:0000313|EMBL:EAA01211.4};
GN ORFNames=AgaP_AGAP001099 {ECO:0000313|EMBL:EAA01211.4};
OS Anopheles gambiae (African malaria mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=7165 {ECO:0000313|EMBL:EAA01211.4};
RN [1] {ECO:0000313|EMBL:EAA01211.4}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PEST {ECO:0000313|EMBL:EAA01211.4};
RX PubMed=12364791; DOI=10.1126/science.1076181;
RA Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z.,
RA Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V.,
RA Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I.,
RA Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A.,
RA McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D.,
RA O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B.,
RA Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M.,
RA Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA Collins F.H., Hoffman S.L.;
RT "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL Science 298:129-149(2002).
RN [2] {ECO:0000313|EMBL:EAA01211.4}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA01211.4};
RG The Anopheles Genome Sequencing Consortium;
RL Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:EAA01211.4}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA01211.4};
RX PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT "The Anopheles gambiae genome: an update.";
RL Trends Parasitol. 20:49-52(2004).
RN [4] {ECO:0000313|EMBL:EAA01211.4}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA01211.4};
RX PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5;
RA Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F.,
RA Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.;
RT "Update of the Anopheles gambiae PEST genome assembly.";
RL Genome Biol. 8:R5.1-R5.13(2007).
RN [5] {ECO:0000313|EMBL:EAA01211.4}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EAA01211.4};
RG VectorBase;
RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EAA01211.4}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAAB01008987; EAA01211.4; -; Genomic_DNA.
DR RefSeq; XP_322068.4; XM_322068.5.
DR STRING; 7165.Q7PWX5; -.
DR PaxDb; 7165-AGAP001099-PA; -.
DR GeneID; 1282065; -.
DR KEGG; aga:AgaP_AGAP001099; -.
DR VEuPathDB; VectorBase:AGAP001099; -.
DR eggNOG; KOG1827; Eukaryota.
DR HOGENOM; CLU_001483_2_0_1; -.
DR OMA; WQFYETL; -.
DR OrthoDB; 2878065at2759; -.
DR PhylomeDB; Q7PWX5; -.
DR GO; GO:0016586; C:RSC-type complex; IEA:InterPro.
DR GO; GO:0003682; F:chromatin binding; IEA:InterPro.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006338; P:chromatin remodeling; IEA:InterPro.
DR CDD; cd04717; BAH_polybromo; 1.
DR CDD; cd05524; Bromo_polybromo_I; 1.
DR CDD; cd05517; Bromo_polybromo_II; 1.
DR CDD; cd05515; Bromo_polybromo_V; 1.
DR CDD; cd05526; Bromo_polybromo_VI; 1.
DR Gene3D; 2.30.30.490; -; 2.
DR Gene3D; 1.20.920.10; Bromodomain-like; 6.
DR InterPro; IPR001025; BAH_dom.
DR InterPro; IPR043151; BAH_sf.
DR InterPro; IPR001487; Bromodomain.
DR InterPro; IPR036427; Bromodomain-like_sf.
DR InterPro; IPR018359; Bromodomain_CS.
DR InterPro; IPR009071; HMG_box_dom.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR037968; PBRM1_BD5.
DR InterPro; IPR037382; Rsc/polybromo.
DR PANTHER; PTHR16062:SF19; PROTEIN POLYBROMO-1; 1.
DR PANTHER; PTHR16062; SWI/SNF-RELATED; 1.
DR Pfam; PF01426; BAH; 2.
DR Pfam; PF00439; Bromodomain; 5.
DR Pfam; PF00505; HMG_box; 1.
DR PRINTS; PR00503; BROMODOMAIN.
DR SMART; SM00439; BAH; 2.
DR SMART; SM00297; BROMO; 6.
DR SMART; SM00398; HMG; 1.
DR SUPFAM; SSF47370; Bromodomain; 6.
DR SUPFAM; SSF47095; HMG-box; 1.
DR PROSITE; PS51038; BAH; 2.
DR PROSITE; PS00633; BROMODOMAIN_1; 1.
DR PROSITE; PS50014; BROMODOMAIN_2; 5.
DR PROSITE; PS50118; HMG_BOX_2; 1.
PE 4: Predicted;
KW Bromodomain {ECO:0000256|PROSITE-ProRule:PRU00035};
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00267};
KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00267}.
FT DOMAIN 57..127
FT /note="Bromo"
FT /evidence="ECO:0000259|PROSITE:PS50014"
FT DOMAIN 199..269
FT /note="Bromo"
FT /evidence="ECO:0000259|PROSITE:PS50014"
FT DOMAIN 352..422
FT /note="Bromo"
FT /evidence="ECO:0000259|PROSITE:PS50014"
FT DOMAIN 514..584
FT /note="Bromo"
FT /evidence="ECO:0000259|PROSITE:PS50014"
FT DOMAIN 645..715
FT /note="Bromo"
FT /evidence="ECO:0000259|PROSITE:PS50014"
FT DOMAIN 904..1021
FT /note="BAH"
FT /evidence="ECO:0000259|PROSITE:PS51038"
FT DOMAIN 1101..1217
FT /note="BAH"
FT /evidence="ECO:0000259|PROSITE:PS51038"
FT DOMAIN 1298..1370
FT /note="HMG box"
FT /evidence="ECO:0000259|PROSITE:PS50118"
FT DNA_BIND 1298..1370
FT /note="HMG box"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00267"
FT REGION 1..35
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 438..485
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1273..1296
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 441..455
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1676 AA; 192602 MW; 1906C29183CF390A CRC64;
MSKRRRMSSL QDDENSEEDA SPEHSPVPTT TRKKKRLDPM ELCQHLYESI RTFKKEDGST
LCDTFIRAPK RRQEPSYYEV VVNPIDLLKV QQKLKTDSYE DVEDLAADIE LIVNNAKAFY
KPDSTEYQDA CQLLDLFNTN KKRILEHHIE EGWSESKTRK ITRPRKSLTT EEDEYEEWSD
FDPYEELFAT VMTATDPLDN HDLYHMFQLL PSKKLYPGYY DIIDHPIDLK LIATKIQTSA
YSSLNEMEKD LLQMTKNACT FNEPGSQIYK DAKMLKKIFM AKKTEIESGR YRKPPLKRPR
QASSAMVAAL KEEIDTSDDD FDDSIETEGD GPLWQLFDQL YNTANTNDPN ALGAPLGEAL
WKLPNKRFHP EYYNQIKKPI SMAQIRNKLK KGIYTHITDM TADLYLMLDN AKKANAPNSK
IHKDALKMQR ILNQKLIDSG DLEESDEDED EEDTDSSTHA STRKKGRIPK SVGSPSHGVS
SNTVVSRTNR IAPAVSLKKK LLSLHEFLVG FTYDDHQPMA LFMEKPSKKL YPDYYQVIQH
PIDMTTIENN IKADRYSTID DIVGDYRLMF SNCRKYNEEG SMIYEDANIL EKALNEKLKE
FSGISKKLNI IGKIPKPARK SNSTPLENKL KQMYDTIREY REPKQNRQLS YIFMKLPSKN
EYPDYYDIIK DPIDIEKIEK KLRQQIYETV DDMAADFMLM FENACKYNEP DSQIYKDALC
LQQLLIQTKQ ALRSEETVPN VQQAVQELLL SLFTTFYNYQ DEEGRCYSDS LAELSEYDEC
DGNRIRAISL DLIKRRLDKG LYKRLDIFQE DIFSCLERAR RLSRTDSQVF EDSIELQSFF
IKKRDELCQN VNVLESPALS YNTMHLSAAV ESLRQSKLLQ EEEADTDSDA VQPSQGESMT
IDQKVFSPGD FVYIDLPENK IPGIMYIERL WTTSDNIKMM NGLLLLRPHE TFHVQTRKFM
EQELFKSDQR IEIPLSKALN KCFVMHIRDY VKLKPENFSS KDVFVCESRY SSKARSFKKL
KTWNLVRAND PVKLTARETP IEVKRVMSVF KERVEKHKEE LSELQLQEAI PEKEKPNVVI
YMNGVEDGNV YYEQYNTVCG GVVKTGDYVY VATESGKQSI SQITSIWETK DGKVLFRGPW
LLTPPEVPGT VNRLFYRQEV LLSTIQETTS AVAIVGRCAV LDQSEYITRR PTEIAESDVY
MCESIFDEFK KQIRKIVAPS GLRKFTHSQM VTTDEIYHFR RPINPPKDMK DDFGIIDDSI
DGPPSIGSDT IMTASPHPTH SINTSTPSMT TKKTSKPGKK LVTGYILYSS EHRKTICASN
PEATFGDVSR IVGNEWRNLS DQEKTIWEQR AIKMNEESAA KHAAEMGESN CPSPANLKTE
SPIIQDIITN HCCWDKCDFQ FEDPAEFSEH CIAEGVGCVY KSFVAPTEQE FICIWRNCVR
LRRNMPPFPS VTRLVKHVKE VHLTKGVSKL IQPQDRSKNY VHSKRFTSGI QIGNNAGMSG
SNTNTNNIGM ITLQPMGNVG NHSSHSTSGN LMALSSQNPG VNQIQSMHAQ QPPSMPNANG
INAQQIGSTI ATPYFSYISS QPAEPLFVTV PPRPQRVLHS DAYIKYIERL QQKSTYITPW
QKTFTARKES IPNTDVNRLP SQWLGKRGQD RPEVVIDALW DLRNFMMKDV IQFDRF
//