ID F5HKL1_ANOGA Unreviewed; 1778 AA.
AC F5HKL1;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 27-JUL-2011, sequence version 1.
DT 27-MAR-2024, entry version 71.
DE SubName: Full=AGAP002807-PB {ECO:0000313|EMBL:EGK96763.1};
GN ORFNames=AgaP_AGAP002807 {ECO:0000313|EMBL:EGK96763.1};
OS Anopheles gambiae (African malaria mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=7165 {ECO:0000313|EMBL:EGK96763.1};
RN [1] {ECO:0000313|EMBL:EGK96763.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PEST {ECO:0000313|EMBL:EGK96763.1};
RX PubMed=12364791; DOI=10.1126/science.1076181;
RA Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z.,
RA Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V.,
RA Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I.,
RA Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A.,
RA McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D.,
RA O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B.,
RA Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M.,
RA Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA Collins F.H., Hoffman S.L.;
RT "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL Science 298:129-149(2002).
RN [2] {ECO:0000313|EMBL:EGK96763.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EGK96763.1};
RG The Anopheles Genome Sequencing Consortium;
RL Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:EGK96763.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EGK96763.1};
RX PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT "The Anopheles gambiae genome: an update.";
RL Trends Parasitol. 20:49-52(2004).
RN [4] {ECO:0000313|EMBL:EGK96763.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EGK96763.1};
RX PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5;
RA Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F.,
RA Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.;
RT "Update of the Anopheles gambiae PEST genome assembly.";
RL Genome Biol. 8:R5.1-R5.13(2007).
RN [5] {ECO:0000313|EMBL:EGK96763.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EGK96763.1};
RG VectorBase;
RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EGK96763.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAAB01008859; EGK96763.1; -; Genomic_DNA.
DR RefSeq; XP_003436250.1; XM_003436202.1.
DR STRING; 7165.F5HKL1; -.
DR PaxDb; 7165-AGAP002807-PB; -.
DR GeneID; 1273155; -.
DR KEGG; aga:AgaP_AGAP002807; -.
DR VEuPathDB; VectorBase:AGAP002807; -.
DR eggNOG; KOG1474; Eukaryota.
DR HOGENOM; CLU_238653_0_0_1; -.
DR OMA; MHDRSID; -.
DR OrthoDB; 168195at2759; -.
DR CDD; cd05497; Bromo_Brdt_I_like; 1.
DR CDD; cd05498; Bromo_Brdt_II_like; 1.
DR Gene3D; 1.20.1270.220; -; 1.
DR Gene3D; 1.20.920.10; Bromodomain-like; 2.
DR InterPro; IPR031354; BRD4_CDT.
DR InterPro; IPR043508; Bromo_Brdt_I.
DR InterPro; IPR043509; Bromo_Brdt_II.
DR InterPro; IPR001487; Bromodomain.
DR InterPro; IPR036427; Bromodomain-like_sf.
DR InterPro; IPR018359; Bromodomain_CS.
DR InterPro; IPR027353; NET_dom.
DR InterPro; IPR038336; NET_sf.
DR PANTHER; PTHR22880:SF225; BROMODOMAIN-CONTAINING PROTEIN BET-1; 1.
DR PANTHER; PTHR22880; FALZ-RELATED BROMODOMAIN-CONTAINING PROTEINS; 1.
DR Pfam; PF17035; BET; 1.
DR Pfam; PF17105; BRD4_CDT; 1.
DR Pfam; PF00439; Bromodomain; 2.
DR PRINTS; PR00503; BROMODOMAIN.
DR SMART; SM00297; BROMO; 2.
DR SUPFAM; SSF47370; Bromodomain; 2.
DR PROSITE; PS00633; BROMODOMAIN_1; 2.
DR PROSITE; PS50014; BROMODOMAIN_2; 2.
DR PROSITE; PS51525; NET; 1.
PE 4: Predicted;
KW Bromodomain {ECO:0000256|PROSITE-ProRule:PRU00035};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 49..121
FT /note="Bromo"
FT /evidence="ECO:0000259|PROSITE:PS50014"
FT DOMAIN 422..494
FT /note="Bromo"
FT /evidence="ECO:0000259|PROSITE:PS50014"
FT DOMAIN 708..790
FT /note="NET"
FT /evidence="ECO:0000259|PROSITE:PS51525"
FT REGION 1..25
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 142..164
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 326..355
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 385..404
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 514..549
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 577..727
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 784..1032
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1101..1152
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1262..1317
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1531..1560
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1615..1778
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 385..400
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 704..727
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 792..811
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 836..1032
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1101..1131
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1266..1317
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1531..1546
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1617..1649
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1654..1699
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1715..1731
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1737..1759
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1778 AA; 184917 MW; 78B4654EA6B4CEBE CRC64;
MDEPPPRNEP VVEPVNGIVQ PPVMPPPERP GRLTNQLHFL LRTVMKAVWK HQFSWPFQQP
VDAKKLNLPD YHKIIKQPMD LGTIKKRLEN NYYWTSKECI QDFNTMFTNC YVYNKPGEDV
VVMAQTLEKL FLTKVSLMPK DETEMEVQQP KGGKKKPRSL APPGTLVGNT SVTGAAAAAG
VAPNAAIVAG ANAVAVAAAA AAAARARPGS ALGAAVSSSV PPLSAVPPGM GAASTVVAPS
VPPIGTMPPQ TVPGSTNTTT TAGPNAAVAA LTAAAAAVAA VNTPHNAMGN PIGAPVVSKP
SAPTNPPYMV NSSQPGMDTV LPPQQPAKVK KGVKRKADTT TPTATVFDPH YGTPMDSNAA
KIATRRESGR QDIAPYQTSA YPMSPMAHQG SSSSQYPPKN KEKLSDALKS CNEILKELFS
KKHSGYAWPF YKPVDAELLG LHDYHDIIKK PMDLGTVKRK MDNREYKSAP EFAADVRLIF
TNCYKYNPPD HDVVAMGRKL QDVFEMRLAN IPDEPVNNVA PHHQCKESEP SSSSDSNDSD
EESESDEECA QKLKLLEKQL FEMQERMRKL TEEALLKKKQ KKKSKDKKKS SVAGPGGAAG
GGASDIKGMM DPHAGLSMPH GPGKAMHQMA GVMGQPPMGG AGAAPVMPAA ATKAKAKGQR
APKAGGVAGG AASNAPAKRA KNAGAAGGTS APRAPNKKKA SQNVSNFDSE EEDTAKPMSY
DEKRQLSLDI NKLPGDKLGR VVHIIQSREP SLRDSNPDEI EIDFETLKPS TLRELESYVA
SCLRKKTHKK VSGKSKEEQM TEKKQELEKR LQDVTGQLGT GKKNAKKDEA NKQDVAPSGG
NMSSSSSSSD SSSSSSSDSS SSDSSDSEAA TASTLSATAA VSQLLPSAGA NNTNTTSNTT
TTTTNNNSNT ITPLPNAAAQ QPQQQQQQQQ HPPANGSSAL NPTAVAGPPQ HSQVHLAAHQ
QQQQHRQTPA NQLQQSLIPP PPHSSQSTST LVSSTSSSIT TSSSTSSSGA GGAPSSMGVA
ATTGSHNGNN PPAATTSIAA AVSSGYGGMM GLSQTPATTV VDSVPALLQI PSLAQQQTMA
AGSHTLANAL MMSASGTANS NSNGMGGGVS GTNSNSGNPN PIGSSNSNQL SLPPPIPSPN
TTTTTTSTML PVNNNALTTT TTTSTSNNLG NSAILNNSIT VTQHQQQNAA LGANGIAPPV
PVTAHAPTSG ITTSSSNVIN TNGSDGALNA SNGLINSMLG LPPTSQLATS LAAMTNLASM
AGGLPQPPLQ MQTQSQQVSK QQQQQQLVHG GATQSQQQPQ QQLQHAQQQQ QQQQQLNQQF
RLGGAAGNNG HGGLGGGFID PLEHTLNSFE QSIKAEPLGL NLLNDMPAAL KQDLTTSMLS
ANMDLCMAHL QNQLHSGNNG FASEFNGNGP LNGGASAGMG LGAGMNQMVM NSMASSAAAA
AAAAANSALM GGGMPSIFDP LQSFMRSGAG VAAAAAAAGG YHQSNAVGAG GAPAGMAGLM
TGASNSSNGA GLGSGASAGN VGAMHDRSID LTQAGSNNSN SSGGSLGGAV PPGMMPTGNG
GGNMMAIKKE EPGVGGGKFM LTPKPIEDLL INPNEKKMGS TPPPADVKGG NVANAFNKGQ
DPKSAWSSLA AAVSPQNTPT SNKPKPSMDS FQAFRNKAKE KFDRMQQQEL KRSQKEQAEK
ELKRQQEQQK IKQEEINNGR KLPIEPVQPR VVEEIKTSPQ GSPSPGGTPT TPHALDRSAA
KRAELRRLEQ ERRRREAMAG QIDMNMQSDL MAAFEESL
//