ID F5HMP9_ANOGA Unreviewed; 2114 AA.
AC F5HMP9;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 27-JUL-2011, sequence version 1.
DT 27-MAR-2024, entry version 71.
DE SubName: Full=AGAP001451-PC {ECO:0000313|EMBL:EGK97572.1};
GN Name=1281722 {ECO:0000313|EnsemblMetazoa:AGAP001451-PC};
GN ORFNames=AgaP_AGAP001451 {ECO:0000313|EMBL:EGK97572.1};
OS Anopheles gambiae (African malaria mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=7165 {ECO:0000313|EMBL:EGK97572.1};
RN [1] {ECO:0000313|EMBL:EGK97572.1, ECO:0000313|Proteomes:UP000007062}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PEST {ECO:0000313|EMBL:EGK97572.1,
RC ECO:0000313|Proteomes:UP000007062};
RX PubMed=12364791; DOI=10.1126/science.1076181;
RA Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M.C., Wides R.,
RA Salzberg S.L., Loftus B.J., Yandell M.D., Majoros W.H., Rusch D.B., Lai Z.,
RA Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA Christophides G.K., Chrystal M.A.M., Clamp M., Cravchik A., Curwen V.,
RA Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA Jaillon O., Ke Z., Kodira C.D., Kokoza E., Koutsos A., Letunic I.,
RA Levitsky A.A., Liang Y., Lin J.-J., Lobo N.F., Lopez J.R., Malek J.A.,
RA McIntosh T.C., Meister S., Miller J.R., Mobarry C., Mongin E., Murphy S.D.,
RA O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA Thomasova D., Ton L.Q., Topalis P., Tu Z.J., Unger M.F., Walenz B.,
RA Wang A.H., Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M.,
RA Yao A., Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA Collins F.H., Hoffman S.L.;
RT "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL Science 298:129-149(2002).
RN [2] {ECO:0000313|EMBL:EGK97572.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EGK97572.1};
RG The Anopheles Genome Sequencing Consortium;
RL Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:EGK97572.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EGK97572.1};
RX PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT "The Anopheles gambiae genome: an update.";
RL Trends Parasitol. 20:49-52(2004).
RN [4] {ECO:0000313|EMBL:EGK97572.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EGK97572.1};
RX PubMed=17210077; DOI=10.1186/gb-2007-8-1-r5;
RA Sharakhova M.V., Hammond M.P., Lobo N.F., Krzywinski J., Unger M.F.,
RA Hillenmeyer M.E., Bruggner R.V., Birney E., Collins F.H.;
RT "Update of the Anopheles gambiae PEST genome assembly.";
RL Genome Biol. 8:R5.1-R5.13(2007).
RN [5] {ECO:0000313|EMBL:EGK97572.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=PEST {ECO:0000313|EMBL:EGK97572.1};
RG VectorBase;
RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
RN [6] {ECO:0000313|EnsemblMetazoa:AGAP001451-PC}
RP IDENTIFICATION.
RC STRAIN=PEST {ECO:0000313|EnsemblMetazoa:AGAP001451-PC};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAAB01008987; EGK97572.1; -; Genomic_DNA.
DR RefSeq; XP_003435841.1; XM_003435793.1.
DR STRING; 7165.F5HMP9; -.
DR PaxDb; 7165-AGAP001451-PC; -.
DR EnsemblMetazoa; AGAP001451-RC; AGAP001451-PC; AGAP001451.
DR GeneID; 1281722; -.
DR KEGG; aga:AgaP_AGAP001451; -.
DR VEuPathDB; VectorBase:AGAP001451; -.
DR eggNOG; KOG0956; Eukaryota.
DR InParanoid; F5HMP9; -.
DR OrthoDB; 163389at2759; -.
DR Proteomes; UP000007062; Chromosome 2R.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0042393; F:histone binding; IBA:GO_Central.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0031491; F:nucleosome binding; IBA:GO_Central.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd20901; CC_AF10; 1.
DR CDD; cd15672; ePHD_AF10_like; 1.
DR CDD; cd15574; PHD_AF10_AF17; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 2.
DR InterPro; IPR049773; AF10-like_CC.
DR InterPro; IPR049781; AF10/AF17_PHD.
DR InterPro; IPR034732; EPHD.
DR InterPro; IPR019786; Zinc_finger_PHD-type_CS.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR019787; Znf_PHD-finger.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR13793:SF162; ALHAMBRA, ISOFORM P; 1.
DR PANTHER; PTHR13793; PHD FINGER PROTEINS; 1.
DR Pfam; PF13831; PHD_2; 1.
DR Pfam; PF13832; zf-HC5HC2H_2; 1.
DR SMART; SM00249; PHD; 2.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 1.
DR PROSITE; PS51805; EPHD; 1.
DR PROSITE; PS01359; ZF_PHD_1; 1.
DR PROSITE; PS50016; ZF_PHD_2; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000007062};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00146}.
FT DOMAIN 5..57
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 62..181
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS51805"
FT REGION 188..592
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 605..932
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 946..1225
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1273..1519
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1582..1617
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1729..1750
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1903..2114
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 248..320
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 328..405
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 406..420
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 421..515
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 526..549
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 605..621
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 638..663
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 672..754
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 764..840
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 851..883
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 894..924
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 948..962
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 963..982
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 999..1064
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1084..1149
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1150..1184
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1198..1225
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1273..1293
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1310..1360
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1373..1486
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1504..1519
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1730..1750
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1903..1925
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1951..2073
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2087..2114
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2114 AA; 207612 MW; D8F16BA7BB6B1D5D CRC64;
MKEMVGGCCV CSDDRGWSEN PLVYCDGQSC AVAVHQACYG IVTVPSGPWY CRKCESQERP
ARVRCELCPS RDGALKRTDN QGWAHVVCAL YIPEVRFGNV TTMEPIILQL IPQERYNKTC
YICQDMGKGS RANVGACMQC NKSGCKQQFH VTCAQQLGLL CEEAGNYLDN VKYCGYCQHH
YSKLKKGGNV KTIPPYKPIS HEANSSDGPS SPEKEMEPPP PQQHSSSSAA GSGGTGGGGG
GGGGGGSGAG GSSSSSSGLK SSSRLSGEPS VGGSSSTSSS SSSSSSKQRK SSSASKSSSG
SGSGASLSSS SSSSVLASGG GGGSGGSGAS GMSGNSSMST SSSSSSSSGV SSMSGSGGSG
SVSNSSGSGI GGSGIAGSVG GSGSSKTSSG SSGGTGGGGS SGSSSKEKDK YSKSRDKSSK
SSKSSSSSST SGGSGSNFNN STSTSSSAGG SSNQSKDVDM GGTAGSSSGS MGALGANASS
QSSSQSGYSS TAGGGGSGPG GSSSSSSGSS SNSSSKHHSD KPDKGSGGGG SSSQSGGPLS
TNSGVGSGRA ASLSPAAIPT TLIIKPPQDH TGGSGKEPLS KEAIAKMSTS SNFTETIVVN
SESVYNHAGS GSGNAGNTSV IESGKLAMGG SGAGSGGSGG SYGGSTGPTG GSSTGSSSSS
SGGKKRKADA RSTPTSSPTP HSPQQPASSQ HTPSLVVSVP LSTATVPGVN LPASNNSNSS
SHTGSSSSSS SSSSNVNHSA SSNSNSTPNI VSPGHHQGSD RGGGGGSHIN NSNNSNSNSG
GNSNLYQQIS HRAGDQLMNA STNRSSPVIQ QQSTAQNIIQ SSSSSSSTAG SSSSSTVGPS
IGHHHLERQS PSMRSSPAGV TTGGANVITS LHHSQSQQPD LLSPAGGGAT VLQSVSPVPM
STIQRTSSPS TLHAGDNNSS SGGGGGLKFG YEKQAPTTNA RIAALQEEEA STGRRSRYGS
HSRERSGGNG GNNSTGGAGN TGVGGGVSSG GGGGGGKTRT SKKRSQQQQQ QQQQQQQQQQ
QSQLPPDRGS PSIGIESSAS GGSHRRGSSP SPSRRSSHGG GGGSGQSYSY GPAVDRNVDD
HSPSRYHPSS SGGSNAPAAG GNNASNNGSV LSMAAGTGSA TASVVVGNSS SQNSHHHQHL
TPHQQQQQQQ QHSHHHHHHQ PQQQQQQQHH HQQHHQHHQH SHHQQHSAAM GDKLSSGGGT
SAAPSSLHYS TGSSHSASSQ HSSSAIANSN ASNVTSTISS VASPAGPGAT AAAAAAAASV
TTSSIASMSA SSSTFSTAHS NNSSTSAAAR NDGGSSSAGV HPVIEMAGSH GGSQSYQQHH
HHHTSGGIIS GYSQNASSNS SASTKSHQNT NNGSNMESFH QQQQHHHHHQ QQQYHGGGSG
SHSVLSGSGN SSISISSNSN NLSSNNNTST ITTTTTTNTT NITTTSNSSS NSGNNSNSKN
TTIISSNSGN LSGTAGNSNT NNSTGSNHNL SNSSSSSSSG GTIVTAAANP NKKHRGASHH
PSIVELSPSP STSRTPEMIV SSGGVVPYSM SSTMTAASSS SYGGSSGGGG GGLKFSYEAQ
PTNPLSAVST ASMMGSSGSV IAAPQVKDSP PSSPGSDAGG SAAGTAIVSG RGTKRNRKMS
SNAAAVNSGA GAVPTTSVAG QTGGPSIVPA SIVAGGSADA KDGKLFQNGG SSSAAAGGSV
VSATHMLGNQ LNPSSSVAQK MSDQLSMEIE AHAYVPGPID AVPTLMGPQF PGKNRTNNSQ
SMPVGAGGGG NSLSSMLTGG ATATANGNTP QSLEQLLERQ WEQGSQFLME QAQHFDIASL
LSCLHQLRSE NIRLEEHVNN LVARRDHLLA VNARLAIPLN PTAALGGVGM IGGSGGSGPG
AGGAATGGAG VGGAGIGSLP GQFNNIHGNG PIDANVITNA SAVSNRSSRG QHGPNQQQQP
GQQGPAGHFG SGAGSNAAGG GIGGLPQENG IDFRHTNSSH PATNSASIRR NSPSSQPFPS
ATTATGGGGG GGTTRGAPST NSSSANNSTV QAQATGGTGS GEPVLPSTTG TTGRSSTLRP
SSGSANVSST GSNSSTGSSS SNNSNSISTG NGPTPAIGGG AGLNSSTVGP PAGTYHQATR
EQQQTIYNTA HQVE
//