ID A0A084VSH7_ANOSI Unreviewed; 2587 AA.
AC A0A084VSH7;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 29-OCT-2014, sequence version 1.
DT 27-MAR-2024, entry version 43.
DE RecName: Full=histone acetyltransferase {ECO:0000256|ARBA:ARBA00013184};
DE EC=2.3.1.48 {ECO:0000256|ARBA:ARBA00013184};
GN ORFNames=ZHAS_00008461 {ECO:0000313|EMBL:KFB40921.1};
OS Anopheles sinensis (Mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=74873 {ECO:0000313|EMBL:KFB40921.1};
RN [1] {ECO:0000313|EMBL:KFB40921.1, ECO:0000313|Proteomes:UP000030765}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24438588; DOI=10.1186/1471-2164-15-42;
RA Zhou D., Zhang D., Ding G., Shi L., Hou Q., Ye Y., Xu Y., Zhou H.,
RA Xiong C., Li S., Yu J., Hong S., Yu X., Zou P., Chen C., Chang X., Wang W.,
RA Lv Y., Sun Y., Ma L., Shen B., Zhu C.;
RT "Genome sequence of Anopheles sinensis provides insight into genetics basis
RT of mosquito competence for malaria parasites.";
RL BMC Genomics 15:42-42(2014).
RN [2] {ECO:0000313|EnsemblMetazoa:ASIC008461-PA}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SIMILARITY: Belongs to the MYST (SAS/MOZ) family.
CC {ECO:0000256|ARBA:ARBA00010107}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ATLV01016022; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ATLV01016023; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; KE525051; KFB40921.1; -; Genomic_DNA.
DR STRING; 74873.A0A084VSH7; -.
DR EnsemblMetazoa; ASIC008461-RA; ASIC008461-PA; ASIC008461.
DR VEuPathDB; VectorBase:ASIC008461; -.
DR VEuPathDB; VectorBase:ASIS006930; -.
DR VEuPathDB; VectorBase:ASIS020763; -.
DR VEuPathDB; VectorBase:ASIS024470; -.
DR OMA; TTNVDIC; -.
DR Proteomes; UP000030765; Unassembled WGS sequence.
DR GO; GO:0000786; C:nucleosome; IEA:InterPro.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0004402; F:histone acetyltransferase activity; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0006334; P:nucleosome assembly; IEA:InterPro.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR Gene3D; 3.40.630.30; -; 1.
DR Gene3D; 3.30.60.60; N-acetyl transferase-like; 1.
DR Gene3D; 1.10.10.10; Winged helix-like DNA-binding domain superfamily/Winged helix DNA-binding domain; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 1.
DR InterPro; IPR016181; Acyl_CoA_acyltransferase.
DR InterPro; IPR002717; HAT_MYST-type.
DR InterPro; IPR005818; Histone_H1/H5_H15.
DR InterPro; IPR048589; SAMD1-like_WH.
DR InterPro; IPR036388; WH-like_DNA-bd_sf.
DR InterPro; IPR036390; WH_DNA-bd_sf.
DR InterPro; IPR040706; Zf-MYST.
DR InterPro; IPR019787; Znf_PHD-finger.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR10615; HISTONE ACETYLTRANSFERASE; 1.
DR PANTHER; PTHR10615:SF102; HISTONE ACETYLTRANSFERASE; 1.
DR Pfam; PF01853; MOZ_SAS; 1.
DR Pfam; PF21524; SAMD1_WH; 1.
DR Pfam; PF17772; zf-MYST; 1.
DR SUPFAM; SSF55729; Acyl-CoA N-acyltransferases (Nat); 1.
DR SUPFAM; SSF46785; Winged helix' DNA-binding domain; 1.
DR PROSITE; PS51504; H15; 1.
DR PROSITE; PS51726; MYST_HAT; 1.
DR PROSITE; PS50016; ZF_PHD_2; 1.
PE 3: Inferred from homology;
KW Acetylation {ECO:0000256|ARBA:ARBA00022990};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000030765};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00146}.
FT DOMAIN 90..162
FT /note="H15"
FT /evidence="ECO:0000259|PROSITE:PS51504"
FT DOMAIN 189..260
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 429..746
FT /note="MYST-type HAT"
FT /evidence="ECO:0000259|PROSITE:PS51726"
FT REGION 157..185
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 314..350
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 372..424
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 583..1177
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1192..1248
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1267..1605
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1618..1712
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1945..2038
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2114..2140
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2260..2280
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2293..2363
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2378..2413
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 384..402
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 404..424
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 599..615
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 637..651
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 680..695
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 696..718
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 734..762
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 794..827
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 848..897
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 898..912
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 943..1062
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1076..1105
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1133..1177
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1208..1224
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1302..1320
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1327..1345
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1388..1410
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1419..1453
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1462..1538
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1561..1575
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1587..1605
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1618..1635
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1650..1692
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1945..1978
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1997..2022
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2293..2314
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2316..2336
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2587 AA; 270628 MW; D6275556BBC92988 CRC64;
MRENPDDVSP QVWKEWILEA IRRIRFQKQR PSIQRICQAI GSHHKFHEDI VAEKLEEAVE
AGAVLKVYNK GLHSYKAPTS TQRRVTNVTN ESNLSRLVAK AVRDLGEFDG SSQKSIENYV
QQTNNLHISA DTDFKSVIRN SLRIAISEET VLQEGRLYKP GPVFKPQTKR KSSSPKKRGG
KHLDRSADGS ICLVCQDGDG NGSDDEGEAL TSCNTCGAGL HDTCATGAGH SGSSRCPPTV
SLSRLLEKGN VWHCEECKTC DGCSSQDEEE DGVKGICLLD CWSCKKHFHL SCLSPPLSES
KKCKFPWRLL TGDKDPKGST DAINVPVPYK NPYDSPSEDT LEKQTLPDGV TPQDAELYKY
VREQSTKIVV GVSKASSKQQ HNNNNSDRAR HFSPDRPSQH GHHHQQQQHA PATSSTSIVQ
QSPSKLMAAQ DRCPAAIEFG KYEIETWYSS PFPQEYARLP KLFLCEFCLK YTKSKAVLQR
HQDKCSWRNP PGTEIYRHEG VSVFEVDGNA NKIYCQNLCL LAKLFLDHKT LYYDVEPFLF
YVLTRYDRKG YHLVGYFSKE KHCQQKYNVS CIMTMPQYQR QGYGRELSPV PEDVHRSSTP
SEPVASTSGH GISHATRSKT PLEEPSDDDE PMEASATASA VVNSVPSRST TVGPGRKRGR
PSEKDAAVVK SSSSTATPSF ERLRKRRRLD SQEELGEKSG ATSSYALKDS SPVSTVRGIS
NLRRKRLQLR LSDSEQEGEQ VSLQKPTDQS SQYHNHNSHP HQTVSPRAKE DSGRLTPIAP
VSGNKRLGMS MRNSKRIASV SGSSPANNSE PSADTPTPPT INTELPPSEK LSKPFRATAG
GTAAVRQKHH IESTSSSSPV GSATRATSSS PGKKEPTVVA PTTTTSSNSS SKLRHKPSPP
STGRHKKRSQ AQGGKKSIVP AVADASEDSS GEADDEMEEE TRSSVPPPSS SSIPSVGTKK
SPNKTSSPTG NSKYNTTPPK ISPRLRDQSS SPEIKSSKTI GGSNNRASSS DRRSKQPSSD
AVEPPTTRVK QSVSPQPATS QEAAPSNATV SSEVTNGADA AEGQNTRLVV EEVEPPISST
NEQKPTEGGT TTSPNRATSL EGGSVISRQV AKANLPADQD ETRVAESNGE QHHAVTGVKQ
NDGNEQQAVQ TDKMMQSSDK GEQSSVVCNN STPPVNGVEL NEKISVITES KNCFTNNSSD
SERKGKEAYS TNSNGASNGA SYVSQHHHHD ARRPSGGDDL ANIPTGTAAN NTASVLKINE
NYEKHHGEIL SRNNRPKVIV DGPTNAVGDG KAKEEQSGHA KDGTTESGGS SVTGTEKAEQ
KHAASASEAT GIIQSPPSSG SGDGPSSAHE LAAMHKKKFM KSTQPQCDNT GDHGSPTKAA
NDLLVPMGDQ QQPKQQQQSS SDQSVIKQND EAAGTKLIGS TFGSFSNNTS STNIPTSQSS
TTNCTSVSQA PQVDASEIKK EKSSTTSTSS STPSGTTTTT LASVAGQKVT KNDATSERQT
TTTKTSNSTS NTKASESTAT SCATAVVTSA SSISSLPPML PQPAVDCKRN SAEAGGGSYH
PAEPLPPPPM AASVTPKVEP PTKRGKDKSS YSNTTTSIGA GSSTVTASIN SVVNAAMGNA
SDDSMSTSGS KTGRVPSSIA GGGDQQVAAK RGGSGSSSAS SSLPNAVSSK SEHTSPSVAT
MTSTVVGSTS RRTDGMQRKD STKSAASYGS GSGAGVVSSV TGSLNHCKVQ PPDQYHMSAK
TSNMTLMDGV GGVGVQGSSA IVTGVSTHGA SGSAMGGAGV GGNNIVSSIA GTMSGSGSVG
GGGGGVVSAP PGTGGSMMMS GTSGGMVDGS NKPPVPSEYG GGGGGGSALM GMNNASCRAD
KTTSTGKHSS SIHDAKATMD LNKVSSPFPG INQLPSYAPS QYWQLDPYYH QGYNLSHLDA
SSQKSPNKFH IELTSMAYSG FPSNLYPPFQ HQEQQYQQVP PAASTTPTPY QTGPPSAKET
RAGNVKADRK GAAGMEQSSA GHNNTTTKHG KSSKSASNSG ADKFKGAESK HDTANKLSDS
SCMVAATKQQ QTSAVTDTSA CHATVVQQQQ QQSMHLMAAA SAVVQQKAAG GFQTDPDGMH
VDGTTGGAPN DMKQSVVPGG GGGGGGGPGG GTAAPTNDVH SIGVYTPEST TNSVQSLHQY
GQCDIDVSQL GLESPASIAS DVTSQNSVDA IRPPSVVSQH VGPYSDSLTG QTLNHHTQNM
HQVAAGGGYL GAQLGLAGQA PPYPQSPTSY GSVIQHRMSN NHTPASLHSP HQRLGASPVS
SCAVSSANNF YVQQQQQQQQ QQQQQANSIP HPIGSHTPIP QAPTPAPTPT PTPTPQLEPQ
SCQGGGGAGG AGQQLGQQQG GGSLSCLSKL QQLTTNVDIC NSPVTPPPSA HHGGGGGGGA
GGAGGAGGGA AGAAGANASM VAAQNVVRNI STPPVSMHSA QMSSISNYHK YYAGNMNMPP
GIGGPNSATA AAVRRNAAAA AVAASSAQIQ HMTNTSSRVS PNVAISTNLM SPYGTLNGYR
MTTAQSAGGA GGYIGNPAAG FIQNPAQLGP VQMMNMQSQY QDPSAIQRAQ QNSMYSSYPA
YLPAMRR
//