ID W6V2F6_ECHGR Unreviewed; 771 AA.
AC W6V2F6;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 36.
DE SubName: Full=Histone acetyltransferase MYST4 {ECO:0000313|EMBL:EUB60109.1};
GN ORFNames=EGR_04962 {ECO:0000313|EMBL:EUB60109.1};
OS Echinococcus granulosus (Hydatid tapeworm).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Cestoda;
OC Eucestoda; Cyclophyllidea; Taeniidae; Echinococcus;
OC Echinococcus granulosus group.
OX NCBI_TaxID=6210 {ECO:0000313|EMBL:EUB60109.1, ECO:0000313|Proteomes:UP000019149};
RN [1] {ECO:0000313|EMBL:EUB60109.1, ECO:0000313|Proteomes:UP000019149}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24013640; DOI=10.1038/ng.2757;
RA Zheng H., Zhang W., Zhang L., Zhang Z., Li J., Lu G., Zhu Y., Wang Y.,
RA Huang Y., Liu J., Kang H., Chen J., Wang L., Chen A., Yu S., Gao Z.,
RA Jin L., Gu W., Wang Z., Zhao L., Shi B., Wen H., Lin R., Jones M.K.,
RA Brejova B., Vinar T., Zhao G., McManus D.P., Chen Z., Zhou Y., Wang S.;
RT "The genome of the hydatid tapeworm Echinococcus granulosus.";
RL Nat. Genet. 45:1168-1175(2013).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EUB60109.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; APAU02000034; EUB60109.1; -; Genomic_DNA.
DR AlphaFoldDB; W6V2F6; -.
DR STRING; 6210.W6V2F6; -.
DR EnsemblMetazoa; XM_024494211.1; XP_024351305.1; GeneID_36340677.
DR OMA; MEYELHY; -.
DR OrthoDB; 3921326at2759; -.
DR Proteomes; UP000019149; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0016740; F:transferase activity; IEA:UniProtKB-KW.
DR Gene3D; 1.10.150.50; Transcription Factor, Ets-1; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 1.
DR InterPro; IPR021893; DUF3504.
DR InterPro; IPR001660; SAM.
DR InterPro; IPR013761; SAM/pointed_sf.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR019787; Znf_PHD-finger.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR12628:SF10; C. ELEGANS HOMEOBOX; 1.
DR PANTHER; PTHR12628; POLYCOMB-LIKE TRANSCRIPTION FACTOR; 1.
DR Pfam; PF12012; DUF3504; 1.
DR Pfam; PF00628; PHD; 1.
DR SMART; SM00249; PHD; 1.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 1.
DR SUPFAM; SSF47769; SAM/Pointed domain; 1.
DR PROSITE; PS50105; SAM_DOMAIN; 1.
DR PROSITE; PS50016; ZF_PHD_2; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000019149};
KW Transferase {ECO:0000313|EMBL:EUB60109.1};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00146}.
FT DOMAIN 467..520
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 681..731
FT /note="SAM"
FT /evidence="ECO:0000259|PROSITE:PS50105"
FT REGION 1..26
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 101..130
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 555..672
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 9..23
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 101..121
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 555..579
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 592..619
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 771 AA; 82617 MW; 4A43BB8051282F27 CRC64;
MVSATPMPTL SLAPPPPPPA SSGGATGAPA AMMMYPGIYN PTVFAVAPDT MVTTTFQPDT
STVSAAAGTV TMPTPAVALT TTVGTTGGVA ISTDVVTETS VTSTTPANAG AANSSSTSAP
ETPKKERVEE ATSAETDVCC RCHQKASENG NDKFLVCGDC GLKAHLLSQG RRKLQAPGPS
TDGLVGYFHP ESSPAEECLW SANFFGHSSV SQLTYTLVYL LGKILCIRSG AELRAMHLWN
TLKLFPMHKL PQHNQKVVME YELHYTLPPD PPIPPPALYE NSKRLLSLSE RAGGNRFLVI
LDHILVKTQK GFVIKHSVKN NHQRCLVCLH ALLLQKRSCS KRSLRVDNYF LSWMDSSPSA
RFLTDPLSDV ALSTIVYDIR VVLTSLQQYL QRGPVPLETN LWSIFAQGQQ FAGSPSQQSR
PVLAGHGEEC SVPLDLSVRG RAHPKCLDFW PEVTRRARHG VWQCADCKSC SVCKNKEAEN
VILICEACDK GFHGNCHSPV VPEKSANQTT PWVCSGCQAE GYCVHVGNVS SSAPVSTPSS
ADLGVSSLAI STSEGVMSTS SMNSKPNQPS TTAMMTTTAG SVDPLLAEPT GVTVPENSNV
PIPIKESETK ASAKPVSTLT TADLPGPLPM EEGEKNGEER EDAEEDDSMP PNLGTPHHDP
PASEMPLLQT DSGRPEDVRL WTVDHVADWL REQGGFEKEA EAFRHQDIDG TSLLLLKSMS
LLTAPSGSSG VEECWMTDVN DNDDDPGVSN LTCHAVRAVQ PAKNPPQCPY A
//