ID E0VU86_PEDHC Unreviewed; 1120 AA.
AC E0VU86;
DT 02-NOV-2010, integrated into UniProtKB/TrEMBL.
DT 02-NOV-2010, sequence version 1.
DT 24-JAN-2024, entry version 79.
DE SubName: Full=Histone-lysine N-methyltransferase, H3 lysine-9 specific, putative {ECO:0000313|EMBL:EEB16942.1};
DE EC=2.1.1.43 {ECO:0000313|EMBL:EEB16942.1};
DE EC=3.1.1.4 {ECO:0000313|EMBL:EEB16942.1};
GN Name=8231218 {ECO:0000313|EnsemblMetazoa:PHUM447810-PA};
GN ORFNames=Phum_PHUM447810 {ECO:0000313|EMBL:EEB16942.1};
OS Pediculus humanus subsp. corporis (Body louse).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Paraneoptera; Psocodea; Phthiraptera; Anoplura; Pediculidae;
OC Pediculus.
OX NCBI_TaxID=121224;
RN [1] {ECO:0000313|EMBL:EEB16942.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=USDA {ECO:0000313|EMBL:EEB16942.1};
RA Kirkness E., Hannick L., Hass B., Bruggner R., Lawson D., Bidwell S.,
RA Joardar V., Caler E., Walenz B., Inman J., Schobel S., Galinsky K.,
RA Amedeo P., Strausberg R.;
RT "Annotation of Pediculus humanus corporis strain USDA.";
RL Submitted (APR-2007) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:EEB16942.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=USDA {ECO:0000313|EMBL:EEB16942.1};
RG The Human Body Louse Genome Consortium;
RA Kirkness E., Walenz B., Hass B., Bruggner R., Strausberg R.;
RT "The genome of the human body louse.";
RL Submitted (APR-2007) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EnsemblMetazoa:PHUM447810-PA}
RP IDENTIFICATION.
RC STRAIN=USDA {ECO:0000313|EnsemblMetazoa:PHUM447810-PA};
RG EnsemblMetazoa;
RL Submitted (FEB-2021) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAZO01005466; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; DS235783; EEB16942.1; -; Genomic_DNA.
DR RefSeq; XP_002429680.1; XM_002429635.1.
DR AlphaFoldDB; E0VU86; -.
DR STRING; 121224.E0VU86; -.
DR EnsemblMetazoa; PHUM447810-RA; PHUM447810-PA; PHUM447810.
DR GeneID; 8231218; -.
DR KEGG; phu:Phum_PHUM447810; -.
DR CTD; 8231218; -.
DR VEuPathDB; VectorBase:PHUM447810; -.
DR eggNOG; KOG1082; Eukaryota.
DR HOGENOM; CLU_005790_2_0_1; -.
DR InParanoid; E0VU86; -.
DR OMA; DSWLEDA; -.
DR OrthoDB; 5481936at2759; -.
DR Proteomes; UP000009046; Unassembled WGS sequence.
DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-KW.
DR GO; GO:0005634; C:nucleus; IEA:InterPro.
DR GO; GO:0140938; F:histone H3 methyltransferase activity; IEA:UniProt.
DR GO; GO:0002039; F:p53 binding; IEA:InterPro.
DR GO; GO:0004623; F:phospholipase A2 activity; IEA:UniProtKB-EC.
DR GO; GO:0016279; F:protein-lysine N-methyltransferase activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR CDD; cd20905; EHMT_ZBD; 1.
DR CDD; cd10543; SET_EHMT; 1.
DR Gene3D; 1.25.40.20; Ankyrin repeat-containing domain; 2.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR002110; Ankyrin_rpt.
DR InterPro; IPR036770; Ankyrin_rpt-contain_sf.
DR InterPro; IPR043550; EHMT1/EHMT2.
DR InterPro; IPR047762; EHMT_CRR.
DR InterPro; IPR007728; Pre-SET_dom.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR PANTHER; PTHR46307; G9A, ISOFORM B; 1.
DR PANTHER; PTHR46307:SF4; G9A, ISOFORM B; 1.
DR Pfam; PF00023; Ank; 1.
DR Pfam; PF12796; Ank_2; 2.
DR Pfam; PF13606; Ank_3; 1.
DR Pfam; PF21533; EHMT1-2_CRR; 1.
DR Pfam; PF05033; Pre-SET; 1.
DR Pfam; PF00856; SET; 1.
DR PRINTS; PR01415; ANKYRIN.
DR SMART; SM00248; ANK; 7.
DR SMART; SM00468; PreSET; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF48403; Ankyrin repeat; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50297; ANK_REP_REGION; 5.
DR PROSITE; PS50088; ANK_REPEAT; 5.
DR PROSITE; PS50867; PRE_SET; 1.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW ANK repeat {ECO:0000256|PROSITE-ProRule:PRU00023};
KW Chromosome {ECO:0000256|ARBA:ARBA00022454};
KW Hydrolase {ECO:0000313|EMBL:EEB16942.1};
KW Methyltransferase {ECO:0000313|EMBL:EEB16942.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000009046};
KW Transferase {ECO:0000313|EMBL:EEB16942.1}.
FT REPEAT 597..629
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 630..662
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 700..732
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 733..765
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 766..798
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT DOMAIN 889..953
FT /note="Pre-SET"
FT /evidence="ECO:0000259|PROSITE:PS50867"
FT DOMAIN 956..1073
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT REGION 73..98
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 243..263
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 243..257
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1120 AA; 126004 MW; 1FA95407027E4CA5 CRC64;
MKEEKEKSLV ICDIDYKVKY LNSDETFDIK KEENNISKTP LLKENEENFI IKTEDVTDES
SSSKEMATEV FFDSDAHDVK ESKPSKANSG DQNSIHLDNK MPSQTKLILR PRKTETNTGS
DCESKIKKQK LIVCKAENKS AIISEDNFLF NDTSNSISDP TVDVSQNLEN DNFQVPVSKI
IENSEMNNAG TTTDCCEKII ENETNNFESP SNLNPVFETG FEVVDSHVPN SNITSVSRIK
KLKNTKRKKK RRDNGWTKKR KRTDNHQVVK NLCTVLESKE IQIKNDESIT QPISKSVNLK
KKDVEGCEIV QKKPREFAIP SPSLADISVK AAIPESTATS LEASKPKLCL CRKKPNLFVS
GNNNTGDLYC QALDCIDSRI VGCCNTIPSK DVGLYRASER ASYQMMCIVH QQRLLRHNCC
PGCGLFCTQG KYLMCKSWHH FHKSCFSDSK DGMFTCPHCG DSSTPKTIIV NIHSPKDPVF
LPQQKPIRNM KSAKMTISRG NEQKVEESAD PNFNLPASLF KINGEEILPV NGITQILQRD
KLAHLLNIAC RSFSKICCSN RYSFKSLYNA AQNGDVEKLI KVIASGLNPN HVFDEHNNQT
ALHFAAGNGH LPAVHILLQA KAQINIFDSE QNTPLTAAIN AKHNDVVKYL IKCGADLILK
GEDGMTPLHI AAKCGNVGAC FHLLNGTHLP NRYIDGLDDG GWTPMVWASE FNHIDVVKFL
ISKGADSLIK DSEQNIALHW AAFGGSVDIV EIFLNEGSDI NSVNVHGDTP LHIAARQQKY
SCVLLLLARG ARSDVKNKNG ELPRDCSHSP SSDIYKGITL NMEISALLTK FQDRTPKIVS
NDISRGKERN QIQCINEVDD EGEPGNFVYV NESCFTSKIT VHRTITSLQS CKCQNVCSSE
GCNCAAISVK CWYDTDGRLK PDFNYVNPPS IFECNQACHC NRITCRNRVV QNGVTCRFQL
FKTEKRGWGI RTLNSIPKGT FVCEYVGEII SDWEADHRED DSYLFDLENR DGETYCIDAR
YYGNFARFIN HMCVPNLMPV HIFVDHQDLR FPRIAFFANK DILPNEELGY NYGDKFWVIK
WKSFTCVCDS EKCLYSENTI QTTLENYQKK LTEENSQESK
//