ID W6V3V6_ECHGR Unreviewed; 1430 AA.
AC W6V3V6;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 59.
DE SubName: Full=Histone-lysine N-methyltransferase SETD2 {ECO:0000313|EMBL:EUB60744.1};
GN ORFNames=EGR_04370 {ECO:0000313|EMBL:EUB60744.1};
OS Echinococcus granulosus (Hydatid tapeworm).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Cestoda;
OC Eucestoda; Cyclophyllidea; Taeniidae; Echinococcus;
OC Echinococcus granulosus group.
OX NCBI_TaxID=6210 {ECO:0000313|EMBL:EUB60744.1, ECO:0000313|Proteomes:UP000019149};
RN [1] {ECO:0000313|EMBL:EUB60744.1, ECO:0000313|Proteomes:UP000019149}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24013640; DOI=10.1038/ng.2757;
RA Zheng H., Zhang W., Zhang L., Zhang Z., Li J., Lu G., Zhu Y., Wang Y.,
RA Huang Y., Liu J., Kang H., Chen J., Wang L., Chen A., Yu S., Gao Z.,
RA Jin L., Gu W., Wang Z., Zhao L., Shi B., Wen H., Lin R., Jones M.K.,
RA Brejova B., Vinar T., Zhao G., McManus D.P., Chen Z., Zhou Y., Wang S.;
RT "The genome of the hydatid tapeworm Echinococcus granulosus.";
RL Nat. Genet. 45:1168-1175(2013).
CC -!- SUBCELLULAR LOCATION: Chromosome {ECO:0000256|ARBA:ARBA00004286}.
CC Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EUB60744.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; APAU02000027; EUB60744.1; -; Genomic_DNA.
DR STRING; 6210.W6V3V6; -.
DR EnsemblMetazoa; XM_024493619.1; XP_024351940.1; GeneID_36340085.
DR OMA; GCINRAV; -.
DR OrthoDB; 950362at2759; -.
DR Proteomes; UP000019149; Unassembled WGS sequence.
DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0046975; F:histone H3K36 methyltransferase activity; IEA:InterPro.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR CDD; cd19172; SET_SETD2; 1.
DR CDD; cd00201; WW; 1.
DR Gene3D; 2.20.70.10; -; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR Gene3D; 1.10.1740.100; Set2, Rpb1 interacting domain; 1.
DR InterPro; IPR006560; AWS_dom.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR044437; SETD2/Set2_SET.
DR InterPro; IPR042294; SETD2_animal.
DR InterPro; IPR038190; SRI_sf.
DR InterPro; IPR001202; WW_dom.
DR InterPro; IPR036020; WW_dom_sf.
DR PANTHER; PTHR46711; HISTONE-LYSINE N-METHYLTRANSFERASE SETD2; 1.
DR PANTHER; PTHR46711:SF1; HISTONE-LYSINE N-METHYLTRANSFERASE SETD2; 1.
DR Pfam; PF17907; AWS; 1.
DR Pfam; PF00856; SET; 1.
DR Pfam; PF00397; WW; 1.
DR SMART; SM00570; AWS; 1.
DR SMART; SM00317; SET; 1.
DR SMART; SM00456; WW; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR SUPFAM; SSF51045; WW domain; 1.
DR PROSITE; PS51215; AWS; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50280; SET; 1.
DR PROSITE; PS50020; WW_DOMAIN_2; 1.
PE 4: Predicted;
KW Chromosome {ECO:0000256|ARBA:ARBA00022454};
KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603,
KW ECO:0000313|EMBL:EUB60744.1}; Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000019149};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679, ECO:0000313|EMBL:EUB60744.1}.
FT DOMAIN 699..753
FT /note="AWS"
FT /evidence="ECO:0000259|PROSITE:PS51215"
FT DOMAIN 755..873
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 880..896
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
FT DOMAIN 1275..1309
FT /note="WW"
FT /evidence="ECO:0000259|PROSITE:PS50020"
FT REGION 47..78
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 184..511
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 539..583
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1034..1053
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1407..1430
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 58..72
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 184..221
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 224..254
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 255..345
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 346..369
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 380..408
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 416..454
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 543..581
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1430 AA; 161257 MW; 9992CFA813E493A8 CRC64;
MWARYLCRVC ISALFIPYNL DNVMQDEIAI FSSTAIYNIP LPPEKVTPVE DLPPHRKRFS
PPPPPPPPID NPTDVSNKSD PSILSFRIKA VSKRSRPGTL RTGVLPTKAK HPEIEAKTEL
FKPTKEDELE IMAELNNESV AELQNSAQTS LTAISEAILA RINNAAQGER QLSAVLREIN
KVNETSSSTT DNKVSELTIG ASTQPDSSEE PSKSQLADPQ AASTVDKEKS ENPKVRSRSR
SRTNVMDLSN RRKRRASSEK RKSRSKSLRR SRSPKKSTKR RRSKSPEKTS KRSRSRSRSA
RRRSRSRSRS TRRRSRSRSL HRSRVRSPVR RRRSRSSRRR KSPLSPRRRD RDSSRSKYKR
SDVKRSSGSS KRRSSKSRKR ETTVRITKKE SGKLPSGKKD EKPRSKNNTS TKQSKSKTND
AEKSKSSEQK SKKTAKATRS LSSNKDERSK GVKSSSKLFE AFSTESHSED GVVQSKRPQK
EENGSVLGDR VGPHTPESPS HAVPTTKAPT DVVTQKCTAL SEMVLNEGSA AQLRHDAGTD
KTEMHPSYSS TSSTTCHTPS SSTSSSSSTV SDCTASPKKC TTGKLGTRKR LTSAVYESDD
SSPNARYRLR RRKTVHQARG DAGTPLQDYS CSKPSPEVAE LLRPFQVKPI EHFARTAGVP
TPDYIHVLEN DYALLEDRLR REGINLASEV LSLQRRGGGG DWVCDCAVPS ADELLSGKLA
CGQGCINRAV YIECGSRCPA HAVCSNRQFQ LRLYASTEPF YCGPEKGWGL RALQTISKGT
FIVEYAGEVI DFPEFRRRIR QYEKAKRVHH YFMSLGPDHF IDAGAKGNWA RFVNHSCEPN
AETQKWMVNG RIKIGFFAIC DIPAGEEITI DYQFVQFGVT EQKCYCGTPS CSGIMGATSK
QLQDKVRLKD TRAVERRIIQ LLTGKTLQTA MDVTFLIQVM VQEYLTRYTR LELLKLLVHT
ETESYLKLFR QYNGLELLAS YMCDTAPTDW ELKHQILLCL DHIPITEQKQ VQTDSSLMEF
VRQWTMDPRY CRSRNTETVS STEPKASADD SKCLDESQFA ELPKEIDEIN EQREQQKQQP
DANTVKVFSP EEEKAIIEDI RQLAERILSR WLKLPVETYR IPRLEREETE QSILQSSCSS
LISWDVSSTQ DGRRNHQDCQ SEVSVDQNLS RLERRAQFEA AVAAAAATNP ATTTGTTEIT
SSTCSSSSSK DLRSLLLKTL SRQAFSSEKQ SEGFCAALQQ ATSEISAKIC EFNAKNDTNG
LIAYLQKLAG EDPETKLEFP WRSAVDSKSG LTYYYNSVTR EVRWDAPVPK KEPSEDRVKK
HFIAEIYAVN LKILKPFRLP NCITGRLESD EDMKYFAFAF ANRELARRRS DEKPRLNRQT
VDRLRRKITR YLTSKGEVYV RSNMSSVSNP TADGCDMEID SEDEGNNIYV
//