ID A8BBT2_GIAIC Unreviewed; 692 AA.
AC A8BBT2;
DT 13-NOV-2007, integrated into UniProtKB/TrEMBL.
DT 13-NOV-2007, sequence version 1.
DT 27-MAR-2024, entry version 84.
DE SubName: Full=Histone-lysine N-methyltransferase MLL1 {ECO:0000313|EMBL:KAE8305242.1};
DE SubName: Full=Set-2, putative {ECO:0000313|EMBL:EDO80384.1};
GN ORFNames=GL50803_008921 {ECO:0000313|EMBL:KAE8305242.1}, GL50803_8921
GN {ECO:0000313|EMBL:EDO80384.1};
OS Giardia intestinalis (strain ATCC 50803 / WB clone C6) (Giardia lamblia).
OC Eukaryota; Metamonada; Diplomonadida; Hexamitidae; Giardiinae; Giardia.
OX NCBI_TaxID=184922 {ECO:0000313|EMBL:EDO80384.1};
RN [1] {ECO:0000313|EMBL:EDO80384.1, ECO:0000313|Proteomes:UP000001548}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 50803 / WB clone C6 {ECO:0000313|Proteomes:UP000001548},
RC and WB C6 {ECO:0000313|EMBL:EDO80384.1};
RX PubMed=17901334; DOI=10.1126/science.1143837;
RA Morrison H.G., McArthur A.G., Gillin F.D., Aley S.B., Adam R.D.,
RA Olsen G.J., Best A.A., Cande W.Z., Chen F., Cipriano M.J., Davids B.J.,
RA Dawson S.C., Elmendorf H.G., Hehl A.B., Holder M.E., Huse S.M., Kim U.U.,
RA Lasek-Nesselquist E., Manning G., Nigam A., Nixon J.E., Palm D.,
RA Passamaneck N.E., Prabhu A., Reich C.I., Reiner D.S., Samuelson J.,
RA Svard S.G., Sogin M.L.;
RT "Genomic minimalism in the early diverging intestinal parasite Giardia
RT lamblia.";
RL Science 317:1921-1926(2007).
RN [2] {ECO:0000313|EMBL:KAE8305242.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=WB C6 {ECO:0000313|EMBL:KAE8305242.1};
RA Xu F., Jex A., Svard S.G.;
RT "New Giardia intestinalis WB genome in near-complete chromosomes.";
RL Submitted (JUL-2019) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EDO80384.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AACB02000010; EDO80384.1; -; Genomic_DNA.
DR EMBL; AACB03000001; KAE8305242.1; -; Genomic_DNA.
DR RefSeq; XP_001708058.1; XM_001708006.1.
DR AlphaFoldDB; A8BBT2; -.
DR STRING; 184922.A8BBT2; -.
DR EnsemblProtists; EDO80384; EDO80384; GL50803_8921.
DR GeneID; 5700967; -.
DR KEGG; gla:GL50803_008921; -.
DR VEuPathDB; GiardiaDB:GL50803_8921; -.
DR HOGENOM; CLU_398195_0_0_1; -.
DR OMA; REVLIMV; -.
DR Proteomes; UP000001548; Chromosome 5.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0042800; F:histone H3K4 methyltransferase activity; IEA:InterPro.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR044570; Set1-like.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR PANTHER; PTHR45814; HISTONE-LYSINE N-METHYLTRANSFERASE SETD1; 1.
DR PANTHER; PTHR45814:SF2; HISTONE-LYSINE N-METHYLTRANSFERASE SETD1; 1.
DR Pfam; PF00856; SET; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603};
KW Reference proteome {ECO:0000313|Proteomes:UP000001548};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT DOMAIN 538..668
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 676..692
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
SQ SEQUENCE 692 AA; 78447 MW; C980E83861C41A02 CRC64;
MSADYRLRSN PVRLQREERA KIAMLHEQSR DFPSSLVGKD LSTLPGSFPA AQSVDECSLL
PVFNFPPLPT VTWSREEGIS NSPSKLLGEL PVIHRVLTTA EREQLAANPR SSQHLRRELH
LGWLQYEPYA PGTDQIGLSE AAQGDGDTIT NCTLPMHSFR SSFNLSKMEA MQERVVVFNK
VRSIAFSMHE EAAKAVLLYT ISLPEGEHLL MHVRMPAVPR LYTILREVLI MVVEDPPPDK
FLQSLIGQLD TLKGYRQRSS HFLLIKATLF IMDSLQFYPL SSWADKMQDP SFYECCRRIV
SELAVSLHTS PNSSHSVPPS ESSTYMVMSN QSADTLGSVD VNINDRGTSG SFLSPNLLNH
LAEACRLSAL LDSLFQTFKV EDGIDACILK EERYANLISQ QQKRHRLTQL SYLNFYIMQT
RIHRIYASPE KEKAYHCVPT KQLQTGFPHY RQLIKKRNID ALVQAIHKRA LSSTKQNSNP
KAALMKHRMH RLRGRQSVSD DQMYHYISDI RLPPCYMRCL KTDPVLSSVF RPYPQRTREL
FYYRSSIHGF GLFLAEPAQR KELITEYCGD VISSLVADIR ERLYAESGLK SVYMFSIKNN
YVIDATLKGN FARFLNHSCA PTAESVHARF SASNNALEPV SLISQSQGIA ISMLSNLGEG
AEVTQNYYLS KESADNKLFC QCGSTRCRLY MN
//