ID V6TBA5_GIAIN Unreviewed; 692 AA.
AC V6TBA5;
DT 19-FEB-2014, integrated into UniProtKB/TrEMBL.
DT 19-FEB-2014, sequence version 1.
DT 24-JAN-2024, entry version 47.
DE SubName: Full=Putative SET domain protein {ECO:0000313|EMBL:ESU36153.1};
GN ORFNames=DHA2_8921 {ECO:0000313|EMBL:ESU36153.1};
OS Giardia intestinalis (Giardia lamblia).
OC Eukaryota; Metamonada; Diplomonadida; Hexamitidae; Giardiinae; Giardia.
OX NCBI_TaxID=5741 {ECO:0000313|EMBL:ESU36153.1, ECO:0000313|Proteomes:UP000018320};
RN [1] {ECO:0000313|Proteomes:UP000018320}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=DH {ECO:0000313|Proteomes:UP000018320};
RA Adam R., Dahlstrom E., Martens C., Bruno D., Barbian K., Porcella S.F.,
RA Nash T.;
RT "Genome sequencing of Giardia lamblia Genotypes A2 and B isolates (DH and
RT GS) and comparative analysis with the genomes of Genotypes A1 and E (WB and
RT Pig).";
RL Submitted (FEB-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:ESU36153.1, ECO:0000313|Proteomes:UP000018320}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DH {ECO:0000313|EMBL:ESU36153.1,
RC ECO:0000313|Proteomes:UP000018320};
RX PubMed=24307482; DOI=10.1093/gbe/evt197;
RA Adam R.D., Dahlstrom E.W., Martens C.A., Bruno D.P., Barbian K.D.,
RA Ricklefs S.M., Hernandez M.M., Narla N.P., Patel R.B., Porcella S.F.,
RA Nash T.E.;
RT "Genome sequencing of Giardia lamblia genotypes A2 and B isolates (DH and
RT GS) and comparative analysis with the genomes of genotypes A1 and E (WB and
RT Pig).";
RL Genome Biol. Evol. 5:2498-2511(2013).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ESU36153.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AHGT01000055; ESU36153.1; -; Genomic_DNA.
DR AlphaFoldDB; V6TBA5; -.
DR EnsemblProtists; ESU36153; ESU36153; DHA2_8921.
DR VEuPathDB; GiardiaDB:DHA2_8921; -.
DR VEuPathDB; GiardiaDB:GL50581_420; -.
DR VEuPathDB; GiardiaDB:GL50803_008921; -.
DR VEuPathDB; GiardiaDB:QR46_2105; -.
DR Proteomes; UP000018320; Unassembled WGS sequence.
DR GO; GO:0042800; F:histone H3K4 methyltransferase activity; IEA:InterPro.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR044570; Set1-like.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR PANTHER; PTHR45814; HISTONE-LYSINE N-METHYLTRANSFERASE SETD1; 1.
DR PANTHER; PTHR45814:SF2; HISTONE-LYSINE N-METHYLTRANSFERASE SETD1; 1.
DR Pfam; PF00856; SET; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT DOMAIN 538..668
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 676..692
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
SQ SEQUENCE 692 AA; 78346 MW; A683A1BFB661C0C2 CRC64;
MSADYRLRSN PARLQREERA KIAMLHEQSR DFPSSLVGKD LSTLPGSFPA AQSVDECSLL
PVFNFPPLPT VTWSPEEGIS NSPSKLLGEL PVIHRVLTTA EREQLAANPR SSQHLRRELH
LGWLQYEPYA PGTDQIGLSE AAQRDGDTIT NCTLPMHSFR SSFNLSKMEA MQERVVVFNK
VRSIAFSMHE EAAKAVLLYT ISLPEGEHLL MHVRMPAVPR LYTILREVLI MVVEDPPPDK
FLQSLIGQLD TLKGYRQRSS HFLLIKATLF IMDSLQFYPL SSWADKMQDP SFYECCRRIV
SELAASLHTS PTSSHSVPPS ESSTYMVMSN QSADTLGSVD VNINDRGTSG SFLSPNLLNH
LAEACRLSAL LDSLFQTFKI EDGIDACILK EERYANLISQ QQKKHRLTQL SYLNFYIMQT
RIHRIYASPE KEKAYHCVPT KQLQTGFPHY RQLIKKRNID ALVQVIHKRA LSSAKQSPNP
KAALMKPRMH RLRGRQSVSD DQMYHYISDI RLPPCYMRCL KTDPVLSSVF RPYPQRTREL
FYYRSSIHGF GLFLAEPAQR KELITEYCGD VISSLVADIR ERLYAESGLK SVYMFSIKNN
YVIDATLKGN FARFLNHSCA PTAESVHARF SASNDALEPV SLISQSQGIA ISMLSNLGEG
AEVTQNYYLS KESADNKLFC QCGSTRCRLY MN
//