ID W1PRV8_AMBTC Unreviewed; 2308 AA.
AC W1PRV8;
DT 19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT 19-MAR-2014, sequence version 1.
DT 24-JAN-2024, entry version 49.
DE RecName: Full=SET domain-containing protein {ECO:0000259|PROSITE:PS50280};
GN ORFNames=AMTR_s00043p00149000 {ECO:0000313|EMBL:ERN12742.1};
OS Amborella trichopoda.
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Amborellales; Amborellaceae; Amborella.
OX NCBI_TaxID=13333 {ECO:0000313|EMBL:ERN12742.1, ECO:0000313|Proteomes:UP000017836};
RN [1] {ECO:0000313|Proteomes:UP000017836}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24357323;
RG Amborella Genome Project;
RT "The Amborella genome and the evolution of flowering plants.";
RL Science 342:1241089-1241089(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KI392605; ERN12742.1; -; Genomic_DNA.
DR RefSeq; XP_006851161.1; XM_006851098.2.
DR STRING; 13333.W1PRV8; -.
DR EnsemblPlants; ERN12742; ERN12742; AMTR_s00043p00149000.
DR GeneID; 18440968; -.
DR Gramene; ERN12742; ERN12742; AMTR_s00043p00149000.
DR KEGG; atr:18440968; -.
DR eggNOG; KOG1080; Eukaryota.
DR HOGENOM; CLU_000704_0_0_1; -.
DR OMA; PHISYVH; -.
DR OrthoDB; 180394at2759; -.
DR Proteomes; UP000017836; Unassembled WGS sequence.
DR GO; GO:0042800; F:histone H3K4 methyltransferase activity; IEA:EnsemblPlants.
DR GO; GO:0048440; P:carpel development; IEA:EnsemblPlants.
DR GO; GO:0007623; P:circadian rhythm; IEA:EnsemblPlants.
DR GO; GO:0040029; P:epigenetic regulation of gene expression; IEA:EnsemblPlants.
DR GO; GO:0048443; P:stamen development; IEA:EnsemblPlants.
DR GO; GO:0010228; P:vegetative to reproductive phase transition of meristem; IEA:EnsemblPlants.
DR CDD; cd10531; SET_SETD2-like; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR036047; F-box-like_dom_sf.
DR InterPro; IPR035445; GYF-like_dom_sf.
DR InterPro; IPR045606; SDG2_C.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR PANTHER; PTHR46655; HISTONE-LYSINE N-METHYLTRANSFERASE ATXR3; 1.
DR PANTHER; PTHR46655:SF1; HISTONE-LYSINE N-METHYLTRANSFERASE ATXR3; 1.
DR Pfam; PF19633; SDG2_C; 1.
DR Pfam; PF00856; SET; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF81383; F-box domain; 1.
DR SUPFAM; SSF55277; GYF domain; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000017836}.
FT DOMAIN 1736..1878
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT REGION 25..400
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 857..880
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1508..1558
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 27..49
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 62..91
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 99..120
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 121..163
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 165..179
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 185..222
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 223..268
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 269..340
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 348..388
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1516..1536
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1541..1558
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2308 AA; 263530 MW; F8D469E82B84508F CRC64;
MGDGGVALVA SQYVMERFPT CHGLHSYDKS PKCHLQRKPE REMEVEQGEL GFENSHNSER
SPGIRSQEDT KGGRAHFAEE VSREESPRPV EFPKLRISSV IENPSNGEYQ RNSELSRTGT
KGSDDFAEEA GECDYSRSRG SSSERHKRNS RWDTSRWEPS RWEPLQERVS NSKGNLGNWG
KREPQSCARR SRDDYGDYSV DESRQRSKSR RRSDEYYHSE KSLSHSRNSN SNSRQYRDSS
YSSSRDLSNR YSRRYDSSSS SRFSYDKHNR SPSYLDRHSN RSPSYSDRSP RDRGRHHDYR
DGNRKSGTDK RDGHYSRDSR EEERFSRKES NGRDSRRYSS SSRHLYGSRS DKAIDDQHPS
SHSDRVAEDH LVSKSGEKKV EDQMENSNLD SEVRPNAEPP KVDGFVEELQ SMEEDMDLCS
SPEYAVNVRN QSGTIGLVDQ GKPFIRGWYY LDHVGIEQGP SKLCELKQLV EDGFLSSDHL
IKHSDSDRWV TVENAASPLV VVNSHSVVPE TVTQLVNPPE APGNAMIEVG DFLKSVNRVS
QELGASSAAL ESPSQLEDLR IDERVDALLN GLAVVPGKEL ESIAEALQTT FEYADWEKRN
PSEGFMRFRD SYMETSRHYR DEENNRAYES LHKESPLLRF GYRSGVLGEK EFALPKIDSS
QWFTGRWSCK GGDWRRCDEM AHDRNLKRKI VLNEGFPLCQ MPKSGYQDPR YHRHDDLYHP
MNYKKLELPP WAYCWLEEKF DHPQPMDAAS DSHVAATMGH GSIVSIQAQS VKTSNGDLNK
QMVFNRVTQY KAVVAKGARG LTQSVVRNNT LVVKIHGSFV SELHARAHNN EFHSLKPEME
QPSSVLDGKS MLCGDASRPK DWQHDLKGSH GPTSSDNAPP AHVLTKDELK LHLGEWHYLD
GAGHESHPVS FKMLQDLVAN GTIQRFSSVY RKRDNIWVPI TGPAPPDPSI EVSGAPMSVL
EARKPSLDND ACDSGDLRRE QVVTEKKSPL SYFHALHPQF IGYTRGKLHE LVMKSYKNRE
FAAAVNEVLD PWMNARQPKK APEKLMSCNS STWVSLSMKS AIASSLNSGY VDPQSEYGTP
TNKMESDLAR TSKDFNRFGK RSRLLIDESE EEDDTAMDLG KLQSANYSFD DLCGETAFPQ
ETCANIATGS DGWGLLNGHI LARIFHFLRA DFKSLAVSAV TCKQWNMAVK FYKDLCVQVD
LSSMGLNCTD SIFQYIMSGY NKENITSVIL MGCIKITART LEEVLQSFPS IEFIDNRGCD
QFRELTTTYL KVKWKKSRGL HYGSETKISD DSHHKIRSLK QINEKSHNYL GDTLRHSQYK
RMKLIGTRKA SLLDELRMKG LYSRKSPSLP GGRYKKMEHY LSLRLKEIMN ENTFAFFIPK
VAEIEDRMQS GYYVGRGLKL LKDDIGRMCR DAMKANNRSD DGEDMTHIIK LFMKLVTYLE
DNSKSFYGGS RTGFFPAASN HKKKHKKTMK IGKLSKRNIA SYDNGFFDSG EYASDRDSRR
RLSKLNRRSF DSDTETSDEA ELSEEVSGDE EGTSSDSDVD TGVHSDNEDR ELSGDRYQMV
DEVLESVTED REWGARMTKA SLVPPITRKY EVIDEYVVIA DEEYVQRKMR VALPEDYEEK
LNQQKSGEED MEIPEVKDYK PRKKLGDEVL EQEVYGIDPY THNLLLDTMP EELDWPLLER
HSFIEEVILC VLNKQVRHFT GTGCTPMEFP LPPVVDEILK NAEKDGDTRI MKMAEELLKA
MKNRPDDNYV AYRKGLGVVC NKEEGFGEDD FVVEFLGEVY PAWKWFEKQD GIRSLQKNNE
DPAPEFYNIY LERPKGDRDG YDLVVVDAMH KANYASRICH SCRPNCEAKV TAVDGQYQIG
IYTVRPIGYG EEITFDYNSV TESKEEYEAS VCLCGSQVCR GSYLNLTGEG AYQKVLKECH
GLLNRHQLML EACEMNFVSQ EDYDDLGKAG LGSCLLSGLP DWLIAYSARL VRFINYERTM
LPEEILNYNL EEKRKFFSDI CLEVEKSDAE VQAEGVYNQR LQNLAVTLDK VRYVIRCIFE
DPKRAPPPLE RLSPQALVCF LWKGEGSLVE ELLQCVAPHV DSDLLNSLKS KIHARDPSGS
DDVGRELRKS LLWLRDEIRS LPSSCKFRHD AAADLIHMYA YTRCFFTVRD YRTVTSAPVY
ISPLDLGPKY VGKLGSGFEE YRKTYGEGYC LGQLIYWHVQ TNADPDSTLA KARRGCLALP
DISSFYAKPQ KPPPKFSYGP KTVRLMMARM EKQPQKPWRK DQIWKFKSTP RVFGSPMLDA
IMNRSALDKE MMQWLKTRPI ACHAPWDQ
//