ID D7LG19_ARALL Unreviewed; 1367 AA.
AC D7LG19;
DT 10-AUG-2010, integrated into UniProtKB/TrEMBL.
DT 10-AUG-2010, sequence version 1.
DT 24-JAN-2024, entry version 70.
DE SubName: Full=Nucleic acid binding protein {ECO:0000313|EMBL:EFH54958.1};
GN ORFNames=ARALYDRAFT_481235 {ECO:0000313|EMBL:EFH54958.1};
OS Arabidopsis lyrata subsp. lyrata (Lyre-leaved rock-cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=81972 {ECO:0000313|Proteomes:UP000008694};
RN [1] {ECO:0000313|Proteomes:UP000008694}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. MN47 {ECO:0000313|Proteomes:UP000008694};
RX PubMed=21478890; DOI=10.1038/ng.807;
RA Hu T.T., Pattyn P., Bakker E.G., Cao J., Cheng J.-F., Clark R.M.,
RA Fahlgren N., Fawcett J.A., Grimwood J., Gundlach H., Haberer G.,
RA Hollister J.D., Ossowski S., Ottilar R.P., Salamov A.A., Schneeberger K.,
RA Spannagl M., Wang X., Yang L., Nasrallah M.E., Bergelson J.,
RA Carrington J.C., Gaut B.S., Schmutz J., Mayer K.F.X., Van de Peer Y.,
RA Grigoriev I.V., Nordborg M., Weigel D., Guo Y.-L.;
RT "The Arabidopsis lyrata genome sequence and the basis of rapid genome size
RT change.";
RL Nat. Genet. 43:476-481(2011).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL348716; EFH54958.1; -; Genomic_DNA.
DR RefSeq; XP_002878699.1; XM_002878653.1.
DR STRING; 81972.D7LG19; -.
DR EnsemblPlants; fgenesh2_kg.4__295__AT2G23740.1; fgenesh2_kg.4__295__AT2G23740.1; fgenesh2_kg.4__295__AT2G23740.1.
DR Gramene; fgenesh2_kg.4__295__AT2G23740.1; fgenesh2_kg.4__295__AT2G23740.1; fgenesh2_kg.4__295__AT2G23740.1.
DR eggNOG; KOG1082; Eukaryota.
DR eggNOG; KOG1721; Eukaryota.
DR HOGENOM; CLU_004911_0_0_1; -.
DR Proteomes; UP000008694; Unassembled WGS sequence.
DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-KW.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0140938; F:histone H3 methyltransferase activity; IEA:EnsemblPlants.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:EnsemblPlants.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0045814; P:negative regulation of gene expression, epigenetic; IEA:EnsemblPlants.
DR GO; GO:1900109; P:regulation of histone H3-K9 dimethylation; IEA:EnsemblPlants.
DR Gene3D; 3.30.160.60; Classic Zinc Finger; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR007728; Pre-SET_dom.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR040689; SUVR5_Znf-C2H2_3rpt.
DR InterPro; IPR013087; Znf_C2H2_type.
DR PANTHER; PTHR47325; HISTONE-LYSINE N-METHYLTRANSFERASE SUVR5; 1.
DR PANTHER; PTHR47325:SF1; HISTONE-LYSINE N-METHYLTRANSFERASE SUVR5; 1.
DR Pfam; PF05033; Pre-SET; 1.
DR Pfam; PF00856; SET; 1.
DR Pfam; PF18868; zf-C2H2_3rep; 1.
DR SMART; SM00508; PostSET; 1.
DR SMART; SM00468; PreSET; 1.
DR SMART; SM00317; SET; 1.
DR SMART; SM00355; ZnF_C2H2; 3.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50867; PRE_SET; 1.
DR PROSITE; PS50280; SET; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 2.
DR PROSITE; PS50157; ZINC_FINGER_C2H2_2; 2.
PE 4: Predicted;
KW Chromosome {ECO:0000256|ARBA:ARBA00022454};
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Reference proteome {ECO:0000313|Proteomes:UP000008694};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00042};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00042}.
FT DOMAIN 756..784
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 825..853
FT /note="C2H2-type"
FT /evidence="ECO:0000259|PROSITE:PS50157"
FT DOMAIN 1130..1206
FT /note="Pre-SET"
FT /evidence="ECO:0000259|PROSITE:PS50867"
FT DOMAIN 1209..1341
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 1351..1367
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
FT REGION 902..927
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 905..922
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1367 AA; 153889 MW; 022294263E03F22C CRC64;
MDELVLDVDV EEATGSELLV KPEPGDDLNE VNRSTDLVTV ITGPIGNNGK GESSPSEPKW
LQQDEPIALW VKWRGKWQAG IRCAKADWPL TTLRGKPTHD RKKYCVIFFP HTKNYSWADM
QLVRSINEFP DPIAYKSHKI GIKLVKDLTA ARRYIMRKLT VGIFNIVDQF PSEVVSEAAR
DIIIWREFAM EATRSTSYHD LGIMLVKLHS MILQRYMDPI WLENSFPLWV QKCNNAVNAE
SIELLNEWSE VKSLSESPMQ PMLFSEWKTW KHDIAKWFSI SRRGVGEIAQ PNSKSVFNSD
VQASRKRPKL EIRRAETTNA SQMESDTSPQ GLTAIDSEFF SSRGNTNTPE ALKDENPIMN
TPENGLDLWD GIVVEAGGSQ IMKTKETNGL SHPHINESVL KKPFGSGNKS QQCIAFIESK
GRQCVRWANE GDVYCCVHLA SRFTTKSAKN EGSPAVEAPM CGGVTVLGTK CKHRSLPGFL
YCKKHRPHTE MEKPDDSSSL LVKRKVAEIM STLETNQCQD LVPFGEPEGL SFEKQEPHGA
TSFTEMFEHC SQEDNLCIGS CSENSYIPCS EFSTKHSLYC EQHLPNWLKR ARNGKSRIIS
KEVFVDLLRG CLSREEKLAL HQACDIFYKL FKSVLSLRNS VPMEVQIDWA KAEASRNADV
GVGEFLMKLV SNERERLTRI WGFATGADEE DVSLSEYPNR LLAITNAWAN DEDKEKWSFS
GFACAICLDS FVKRKLLEIH VEERHHVQFA EKCMLLQCIP CGSHFGDKEQ LLLHVQAVHP
SECKSITVAP ECNLTNGESS QKPDAGSSQI VVSQNNENTS GVHKFVCKFC GLKFNLLPDL
GRHHQAEHMG PSLVGSRGPK KGIRFNTYRM KSGRLSRPNK FKKSLGAVSY RIRNRAGVNM
KRRMQGSKPL STEGNTGVSP PPPGDSRNFD GTDAHCSVVS NILLSKVQKA KHRPNNFDIL
SAARSACCRV SLETSLEAKF GDLPDRIYLK AAKLCGEQGV QVQWHQEGYI CSNGCKPVKD
PNLLRPLIPR QENDRFGISM DPVQHSNIEL EVDECHCIME AHHFSKRPFG NTAVLCKDIS
FGKESVPICV VDDDLLNSGK PYERPWESFT YVTNSILHPS MELVKENLQL RCGCRSSVCS
PVTCDHVYLF GNDFEDARDI YGKSMRFRFP YDGKQRIILE EGYPVYECNK FCGCSRTCQN
RVLQNGIRVK LEVFRTESKG WGLRACEHIL RGTFVCEYIG EVLDQQEANK RRNQYGKEGC
SYILDIDANI NDIGRLMEEE PDYAIDATTH GNISRFINHS CSPNLVNHQV IVESMESPLA
HIGLYASMDV AAGEEITRDY GCRPVPSGQE NEHPCHCKAT NCRGLLS
//