ID F4JJC1_ARATH Unreviewed; 768 AA.
AC F4JJC1;
DT 28-JUN-2011, integrated into UniProtKB/TrEMBL.
DT 28-JUN-2011, sequence version 1.
DT 27-MAR-2024, entry version 94.
DE SubName: Full=HAT transposon superfamily {ECO:0000313|EMBL:AEE83540.1};
GN Name=DL3551W {ECO:0000313|EMBL:AEE83540.1};
GN OrderedLocusNames=At4g15020 {ECO:0000313|Araport:AT4G15020,
GN ECO:0000313|EMBL:AEE83540.1};
GN ORFNames=FCAALL.174 {ECO:0000313|EMBL:AEE83540.1};
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702 {ECO:0000313|EMBL:AEE83540.1, ECO:0000313|Proteomes:UP000006548};
RN [1] {ECO:0000313|EMBL:AEE83540.1, ECO:0000313|Proteomes:UP000006548}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia {ECO:0000313|Proteomes:UP000006548};
RX PubMed=10617198; DOI=10.1038/47134;
RG EU;
RG CSHL and WU Arabidopsis Sequencing Project;
RA Mayer K., Schuller C., Wambutt R., Murphy G., Volckaert G., Pohl T.,
RA Dusterhoft A., Stiekema W., Entian K.D., Terryn N., Harris B., Ansorge W.,
RA Brandt P., Grivell L., Rieger M., Weichselgartner M., de Simone V.,
RA Obermaier B., Mache R., Muller M., Kreis M., Delseny M., Puigdomenech P.,
RA Watson M., Schmidtheini T., Reichert B., Portatelle D., Perez-Alonso M.,
RA Boutry M., Bancroft I., Vos P., Hoheisel J., Zimmermann W., Wedler H.,
RA Ridley P., Langham S.A., McCullagh B., Bilham L., Robben J.,
RA Van der Schueren J., Grymonprez B., Chuang Y.J., Vandenbussche F.,
RA Braeken M., Weltjens I., Voet M., Bastiaens I., Aert R., Defoor E.,
RA Weitzenegger T., Bothe G., Ramsperger U., Hilbert H., Braun M., Holzer E.,
RA Brandt A., Peters S., van Staveren M., Dirske W., Mooijman P.,
RA Klein Lankhorst R., Rose M., Hauf J., Kotter P., Berneiser S., Hempel S.,
RA Feldpausch M., Lamberth S., Van den Daele H., De Keyser A., Buysshaert C.,
RA Gielen J., Villarroel R., De Clercq R., Van Montagu M., Rogers J.,
RA Cronin A., Quail M., Bray-Allen S., Clark L., Doggett J., Hall S., Kay M.,
RA Lennard N., McLay K., Mayes R., Pettett A., Rajandream M.A., Lyne M.,
RA Benes V., Rechmann S., Borkova D., Blocker H., Scharfe M., Grimm M.,
RA Lohnert T.H., Dose S., de Haan M., Maarse A., Schafer M., Muller-Auer S.,
RA Gabel C., Fuchs M., Fartmann B., Granderath K., Dauner D., Herzl A.,
RA Neumann S., Argiriou A., Vitale D., Liguori R., Piravandi E., Massenet O.,
RA Quigley F., Clabauld G., Mundlein A., Felber R., Schnabl S., Hiller R.,
RA Schmidt W., Lecharny A., Aubourg S., Chefdor F., Cooke R., Berger C.,
RA Montfort A., Casacuberta E., Gibbons T., Weber N., Vandenbol M.,
RA Bargues M., Terol J., Torres A., Perez-Perez A., Purnelle B., Bent E.,
RA Johnson S., Tacon D., Jesse T., Heijnen L., Schwarz S., Scholler P.,
RA Heber S., Francs P., Bielke C., Frishman D., Haase D., Lemcke K.,
RA Mewes H.W., Stocker S., Zaccaria P., Bevan M., Wilson R.K.,
RA de la Bastide M., Habermann K., Parnell L., Dedhia N., Gnoj L., Schutz K.,
RA Huang E., Spiegel L., Sehkon M., Murray J., Sheet P., Cordes M.,
RA Abu-Threideh J., Stoneking T., Kalicki J., Graves T., Harmon G.,
RA Edwards J., Latreille P., Courtney L., Cloud J., Abbott A., Scott K.,
RA Johnson D., Minx P., Bentley D., Fulton B., Miller N., Greco T., Kemp K.,
RA Kramer J., Fulton L., Mardis E., Dante M., Pepin K., Hillier L., Nelson J.,
RA Spieth J., Ryan E., Andrews S., Geisel C., Layman D., Du H., Ali J.,
RA Berghoff A., Jones K., Drone K., Cotton M., Joshu C., Antonoiu B.,
RA Zidanic M., Strong C., Sun H., Lamar B., Yordan C., Ma P., Zhong J.,
RA Preston R., Vil D., Shekher M., Matero A., Shah R., Swaby I.K.,
RA O'Shaughnessy A., Rodriguez M., Hoffmann J., Till S., Granat S., Shohdy N.,
RA Hasegawa A., Hameed A., Lodhi M., Johnson A., Chen E., Marra M.,
RA Martienssen R., McCombie W.R.;
RT "Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana.";
RL Nature 402:769-777(1999).
RN [2] {ECO:0000313|EMBL:AEE83540.1}
RP NUCLEOTIDE SEQUENCE.
RG TAIR;
RA Swarbreck D., Lamesch P., Wilks C., Huala E.;
RL Submitted (FEB-2011) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0007829|PubMed:22223895}
RP IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RX PubMed=22223895; DOI=10.1074/mcp.M111.015131;
RA Bienvenut W.V., Sumpton D., Martinez A., Lilla S., Espagne C., Meinnel T.,
RA Giglione C.;
RT "Comparative large-scale characterisation of plant vs. mammal proteins
RT reveals similar and idiosyncratic N-alpha acetylation features.";
RL Mol. Cell. Proteomics 11:M111.015131-M111.015131(2012).
RN [4] {ECO:0000313|EMBL:AEE83540.1}
RP NUCLEOTIDE SEQUENCE.
RA Krishnakumar V., Cheng C.-Y., Chan A.P., Schobel S., Kim M., Ferlanti E.S.,
RA Belyaeva I., Rosen B.D., Micklem G., Miller J.R., Vaughn M., Town C.D.;
RL Submitted (MAY-2016) to the EMBL/GenBank/DDBJ databases.
RN [5] {ECO:0000313|Proteomes:UP000006548}
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia {ECO:0000313|Proteomes:UP000006548};
RX PubMed=27862469; DOI=10.1111/tpj.13415;
RA Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA Town C.D.;
RT "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT genome.";
RL Plant J. 89:789-804(2017).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP002687; AEE83540.1; -; Genomic_DNA.
DR EMBL; CP002687; AEE83541.1; -; Genomic_DNA.
DR RefSeq; NP_001154234.1; NM_001160762.2.
DR RefSeq; NP_193238.5; NM_117589.6.
DR AlphaFoldDB; F4JJC1; -.
DR SMR; F4JJC1; -.
DR STRING; 3702.F4JJC1; -.
DR PaxDb; 3702-AT4G15020-1; -.
DR ProteomicsDB; 187740; -.
DR EnsemblPlants; AT4G15020.1; AT4G15020.1; AT4G15020.
DR EnsemblPlants; AT4G15020.2; AT4G15020.2; AT4G15020.
DR GeneID; 827161; -.
DR Gramene; AT4G15020.1; AT4G15020.1; AT4G15020.
DR Gramene; AT4G15020.2; AT4G15020.2; AT4G15020.
DR KEGG; ath:AT4G15020; -.
DR Araport; AT4G15020; -.
DR TAIR; AT4G15020; -.
DR eggNOG; ENOG502QR6K; Eukaryota.
DR HOGENOM; CLU_016471_3_1_1; -.
DR OMA; YKAKEFD; -.
DR OrthoDB; 78055at2759; -.
DR Proteomes; UP000006548; Chromosome 4.
DR ExpressionAtlas; F4JJC1; baseline and differential.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR InterPro; IPR007021; DUF659.
DR InterPro; IPR008906; HATC_C_dom.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR003656; Znf_BED.
DR PANTHER; PTHR32166:SF88; HAT TRANSPOSON SUPERFAMILY; 1.
DR PANTHER; PTHR32166; OSJNBA0013A04.12 PROTEIN; 1.
DR Pfam; PF05699; Dimer_Tnp_hAT; 1.
DR Pfam; PF04937; DUF659; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50808; ZF_BED; 1.
PE 1: Evidence at protein level;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Proteomics identification {ECO:0007829|PeptideAtlas:F4JJC1,
KW ECO:0007829|ProteomicsDB:F4JJC1};
KW Reference proteome {ECO:0000313|Proteomes:UP000006548};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00027}.
FT DOMAIN 13..71
FT /note="BED-type"
FT /evidence="ECO:0000259|PROSITE:PS50808"
SQ SEQUENCE 768 AA; 86644 MW; B76A04CEFBD23359 CRC64;
MDAELEPVAL TPQKQDNAWK HCEIYKYGDR LQMRCLYCRK MFKGGGITRV KEHLAGKKGQ
GTICDQVPED VRLFLQQCID GTVRRQRKRH KSSSEPLSVA SLPPIEGDMM VVQPDVNDGF
KSPGSSDVVV QNESLLSGRT KQRTYRSKKN AFENGSASNN VDLIGRDMDN LIPVAISSVK
NIVHPSFRDR ENTIHMAIGR FLFGIGADFD AVNSVNFQPM IDAIASGGFG VSAPTHDDLR
GWILKNCVEE MAKEIDECKA MWKRTGCSIL VEELNSDKGF KVLNFLVYCP EKVVFLKSVD
ASEVLSSADK LFELLSELVE EVGSTNVVQV ITKCDDYYVD AGKRLMLVYP SLYWVPCAAH
CIDQMLEEFG KLGWISETIE QAQAITRFVY NHSGVLNLMW KFTSGNDILL PAFSSSATNF
ATLGRIAELK SNLQAMVTSA EWNECSYSEE PSGLVMNALT DEAFWKAVAL VNHLTSPLLR
ALRIVCSEKR PAMGYVYAAL YRAKDAIKTH LVNREDYIIY WKIIDRWWEQ QQHIPLLAAG
FFLNPKLFYN TNEEIRSELI LSVLDCIERL VPDDKIQDKI IKELTSYKTA GGVFGRNLAI
RARDTMLPAE WWSTYGESCL NLSRFAIRIL SQTCSSSVSC RRNQIPVEHI YQSKNSIEQK
RLSDLVFVQY NMRLRQLGPG SGDDTLDPLS HNRIDVLKEW VSGDQACVEG NGSADWKSLE
SIHRNQVAPI IDDTEDLGSG FDDIEIFKVE KEVRDEGYYT NTSEKLFT
//