GenomeNet

Database: UniProt
Entry: F4HQA2_ARATH
LinkDB: F4HQA2_ARATH
Original site: F4HQA2_ARATH 
ID   F4HQA2_ARATH            Unreviewed;       651 AA.
AC   F4HQA2;
DT   28-JUN-2011, integrated into UniProtKB/TrEMBL.
DT   28-JUN-2011, sequence version 1.
DT   24-JAN-2024, entry version 80.
DE   SubName: Full=HAT transposon superfamily {ECO:0000313|EMBL:AEE36293.1};
GN   OrderedLocusNames=At1g79740 {ECO:0000313|Araport:AT1G79740,
GN   ECO:0000313|EMBL:AEE36293.1};
GN   ORFNames=F19K16.28 {ECO:0000313|EMBL:AEE36293.1}, F19K16_28
GN   {ECO:0000313|EMBL:AEE36293.1};
OS   Arabidopsis thaliana (Mouse-ear cress).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX   NCBI_TaxID=3702 {ECO:0000313|EMBL:AEE36293.1, ECO:0000313|Proteomes:UP000006548};
RN   [1] {ECO:0000313|EMBL:AEE36293.1, ECO:0000313|Proteomes:UP000006548}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Columbia {ECO:0000313|Proteomes:UP000006548};
RX   PubMed=11130712; DOI=10.1038/35048500;
RA   Theologis A., Ecker J.R., Palm C.J., Federspiel N.A., Kaul S., White O.,
RA   Alonso J., Altafi H., Araujo R., Bowman C.L., Brooks S.Y., Buehler E.,
RA   Chan A., Chao Q., Chen H., Cheuk R.F., Chin C.W., Chung M.K., Conn L.,
RA   Conway A.B., Conway A.R., Creasy T.H., Dewar K., Dunn P., Etgu P.,
RA   Feldblyum T.V., Feng J., Fong B., Fujii C.Y., Gill J.E., Goldsmith A.D.,
RA   Haas B., Hansen N.F., Hughes B., Huizar L., Hunter J.L., Jenkins J.,
RA   Johnson-Hopson C., Khan S., Khaykin E., Kim C.J., Koo H.L.,
RA   Kremenetskaia I., Kurtz D.B., Kwan A., Lam B., Langin-Hooper S., Lee A.,
RA   Lee J.M., Lenz C.A., Li J.H., Li Y., Lin X., Liu S.X., Liu Z.A.,
RA   Luros J.S., Maiti R., Marziali A., Militscher J., Miranda M., Nguyen M.,
RA   Nierman W.C., Osborne B.I., Pai G., Peterson J., Pham P.K., Rizzo M.,
RA   Rooney T., Rowley D., Sakano H., Salzberg S.L., Schwartz J.R., Shinn P.,
RA   Southwick A.M., Sun H., Tallon L.J., Tambunga G., Toriumi M.J., Town C.D.,
RA   Utterback T., Van Aken S., Vaysberg M., Vysotskaia V.S., Walker M., Wu D.,
RA   Yu G., Fraser C.M., Venter J.C., Davis R.W.;
RT   "Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana.";
RL   Nature 408:816-820(2000).
RN   [2] {ECO:0000313|EMBL:AEE36293.1}
RP   NUCLEOTIDE SEQUENCE.
RG   TAIR;
RA   Swarbreck D., Lamesch P., Wilks C., Huala E.;
RL   Submitted (FEB-2011) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|EMBL:AEE36293.1}
RP   NUCLEOTIDE SEQUENCE.
RA   Krishnakumar V., Cheng C.-Y., Chan A.P., Schobel S., Kim M., Ferlanti E.S.,
RA   Belyaeva I., Rosen B.D., Micklem G., Miller J.R., Vaughn M., Town C.D.;
RL   Submitted (MAY-2016) to the EMBL/GenBank/DDBJ databases.
RN   [4] {ECO:0000313|Proteomes:UP000006548}
RP   GENOME REANNOTATION.
RC   STRAIN=cv. Columbia {ECO:0000313|Proteomes:UP000006548};
RX   PubMed=27862469; DOI=10.1111/tpj.13415;
RA   Cheng C.Y., Krishnakumar V., Chan A.P., Thibaud-Nissen F., Schobel S.,
RA   Town C.D.;
RT   "Araport11: a complete reannotation of the Arabidopsis thaliana reference
RT   genome.";
RL   Plant J. 89:789-804(2017).
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CP002684; AEE36293.1; -; Genomic_DNA.
DR   EMBL; CP002684; ANM57803.1; -; Genomic_DNA.
DR   RefSeq; NP_001320286.1; NM_001334919.1.
DR   RefSeq; NP_178092.4; NM_106623.5.
DR   AlphaFoldDB; F4HQA2; -.
DR   SMR; F4HQA2; -.
DR   STRING; 3702.F4HQA2; -.
DR   PaxDb; 3702-AT1G79740-1; -.
DR   ProteomicsDB; 191399; -.
DR   EnsemblPlants; AT1G79740.1; AT1G79740.1; AT1G79740.
DR   EnsemblPlants; AT1G79740.4; AT1G79740.4; AT1G79740.
DR   GeneID; 844313; -.
DR   Gramene; AT1G79740.1; AT1G79740.1; AT1G79740.
DR   Gramene; AT1G79740.4; AT1G79740.4; AT1G79740.
DR   Araport; AT1G79740; -.
DR   TAIR; AT1G79740; -.
DR   eggNOG; ENOG502R0D2; Eukaryota.
DR   HOGENOM; CLU_016471_3_1_1; -.
DR   OMA; HPAMYTA; -.
DR   OrthoDB; 449576at2759; -.
DR   Proteomes; UP000006548; Chromosome 1.
DR   ExpressionAtlas; F4HQA2; baseline and differential.
DR   GO; GO:0005829; C:cytosol; HDA:TAIR.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR   GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR   GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR   InterPro; IPR007021; DUF659.
DR   InterPro; IPR008906; HATC_C_dom.
DR   InterPro; IPR012337; RNaseH-like_sf.
DR   InterPro; IPR003656; Znf_BED.
DR   PANTHER; PTHR32166:SF67; HAT TRANSPOSON SUPERFAMILY; 1.
DR   PANTHER; PTHR32166; OSJNBA0013A04.12 PROTEIN; 1.
DR   Pfam; PF05699; Dimer_Tnp_hAT; 1.
DR   Pfam; PF04937; DUF659; 1.
DR   Pfam; PF02892; zf-BED; 1.
DR   SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR   PROSITE; PS50808; ZF_BED; 1.
PE   1: Evidence at protein level;
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Proteomics identification {ECO:0007829|PeptideAtlas:F4HQA2,
KW   ECO:0007829|ProteomicsDB:F4HQA2};
KW   Reference proteome {ECO:0000313|Proteomes:UP000006548};
KW   Zinc {ECO:0000256|ARBA:ARBA00022833};
KW   Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00027}.
FT   DOMAIN          3..58
FT                   /note="BED-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50808"
SQ   SEQUENCE   651 AA;  73759 MW;  9ABA7C0672859E28 CRC64;
     MVREKDICWE YAEKLDGNKV KCKFCSRVLN GGISRLKHHL SRLPSKGVNP CAKVRDDVTD
     RVRSILSAKD DPPITNKYKP PPPLSPPFDA PASKLVFPSS PPNAQDIAER SISLFFFENK
     IDFAVARSPS YHHMLDAVAK CGPGFVAPSP KTEWLDRVKS DISLQLKDTE KEWVTTGCTI
     IAEAWTDNKS RALINFSVSS PSRIFFHKSV DASSYFKNSK CLADLFDSVI QDIGQEHIVQ
     IIMDNSFCYT GISNHLLQNY ATIFVSPCAS QCLNIILEEF SKVDWVNQCI SQAQVISKFV
     YNNSPVLDLL RKLTGGQDII RSGVTRSVSN FLSLQSMMKQ KARLKHMFNC PEYTTNTNKP
     QSISCVNILE DNDFWRAVEE SVAISEPILK VLREVSTGKP AVGSIYELMS KAKESIRTYY
     IMDENKHKVF SDIVDTNWCE HLHSPLHAAA AFLNPSIQYN PEIKFLTSLK EDFFKVLEKL
     LPTSDLRRDI TNQIFTFTRA KGMFGCNLAM EARDSVSPGL WWEQFGDSAP VLQRVAIRIL
     SQVCSGYNLE RQWSTFQQMH WERRNKIDRE ILNKLAYVNQ NLKLGRMITL ETDPIALEDI
     DMMSEWVEEA ENPSPAQWLD RFGTALDGGD LNTRQFGGAI FSANDHNIFG L
//
DBGET integrated database retrieval system