ID Q9ZUK1_ARATH Unreviewed; 1329 AA.
AC Q9ZUK1;
DT 01-MAY-1999, integrated into UniProtKB/TrEMBL.
DT 01-MAY-1999, sequence version 1.
DT 27-MAR-2024, entry version 93.
DE SubName: Full=Putative retroelement pol polyprotein {ECO:0000313|EMBL:AAD03367.1};
GN OrderedLocusNames=At2g15100 {ECO:0000313|EMBL:AAD03367.1};
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702 {ECO:0000313|EMBL:AAD03367.1};
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=10617197; DOI=10.1038/45471;
RA Lin X., Kaul S., Rounsley S., Shea T.P., Benito M.I., Town C.D.,
RA Fujii C.Y., Mason T., Bowman C.L., Barnstead M., Feldblyum T.V.,
RA Buell C.R., Ketchum K.A., Lee J., Ronning C.M., Koo H.L., Moffat K.S.,
RA Cronin L.A., Shen M., Pai G., Van Aken S., Umayam L., Tallon L.J.,
RA Gill J.E., Adams M.D., Carrera A.J., Creasy T.H., Goodman H.M.,
RA Somerville C.R., Copenhaver G.P., Preuss D., Nierman W.C., White O.,
RA Eisen J.A., Salzberg S.L., Fraser C.M., Venter J.C.;
RT "Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana.";
RL Nature 402:761-768(1999).
RN [2] {ECO:0000313|EMBL:AAD03367.1}
RP NUCLEOTIDE SEQUENCE.
RA Rounsley S.D., Lin X., Kaul S., Shea T.P., Fujii C.Y., Mason T.M., Shen M.,
RA Ronning C.M., Fraser C.M., Somerville C.R., Venter J.C.;
RL Submitted (MAR-2000) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:AAD03367.1}
RP NUCLEOTIDE SEQUENCE.
RA Town C.D., Kaul S.;
RL Submitted (FEB-2002) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC005957; AAD03367.1; -; Genomic_DNA.
DR PIR; A84525; A84525.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 4: Predicted;
FT DOMAIN 1036..1198
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 316..335
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1329 AA; 151805 MW; DECE82480DD2E0E3 CRC64;
MSSNDGQSSG DQASTLDDIK QLLQQLSEKT DRQQLAVTSL SNKFVTFQEQ CNGQHAALAT
NHSEIRSAHN AYAELERDSI FRAPSARASK IPRKHLASRL HRHRLKSERH RLRILSPLNS
ELQNLQDQIW AMNAKVHQAT TSAPEVEKVI EATRRTPFTP RISKLRIREF RDFKLPVYNG
KGDLKEHLTS FQVIAGRVPL EPHEEDAGLC KLFSENLFGL ALTWFTQLEE GSIDNFKQLS
TAFIKQYEYF INSDITEAHL WNFSQSADEP LRTYIYRVQG NHVNRPETIQ DALHRATNWI
NAEEERAFLA KKFSASNAAP KAPQPATTKK PTELRKPAAG TCTTMLCETS MSLGTVVLPV
TAQGVVKMVE FTVFDRPAAY NVILGTPWLY EMKVVPSTYH QCVKFPTPVG KMTGISTEVI
SHELNVDPTF KPVKQKRRKL GPDRAQAVNI EVVRLLEVGR IREVKYPEWL ANPVVVKKKN
GKWRVCVDFT DLNKACSKDF FPLPHIDRLV ESTTGHEMLS FMDAFSGYNQ ILMNPEDQEK
TSFITECGTY YYKVMPFGLK NAVYFRCHRT RNRSEFEADK RFPVNDLTSQ YKGSPTIDMS
SGSTEQIHIP IYRQVFPFLH FTAKISKGFI WNESCEEAFK QLKRYLSEPS VLAKREFGEQ
LFLYIAVSES AVTGVQVRVE RSDKRPIFYV KTRYPMMEKL ALAVVTAARK LRPYFQSHPI
VVLTSLPLRT ILHSPTQSGR LGKWAIELSE FDLEFRARTS LNSTMDTSRQ WGFIKTWVRS
RNLSDAPTAN RYSGEYEAKD ACMEAYLNLV REVSGRFEQF ELTRIPRAEN SAANALAALA
STFEVTLPRV IPVETISQPS IRLDEISFVT TRAMRRRLDA QSAENGLHQL GDDEEISDAV
HPTEIVENQS LPDNHNAPLP DQPPHDWGAD WREPIRDYIL NGTLPAEKWA ARKLKATCAR
FCIANDILYR RIFSAPDAVC IFGEQTRTVM KEVHDGTCGN HTGGRSLAFK VRKYGYYWPT
LVADCEAYAR KCEQCQKHAP LILQPAELLT TVSAPYPFMK WLMDIVGPLH VSTRGVEAAA
YSNITHVQVW NFIWKDIICR HGLPYEIVTD NGSQFISEQF EVFCEEWQIR LSHSTPRYPQ
GNGQAEAMNK TIISNLKKKL NAYKGAWFGE LQNVLWAVRT TPRRATDETP FSLIYGMEAV
IPAEIKVPSA RRIRNPQNET ENNEMIIDVI DTIDERRNRA LARMQNYHNA GARYYNSNVR
NRSFEVGTLV LRRVQQNKAE KGAGKLGISW EGPYKITHVV RNGVYRLINM EGKTVRRAWN
SMHLKRFYI
//