ID A0A087GQI2_ARAAL Unreviewed; 849 AA.
AC A0A087GQI2;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 29-OCT-2014, sequence version 1.
DT 27-MAR-2024, entry version 40.
DE RecName: Full=Integrase catalytic domain-containing protein {ECO:0008006|Google:ProtNLM};
GN OrderedLocusNames=AALP_Aa6g202700 {ECO:0000313|EMBL:KFK32134.1};
OS Arabis alpina (Alpine rock-cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Arabideae; Arabis.
OX NCBI_TaxID=50452 {ECO:0000313|EMBL:KFK32134.1, ECO:0000313|Proteomes:UP000029120};
RN [1] {ECO:0000313|Proteomes:UP000029120}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Pajares {ECO:0000313|Proteomes:UP000029120};
RX PubMed=27246759; DOI=10.1038/nplants.2014.23;
RA Willing E.M., Rawat V., Mandakova T., Maumus F., James G.V.,
RA Nordstroem K.J., Becker C., Warthmann N., Chica C., Szarzynska B.,
RA Zytnicki M., Albani M.C., Kiefer C., Bergonzi S., Castaings L.,
RA Mateos J.L., Berns M.C., Bujdoso N., Piofczyk T., de Lorenzo L.,
RA Barrero-Sicilia C., Mateos I., Piednoel M., Hagmann J., Chen-Min-Tao R.,
RA Iglesias-Fernandez R., Schuster S.C., Alonso-Blanco C., Roudier F.,
RA Carbonero P., Paz-Ares J., Davis S.J., Pecinka A., Quesneville H.,
RA Colot V., Lysak M.A., Weigel D., Coupland G., Schneeberger K.;
RT "Genome expansion of Arabis alpina linked with retrotransposition and
RT reduced symmetric DNA methylation.";
RL Nat. Plants 1:14023-14023(2015).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM002874; KFK32134.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A087GQI2; -.
DR EnsemblPlants; KFK32134; KFK32134; AALP_AA6G202700.
DR Gramene; KFK32134; KFK32134; AALP_AA6G202700.
DR eggNOG; KOG0017; Eukaryota.
DR OMA; DNIAHAR; -.
DR OrthoDB; 5403705at2759; -.
DR Proteomes; UP000029120; Chromosome 6.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 2.40.50.40; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR000953; Chromo/chromo_shadow_dom.
DR InterPro; IPR023780; Chromo_domain.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR24559:SF438; RT_RNASEH_2 DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR24559; TRANSPOSON TY3-I GAG-POL POLYPROTEIN; 1.
DR Pfam; PF00385; Chromo; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SMART; SM00298; CHROMO; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50013; CHROMO_2; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000029120}.
FT DOMAIN 217..396
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 511..682
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT DOMAIN 786..820
FT /note="Chromo"
FT /evidence="ECO:0000259|PROSITE:PS50013"
SQ SEQUENCE 849 AA; 96839 MW; D3524686EBEAD0BB CRC64;
MLGSTVAPSP PNALTSITAQ AGMTFDGTVI PHREGKGILG ETPPLFGNAS RELGYSPASQ
PPWTYPDDRR ASPMGRLELP VFNREQAQDW IARVEQFFEL GEMTDGQKLP EDDDSEIEWV
EEEEEENAKG YNGWGDKEVE LRGDPSLYCP PITMKGLWKA LDREGQEFAS VFEEPQGLPP
CRGKEHAIAG ASPVSVRPFR YPQVQREELE KQVVAMLAAD IIKESPSPFS SPVLLVRKKD
GSWRFCVDDR TLNNVTIGDS YPIPMIDQLL DELHGAIVFS KLDLRTGYHQ IRVKAENVPK
TAFRTHDGHY EFFRIRVRVT NAPATFQSLM NDVFRQFLRR FVLVFFDDIL IHSKTETDHK
AHVRLVLQAL ADHQPYANAK KCEFGKSEVE YLGHVISVRG VVADPTKVKA MVDWPAPTIV
KALRDFLGLT GYYKKFVQEP EASRIGLGAV MMQQQQPIAY FSQALTDRQK LKSVYERELM
AVFAIQMTEF ESEVDKDEEL SQLKEAVLAN PGDHPDYSIV QGRLLRKGRS VSVTSTQYYH
QLAFYNLCPF QMDRLTKYAH FIKMSYPYEA AEVALIFIKE VVRLHGYPRT IVSDRDKTFT
GKFWGKLFRL TGTHLCFSIA YHPQSDGQTE VTNRGMETYL MCFCNEKPKK WRSYLAWAEL
SYITSYHTAI RMTPFKAVYG RDPPTLLQFE RGSTDNAGQL IEKDEMINIL QQQLLKTQQL
MKQQADNHRR EEVEARVRPV ACKLKLPPTV KIHHTFHVSQ LKAALGSALV PVTIPHHLNE
EGILEAEPEF VLENRVNRET GQEEVLIQWR GLPEEDCTWE WKKVIKGQFP HFNLEDKVNL
KGAGKVNQG
//