ID J7MFH4_ARATH Unreviewed; 1475 AA.
AC J7MFH4;
DT 31-OCT-2012, integrated into UniProtKB/TrEMBL.
DT 31-OCT-2012, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=Polyprotein {ECO:0000313|EMBL:BAM42649.1};
GN Name=AtRE2 {ECO:0000313|EMBL:BAM42649.1};
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702 {ECO:0000313|EMBL:BAM42649.1};
RN [1] {ECO:0000313|EMBL:BAM42649.1}
RP NUCLEOTIDE SEQUENCE.
RA Yamada M., Akaoka M., Kato A.;
RT "Genomic localization of AtRE2, a copia-type retrotransposon, in natural
RT variants of Arabidopsis thaliana.";
RL Submitted (MAR-2012) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AB701746; BAM42649.1; -; Genomic_DNA.
DR ExpressionAtlas; J7MFH4; baseline and differential.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd09272; RNase_HI_RT_Ty1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR025724; GAG-pre-integrase_dom.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR013103; RVT_2.
DR PANTHER; PTHR42648:SF24; RETROTRANSPOSON, UNCLASSIFIED-LIKE PROTEIN; 1.
DR PANTHER; PTHR42648; TRANSPOSASE, PUTATIVE-RELATED; 1.
DR Pfam; PF13976; gag_pre-integrs; 1.
DR Pfam; PF14223; Retrotran_gag_2; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF07727; RVT_2; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 4: Predicted;
FT DOMAIN 517..680
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 226..271
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 295..314
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 757..915
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 226..268
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 757..779
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 780..804
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 805..858
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 859..873
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 874..901
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1475 AA; 164782 MW; 8711745C6B44970F CRC64;
MATHAEEIVL VNTNILNVNM SNVTKLTSTN YLMWSRQVHA LFDGYELAGF LDGSTPMPPA
TIGTDAVPRV NPDYTRWRRQ DKLIYSAILG AISMSVQPAV SRATTAAQIW ETLRKIYANP
SYGHVTQLRT QLKQWTKGAK TIDDYMQGFI TRFDQLALLG KPMDHDEQVE RVLENLPDDY
KPVIDQIAAK DTPPSLTEIH ERLINQESKL LALNSAEVVP ITANVVTHRN TNTNRNQNNR
GDNRNYNNNN NRSNSWQPSS SGSRSDNRQP KPYLGRCQIC SVQGHSAKRC PQLHQFQSTT
NQQQSTSPFT PWQPRANLAV NSPYNANNWL LDSGATHHIT SDFNNLSFHQ PYTGGDDVMI
ADGSTIPITH TGSASLPTSS RSLDLNKVLY VPNINKNLIS VYRLCNTNRV SVEFFPASFQ
VKDLNTGVPL LQGKTKDELY EWPIASSQAV SMFASPCSKA THSSWHSRLG HPSLAILNSV
ISNHSLPVLN PSHKLLSCSD CFINKSHKVP FSNSTITSSK PLEYIYSDVW SSPILSIDNY
RYYVIFVDHF TRYTWLYPLK QKSQVKDTFI IFKSLVENRF QTRIGTLYSD NGGEFVVLRD
YLSQHGISHF TSPPHTPEHN GLSERKHRHI VEMGLTLLSH ASVPKTYWPY AFSVAVYLIN
RLPTPLLQLQ SPFQKLFGQP PNYEKLKVFG CACYPWLRPY NRHKLEDKSK QCAFMGYSLT
QSAYLCLHIP TGRLYTSRHV QFDERCFPFS TTNFGVSTSQ EQRSDSAPNW PSHTTLPTTP
LVLPAPPCLG PHLDTSPRPP SLPSPLCTTQ VSSSNLPSSS ISSPSSSEPT APSHNGPQPT
AQPHQTQNSN SNSPILNNPN PNSPSPNSPN QNSPLPQSPI SSPHIPTPST SISEPNSPSS
SSTSTPPLPP VLPAPPIIQV NAQAPVNTHS MATRAKDGIR KPNQKYSYAT SLAANSEPRT
AIQAMKDDRW RQAMGSEINA QIGNHTWDLV PPPPPSVTIV GCRWIFTKKF NSDGSLNRYK
ARLVAKGYNQ RPGLDYAETF SPVIKSTSIR IVLGVAVDRS WPIRQLDVNN AFLQGTLTDE
VYMSQPPGFV DKDRPDYVCR LRKAIYGLKQ APRAWYVELR TYLLTVGFVN SISDTSLFVL
QRGRSIIYML VYVDDILITG NDTVLLKHTL DALSQRFSVK EHEDLHYFLG IEAKRVPQGL
HLSQRRYTLD LLARTNMLTA KPVATPMATS PKLTLHSGTK LPDPTEYRGI VGSLQYLAFT
RPDLSYAVNR LSQYMHMPTD DNWNALKRVL RYLAGTPDHG IFLKKGNTLS LHAYSDADWA
GDTDDYVSTN GYIVYLGHHP ISWSSKKQKG VVRSSTEAEY RSVANTSSEL QWICSLLTEL
GIQLSHPPVI YCDNVGATYL CANPVFHSRM KHIALDYHFI RNQVQSGALR VVHVSTHDQL
ADTLTKPLSR VAFQNFSRKI GVIKVPPSCG GVLRI
//