ID A0A2K3PM23_TRIPR Unreviewed; 950 AA.
AC A0A2K3PM23;
DT 28-MAR-2018, integrated into UniProtKB/TrEMBL.
DT 28-MAR-2018, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE SubName: Full=Retrovirus-related Pol polyprotein from transposon TNT 1-94 {ECO:0000313|EMBL:PNY16332.1};
DE Flags: Fragment;
GN ORFNames=L195_g013051 {ECO:0000313|EMBL:PNY16332.1};
OS Trifolium pratense (Red clover).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; Hologalegina; IRL clade; Trifolieae; Trifolium.
OX NCBI_TaxID=57577 {ECO:0000313|EMBL:PNY16332.1, ECO:0000313|Proteomes:UP000236291};
RN [1] {ECO:0000313|EMBL:PNY16332.1, ECO:0000313|Proteomes:UP000236291}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Tatra {ECO:0000313|Proteomes:UP000236291};
RC TISSUE=Young leaves {ECO:0000313|EMBL:PNY16332.1};
RX PubMed=24500806; DOI=10.3732/ajb.1300340;
RA Istvanek J., Jaros M., Krenek A., Repkova J.;
RT "Genome assembly and annotation for red clover (Trifolium pratense;
RT Fabaceae).";
RL Am. J. Bot. 101:327-337(2014).
RN [2] {ECO:0000313|EMBL:PNY16332.1, ECO:0000313|Proteomes:UP000236291}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Tatra {ECO:0000313|Proteomes:UP000236291};
RC TISSUE=Young leaves {ECO:0000313|EMBL:PNY16332.1};
RX PubMed=28382043; DOI=10.3389/fpls.2017.00367;
RA Istvanek J., Dluhosova J., Dluhos P., Patkova L., Nedelnik J., Repkova J.;
RT "Gene Classification and Mining of Molecular Markers Useful in Red Clover
RT (Trifolium pratense) Breeding.";
RL Front. Plant Sci. 8:367-367(2017).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PNY16332.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ASHM01008435; PNY16332.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2K3PM23; -.
DR Proteomes; UP000236291; Unassembled WGS sequence.
DR ExpressionAtlas; A0A2K3PM23; baseline.
DR GO; GO:0016887; F:ATP hydrolysis activity; IEA:InterPro.
DR CDD; cd09272; RNase_HI_RT_Ty1; 1.
DR Gene3D; 3.30.565.10; Histidine kinase-like ATPase, C-terminal domain; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR025724; GAG-pre-integrase_dom.
DR InterPro; IPR036890; HATPase_C_sf.
DR InterPro; IPR045261; MORC_ATPase.
DR InterPro; IPR013103; RVT_2.
DR PANTHER; PTHR23336:SF44; PROTEIN MICRORCHIDIA 6; 1.
DR PANTHER; PTHR23336; ZINC FINGER CW-TYPE COILED-COIL DOMAIN PROTEIN 3; 1.
DR Pfam; PF13976; gag_pre-integrs; 1.
DR Pfam; PF13589; HATPase_c_3; 1.
DR Pfam; PF07727; RVT_2; 1.
DR SUPFAM; SSF55874; ATPase domain of HSP90 chaperone/DNA topoisomerase II/histidine kinase; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000236291}.
FT DOMAIN 263..327
FT /note="GAG-pre-integrase"
FT /evidence="ECO:0000259|Pfam:PF13976"
FT DOMAIN 516..611
FT /note="Reverse transcriptase Ty1/copia-type"
FT /evidence="ECO:0000259|Pfam:PF07727"
FT REGION 425..452
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 950
FT /evidence="ECO:0000313|EMBL:PNY16332.1"
SQ SEQUENCE 950 AA; 105977 MW; 8089FB8A281EC0D8 CRC64;
MLSSVEKIEA VMKFFVNEMG WDSLVLAKKC CLHENAVDVW KDLKERCLQG DRVRVATLYQ
EISNFKQGNS RVSDYFTEMC AMWEELDQFR PIPQCTCPYM SHVLLMEPLP NINKVLSMVL
QDERQQNYGV NVSIDSKHEE TEVLANAVEN LGARRGFGRG RGNGGNQFGN NQYGRGRGNP
YKEKVCTYCG KNGHIVDICY KKHGYPPNWG YGRGNQGNAY ANNVEDENNE GYNDGGNMQM
KASDEGNEKD TMRRIGLAKQ LDGLYYWKLE QVSSIVRGNS VSVNSGRLWH LRLGHLSAER
MKCLNKKFSY IPVLDHDPFD ECHMAKQKKL PFPGTSFDPR ASKCVFLGDK QGMKGYVILD
THSRSYSVSR NVEFYELEFP YKPTKCTPQS LPSVSHKESI SSIVPYTDPD VDTFESVADI
PSCTSIPPTL TPSAPSELDT QQPSDPPNHD TQLLLRSSTR ARNPSSYLTD YIDALDSIHR
WHVHQLDVNN GFLHGDLNEA VYMKVPQGVV SPKPGQLYHT LFIESSGTTF LALLVYVDDI
LLAGPDIAEF DSIKSALHST FCIKNLGQLK FFLGLDVAHS SHGISICQRK YCLELLEDSG
LTHCKPATTP LDPAAKLSTD DRPLFADISA YRRLVGRLLY LTTTRPDISH ATQQLSQFMN
SDVQVLGFSN ADWGGCLETR RSISGYFFFI GHSLISWKSK KQSTVSCSSA EAEYRALAAA
TRELQWISFL LKDLAQSCAR QAVLYYDSQS AMHIAANPVF HQPTKHLDID CHIVRLKIQE
GLMRLLPVPS ASQVADIFTK ASHPRPFHAL VTSSASYSSS NSGKNYLHVH PMFLHSNATS
HKRAFGAVAE LLDNAVDEIQ NGATFVSIDK TSNPRDGSPA LLIHGMLLLE LFDIDDGGGM
DPEAMRRCMS FGFSDKNSKL SIGQYGNGFK TSSMRLGADA IVFSRHLNNG
//