ID A0A2X0MQ76_9BASI Unreviewed; 2646 AA.
AC A0A2X0MQ76;
DT 12-SEP-2018, integrated into UniProtKB/TrEMBL.
DT 12-SEP-2018, sequence version 1.
DT 27-MAR-2024, entry version 24.
DE RecName: Full=RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00012493};
DE EC=2.7.7.49 {ECO:0000256|ARBA:ARBA00012493};
GN Name=BQ5605_C024g09932 {ECO:0000313|EMBL:SGZ26524.1};
GN ORFNames=BQ5605_C024G09932 {ECO:0000313|EMBL:SGZ26524.1};
OS Microbotryum silenes-dioicae.
OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina;
OC Microbotryomycetes; Microbotryales; Microbotryaceae; Microbotryum.
OX NCBI_TaxID=796604 {ECO:0000313|EMBL:SGZ26524.1, ECO:0000313|Proteomes:UP000249464};
RN [1] {ECO:0000313|EMBL:SGZ26524.1, ECO:0000313|Proteomes:UP000249464}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Jaros S., Januszkiewicz K., Wedrychowicz H.;
RL Submitted (NOV-2016) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FQNC01000086; SGZ26524.1; -; Genomic_DNA.
DR Proteomes; UP000249464; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProt.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 2.40.50.40; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.60.10.10; Endonuclease/exonuclease/phosphatase; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR000953; Chromo/chromo_shadow_dom.
DR InterPro; IPR023780; Chromo_domain.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR032549; DUF4939.
DR InterPro; IPR036691; Endo/exonu/phosph_ase_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041373; RT_RNaseH.
DR PANTHER; PTHR24559:SF437; RIBONUCLEASE H; 1.
DR PANTHER; PTHR24559; TRANSPOSON TY3-I GAG-POL POLYPROTEIN; 1.
DR Pfam; PF00385; Chromo; 1.
DR Pfam; PF16297; DUF4939; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17917; RT_RNaseH; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SMART; SM00298; CHROMO; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF56219; DNase I-like; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50013; CHROMO_2; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000249464};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW Transposable element {ECO:0000256|ARBA:ARBA00022464}.
FT DOMAIN 1588..1767
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 2137..2296
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT DOMAIN 2438..2499
FT /note="Chromo"
FT /evidence="ECO:0000259|PROSITE:PS50013"
FT REGION 34..203
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 302..326
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 363..384
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1218..1263
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1969..1991
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 961..995
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 45..74
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 97..125
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 179..200
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 366..384
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1218..1246
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2646 AA; 299299 MW; 38BCD85BBDA48AD8 CRC64;
MHTAVRHDYE REAACARCHF VGHVETCALE QERRQKEANN NTGRHNKHHN NPTNTTTNND
DTTNTNNTDN DASEVVAPSR ALGTDQLESR NRFAGLEVEG GQEDHVEEED AGKQGKEKEQ
VEEGDEDTGA KADDEDSDSD GNAGAGIDKA GGDGKDDDKD EQDDSSEGSE RRGDDVMKDG
AEEEDEGMET EVEEEVEQEQ GVQDDRIMII DDGQTTITTV ARESTPRSPS WPPLSPETLA
KSLREGAEWD RANAEAAARV QAEKEAIINA QLDAEEPLVR HQFAEWGQED GQLRFINIRG
TGSRSNKKMK RSQTVADLSD PSDDPVDVKR VLNKVKGRAG KEVAGQGKRV TRSMSAAAAM
QVEGAKAKAE KGKGVEEEKR WSDHEPKDDI LPEFPLFEDD TTAKSTWNVR RCMGIPEKRR
NIYTYLSTFK ASAILLQEHF IKPELWQCIK NEYEGKVFIS KHCLTLIPAD SPLIDAEILR
THSALDGRIL VTSFRLRGDI RILEINNLYA PVDTKQRLTF FDKLHFHRTQ STHLRLLGGD
LNDCPLPAID RRNQGRHGHH WPILIGKLDS PYTDCIRFKH PLTPSFTRPN IVRKRPKSFS
RLDYFLLQRT HQKRLVQAST IYDHPKDLSD HRPVSIVLVL SDERGDAAQP NNSLPTTSNQ
LHRINAATFK TAAFQGMLEG WLEGASGEEP VGELEVVLGN CEKEGRELAR REHRERVERM
AVLVGRVQDL EALPVMGDEE TAEWTRTTEE LRRAVNERAR QLRIRAHVPE IATEERLSRP
VHAKLAARSS DSKITALRLP NGELTHDIDV ALDHTQAHFQ RLYNLEPRDR DHVERLRDDF
LAPIRAARTC DDPSSDPLFL RRLSEAHIEL LQQPITEDEV VAAIATTHPG RSPGPSGVPY
ELYHTAPRAW AKAVTNQQSE YSYKNNYINP ASRPVILAIA CCLRLSNLNP DSRSRFPIDL
VDLTQASLDS LIDQLRTLQA RNQALEAECR SHLSDKENYQ ITVTALANQQ QTLSDFAKAV
VDQQKESIDV IKSGLVNLNV TAPANTTITP VKSQLAKPDK YDGKEKVKFK TFITQIKFYI
FGNPSSFPTD ESKIAFIISH LTGDAFQHFE HAINAKDDSK PEWLTNYQKF LDQAELVLGD
PDYRNNLTHI VTVGLESGRT HCALPQGSQA QLALHDDPQD LQSLIELAIK VDNKLHLARH
RTNASTQFRQ TNWQYSHSVN SPLRTSPVSQ LQANPSTANT SGPAPMDLDA TRSRRGPLSE
EEKLRRRTNR LCMWCASDQH LRDQCPTAPP MIPRHTALNA SPRMSCTGTL FPQVSSPPSP
RYSQQNSTRF PKTTLFCPSY SISSPHPDLA TSFVHDNHLQ LIAHPTPIPL YVIDGRPIQS
GNITHFVHLE VQFNGHTQSL RADVTQLGTY PLVLGMPWLR LHNPIIDWKR NTLVYSCQSC
ALGHTQPINV SIEGAPLVPL DHKHLDISFA SSFAFERLVN NSDNHHGLLF YDPTSHQLSS
SSPAPSQSLS DDNQDSLEYL ESLKNLVPSE YYHLLAAFSK VKADQLPLHR KFDLSIDLED
NTTPPFGPLY PLSETELQTL SSWLKENLSK NFIRASTSPA GAPVLFVRKK DGSLRLCVDY
RGLNKITRKN RYPLPLIPEA LDRIRGAKIY TKLDLRSGYN LVRIKEGDEW KTAFRTRYGH
FECLVMPFGL TNAPAAFQHL MNSIFRDLLD VSVLVYLDDI LIFSGDECQH TRHVQEVLQR
LINNKLYCNP KKCEFNRTST EYLGFIISPS GVSMSQDKVK AITSWPTPTS LKELQQFLGF
CNFYRRFIEG YSRVIAPLTR LLKKNTPFLL DSAALSSLDR LKQIFTSGAI LCHFNPLLPS
IIETDASDFA ISGILSQVTD GHLRPVAFMS RKMLPAEQNY EIHDKELLAI VECIKIWRHY
LEGSQHPFKI YTDHAALQYF QTKRVLTRRQ ARWSETVNHH KYTIEYRSGS KNNKADTLSR
RPDFSEGGKA SEQPGQILLR PYTLAASLVK FSPPSDIVDL IKFHLSQDPV SNQIVNDLNH
DSTLHPYVKL QDNLLLHHDK IYIPNAEPLK VKLLAQAHDS LLSGHPGQVK TFELLDRNYT
WPGMRQFVNN YVKTCDSCQR NKPTHHRKHG HLQPLPIPSK PWSSLSMDHI VDLPPSSGFD
CVLVVVDRLT KEAHFIPTHK TDSSRDLART FLTHVFKLHG LPTDIVSDRG ATFTSNWWSE
FLAMLKIKPN LSTAFHPESD GQTERTNQTL EHYLRHFCDY LQTNWSELLP LAEFAYNNSF
HSSIGASPFY VTRGYHPRLE VSLRDSFVTD VPKYLQHLRS VQETARKQIL QAQETQARFA
NLKRKPSPPF KIGDQVLLNR KNIQTSRPSS KLDSHKLGPF RIQRIISPVA FKLELPASMK
IHPVFHVSLL EPYQANSLAS RCSNPPPPPE IINGEEEYQV EQILDSRNNR RSRRLEYFVD
WTGYGPQDRQ WVSAADFDDD DSLVIEFHTR YPHKPGFERI QGLNGARASS YLYDLAPRSL
PLASTCFHEF LSLSLGFPAT INRASYSYKN NYINPASPTS TLTKALSRAF NAMTERGSLS
PKQGQGLVRL LFKHHKIGAD RAELSSYRPI TLRECDYKLF TKVYVARLNH VLPDLLPPQQ
HGRTSK
//