ID R9IQ43_9FIRM Unreviewed; 634 AA.
AC R9IQ43;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 37.
DE RecName: Full=Reverse transcriptase domain-containing protein {ECO:0000259|PROSITE:PS50878};
GN ORFNames=C804_05304 {ECO:0000313|EMBL:EOS23926.1};
OS Lachnospiraceae bacterium A4.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae.
OX NCBI_TaxID=397291 {ECO:0000313|EMBL:EOS23926.1, ECO:0000313|Proteomes:UP000014118};
RN [1] {ECO:0000313|EMBL:EOS23926.1, ECO:0000313|Proteomes:UP000014118}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=A4 {ECO:0000313|EMBL:EOS23926.1,
RC ECO:0000313|Proteomes:UP000014118};
RG The Broad Institute Genomics Platform;
RG The Broad Institute Genome Sequencing Center for Infectious Disease;
RA Earl A., Xavier R., Elson C., Duck W., Walker B., Young S., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Lachnospiraceae bacterium A4.";
RL Submitted (APR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EOS23926.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ASSR01000130; EOS23926.1; -; Genomic_DNA.
DR AlphaFoldDB; R9IQ43; -.
DR STRING; 397291.C804_05304; -.
DR PATRIC; fig|397291.3.peg.5488; -.
DR eggNOG; COG3344; Bacteria.
DR HOGENOM; CLU_013584_14_3_9; -.
DR OrthoDB; 9788687at2; -.
DR Proteomes; UP000014118; Unassembled WGS sequence.
DR GO; GO:0004519; F:endonuclease activity; IEA:InterPro.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd00085; HNHc; 1.
DR CDD; cd01651; RT_G2_intron; 1.
DR Gene3D; 1.10.30.50; -; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR030931; Group_II_RT_mat.
DR InterPro; IPR002711; HNH.
DR InterPro; IPR003615; HNH_nuc.
DR InterPro; IPR000477; RT_dom.
DR NCBIfam; TIGR04416; group_II_RT_mat; 1.
DR PANTHER; PTHR34047:SF2; NUCLEAR INTRON MATURASE 1, MITOCHONDRIAL; 1.
DR PANTHER; PTHR34047; NUCLEAR INTRON MATURASE 1, MITOCHONDRIAL-RELATED; 1.
DR Pfam; PF01844; HNH; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SMART; SM00507; HNHc; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000014118}.
FT DOMAIN 86..350
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
SQ SEQUENCE 634 AA; 73307 MW; E110AB2904A7041E CRC64;
MATGKAQKKD KLRHAEYYDF QEIQDKLYAD SSKGKVFKHL VEIIALPENI RLAYRNIKKN
HGSMTPGTDG KTITELATLS DEQLISSVQH KLDWYVPQSV RRVEIPKGND PTKKRPLGIP
TIMDRLIQQC VLQVMEPICE AKFHDHSYGF RPNRNQQHAI AQVHKDMQRS NLHYVVDIDI
KGFFDNVNHG KLLKQLWTMG IHDKKLLCIL SAMLKAEVAG IGFPEVGTPQ GGIISPLLSN
VVLNELDWWL ASQWEEIPTE FPYKETVNPN GSVSKSHKFS ALRRTKLKEV TCVRYADDFK
IFTNSYQNAV KLFHATKSWL HERLALEISP EKSKVINLKE QYSEFLGFKL KVIPRGKRSD
GRTKYVVESH IREKSIAKIK PNLKRLIYDI EFPSQGKRTE YQAICRYNVY VMGIHDYYQL
ATKVCDDLTP FALSVHKSLR ARLKERVKTA KQVRKRKLPC EVPKVIRERY GKSKQLRYLA
GHAVVPVGYI RHKPPKSMNR KINSYTPEGR AEIHKNLDRI DMTVLHYLMR NPVLYRTIEY
NDNRLSLYSA QMGKCAVSGK VLSIGDIHCH HKVPRYLGGK DNYQNLVLVC EDVHHLIHAT
NPDIIRKYME ILGLDQKQKE KLNKLRSLVH VESY
//