ID A0A023AVZ6_GRENI Unreviewed; 924 AA.
AC A0A023AVZ6;
DT 11-JUN-2014, integrated into UniProtKB/TrEMBL.
DT 11-JUN-2014, sequence version 1.
DT 27-MAR-2024, entry version 36.
DE RecName: Full=RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00012493};
DE EC=2.7.7.49 {ECO:0000256|ARBA:ARBA00012493};
DE Flags: Fragment;
GN ORFNames=GNI_211010 {ECO:0000313|EMBL:EZG42896.1};
OS Gregarina niphandrodes (Septate eugregarine).
OC Eukaryota; Sar; Alveolata; Apicomplexa; Conoidasida; Gregarinasina;
OC Eugregarinorida; Gregarinidae; Gregarina.
OX NCBI_TaxID=110365 {ECO:0000313|EMBL:EZG42896.1, ECO:0000313|Proteomes:UP000019763};
RN [1] {ECO:0000313|EMBL:EZG42896.1, ECO:0000313|Proteomes:UP000019763}
RP NUCLEOTIDE SEQUENCE.
RA Omoto C.K., Sibley D., Venepally P., Hadjithomas M., Karamycheva S.,
RA Brunk B., Roos D., Caler E., Lorenzi H.;
RL Submitted (DEC-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EZG42896.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AFNH02001651; EZG42896.1; -; Genomic_DNA.
DR RefSeq; XP_011133828.1; XM_011135526.1.
DR AlphaFoldDB; A0A023AVZ6; -.
DR EnsemblProtists; EZG42896; EZG42896; GNI_211010.
DR GeneID; 22916500; -.
DR VEuPathDB; CryptoDB:GNI_211010; -.
DR eggNOG; KOG0017; Eukaryota.
DR OMA; WEYISVD; -.
DR OrthoDB; 4271330at2759; -.
DR Proteomes; UP000019763; Unassembled WGS sequence.
DR GO; GO:0004519; F:endonuclease activity; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.10.20.370; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041373; RT_RNaseH.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17917; RT_RNaseH; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000019763}.
FT DOMAIN 110..337
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 697..856
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT COILED 875..902
FT /evidence="ECO:0000256|SAM:Coils"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:EZG42896.1"
SQ SEQUENCE 924 AA; 106390 MW; CFCA0B5C2A041163 CRC64;
RNLPLIRTDL GVALLDTGSD INLARLTSGL RILKDRTPPP NIRDANGQQI TIVGYVNLNA
TLPDNTTSRF TCWVTPQLPV PILVGLPQLD QWGITWDFKT PAGRKDDHIP LPKQTIPAAN
VKPVDLELLD ANTKPIVTKP YVFPAGIRDK ADEAIQDMLK QGIITQSSSP WAFRPRLVFH
PDKPVRICGN YIPLNTLLKG NSYPLPNMEE MLQLLAQGKY FAKVDLTKSF WQLPLTAQSR
QLTAFYGVRG LYEYTRLPFG LKVAPALFQQ TIDQVLKDLP WAFPYVDDIA TIGATEEECA
DRIRAIVETL TQKRFAINYD KSVLEPQKEL DFLGHKIAYK SIVLHPRHVE AMQAQRPPLS
KADLHSFLGL ANCFRRFIPR YAEIAQPLYT TLHQPEWGFG EKELKAWSHL KEVLTQLPNL
HPIDSDSPIV LDTDASQRGI GACLYLQKDN DLFPVAYASH AFTKQEAKWP IRELEAFAIV
WALRHFRQLL LGRNITIRTD HESLRWMRNC DKGRIARWNS SLDEFDLQII YRKGKENQVA
DYLSRHVELD DTTKSIEEDT HPYLSLPALE HPPITLHTGT RIHKYSSLDL ETIKEHSNED
VEAEQLRIQG VLEKHDGCYW EGKRCFVPRD LRNNTLDDFH SPNGLHLGTT KTYRSLSGAY
YWPKMHEQVS QFVKTCPECI RTKARHDSRQ GLPLHVRATE PLSMLMIDVY GPVRRPGRNP
SYILNLLDVA SRYWQISIIT QPFTSSLLWE ALLQKWFLPF TVPERIISDN ATIFHSRYTR
DMTSRLKISW THNAPGFPQA RAPVEIINRH LNNFFRTPEE LPRTWPFQLK LLVQQYNLTH
HESLGFSPAA LLFGRQTPQQ TGITVDLLKD NEQHVMETNE IRRRADQRMK QLQSELDEKM
AHINFNPVAW IILEMRFYCN KGIR
//