ID A0A0C2NBS9_THEKT Unreviewed; 1324 AA.
AC A0A0C2NBS9;
DT 01-APR-2015, integrated into UniProtKB/TrEMBL.
DT 01-APR-2015, sequence version 1.
DT 27-MAR-2024, entry version 38.
DE RecName: Full=RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00012493};
DE EC=2.7.7.49 {ECO:0000256|ARBA:ARBA00012493};
GN ORFNames=RF11_02953 {ECO:0000313|EMBL:KII71437.1};
OS Thelohanellus kitauei (Myxosporean).
OC Eukaryota; Metazoa; Cnidaria; Myxozoa; Myxosporea; Bivalvulida;
OC Platysporina; Myxobolidae; Thelohanellus.
OX NCBI_TaxID=669202 {ECO:0000313|EMBL:KII71437.1, ECO:0000313|Proteomes:UP000031668};
RN [1] {ECO:0000313|EMBL:KII71437.1, ECO:0000313|Proteomes:UP000031668}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Wuqing {ECO:0000313|EMBL:KII71437.1};
RX PubMed=25381665; DOI=10.1093/gbe/evu247;
RA Yang Y., Xiong J., Zhou Z., Huo F., Miao W., Ran C., Liu Y., Zhang J.,
RA Feng J., Wang M., Wang M., Wang L., Yao B.;
RT "The genome of the myxosporean Thelohanellus kitauei shows adaptations to
RT nutrient acquisition within its fish host.";
RL Genome Biol. Evol. 6:3182-3198(2014).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KII71437.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JWZT01001764; KII71437.1; -; Genomic_DNA.
DR EnsemblMetazoa; KII71437; KII71437; RF11_02953.
DR OMA; HYWESIS; -.
DR Proteomes; UP000031668; Unassembled WGS sequence.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR Gene3D; 4.10.60.10; Zinc finger, CCHC-type; 1.
DR InterPro; IPR001969; Aspartic_peptidase_AS.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041373; RT_RNaseH.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17917; RT_RNaseH; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF08284; RVP_2; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SMART; SM00343; ZnF_C2HC; 2.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS00141; ASP_PROTEASE; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
DR PROSITE; PS50158; ZF_CCHC; 2.
PE 4: Predicted;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW Reference proteome {ECO:0000313|Proteomes:UP000031668};
KW RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00022918};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 236..249
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 255..270
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 481..660
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 1008..1167
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
SQ SEQUENCE 1324 AA; 150990 MW; C3B30BD2C55A26CB CRC64;
MDLKPEFLCK NTDIEEWLYN FGLIAVANQW DESRQSAIIP AYLRDDALQA FKDSELPSLT
PSVERYKRLI TMLKIYRHST DKMSAYYRSF EEVVLSPGMD PGDFVREISR LLSLARPGIS
KEDLDFFVRQ KLLSALPENI AAIIRICDFG STSELVDKTR IILSANQGNH LQPFTSVIKA
AESLKTEAPK EVMLSKKVTD CDKNDLGNLL HRIENLEIQA RTNTGSPFNP NSRKECKKCG
SFGHLTSDCL GLVICKNCKK RGHTVRKCPE YSFDNYLTSS LNRDHQINST QQIYVPCLRG
NFNALIDTGA TVSLVSHDIV KNLPILSKIT INLNTANKSH IKSLGSVNLP FELSNLNTSW
DFHVIENLAF DFIVGLDILA KHNGNISINS KNFVTFEQSV NSITVEKLNV SDRLPYSSKN
ELVDLLREFE DVFAANSTDL GHTNKVVHDI VTKSDQPIKV RPYRISIHQE KEMIKLIDDM
LKSNVIRHSS SAWSAPAIIV KKKDGSNRLC VDYRKLNEIT VKDEYSLPNI ESMFDKFSNA
QYISTLDLQS GYWQVALSDE SKPKTAFSPG PGLGLYEYNV VPFGLCNAPA TFQRLMHKLL
KGLDNCMAYL DDIVIFSESF EGHISDIKKV MERLRMFNLK LKPSKCEFAK KELKFLGFIV
SGNGLRPDMS NIEPILSWPT PSCKKDIKSF LGACNYYSKF IKNFANLAYP LYRLLRTDRV
WKWDDECQKS FSSLKSHLKH IPSIGLPDPS RPFQIYTDAS DVALGAVLTQ RLGGVERPIG
FASQLLNTTQ RRYSTIDREC LAILWGIRKF RHYLYGSHFT VMTDHNPLKY LKSMKDPHGR
RARWIMELEE YSFTINHIPG RKNVVADALS RNIASTFISS KTSLSEEQAK DPDIVKTIDY
ISKKNRNPDD SPKIGMSNLL IIDGILVHQS RSGFRPFIPC HQRHEIFDIA HKTSLSHLGV
KKTLHLLRET SFWPNMREDV DKWIDECHSC AINKHKNYTP RAPLNHIVAS KPFSAWEVDF
TGPLPVSKKG NKYLIVFIDI FTKWIEAVPV PAITAETASK ALISNIVSRF GIPDSIHSDQ
GPQFESKLFA QMCSHLNIKK TRTTPYHPMC NGSVERANKT IKQQLRHLVN EFQNDWDECI
DLVLLSLRSS FNESTKFSPS ELVYGKRIRL PIDLSLERDH PISIPPDYHV HVQNLNQKLN
KLHKTAFINN SNANQRNKRY YDTKVKGLLF NVGEKVFLKK VQTNKLCPLF DGPYIVEKAD
HPTYLIRHSL NQQITKRTHF NNLYSGIRVY GGNPKTEEGL ALSNDPPLRR SERLKLKPRV
FYPK
//