ID A0A0C2N457_THEKT Unreviewed; 1358 AA.
AC A0A0C2N457;
DT 01-APR-2015, integrated into UniProtKB/TrEMBL.
DT 01-APR-2015, sequence version 1.
DT 27-MAR-2024, entry version 39.
DE RecName: Full=RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00012493};
DE EC=2.7.7.49 {ECO:0000256|ARBA:ARBA00012493};
GN ORFNames=RF11_04767 {ECO:0000313|EMBL:KII68637.1};
OS Thelohanellus kitauei (Myxosporean).
OC Eukaryota; Metazoa; Cnidaria; Myxozoa; Myxosporea; Bivalvulida;
OC Platysporina; Myxobolidae; Thelohanellus.
OX NCBI_TaxID=669202 {ECO:0000313|EMBL:KII68637.1, ECO:0000313|Proteomes:UP000031668};
RN [1] {ECO:0000313|EMBL:KII68637.1, ECO:0000313|Proteomes:UP000031668}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Wuqing {ECO:0000313|EMBL:KII68637.1};
RX PubMed=25381665; DOI=10.1093/gbe/evu247;
RA Yang Y., Xiong J., Zhou Z., Huo F., Miao W., Ran C., Liu Y., Zhang J.,
RA Feng J., Wang M., Wang M., Wang L., Yao B.;
RT "The genome of the myxosporean Thelohanellus kitauei shows adaptations to
RT nutrient acquisition within its fish host.";
RL Genome Biol. Evol. 6:3182-3198(2014).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KII68637.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JWZT01002745; KII68637.1; -; Genomic_DNA.
DR EnsemblMetazoa; KII68637; KII68637; RF11_04767.
DR OMA; KETHPEW; -.
DR Proteomes; UP000031668; Unassembled WGS sequence.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR Gene3D; 4.10.60.10; Zinc finger, CCHC-type; 1.
DR InterPro; IPR001969; Aspartic_peptidase_AS.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041373; RT_RNaseH.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17917; RT_RNaseH; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF08284; RVP_2; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SMART; SM00343; ZnF_C2HC; 2.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS00141; ASP_PROTEASE; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
DR PROSITE; PS50158; ZF_CCHC; 2.
PE 4: Predicted;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW Reference proteome {ECO:0000313|Proteomes:UP000031668};
KW RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00022918};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 263..276
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 282..296
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 515..694
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 1042..1201
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
SQ SEQUENCE 1358 AA; 154823 MW; E9A4A6A7B35502CA CRC64;
MSSNHNYSSI VVIGSYLRSL HLVPGVQMDL KPEFLCKNTD IEEWLYNFGL IAVANQWDES
RQSAIIPAYL RDDALQAFKD SELPSLTPSV ERYKRLITML KIYRHSTDKM SAYYRSFEEV
VLSPGMDPGD FVREISRLLS LARPGISKED LDFFVRQKLL SALPENIAAI IRICDFGSTS
ELVDKTRIIL SANQGNHLQP FTSVIKAAES LKTEAPKEVM LSKKVTDCDK NDLGNLLHRI
ENLEIQARTN TGSPFNPNSR KECKKCGSFG HLTSDCLGLV ICKNCKKRGH TVRKCPYINN
KCREYSFDNY LTSSLNRDHQ INSTQQIYVP CLRGNFNALI DTGATVSLVS HDIVKNLPIL
SKITINLNTA NKSHIKSLGS VNLPFELSNL NTSWDFHVIE NLAFDFIVGL DILAKHNGNI
SINSKNFVTF EQSVNSITVE KLNVSDRLPY SSKNELVDLL REFEDVFAAN STDLGHTNKV
VHDIVTKSDQ PIKVRPYRIS IHQEKEMIKL IDDMLKSNVI RHSSSAWSAP AIIVKKKDGS
NRLCVDYRKL NEITVKDEYS LPNIESMFDK FSNAQYISTL DLQSGYWQVA LSDESKPKTA
FSPGPGLGLY EYNVVPFGLC NAPATFQRLM HKLLKGLDNC MAYLDDIVIF SESFEGHISD
IKKVMERLRM FNLKLKPSKC EFAKKELKFL GFIVSGNGLR PDMSNIEPIL SWPTPSCKKD
IKSFLGACNY YSKFIKNFAN LAYPLYRLLR TDRVWKWDDE CQKSFSSLKS HLKHIPSIGL
PDPSRPFQIY TDASDVALGA VLTQRLGGVE RPIGFASQLL NTTQRRYSTI DRECLAILWG
IRKFRHYLYG SHFTVMTDHN PLKYLKSMKD PHGRRARWIM ELEEYSFTIN HIPGRKNVVA
DALSRNIAST FISSKTSLSE EQAKDPDIVK TIDYISKKNR NPDDSPKIGM SNLLIIDGIL
VHQSRSGFRP FIPCHQRHEI FDIAHKTSLS HLGVKKTLHL LRETSFWPNM REDVDKWIDE
CHSCAINKHK NYTPRAPLNH IVASKPFSAW EVDFTGPLPV SKKGNKYLIV FIDIFTKWIE
AVPVPAITAE TASKALISNI VSRFGIPDSI HSDQGPQFES KLFAQMCSHL NIKKTRTTPY
HPMCNGSVER ANKTIKQQLR HLVNEFQNDW DECIDLVLLS LRSSFNESTK FSPSELVYGK
RIRLPIDLSL ERDHPISIPP DYHVHVQNLN QKLNKLHKTA FINNSNANQR NKRYYDTKVK
GLLFNVGEKV FLKKVQTNKL CPLFDGPYIV EKADHPTYLI RHSLNQQITK RTHFNNLYSG
IRVYGGNPKT EEGLALSNDP PLRRSERLKL KPRVFYPK
//