GenomeNet

Database: UniProt
Entry: A0A0C2NBS9_THEKT
LinkDB: A0A0C2NBS9_THEKT
Original site: A0A0C2NBS9_THEKT 
ID   A0A0C2NBS9_THEKT        Unreviewed;      1324 AA.
AC   A0A0C2NBS9;
DT   01-APR-2015, integrated into UniProtKB/TrEMBL.
DT   01-APR-2015, sequence version 1.
DT   27-MAR-2024, entry version 38.
DE   RecName: Full=RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00012493};
DE            EC=2.7.7.49 {ECO:0000256|ARBA:ARBA00012493};
GN   ORFNames=RF11_02953 {ECO:0000313|EMBL:KII71437.1};
OS   Thelohanellus kitauei (Myxosporean).
OC   Eukaryota; Metazoa; Cnidaria; Myxozoa; Myxosporea; Bivalvulida;
OC   Platysporina; Myxobolidae; Thelohanellus.
OX   NCBI_TaxID=669202 {ECO:0000313|EMBL:KII71437.1, ECO:0000313|Proteomes:UP000031668};
RN   [1] {ECO:0000313|EMBL:KII71437.1, ECO:0000313|Proteomes:UP000031668}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Wuqing {ECO:0000313|EMBL:KII71437.1};
RX   PubMed=25381665; DOI=10.1093/gbe/evu247;
RA   Yang Y., Xiong J., Zhou Z., Huo F., Miao W., Ran C., Liu Y., Zhang J.,
RA   Feng J., Wang M., Wang M., Wang L., Yao B.;
RT   "The genome of the myxosporean Thelohanellus kitauei shows adaptations to
RT   nutrient acquisition within its fish host.";
RL   Genome Biol. Evol. 6:3182-3198(2014).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KII71437.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; JWZT01001764; KII71437.1; -; Genomic_DNA.
DR   EnsemblMetazoa; KII71437; KII71437; RF11_02953.
DR   OMA; HYWESIS; -.
DR   Proteomes; UP000031668; Unassembled WGS sequence.
DR   GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR   GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR   GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR   CDD; cd00303; retropepsin_like; 1.
DR   CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR   CDD; cd01647; RT_LTR; 1.
DR   Gene3D; 1.10.340.70; -; 1.
DR   Gene3D; 3.30.70.270; -; 2.
DR   Gene3D; 2.40.70.10; Acid Proteases; 1.
DR   Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR   Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR   Gene3D; 4.10.60.10; Zinc finger, CCHC-type; 1.
DR   InterPro; IPR001969; Aspartic_peptidase_AS.
DR   InterPro; IPR043502; DNA/RNA_pol_sf.
DR   InterPro; IPR001584; Integrase_cat-core.
DR   InterPro; IPR041588; Integrase_H2C2.
DR   InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR   InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR   InterPro; IPR012337; RNaseH-like_sf.
DR   InterPro; IPR036397; RNaseH_sf.
DR   InterPro; IPR000477; RT_dom.
DR   InterPro; IPR041373; RT_RNaseH.
DR   InterPro; IPR001878; Znf_CCHC.
DR   InterPro; IPR036875; Znf_CCHC_sf.
DR   PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR   Pfam; PF17921; Integrase_H2C2; 1.
DR   Pfam; PF17917; RT_RNaseH; 1.
DR   Pfam; PF00665; rve; 1.
DR   Pfam; PF08284; RVP_2; 1.
DR   Pfam; PF00078; RVT_1; 1.
DR   SMART; SM00343; ZnF_C2HC; 2.
DR   SUPFAM; SSF50630; Acid proteases; 1.
DR   SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR   SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR   SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR   PROSITE; PS00141; ASP_PROTEASE; 1.
DR   PROSITE; PS50994; INTEGRASE; 1.
DR   PROSITE; PS50878; RT_POL; 1.
DR   PROSITE; PS50158; ZF_CCHC; 2.
PE   4: Predicted;
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW   Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW   Reference proteome {ECO:0000313|Proteomes:UP000031668};
KW   RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00022918};
KW   Transferase {ECO:0000256|ARBA:ARBA00022679};
KW   Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW   Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT   DOMAIN          236..249
FT                   /note="CCHC-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50158"
FT   DOMAIN          255..270
FT                   /note="CCHC-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50158"
FT   DOMAIN          481..660
FT                   /note="Reverse transcriptase"
FT                   /evidence="ECO:0000259|PROSITE:PS50878"
FT   DOMAIN          1008..1167
FT                   /note="Integrase catalytic"
FT                   /evidence="ECO:0000259|PROSITE:PS50994"
SQ   SEQUENCE   1324 AA;  150990 MW;  C3B30BD2C55A26CB CRC64;
     MDLKPEFLCK NTDIEEWLYN FGLIAVANQW DESRQSAIIP AYLRDDALQA FKDSELPSLT
     PSVERYKRLI TMLKIYRHST DKMSAYYRSF EEVVLSPGMD PGDFVREISR LLSLARPGIS
     KEDLDFFVRQ KLLSALPENI AAIIRICDFG STSELVDKTR IILSANQGNH LQPFTSVIKA
     AESLKTEAPK EVMLSKKVTD CDKNDLGNLL HRIENLEIQA RTNTGSPFNP NSRKECKKCG
     SFGHLTSDCL GLVICKNCKK RGHTVRKCPE YSFDNYLTSS LNRDHQINST QQIYVPCLRG
     NFNALIDTGA TVSLVSHDIV KNLPILSKIT INLNTANKSH IKSLGSVNLP FELSNLNTSW
     DFHVIENLAF DFIVGLDILA KHNGNISINS KNFVTFEQSV NSITVEKLNV SDRLPYSSKN
     ELVDLLREFE DVFAANSTDL GHTNKVVHDI VTKSDQPIKV RPYRISIHQE KEMIKLIDDM
     LKSNVIRHSS SAWSAPAIIV KKKDGSNRLC VDYRKLNEIT VKDEYSLPNI ESMFDKFSNA
     QYISTLDLQS GYWQVALSDE SKPKTAFSPG PGLGLYEYNV VPFGLCNAPA TFQRLMHKLL
     KGLDNCMAYL DDIVIFSESF EGHISDIKKV MERLRMFNLK LKPSKCEFAK KELKFLGFIV
     SGNGLRPDMS NIEPILSWPT PSCKKDIKSF LGACNYYSKF IKNFANLAYP LYRLLRTDRV
     WKWDDECQKS FSSLKSHLKH IPSIGLPDPS RPFQIYTDAS DVALGAVLTQ RLGGVERPIG
     FASQLLNTTQ RRYSTIDREC LAILWGIRKF RHYLYGSHFT VMTDHNPLKY LKSMKDPHGR
     RARWIMELEE YSFTINHIPG RKNVVADALS RNIASTFISS KTSLSEEQAK DPDIVKTIDY
     ISKKNRNPDD SPKIGMSNLL IIDGILVHQS RSGFRPFIPC HQRHEIFDIA HKTSLSHLGV
     KKTLHLLRET SFWPNMREDV DKWIDECHSC AINKHKNYTP RAPLNHIVAS KPFSAWEVDF
     TGPLPVSKKG NKYLIVFIDI FTKWIEAVPV PAITAETASK ALISNIVSRF GIPDSIHSDQ
     GPQFESKLFA QMCSHLNIKK TRTTPYHPMC NGSVERANKT IKQQLRHLVN EFQNDWDECI
     DLVLLSLRSS FNESTKFSPS ELVYGKRIRL PIDLSLERDH PISIPPDYHV HVQNLNQKLN
     KLHKTAFINN SNANQRNKRY YDTKVKGLLF NVGEKVFLKK VQTNKLCPLF DGPYIVEKAD
     HPTYLIRHSL NQQITKRTHF NNLYSGIRVY GGNPKTEEGL ALSNDPPLRR SERLKLKPRV
     FYPK
//
DBGET integrated database retrieval system