ID A0A2B4SX38_STYPI Unreviewed; 2142 AA.
AC A0A2B4SX38;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 03-MAY-2023, entry version 19.
DE SubName: Full=RNA-directed DNA polymerase from mobile element jockey {ECO:0000313|EMBL:PFX32997.1};
GN Name=pol {ECO:0000313|EMBL:PFX32997.1};
GN ORFNames=AWC38_SpisGene2095 {ECO:0000313|EMBL:PFX32997.1};
OS Stylophora pistillata (Smooth cauliflower coral).
OC Eukaryota; Metazoa; Cnidaria; Anthozoa; Hexacorallia; Scleractinia;
OC Astrocoeniina; Pocilloporidae; Stylophora.
OX NCBI_TaxID=50429 {ECO:0000313|EMBL:PFX32997.1, ECO:0000313|Proteomes:UP000225706};
RN [1] {ECO:0000313|Proteomes:UP000225706}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Voolstra C.R., Li Y., Liew Y.J., Baumgarten S., Zoccola D., Flot J.-F.,
RA Tambutte S., Allemand D., Aranda M.;
RT "Comparative analysis of the genomes of Stylophora pistillata and Acropora
RT digitifera provides evidence for extensive differences between species of
RT corals.";
RL bioRxiv 0:0-0(2017).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PFX32997.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LSMT01000016; PFX32997.1; -; Genomic_DNA.
DR EnsemblMetazoa; XM_022950598.1; XP_022806333.1; LOC111343422.
DR Proteomes; UP000225706; Unassembled WGS sequence.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0006281; P:DNA repair; IEA:UniProt.
DR CDD; cd22343; PDDEXK_lambda_exonuclease-like; 1.
DR CDD; cd15505; PHD_ING; 1.
DR CDD; cd01650; RT_nLTR_like; 1.
DR Gene3D; 3.90.320.10; -; 1.
DR Gene3D; 3.60.10.10; Endonuclease/exonuclease/phosphatase; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR036691; Endo/exonu/phosph_ase_sf.
DR InterPro; IPR011604; PDDEXK-like_dom_sf.
DR InterPro; IPR029526; PGBD.
DR InterPro; IPR011335; Restrct_endonuc-II-like.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR019080; YqaJ_viral_recombinase.
DR InterPro; IPR019786; Zinc_finger_PHD-type_CS.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR019787; Znf_PHD-finger.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR InterPro; IPR007527; Znf_SWIM.
DR PANTHER; PTHR47526; ATP-DEPENDENT DNA HELICASE; 1.
DR PANTHER; PTHR47526:SF1; SWIM-TYPE DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF13843; DDE_Tnp_1_7; 2.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF09588; YqaJ; 1.
DR SMART; SM00249; PHD; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF56219; DNase I-like; 1.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 1.
DR SUPFAM; SSF52980; Restriction endonuclease-like; 1.
DR PROSITE; PS50878; RT_POL; 1.
DR PROSITE; PS01359; ZF_PHD_1; 1.
DR PROSITE; PS50016; ZF_PHD_2; 1.
DR PROSITE; PS50966; ZF_SWIM; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleotidyltransferase {ECO:0000313|EMBL:PFX32997.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000225706};
KW RNA-directed DNA polymerase {ECO:0000313|EMBL:PFX32997.1};
KW Transferase {ECO:0000313|EMBL:PFX32997.1};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00325}.
FT DOMAIN 1129..1388
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 1665..1701
FT /note="SWIM-type"
FT /evidence="ECO:0000259|PROSITE:PS50966"
FT DOMAIN 2091..2141
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT REGION 31..60
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 553..577
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 36..60
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2142 AA; 244317 MW; CBD54B316B26A2BA CRC64;
MAAAELTGLT FVDEDDLPVA FLRIAHVHSD NEAPSDIDSS EGESADESGE ESDECEDTEE
RDYSNLMWSS VIRPPQDRNF NEEVGMRVEM ENNSSCLNYF ELLFTDNVYQ LILNETARFE
RQKSHLDPNS RGHLHNLTVP ELKAWLGLTL AMGLVKKPNF KSYWCNKSVI KTPLFPNKMS
RDRYLHILRF MHFVDNNNAP DPADPNRDKL WKIRPFLNAL LPRFTTVYSP SQNLSVDETF
IKFKGHSATG YVLDTMIYTR KEGPAVSRDL AMRVVLKLVE PYVDKGYLLF VDNWYTSVPL
FLELERRGIL ACGTVRGNRK FLPKDIVDQT KEQVKRLKKG ESLFRQNNNL VCVTWKDKKL
VHLLSTIPEG LEIGQVERKV RSKGRWQKQN FAQPKVIKMY NSHMGGVDLG DKRIATSSRL
MKGNIWYYKI FFLMLEVSAL NAHIMHKRAG HGKVTLAAFK EKLVEQLIAG NSFRRDTTNN
LSAIAAQLPD IHFNRVQFHH PVKTDTHKKC KVHIQRVETV YECAVCQVRT CPAPCFERYH
TLQEYLFDDP KRNNNANRLK DVTGRPRAGP GRPPQRSAHV KKLRSCLLXP GPTQTSAHFS
SCDSSFSSCF SNESFVSDSS AASDNEDSVL STYYDLGLGD RGLRLGHWNV NYLTMAKFEE
IKLCLLNADG KAQLDILFLS ETFLKASDPD TLYSATGFNT LRRDRMTNGG GILALVNNEL
EFKRRMDLEQ QGIESIWLEV SPYKSNRSLI IGCVYRPPNQ KKQLDIDIEE HIERIHLLNK
ETIFLTDINI DYKNRPKYDN HRLIKGLRIM HFKQLVDFIT RPVSKTCLDH VYSNQPQRIS
SVSCHNIGLA DHVPVFVVRK YARDNHKAHN STRITYRNMK RFDEEAFKQS LQEAPWDTAF
VFDDIDDIVH SWEDIFNSIL DSHCPWRVKR VKQDTQAPWM TKKVLKQLHT RDHLLKVARL
SDDSDDWSKY RAARNYAVSM IRSAKRDFYA TSFQDNKNNP RAIWKSIKTL TGANRNTDAI
KKLEVDGRVI EESSEMSEQF NCYFSSIADK LRNQLCHVNY DLSKLINFVA SRKDPDVSFM
VPAITSAQVS AIMMKISSHK ATGIDGISAR LLRIGMPAIA PCIARLINLS MSTGKFPTRW
KTAKVTPLFK GGALSDPSNY RPISVLPVLS KIIERHMYNS LYAFLTEQNL IYSRQSGFRK
HHSTETALIK IVDELLFNLD RNKVSGLVLV DYAKAFDMVD HELLLKKLEV YGVKNQELNW
CQSYLSDRKQ VVCLDGNKSS EAFMRHGVPQ GSILGPLFFI LFINDLPLHV SGTIDLYADD
TTISASADVN NIPSLQSSLK TSFGEIQQWA MANKLPLNES KTKVLTVTGK RLAPRIQQDA
LVILGTSLKA LANVDCVSLL GLNIDSALSF NAHADKVCKK LASRIAVLRK IRTYLPLPQR
IQYYNSIISP VMSYVSAIWS NCDKELLYRV FKLQKRAARV ILYAERMAPS VELFNRLKWI
PFYEKCKIDK ASIMFKRIHG ALPSYLNEHI SINNSRHSRT TRYSNFNVLC PRYNRETEDG
PTRERYKQKA NLVGFDPFDL RKSDLSEDLG LIPGVEYPDI VNYLILQTSW ATNSEMKAYK
SLDAFNFFIS GWVNTLMMKE VTETTVVVLT RVNHSQRASE KPLKAWILAE YSGKVITAHC
DCMSGKSECC SHVAAILFAS EAACRMHSST TCTQTKSQWL MPGYVKEIPY AAIEEIDFTS
AKKKHSQLLE DKKVPTALRN SDEVVQRPLT ITSYEEQLNF FQKISLSDSK PAILSLISPY
NTKYIPKEAA LPQPLTHLYD ANATELNYDE LLSKCIDRSL QISITDEEIK AIEEGTREQA
STNVWYKQRA GRITASNFKS ACHTNISRPS PSLIKRICYP ESTKFSSAAT AWGCKNETNA
RKAYLEKTNR CHDDFSICDS GLQINKQWPH IGASPDGLVN CKCCGAGVCE IKCPYSAKDL
SPTDPHVISD KNYCLQNDSD SIYLNRTHAN YYQVQCQMFI CNVEYCDFIV WTPHGLYVER
IMPDVEFWSA SVAKVTEFFK IGILPEIVGK LYTRPSLPST LVAPSPDDDS AEWCICQRYI
EDSTLVGCDN DDCKVKWFHL QCLRLKNPPK GRWLCMDCQK LD
//