ID A0A2B4R8I4_STYPI Unreviewed; 835 AA.
AC A0A2B4R8I4;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 27-MAR-2024, entry version 15.
DE SubName: Full=Retrovirus-related Pol polyprotein from transposon 17.6 {ECO:0000313|EMBL:PFX13456.1};
GN Name=pol {ECO:0000313|EMBL:PFX13456.1};
GN ORFNames=AWC38_SpisGene22456 {ECO:0000313|EMBL:PFX13456.1};
OS Stylophora pistillata (Smooth cauliflower coral).
OC Eukaryota; Metazoa; Cnidaria; Anthozoa; Hexacorallia; Scleractinia;
OC Astrocoeniina; Pocilloporidae; Stylophora.
OX NCBI_TaxID=50429 {ECO:0000313|EMBL:PFX13456.1, ECO:0000313|Proteomes:UP000225706};
RN [1] {ECO:0000313|Proteomes:UP000225706}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Voolstra C.R., Li Y., Liew Y.J., Baumgarten S., Zoccola D., Flot J.-F.,
RA Tambutte S., Allemand D., Aranda M.;
RT "Comparative analysis of the genomes of Stylophora pistillata and Acropora
RT digitifera provides evidence for extensive differences between species of
RT corals.";
RL bioRxiv 0:0-0(2017).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PFX13456.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LSMT01000973; PFX13456.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2B4R8I4; -.
DR STRING; 50429.A0A2B4R8I4; -.
DR Proteomes; UP000225706; Unassembled WGS sequence.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.10.20.370; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR InterPro; IPR024983; CHAT_dom.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR041373; RT_RNaseH.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF12770; CHAT; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17917; RT_RNaseH; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000225706}.
FT DOMAIN 19..200
FT /note="CHAT"
FT /evidence="ECO:0000259|Pfam:PF12770"
FT DOMAIN 375..478
FT /note="Reverse transcriptase RNase H-like"
FT /evidence="ECO:0000259|Pfam:PF17917"
FT DOMAIN 503..561
FT /note="Integrase zinc-binding"
FT /evidence="ECO:0000259|Pfam:PF17921"
FT REGION 754..808
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 646..673
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 786..808
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 835 AA; 95135 MW; 479665042CF029B1 CRC64;
MSFSDPTKDI TSTKLLLCYK IIISPVVHLL KEPEIIIVLD PCMYQVPFAA LTDQEGKCLS
ETKRICTVPS LTTLKVIQDS PPDYHSQTGA LILGDPKVGV VLYKDRRKDP SPLPCAKREA
DMIVELFGVT PLVGEHATKQ AVLHAITSAS LIHLAANGSD ERGEIFLSPK SANSCVPPRE
ESYLLAMREI SLIQLRAKLA IHFPLKNFDK YTQFMAVSIL SPDRAEAVKQ QGKCYNYLPP
ATDIESVDEH LVHLEEVFKR LREANIKLNP KECDFVKQRV EYLSHIVTPE GVSPNSEKIR
VVQEFPTPIN LKELRNFLGL ANHYRRFVKG FSHIANPLNA LTKKGFSFNW TEECAVAFDK
LKRALVSAPI LAYPNFKEPF LLFVDASSTG IGFTLAQVQN GKEIAIAYNG RGLNSAERYY
STTEREALAF IEGIKKFQPY LQNRKFTVVT DHSSLRWLMN VKDVSGRLAR WALLLQQYDF
EIIHSPGNNR KRNSWDSFPQ LVVPPALRFE ILSIMHDHIS GAHFGVHKTF NKVKQRYWWK
GMYKDVEHWC KSCTECSMSK SPRNTKKAPL LPIPVENAFD RVTVDVLGPF PPSNKGNRTS
ILEAISHSPL YVLYGREPPL PMDVKYLPPL DDDVTASVFG HRKRIVENIE LAQNMARENL
QRAQQKIKDY YDQNAKEPVF EVGQRVWVNT PRTKKGLSRK LLHNWSGPYR IVEQSSPVHF
RLRTDTNKKV TFAVHANRMK PLIDPSLRLI EPPLVDDPRE PYLDESDIPD DNFQSELPVD
RKVNSRPPVS DTTDSSSHSD NQVQPCSIPR RKTTIELFKR NEFLNLVERK AKWNI
//