ID A0A2B4RJL5_STYPI Unreviewed; 1018 AA.
AC A0A2B4RJL5;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 22-FEB-2023, entry version 18.
DE SubName: Full=Retrovirus-related Pol polyprotein {ECO:0000313|EMBL:PFX16680.1};
GN Name=pol {ECO:0000313|EMBL:PFX16680.1};
GN ORFNames=AWC38_SpisGene19031 {ECO:0000313|EMBL:PFX16680.1};
OS Stylophora pistillata (Smooth cauliflower coral).
OC Eukaryota; Metazoa; Cnidaria; Anthozoa; Hexacorallia; Scleractinia;
OC Astrocoeniina; Pocilloporidae; Stylophora.
OX NCBI_TaxID=50429 {ECO:0000313|EMBL:PFX16680.1, ECO:0000313|Proteomes:UP000225706};
RN [1] {ECO:0000313|Proteomes:UP000225706}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Voolstra C.R., Li Y., Liew Y.J., Baumgarten S., Zoccola D., Flot J.-F.,
RA Tambutte S., Allemand D., Aranda M.;
RT "Comparative analysis of the genomes of Stylophora pistillata and Acropora
RT digitifera provides evidence for extensive differences between species of
RT corals.";
RL bioRxiv 0:0-0(2017).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PFX16680.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LSMT01000526; PFX16680.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2B4RJL5; -.
DR Proteomes; UP000225706; Unassembled WGS sequence.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR Gene3D; 2.40.50.40; -; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR000953; Chromo/chromo_shadow_dom.
DR InterPro; IPR023780; Chromo_domain.
DR InterPro; IPR023779; Chromodomain_CS.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR041373; RT_RNaseH.
DR PANTHER; PTHR46389; POLYCOMB GROUP PROTEIN PC; 1.
DR PANTHER; PTHR46389:SF3; POLYCOMB GROUP PROTEIN PC; 1.
DR Pfam; PF00385; Chromo; 1.
DR Pfam; PF17917; RT_RNaseH; 1.
DR SMART; SM00298; CHROMO; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR PROSITE; PS00598; CHROMO_1; 1.
DR PROSITE; PS50013; CHROMO_2; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000225706}.
FT DOMAIN 967..1018
FT /note="Chromo"
FT /evidence="ECO:0000259|PROSITE:PS50013"
FT REGION 219..238
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 373..396
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 898..966
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 907..922
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 935..954
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1018 AA; 114405 MW; EB02FC0D41CD6A97 CRC64;
MSRQPLLHPI SNLEYHSFVH FLNQLIDRRG LTHNQMRSVG FLREARNDSV VNNRHNSLDS
NQLSLVPAYA EMEAQVSQLC QAVERMQASV NSQNRSGGVK SSVTFPAFRG DESEDAHEFV
RNYKRAGRLN GWDSNSLALG LPLYLKGYGS AWFRTLPRAD EMSFEELSEK LITHFASGAS
EWRVFVQGLI PEIRDKAKSA DISVDERINL AQHVIVEGSN RTSPEDSSSR PPQSSRQEQG
ACLESSIVII LSIILAVVIK VLELFEVVIT STVEVLKRPP VSSKGHATLF SGASTPKFSA
LSIQDKTPEC LCEVSKGEPQ SSRAVVEENS QAFEQADEIY DDIIISEVIP HSVKITDSNK
TLCSDYSPLT NSNIKNSDQG STSLSVSRTL NNPSSRARAE KEYSGAAITA VSASVWRKHL
CYAYPKLSVP ASENVTTVNG SLLTTIGKTS MEFVIDSRIF NFEVCVIEDL SFDIILGRDF
LQRFCFKVDF ENGLVSFPSE PSPFPFEGLR VDDDDDLIDK AFISSVHASR TFVIPPQSEI
LISGELEDSS NKYGIGGMIV PKPDLSHRDS IFRASEIVSV AEDGTVPVRL VNPSFEPVKI
YRRTRRANFE EVDRNKATSE LNASEKLRES HCSLNSDNQP KECDYSQLPD LSDSILSADD
KIKFRDLFKK YRDVFAFSDA ELGRTLLVQR VIDTDDATPI KQMPYRTSPE EECAVAFDKL
KRVFVSAPVL AYPNFKEPFL QFVDASSTGI RFTLAQVQNG KEVAIAYNGR GLNSAERNYS
TTEREALALI EGIKKFQPYP QNRQFTVVTD HSSLRWLMNV KDAFGRLARW ALLLEQYDFE
IIIALGLSHL LTKEIGPYRI VEQSSPVHFR LRTDTNKKVT FAVHANRLKP FIDPSLRPIE
PPLVDDPSEP YLDESDTPDD NFESELPVDK KVNSRPPVSD TTDSSSQSDN QVRPCSDSSE
KDDDRIFQAD RILKSRKKKG KIEYLVKWHN LPRSQSTWEP EQNILDKRLI DNFNNSRK
//