ID C5KZZ7_PERM5 Unreviewed; 1863 AA.
AC C5KZZ7;
DT 28-JUL-2009, integrated into UniProtKB/TrEMBL.
DT 28-JUL-2009, sequence version 1.
DT 27-MAR-2024, entry version 53.
DE SubName: Full=Gag/pol/env polyprotein, putative {ECO:0000313|EMBL:EER09855.1};
GN ORFNames=Pmar_PMAR018497 {ECO:0000313|EMBL:EER09855.1};
OS Perkinsus marinus (strain ATCC 50983 / TXsc).
OC Eukaryota; Sar; Alveolata; Perkinsozoa; Perkinsea; Perkinsida; Perkinsidae;
OC Perkinsus.
OX NCBI_TaxID=423536 {ECO:0000313|Proteomes:UP000007800};
RN [1] {ECO:0000313|EMBL:EER09855.1, ECO:0000313|Proteomes:UP000007800}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 50983 / TXsc {ECO:0000313|Proteomes:UP000007800};
RA El-Sayed N., Caler E., Inman J., Amedeo P., Hass B., Wortman J.;
RL Submitted (JUL-2008) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GG677981; EER09855.1; -; Genomic_DNA.
DR RefSeq; XP_002778060.1; XM_002778014.1.
DR EnsemblProtists; EER09855; EER09855; Pmar_PMAR018497.
DR GeneID; 9052794; -.
DR InParanoid; C5KZZ7; -.
DR OrthoDB; 1707090at2759; -.
DR Proteomes; UP000007800; Unassembled WGS sequence.
DR GO; GO:0004519; F:endonuclease activity; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041373; RT_RNaseH.
DR PANTHER; PTHR33064; POL PROTEIN; 1.
DR PANTHER; PTHR33064:SF36; RT_RNASEH_2 DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17917; RT_RNaseH; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000007800}.
FT DOMAIN 721..919
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 1435..1616
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 1..22
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 395..425
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1863 AA; 206191 MW; 6A1ACA4BF535BF5D CRC64;
MTNNGSDASN GEEPSVTEFV SPSGSTLICG GAVPGTGLEI STEARALAQS VSEARLAQEV
LTLREELNKI KQQVCQVTKC NGEATLSSLQ TAGNVVANVI RGNGMVGQNS PIYYIYESVN
ISKEISDKYW VASIPQLYEE VCRNEPTPFR ALFKGAKKDT YEDLGRALLE TVTMLEKSWN
FPLDIIVALL LHHVYQYGNG IHKKAAETVV LRLLAAEGDR EVEDILDEIK VQHGHLLRPS
LPVDVDYATL SGAARFKEKS GWERLLSLLT QVLVRIGSSQ TCHLEARYRW EQLSQKSKEW
LVDFLSREDE AWSDMVAAFA FKRVPPPSDY DRVVKVVGAV SKPIRERFSE ALRRNRKAPE
ELLFKDITKE LLSIEAEAYA VLPWEYGSVP KHVSRSKDSA ARSRTHSSPA RVSDHVKEKS
EGRPVRPDRW CTICNKYTRH TAPFCWNNPD CKNAPEWFKN KLEKDTKKSS DDDITNREDA
PTYMVTSLMA EPLAVPAVLA RLRSSAYPDR DVIVAVDTLC ELNLIHEDLI HYCEIVDCPR
DELPTLAGFK SGRVVPRCKV RLSLSFEGKR CFLYALAVDL RSTCVEMDVV LGAQSLRRMG
ADVCLTDDRL QVKDLGIAIP LVRLDRKPQP VCSVPVCSMV EAVNYEELSA IWSEERLTTS
MKERLADIGY VAPVLDFDIP ESYRKTPYQC QAPYNIPAKL VPGVHDKIRK EVLAGHWEEV
APTASMWISP CFAKAKGRLI EEGPAKGEEA VRLLVDLRRL NTMVDIPDYF RDDGLSAAEF
VRQVDQSARY FTTVDVEAAF ESIPAAPRAQ DLMCFAIGGR VYRSKVALQG LGLSPLLWQY
HIRHGLQTLL GESYREYCAI FMDDILIWGD TADQCERRRK VVVAAINALG KRVSSKCGPS
IGASAVCIGL EFSARGIRLS DESIARLKSA LAQTPTSGSH LRRILGSINY ARSAFQFDRL
AGFGETLKVL TPLVNKKPFK ISPEAEDALK RLSESIVNAP LGLHSYKDLI CGEASSWVVT
VDASDLAIGA ALFRYNGPSQ EVSMEDLKDP EKAVLVGALS RALSGDEVKL MIYEKEMLAL
MAAMQKWGKL FIATTRPRSP RESQSEAKIL VLTDNTISLS RWTTYRLPLS PLISAKSRRF
LSWIEEASEW RYLPYTVRHC TGTSNDVADL LSRWHEQLLC QPEEDPRCQG DEAGKDLLPF
CMVIPDLLRP ADDDMLGEQD YPVMAAAPVT SGAPPEGESP PEECYTAPQV AHVEAPLAAL
DIIDLNHNQA AVVEKALAED DSVFQGVRVR EIFAVGRKVD HSGLSPLVVE KVSRWFANGI
FAIRQPFAGS EVGLLFCQGS LRTSDGPKEV MVIPSDCDVD LQPMVPTLLE DGSDRDMRKS
LLMRLHDNLL TAHNSRDRLL GMVLQVAWWP GVSADVKAYR ARCPICCPGR YRQPPGVLPD
CRTRFDTYSV DLKMIPTGLK ERLGLPPSAC VVSAIDLATG EVSFELINDQ SAANVARFLW
HRIIVRHGEF SRLCSDQGSV FVGSVLGAMS NLFGWVVKTS AARNPTGNSR IERAHRAVSQ
TFRWVESLGD ADGPDDLRLY LGSAEAKVNL AANEEGLSPH LAVYGSEPLM PLLKEGVTCE
LDDLNKDIDK RTIADIREAA VWATEALADS RDVKAFYNRA SLAAGSAVDS NTLKLSEHDE
VLYEGRREVV KKLFRAPGSD IPIVAVLESG KKVPIGALAL LLRSEQLTCR AFCELQDDAL
VDKFVAFYLP SQDNLVMVGK VIEARCEKVV IWVYDGGAKT TVFLPLWRSD EATKRSKESP
GRNWMVLTTE VPASCILARV EIHPNGRVTA ASKDRIEALG ISRPSDTLLV TNPSQASGPS
GAL
//