GenomeNet

Database: UniProt
Entry: C5KZZ7_PERM5
LinkDB: C5KZZ7_PERM5
Original site: C5KZZ7_PERM5 
ID   C5KZZ7_PERM5            Unreviewed;      1863 AA.
AC   C5KZZ7;
DT   28-JUL-2009, integrated into UniProtKB/TrEMBL.
DT   28-JUL-2009, sequence version 1.
DT   27-MAR-2024, entry version 53.
DE   SubName: Full=Gag/pol/env polyprotein, putative {ECO:0000313|EMBL:EER09855.1};
GN   ORFNames=Pmar_PMAR018497 {ECO:0000313|EMBL:EER09855.1};
OS   Perkinsus marinus (strain ATCC 50983 / TXsc).
OC   Eukaryota; Sar; Alveolata; Perkinsozoa; Perkinsea; Perkinsida; Perkinsidae;
OC   Perkinsus.
OX   NCBI_TaxID=423536 {ECO:0000313|Proteomes:UP000007800};
RN   [1] {ECO:0000313|EMBL:EER09855.1, ECO:0000313|Proteomes:UP000007800}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=ATCC 50983 / TXsc {ECO:0000313|Proteomes:UP000007800};
RA   El-Sayed N., Caler E., Inman J., Amedeo P., Hass B., Wortman J.;
RL   Submitted (JUL-2008) to the EMBL/GenBank/DDBJ databases.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; GG677981; EER09855.1; -; Genomic_DNA.
DR   RefSeq; XP_002778060.1; XM_002778014.1.
DR   EnsemblProtists; EER09855; EER09855; Pmar_PMAR018497.
DR   GeneID; 9052794; -.
DR   InParanoid; C5KZZ7; -.
DR   OrthoDB; 1707090at2759; -.
DR   Proteomes; UP000007800; Unassembled WGS sequence.
DR   GO; GO:0004519; F:endonuclease activity; IEA:UniProtKB-KW.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR   GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR   Gene3D; 1.10.340.70; -; 1.
DR   Gene3D; 3.30.70.270; -; 2.
DR   Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR   Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR   InterPro; IPR043502; DNA/RNA_pol_sf.
DR   InterPro; IPR001584; Integrase_cat-core.
DR   InterPro; IPR041588; Integrase_H2C2.
DR   InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR   InterPro; IPR012337; RNaseH-like_sf.
DR   InterPro; IPR036397; RNaseH_sf.
DR   InterPro; IPR000477; RT_dom.
DR   InterPro; IPR041373; RT_RNaseH.
DR   PANTHER; PTHR33064; POL PROTEIN; 1.
DR   PANTHER; PTHR33064:SF36; RT_RNASEH_2 DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR   Pfam; PF17921; Integrase_H2C2; 1.
DR   Pfam; PF17917; RT_RNaseH; 1.
DR   Pfam; PF00078; RVT_1; 1.
DR   SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR   SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR   PROSITE; PS50994; INTEGRASE; 1.
DR   PROSITE; PS50878; RT_POL; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000007800}.
FT   DOMAIN          721..919
FT                   /note="Reverse transcriptase"
FT                   /evidence="ECO:0000259|PROSITE:PS50878"
FT   DOMAIN          1435..1616
FT                   /note="Integrase catalytic"
FT                   /evidence="ECO:0000259|PROSITE:PS50994"
FT   REGION          1..22
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          395..425
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1863 AA;  206191 MW;  6A1ACA4BF535BF5D CRC64;
     MTNNGSDASN GEEPSVTEFV SPSGSTLICG GAVPGTGLEI STEARALAQS VSEARLAQEV
     LTLREELNKI KQQVCQVTKC NGEATLSSLQ TAGNVVANVI RGNGMVGQNS PIYYIYESVN
     ISKEISDKYW VASIPQLYEE VCRNEPTPFR ALFKGAKKDT YEDLGRALLE TVTMLEKSWN
     FPLDIIVALL LHHVYQYGNG IHKKAAETVV LRLLAAEGDR EVEDILDEIK VQHGHLLRPS
     LPVDVDYATL SGAARFKEKS GWERLLSLLT QVLVRIGSSQ TCHLEARYRW EQLSQKSKEW
     LVDFLSREDE AWSDMVAAFA FKRVPPPSDY DRVVKVVGAV SKPIRERFSE ALRRNRKAPE
     ELLFKDITKE LLSIEAEAYA VLPWEYGSVP KHVSRSKDSA ARSRTHSSPA RVSDHVKEKS
     EGRPVRPDRW CTICNKYTRH TAPFCWNNPD CKNAPEWFKN KLEKDTKKSS DDDITNREDA
     PTYMVTSLMA EPLAVPAVLA RLRSSAYPDR DVIVAVDTLC ELNLIHEDLI HYCEIVDCPR
     DELPTLAGFK SGRVVPRCKV RLSLSFEGKR CFLYALAVDL RSTCVEMDVV LGAQSLRRMG
     ADVCLTDDRL QVKDLGIAIP LVRLDRKPQP VCSVPVCSMV EAVNYEELSA IWSEERLTTS
     MKERLADIGY VAPVLDFDIP ESYRKTPYQC QAPYNIPAKL VPGVHDKIRK EVLAGHWEEV
     APTASMWISP CFAKAKGRLI EEGPAKGEEA VRLLVDLRRL NTMVDIPDYF RDDGLSAAEF
     VRQVDQSARY FTTVDVEAAF ESIPAAPRAQ DLMCFAIGGR VYRSKVALQG LGLSPLLWQY
     HIRHGLQTLL GESYREYCAI FMDDILIWGD TADQCERRRK VVVAAINALG KRVSSKCGPS
     IGASAVCIGL EFSARGIRLS DESIARLKSA LAQTPTSGSH LRRILGSINY ARSAFQFDRL
     AGFGETLKVL TPLVNKKPFK ISPEAEDALK RLSESIVNAP LGLHSYKDLI CGEASSWVVT
     VDASDLAIGA ALFRYNGPSQ EVSMEDLKDP EKAVLVGALS RALSGDEVKL MIYEKEMLAL
     MAAMQKWGKL FIATTRPRSP RESQSEAKIL VLTDNTISLS RWTTYRLPLS PLISAKSRRF
     LSWIEEASEW RYLPYTVRHC TGTSNDVADL LSRWHEQLLC QPEEDPRCQG DEAGKDLLPF
     CMVIPDLLRP ADDDMLGEQD YPVMAAAPVT SGAPPEGESP PEECYTAPQV AHVEAPLAAL
     DIIDLNHNQA AVVEKALAED DSVFQGVRVR EIFAVGRKVD HSGLSPLVVE KVSRWFANGI
     FAIRQPFAGS EVGLLFCQGS LRTSDGPKEV MVIPSDCDVD LQPMVPTLLE DGSDRDMRKS
     LLMRLHDNLL TAHNSRDRLL GMVLQVAWWP GVSADVKAYR ARCPICCPGR YRQPPGVLPD
     CRTRFDTYSV DLKMIPTGLK ERLGLPPSAC VVSAIDLATG EVSFELINDQ SAANVARFLW
     HRIIVRHGEF SRLCSDQGSV FVGSVLGAMS NLFGWVVKTS AARNPTGNSR IERAHRAVSQ
     TFRWVESLGD ADGPDDLRLY LGSAEAKVNL AANEEGLSPH LAVYGSEPLM PLLKEGVTCE
     LDDLNKDIDK RTIADIREAA VWATEALADS RDVKAFYNRA SLAAGSAVDS NTLKLSEHDE
     VLYEGRREVV KKLFRAPGSD IPIVAVLESG KKVPIGALAL LLRSEQLTCR AFCELQDDAL
     VDKFVAFYLP SQDNLVMVGK VIEARCEKVV IWVYDGGAKT TVFLPLWRSD EATKRSKESP
     GRNWMVLTTE VPASCILARV EIHPNGRVTA ASKDRIEALG ISRPSDTLLV TNPSQASGPS
     GAL
//
DBGET integrated database retrieval system