ID C5KCA4_PERM5 Unreviewed; 1123 AA.
AC C5KCA4;
DT 28-JUL-2009, integrated into UniProtKB/TrEMBL.
DT 28-JUL-2009, sequence version 1.
DT 27-MAR-2024, entry version 51.
DE RecName: Full=Reverse transcriptase domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=Pmar_PMAR027633 {ECO:0000313|EMBL:EER17917.1};
OS Perkinsus marinus (strain ATCC 50983 / TXsc).
OC Eukaryota; Sar; Alveolata; Perkinsozoa; Perkinsea; Perkinsida; Perkinsidae;
OC Perkinsus.
OX NCBI_TaxID=423536 {ECO:0000313|Proteomes:UP000007800};
RN [1] {ECO:0000313|EMBL:EER17917.1, ECO:0000313|Proteomes:UP000007800}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 50983 / TXsc {ECO:0000313|Proteomes:UP000007800};
RA El-Sayed N., Caler E., Inman J., Amedeo P., Hass B., Wortman J.;
RL Submitted (JUL-2008) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GG671975; EER17917.1; -; Genomic_DNA.
DR RefSeq; XP_002786121.1; XM_002786075.1.
DR AlphaFoldDB; C5KCA4; -.
DR EnsemblProtists; EER17917; EER17917; Pmar_PMAR027633.
DR GeneID; 9063178; -.
DR InParanoid; C5KCA4; -.
DR OrthoDB; 2967382at2759; -.
DR Proteomes; UP000007800; Unassembled WGS sequence.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd00590; RRM_SF; 1.
DR Gene3D; 3.30.70.270; -; 1.
DR Gene3D; 3.30.70.330; -; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR008042; Retrotrans_Pao.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR000504; RRM_dom.
DR InterPro; IPR000477; RT_dom.
DR PANTHER; PTHR22955; RETROTRANSPOSON; 1.
DR Pfam; PF05380; Peptidase_A17; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SMART; SM00360; RRM; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF54928; RNA-binding domain, RBD; 1.
DR PROSITE; PS50102; RRM; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000007800};
KW RNA-binding {ECO:0000256|PROSITE-ProRule:PRU00176}.
FT DOMAIN 358..433
FT /note="RRM"
FT /evidence="ECO:0000259|PROSITE:PS50102"
FT DOMAIN 726..962
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT REGION 1..27
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 46..78
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 310..343
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 310..331
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1123 AA; 124765 MW; BCCA719FFF128434 CRC64;
MVSPKASDPV LTDDNKKSST EQSPPVWAWQ LKKELDSLTE SIAGIKSQLE SNADHKTRHD
ANGTTGATGD DSKDGPANTV HGIVEEVKQQ LDSTKSDRSA KSSLVIEPLV IMKISKQAKS
AGVIEGGRFS GVTDKRGYMI FKKTIMNQPE IQFSGTGELS SAYQYFYIIN NVSTPLQQLI
NEQCHVKGNT LFSQRVVTAW RVLRLYSDLG SEMELHRKWE NLKCDGPTTL EVYIAAIQTM
KDQIGEVREC PLTDSELRAR LYSGLPTEAR LYLDRMPLTS VSTMESMISE TRRWAQLRVR
YGASLVGTPT TATSNPTIST TRGASSAKVK SATPSKKDKK KTDKEANVIE FSTSPDAKRV
FLSGVPLAWN YQDVKDFVSR LLGHDTDVYV RLLPRRGFVT VKFNDHKPAQ AFIEASNGAD
IGGKRIRARF DKFESQDATI NPTTTAAPTT EGNAVQVPYD EVFFQGGSQP CEEEEGESPD
CLIAEFHDGS CEQPVPTATV SDSFQDHSAC MVSMVECNEG SSDVQVLEKA SNGLPLIPVW
FEGDEHRYYA LLDSGATDVF ITRRVFNSMK ESIPELSTTT TATSSMLTMI NGTSCAIVDK
VIGVKIHLAS HQVRQCDCYV IEHSKYDVIV PKSLLGPCTW VVSAESHCPD RIIFRPMGIF
DDDSCIPDRW LQDNSLAIQC NHAEVVADRL SVSIPWKSSA RPRYNFKDVW ARDRKVLTRL
KHDPVKYEAY LSAVHELLES GVVVEKPPDQ SPHDFAKFYT AVVPVFNLNR TSTKCRLCLD
ARPLNTYTTT TGDGTGSTSM DLFGTLMLWR SFERASCDDL SKAFWQVLVR GGKVDDEDTD
YLALVVAGHL VKFSRVPFGT NWSPWALGSS LEKIFSSLPA KIRDGVGYYV DDVLIGASSI
SEVEASRISV VQALKRRGFD INEKKRFHNL GTARSSIGHD VTVSDVTGMD GLQMASWLGY
RWFISPNCDN VDIKLPDFRL PREVRMSSLR SITARLFDPL GLVAEASLEI KALLRQLHNA
GFGEVHHGLV KNDAITGDYL LRSQECLQQC EKYFTCSTRM LPPRYIPIEG LVVFVDASEF
AMAVDVRSRA NGRRLLSRIS THPGGSIPRR ELESLRMGVV RIR
//