ID A0A087SEG0_AUXPR Unreviewed; 641 AA.
AC A0A087SEG0;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 29-OCT-2014, sequence version 1.
DT 27-MAR-2024, entry version 40.
DE RecName: Full=mRNA guanylyltransferase {ECO:0000256|ARBA:ARBA00012475};
DE EC=2.7.7.50 {ECO:0000256|ARBA:ARBA00012475};
GN ORFNames=F751_1379 {ECO:0000313|EMBL:KFM24114.1};
OS Auxenochlorella protothecoides (Green microalga) (Chlorella
OS protothecoides).
OC Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Trebouxiophyceae;
OC Chlorellales; Chlorellaceae; Auxenochlorella.
OX NCBI_TaxID=3075 {ECO:0000313|EMBL:KFM24114.1, ECO:0000313|Proteomes:UP000028924};
RN [1] {ECO:0000313|EMBL:KFM24114.1, ECO:0000313|Proteomes:UP000028924}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=0710 {ECO:0000313|EMBL:KFM24114.1,
RC ECO:0000313|Proteomes:UP000028924};
RX PubMed=25012212; DOI=10.1186/1471-2164-15-582;
RA Gao C., Wang Y., Shen Y., Yan D., He X., Dai J., Wu Q.;
RT "Oil accumulation mechanisms of the oleaginous microalga Chlorella
RT protothecoides revealed through its genome, transcriptomes, and
RT proteomes.";
RL BMC Genomics 15:582-582(2014).
CC -!- CATALYTIC ACTIVITY:
CC Reaction=a 5'-end diphospho-ribonucleoside in mRNA + GTP + H(+) = a 5'-
CC end (5'-triphosphoguanosine)-(ribonucleoside) in mRNA + diphosphate;
CC Xref=Rhea:RHEA:67012, Rhea:RHEA-COMP:17165, Rhea:RHEA-COMP:17166,
CC ChEBI:CHEBI:15378, ChEBI:CHEBI:33019, ChEBI:CHEBI:37565,
CC ChEBI:CHEBI:167616, ChEBI:CHEBI:167617; EC=2.7.7.50;
CC Evidence={ECO:0000256|ARBA:ARBA00024520};
CC PhysiologicalDirection=left-to-right; Xref=Rhea:RHEA:67013;
CC Evidence={ECO:0000256|ARBA:ARBA00024520};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KL662105; KFM24114.1; -; Genomic_DNA.
DR RefSeq; XP_011397000.1; XM_011398698.1.
DR AlphaFoldDB; A0A087SEG0; -.
DR STRING; 3075.A0A087SEG0; -.
DR GeneID; 23612770; -.
DR KEGG; apro:F751_1379; -.
DR eggNOG; KOG2386; Eukaryota.
DR OrthoDB; 49440at2759; -.
DR Proteomes; UP000028924; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0005524; F:ATP binding; IEA:InterPro.
DR GO; GO:0005525; F:GTP binding; IEA:UniProtKB-KW.
DR GO; GO:0140818; F:mRNA 5'-phosphatase activity; IEA:InterPro.
DR GO; GO:0004484; F:mRNA guanylyltransferase activity; IEA:UniProtKB-EC.
DR GO; GO:0008138; F:protein tyrosine/serine/threonine phosphatase activity; IEA:InterPro.
DR GO; GO:0006370; P:7-methylguanosine mRNA capping; IEA:UniProtKB-KW.
DR GO; GO:0016311; P:dephosphorylation; IEA:InterPro.
DR CDD; cd07895; Adenylation_mRNA_capping; 1.
DR CDD; cd14502; RNA_5'-triphosphatase; 1.
DR Gene3D; 3.30.470.30; DNA ligase/mRNA capping enzyme; 1.
DR Gene3D; 2.40.50.140; Nucleic acid-binding proteins; 1.
DR Gene3D; 3.90.190.10; Protein tyrosine phosphatase superfamily; 1.
DR InterPro; IPR000340; Dual-sp_phosphatase_cat-dom.
DR InterPro; IPR017074; mRNA_cap_enz_bifunc.
DR InterPro; IPR001339; mRNA_cap_enzyme_adenylation.
DR InterPro; IPR013846; mRNA_cap_enzyme_C.
DR InterPro; IPR012340; NA-bd_OB-fold.
DR InterPro; IPR029021; Prot-tyrosine_phosphatase-like.
DR InterPro; IPR016130; Tyr_Pase_AS.
DR InterPro; IPR000387; Tyr_Pase_dom.
DR PANTHER; PTHR10367; MRNA-CAPPING ENZYME; 1.
DR PANTHER; PTHR10367:SF17; MRNA-CAPPING ENZYME; 1.
DR Pfam; PF00782; DSPc; 1.
DR Pfam; PF03919; mRNA_cap_C; 1.
DR Pfam; PF01331; mRNA_cap_enzyme; 1.
DR PIRSF; PIRSF036958; mRNA_capping_HCE; 1.
DR SUPFAM; SSF52799; (Phosphotyrosine protein) phosphatases II; 1.
DR SUPFAM; SSF56091; DNA ligase/mRNA capping enzyme, catalytic domain; 1.
DR SUPFAM; SSF50249; Nucleic acid-binding proteins; 1.
DR PROSITE; PS00383; TYR_PHOSPHATASE_1; 1.
DR PROSITE; PS50056; TYR_PHOSPHATASE_2; 1.
PE 4: Predicted;
KW GTP-binding {ECO:0000256|ARBA:ARBA00023134, ECO:0000256|PIRSR:PIRSR036958-
KW 3}; mRNA capping {ECO:0000256|ARBA:ARBA00023042};
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741,
KW ECO:0000256|PIRSR:PIRSR036958-3};
KW Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW Reference proteome {ECO:0000313|Proteomes:UP000028924};
KW Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT DOMAIN 144..172
FT /note="Tyrosine specific protein phosphatases"
FT /evidence="ECO:0000259|PROSITE:PS50056"
FT REGION 1..40
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 235..264
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 15..35
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT ACT_SITE 164
FT /note="Phosphocysteine intermediate"
FT /evidence="ECO:0000256|PIRSR:PIRSR036958-1"
FT ACT_SITE 324
FT /note="N6-GMP-lysine intermediate"
FT /evidence="ECO:0000256|PIRSR:PIRSR036958-2"
FT BINDING 329
FT /ligand="GTP"
FT /ligand_id="ChEBI:CHEBI:37565"
FT /evidence="ECO:0000256|PIRSR:PIRSR036958-3"
FT BINDING 344
FT /ligand="GTP"
FT /ligand_id="ChEBI:CHEBI:37565"
FT /evidence="ECO:0000256|PIRSR:PIRSR036958-3"
FT BINDING 381..383
FT /ligand="GTP"
FT /ligand_id="ChEBI:CHEBI:37565"
FT /evidence="ECO:0000256|PIRSR:PIRSR036958-3"
FT BINDING 506..508
FT /ligand="GTP"
FT /ligand_id="ChEBI:CHEBI:37565"
FT /evidence="ECO:0000256|PIRSR:PIRSR036958-3"
FT BINDING 584..589
FT /ligand="GTP"
FT /ligand_id="ChEBI:CHEBI:37565"
FT /evidence="ECO:0000256|PIRSR:PIRSR036958-3"
SQ SEQUENCE 641 AA; 72480 MW; 491BB7A57494E047 CRC64;
MKRGLDGPAG PSKLARPEEV EDSAEARRQP DDIADEELGD YGIPSGWKQC PAQGQPISRF
IPSKVPLGAH FDDYIPAAQR YNVDQAQERA DAKGAALQAR CATVIDLTRS TRYYDVRRWQ
ELGVRYIKIP CRGRGQVPQP EAVNDFVWEV ASQRAGQSTV LVHCTHGFNR TGEEGYMIVS
YCVRMHHWTV EKALHAFALS RPPGIYKHYY IRQLYRYYHE AVPAAAVMPA LPDWKAGDAP
DSGDEGGEGE NGAEAPPEHM EHDDVLGEEV SPAEAATVCG SIVEMIAGPA QDHRRAWFPG
SQPVSLDRSN LELLRQRRYW VTWKADGTRY MLVILPSGTY LADRKLATRR VQMRWPLPAK
PQAGVPQKAP VGPPHSLTIL DGEMIVDDDL LSDRKQRRFL AYDLMALNGK SVVGLPWKER
YDMIKRYVMD PRKLERHVIE TKQWSFPYQY GEELFHVRRK EFWPLTQADK LLHDFIPNQV
SHESDGLILQ GYEDAYVPGT CQELLKWKFA HLNSVDFRLR WDPKTSTAGL ELLETRAGGA
RPRGYHLLPG ATIAFPGGED PAALHQRIVE AAYDAEAGSW VYMRERRDKP TPNAYHVYES
VARSIQDNIQ EEELLESIAV ALQLPLYERD RARQAAGVRA A
//