KEGG icon

KEGG MODULE Database

Functional units for annotating and interpreting genomes

KEGG2 PATHWAY BRITE MODULE DISEASE DRUG KO GENOME GENES LIGAND DBGET

KEGG Modules

KEGG MODULE is a collection of manually defined functional units, termed KEGG modules, used for annotation and biological interpretation of sequenced genomes. Each module is identified by the M number that corresponds to a set of K numbers (KO identifiers). There are four types of KEGG modules:
  • pathway modules – representing tight functional units in KEGG metabolic pathway maps, such as M00002 (Glycolysis, core module involving three-carbon compounds)
  • structural complexes – often forming molecular machineries, such as M00072 (Oligosaccharyltransferase)
  • functinal sets – for other types of essential sets, such as M00360 (Aminoacyl-tRNA synthases, prokaryotes)
  • signature modules – as markers of phenotypes, such as M00363 (EHEC pathogenicity signature, Shiga toxin)
The entire list of KEGG modules can be viewed from the BRITE hierarchy file:
KEGG modules
Each M number entry is defined by the Boolean-like expression for the combination of K numbers. Notations for this definition include:
  • space delimited items for pathway elements
  • comma separated items in parentheses for alternatives
  • plus sign to define a complex
  • minus sign for an optional item

Phenotype Inference

A single M number or a combination of M numbers can be used for characterizing phenotypic features encoded in the genome. For example, the BTX (benzen, toluene, and xylene) degradation capacity can be seen from the following diagram where M numbers are linked to the ortholog tables indicating in which KEGG organisms modules are complete.
benzene M00548 catechol
toluene M00538 benzoate M00551 catechol M00569 meta-cleavage
M00568 ortho-cleavage
xylene M00537 methyl-
benzoate
M00551 methyl-
catechol
M00569 meta-cleavage
M00568 ortho-cleavage
It is interesting to note that KEGG pathway modules often correspond to reaction modules extracted from purely chemical properties as summarized in the following BRITE hierarchy file.
KEGG reaction modules
The above example can be rewritten in terms of the reaction modules.
benzene RM006 catechol
toluene RM003 benzoate RM005 catechol RM009 meta-cleavage
RM008 ortho-cleavage
xylene RM003 methyl-
benzoate
RM005 methyl-
catechol
RM009 meta-cleavage
RM008 ortho-cleavage

Module Maps

KEGG modules are associated with graphical diagrams called module maps. For example, M00002 represents glycolysis core module involving three-carbon compounds and its organism specific module can be selected from the pop-up menu or directly specified in the form of hsa_M00002. While KEGG pathway maps are all manually drawn, KEGG module maps are computationally generated from the text definition describe above. There is also a difference of how organism-specific versions are generated. Organism specific modules are generated only when consituent components are present in the genome.

Module Mapping

The KEGG Mapper interface allows KEGG MODULE mapping in a similar way to KEGG PATHWAY mapping.

Ortholog Table

The ortholog table is a useful tool to check completeness and consistency of genome annotations. It shows currently annotated genes in individual genomes for a given set of K numbers, together with coloring of adjacent genes (operon-like structures) on the chromosome. Each KEGG module contains a link to the corresponding ortholog table.

Taxonomy Mapping

Each KEGG module also contains a link to KEGG taxonomy mapping, which indicates the presence or absence of the module in all KEGG organisms under a taxonomic classification.



Last updated: April 1, 2013
KEGG GenomeNet Kanehisa Laboratories