KEGG Annotation

Genome Annotation (KO Assignment)

The KEGG database contains three main components for genome/metagenome annotation:

the collection of internally annotated gene catalogs for sequenced genomes, both for cellular organisms (called KEGG organisms) and viruses, and additional publication-based protein sequences in the GENES database,
the knowledge base of high-level functions represented as molecular interaction, reaction and relation networks in the PATHWAY, BRITE and MODULE databases, and
the knowledge base of molecular-level functions associated with ortholog groups in the KO database, where most KO entries are defined in a context-dependent manner as nodes of the KEGG molecular networks.

In general, KO entries (identified by K numbers) also represent sequence similarity groups. Thus, the sequence similarity search of a query genome against KEGG GENES is a search for most appropriate K numbers (see KO assignment tools). The assigned set of K numbers can then be used to reconstruct KEGG pathway maps, BRITE hierarchies and KEGG modules, enabling interpretation of high-level functions.

Automatic KO assignment can be done by the following web servers

and the result of KO assignment is linked to KEGG Mapper Reconstruct tool.

Ortholog table

The ortholog table (OT) has existed from the beginning of the KEGG project. For a given set of K numbers it displays the current assignment of genes in KEGG organisms and viruses.

The ortholog table additionally displays positional correlations of genes in the chromosome, such as operon structures, by coloring. The same color means that the genes are adjacent.

Annotation guide

The KEGG Annotation Guide is a collection of HTML tables, called BRITE tables, showing summary views of the current annotation of the GENES database, such as how K numbers are defined and assigned for distinguishing related genes and for comparing different subunit structures.

Comparing subunit structures or gene sets

Distinguishing related genes

01601 Enzymes of 2-oxocarboxylic acid metabolism
01602 Dioxygenases

Signature KOs and modules

Another set of BRITE tables contains signature KOs and/or signature modules, which can be used to infer phenotypic features of organisms.

Metabolic capacity

01620 Photosynthetic and chemosynthetic capacities

Pathogenicity and drug resistance

01600 Antimicrobial resistance: KEGG signatures

Last updated: September 23, 2024

Enter K numbers (Example) K22014 K21512 K26964 K26963 K21511 K26962 K26961 K26965 K26966

KEGG Annotation

KO-based annotation for linking genomes to phenotypes

Genome Annotation (KO Assignment)

Ortholog table

Annotation guide

Signature KOs and modules