KEGG Taxonomy Files

The KEGG database uses the NCBI taxonomy for classification of cellular organisms and viruses. For cellulcar organisms, the three- or four-letter KEGG organism codes are classified somewhat differently in the following Brite hierarchy files.
The 08601 file is manually created to define the order of organism codes with hsa (Homo sapiens) at the top. The 08610 file is computationally generated using the abbreviated lineage of the NCBI taxonomy keeping the order of organism codes defined in 08601. In addition to organism codes, the 08610 file contains taxonomy IDs that are linked to GENES Addendum (ag) entries. The 08611 file is also computationally generated with fixed levels of species, genus and other organism groups.

For viruses, the taxonomy IDs of GENES Virus (vg) entries are classified according to the NCBI taxonomy, which is based on the ICTV taxonomy, with the Baltimore classification at the top level added by KEGG.
Both of these Brite hierarchy files are computationally generated. In the 08620 file the taxonomy IDs are shown in the full lineage of NCBI virus taxonomy and linked to GENES vg entries.


Taxonomy Mapping

Taxonomy mapping is a powerful method to integrate various biological data, especially for integrating genomic features and organism-level features. The following tool displays the taxonomic distributions of KOs (K numbers) and modules (M numbers), which may be combined with user-defined data such as for phenotypic features. The mapping result is shown in the Brite hierarchy file 08611, which can be viewed with somewhat different tree manipulation functions. The pruning includes non-matching siblings and the degree of classification can be changed by zooming.
Enter K/M nubers

Example:
   M00595 K16952 M00596
Enter user-defined data (KEGG organism codes and attributes)

Or upload file:


Last updated: August 1, 2021