|
Entry
|
The KEGG MGENES database contains gene catalogs of metagenomes generated from GenBank and other public resources and annotated with the KEGG Orthology (KO) system. Each entry is identified by the sequential entry id or the accession number given by the original database. This is followed by 'CDS' indicating a protein-coding gene, and the organism identifier (T number) linking to the KEGG MGENOME entry.
|
|
Gene name
|
Gene name(s) where the primary gene name shown first can be used as an alternative identifier (in place of the accession number) to retrieve the entry.
|
|
Definition
|
Description of the gene or its product given by the original database. The annotation given by KEGG is shown in the next "Orthology" field. Note that the original annotaion is not modified even if it contradicts to the KEGG annotation.
|
|
Orthology
|
The KO (KEGG Orthology) system is the basis for cross-species annotation in KEGG, which assigns the KO identifier called the K number representing a functional ortholog that corresponds to a KEGG pathway node or a BRITE hierarchy node. Thus, a set of genes in the metagenome with assigned K numbers can be mapped to KEGG reference pathways and BRITE reference hierarchies to generate organism-specific pathways and hierarchies. In KEGG the EC numbers are attributes of the K numbers, and they are indirectly assigned through the KO system. The KO identifiers in this field have been assigned by KAAS (KEGG Automatic Annotaton System).
|
|
Taxonomy
|
Tentative taxonomy is assigned based on the BLAST hits to KEGG GENES calculated by KAAS. Organisms whose genes have hits to this gene with 50% or more identity are listed in the descending order of their similarity scores. The best bit score is also shown.
|
|
Pathway
|
KEGG pathways to which the gene has been mapped through the KO system, suggesting functional roles of the gene product in the context of molecular networks. By clicking on the link, the rectangular object of this gene product is marked red in the pathway map.
|
|
Other DBs
|
Links to the outside database resources.
|
|
Source
|
Taxonomic and other links to NCBI.
|
|
Position
|
The genomic position of the gene in the scaffold, if available.
|
|
AA seq
|
The number of amino acids and the sequence data. The AA seq link generates the sequence data in the FASTA format. The DB search link can be used for sequence similarity search by BLAST or FASTA against various databases.
|
|
NT seq
|
The number of nucleotides and the sequence data. The NT seq link generates the sequence data of the coding region in the FASTA format.
|