A functional update of the Escherichia coli K-12 genome
A functional update of the Escherichia coli K-12 genome
Date
2001-08-20
Authors
Serres, Margrethe H.
Gopal, Shuba
Nahum, Laila A.
Liang, Ping
Gaasterland, Terry
Riley, Monica
Gopal, Shuba
Nahum, Laila A.
Liang, Ping
Gaasterland, Terry
Riley, Monica
Linked Authors
Person
Person
Person
Person
Person
Alternative Title
Citable URI
As Published
Date Created
Location
DOI
10.1186/gb-2001-2-9-research0035
Related Materials
Replaces
Replaced By
Keywords
Escherichia coli K-12
Abstract
Background: Since the genome of Escherichia coli K-12 was initially annotated in 1997, additional
functional information based on biological characterization and functions of sequence-similar
proteins has become available. On the basis of this new information, an updated version of the
annotated chromosome has been generated.
Results: The E. coli K-12 chromosome is currently represented by 4,401 genes encoding 116
RNAs and 4,285 proteins. The boundaries of the genes identified in the GenBank Accession
U00096 were used. Some protein-coding sequences are compound and encode multimodular
proteins. The coding sequences (CDSs) are represented by modules (protein elements of at
least 100 amino acids with biological activity and independent evolutionary history). There are
4,616 identified modules in the 4,285 proteins. Of these, 48.9% have been characterized, 29.5%
have an imputed function, 2.1% have a phenotype and 19.5% have no function assignment. Only
7% of the modules appear unique to E. coli, and this number is expected to be reduced as more
genome data becomes available. The imputed functions were assigned on the basis of manual
evaluation of functions predicted by BLAST and DARWIN analyses and by the MAGPIE genome
annotation system.
Conclusions: Much knowledge has been gained about functions encoded by the E. coli K-12 genome
since the 1997 annotation was published. The data presented here should be useful for analysis of
E. coli gene products as well as gene products encoded by other genomes.
Description
Author Posting. © 2001 Serres et al. The definitive version was published in Genome Biology 2 (2001): research0035.1–0035.7, doi:10.1186/gb-2001-2-9-research0035.
Embargo Date
Citation
Genome Biology 2 (2001): research0035.1–0035.7