Nahum Laila A.

Last Name

Nahum

First Name

Laila A.

Full item page

Search Results

Now showing 1 - 2 of 2

EGenBio : a data management system for evolutionary genomics and biodiversity

(BioMed Central, 2006-09-26) Nahum, Laila A. ; Reynolds, Matthew T. ; Wang, Zhengyuan O. ; Faith, Jeremiah J. ; Jonna, Rahul ; Jiang, Zhi J. ; Meyer, Thomas J. ; Pollock, David D.

EGenBio is a system for manipulation and filtering of large numbers of sequences, integrating curated sequence alignments and phylogenetic trees, managing evolutionary analyses, and visualizing their output. EGenBio is organized into three conceptual divisions, Evolution, Genomics, and Biodiversity. The Genomics division includes tools for selecting pre-aligned sequences from different genes and species, and for modifying and filtering these alignments for further analysis. Species searches are handled through queries that can be modified based on a tree-based navigation system and saved. The Biodiversity division contains tools for analyzing individual sequences or sequence alignments, whereas the Evolution division contains tools involving phylogenetic trees. Alignments are annotated with analytical results and modification history using our PRAED format. A miscellaneous Tools section and Help framework are also available. EGenBio was developed around our comparative genomic research and a prototype database of mtDNA genomes. It utilizes MySQL-relational databases and dynamic page generation, and calls numerous custom programs.
A functional update of the Escherichia coli K-12 genome

(BioMed Central, 2001-08-20) Serres, Margrethe H. ; Gopal, Shuba ; Nahum, Laila A. ; Liang, Ping ; Gaasterland, Terry ; Riley, Monica

Background: Since the genome of Escherichia coli K-12 was initially annotated in 1997, additional functional information based on biological characterization and functions of sequence-similar proteins has become available. On the basis of this new information, an updated version of the annotated chromosome has been generated. Results: The E. coli K-12 chromosome is currently represented by 4,401 genes encoding 116 RNAs and 4,285 proteins. The boundaries of the genes identified in the GenBank Accession U00096 were used. Some protein-coding sequences are compound and encode multimodular proteins. The coding sequences (CDSs) are represented by modules (protein elements of at least 100 amino acids with biological activity and independent evolutionary history). There are 4,616 identified modules in the 4,285 proteins. Of these, 48.9% have been characterized, 29.5% have an imputed function, 2.1% have a phenotype and 19.5% have no function assignment. Only 7% of the modules appear unique to E. coli, and this number is expected to be reduced as more genome data becomes available. The imputed functions were assigned on the basis of manual evaluation of functions predicted by BLAST and DARWIN analyses and by the MAGPIE genome annotation system. Conclusions: Much knowledge has been gained about functions encoded by the E. coli K-12 genome since the 1997 annotation was published. The data presented here should be useful for analysis of E. coli gene products as well as gene products encoded by other genomes.

Nahum Laila A.

Last Name

First Name

ORCID

Filters

Author

Subject

Date

Type

Has files

Settings

Sort By

Results per page

Search Results