Josephine Bay Paul Center in Comparative Molecular Biology and Evolution
Permanent URI for this collection
The Josephine Bay Paul Center for Comparative Molecular Biology and Evolution explores the evolution and interaction of genomes of diverse organisms that play significant roles in environmental biology and human health. This dynamic research program integrates the powerful tools of genome science, molecular phylogenetics, and molecular ecology to advance our understanding of how living organisms are related to each other, to provide the tools to quantify and assess biodiversity, and to identify genes and underlying mechanisms of biomedical importance. Projects span all evolutionary time scales, ranging from deep phylogenetic divergence of ancient eukaryotic and prokaryotic lineages, to ecological analyses of how members of diverse communities contribute and respond to environmental change.
Browse
Browsing Josephine Bay Paul Center in Comparative Molecular Biology and Evolution by Issue Date
Results Per Page
Sort Options
-
ArticleDecoupling of genome size and sequence divergence in a symbiotic bacterium(American Society for Microbiology, 2000-07) Wernegreen, Jennifer J. ; Ochman, Howard ; Jones, Isaac B. ; Moran, Nancy A.In contrast to genome size variation in most bacterial taxa, the small genome size of Buchnera sp. was shown to be highly conserved across genetically diverse isolates (630 to 643 kb). This exceptional size conservation may reflect the inability of this obligate mutualist to acquire foreign DNA and reduced selection for genetic novelty within a static intracellular environment.
-
ArticleVertical transmission of biosynthetic plasmids in aphid endosymbionts (Buchnera)(American Society for Microbiology, 2001-01) Wernegreen, Jennifer J. ; Moran, Nancy A.This study tested for horizontal transfer of plasmids among Buchnera aphidicola strains associated with ecologically and phylogenetically related aphid hosts (Uroleucon species). Phylogenetic congruence of Buchnera plasmid (trpEG and leuABC) and chromosomal (dnaN and trpB) genes supports strictly vertical long-term transmission of plasmids, which persist due to their contributions to host nutrition rather than capacity for infectious transfer. Synonymous divergences indicate elevated mutation on plasmids relative to chromosomal genes.
-
ArticleA functional update of the Escherichia coli K-12 genome(BioMed Central, 2001-08-20) Serres, Margrethe H. ; Gopal, Shuba ; Nahum, Laila A. ; Liang, Ping ; Gaasterland, Terry ; Riley, MonicaBackground: Since the genome of Escherichia coli K-12 was initially annotated in 1997, additional functional information based on biological characterization and functions of sequence-similar proteins has become available. On the basis of this new information, an updated version of the annotated chromosome has been generated. Results: The E. coli K-12 chromosome is currently represented by 4,401 genes encoding 116 RNAs and 4,285 proteins. The boundaries of the genes identified in the GenBank Accession U00096 were used. Some protein-coding sequences are compound and encode multimodular proteins. The coding sequences (CDSs) are represented by modules (protein elements of at least 100 amino acids with biological activity and independent evolutionary history). There are 4,616 identified modules in the 4,285 proteins. Of these, 48.9% have been characterized, 29.5% have an imputed function, 2.1% have a phenotype and 19.5% have no function assignment. Only 7% of the modules appear unique to E. coli, and this number is expected to be reduced as more genome data becomes available. The imputed functions were assigned on the basis of manual evaluation of functions predicted by BLAST and DARWIN analyses and by the MAGPIE genome annotation system. Conclusions: Much knowledge has been gained about functions encoded by the E. coli K-12 genome since the 1997 annotation was published. The data presented here should be useful for analysis of E. coli gene products as well as gene products encoded by other genomes.
-
ArticleSmall genome of Candidatus Blochmannia, the bacterial endosymbiont of Camponotus, implies irreversible specialization to an intracellular lifestyle(Society for General Mircobiology, 2002) Wernegreen, Jennifer J. ; Lazarus, Adam B. ; Degnan, Patrick H.Blochmannia (Candidatus Blochmannia gen. nov.) is the primary bacterial endosymbiont of the ant genus Camponotus. Like other obligate endosymbionts of insects, Blochmannia occurs exclusively within eukaryotic cells and has experienced long-term vertical transmission through host lineages. In this study, PFGE was used to estimate the genome size of Blochmannia as approximately 800 kb, which is significantly smaller than its free-living relatives in the enterobacteria. This small genome implies that Blochmannia has deleted most of the genetic machinery of related free-living bacteria. Due to restricted gene exchange in obligate endosymbionts, the substantial gene loss in Blochmannia and other insect mutualists may reflect irreversible specialization to a host cellular environment.
-
ArticleEvidence for lateral transfer of genes encoding ferredoxins, nitroreductases, NADH oxidase, and alcohol dehydrogenase 3 from anaerobic prokaryotes to Giardia lamblia and Entamoeba histolytica(American Society for Microbiology, 2002-04) Nixon, Julie E. J. ; Wang, Amy ; Field, Jessica ; Morrison, Hilary G. ; McArthur, Andrew G. ; Sogin, Mitchell L. ; Loftus, Brendan J. ; Samuelson, JohnGiardia lamblia and Entamoeba histolytica are amitochondriate, microaerophilic protists which use fermentation enzymes like those of bacteria to survive anaerobic conditions within the intestinal lumen. Genes encoding fermentation enzymes and related electron transport peptides (e.g., ferredoxins) in giardia organisms and amebae are hypothesized to be derived from either an ancient anaerobic eukaryote (amitochondriate fossil hypothesis), a mitochondrial endosymbiont (hydrogen hypothesis), or anaerobic bacteria (lateral transfer hypothesis). The goals here were to complete the molecular characterization of giardial and amebic fermentation enzymes and to determine the origins of the genes encoding them, when possible. A putative giardia [2Fe-2S]ferredoxin which had a hypothetical organelle-targeting sequence at its N terminus showed similarity to mitochondrial ferredoxins and the hydrogenosomal ferredoxin of Trichomonas vaginalis (another luminal protist). However, phylogenetic trees were star shaped, with weak bootstrap support, so we were unable to confirm or rule out the endosymbiotic origin of the giardia [2Fe-2S]ferredoxin gene. Putative giardial and amebic 6-kDa ferredoxins, ferredoxin-nitroreductase fusion proteins, and oxygen-insensitive nitroreductases each tentatively supported the lateral transfer hypothesis. Although there were not enough sequences to perform meaningful phylogenetic analyses, the unique common occurrence of these peptides and enzymes in giardia organisms, amebae, and the few anaerobic prokaryotes suggests the possibility of lateral transfer. In contrast, there was more robust phylogenetic evidence for the lateral transfer of G. lamblia genes encoding an NADH oxidase from a gram-positive coccus and a microbial group 3 alcohol dehydrogenase from thermoanaerobic prokaryotes. In further support of lateral transfer, the G. lamblia NADH oxidase and adh3 genes appeared to have an evolutionary history distinct from those of E. histolytica.
-
ArticleMicrobial diversity of hydrothermal sediments in the Guaymas Basin : evidence for anaerobic methanotrophic communities(American Society for Microbiology, 2002-04) Teske, Andreas ; Hinrichs, Kai-Uwe ; Edgcomb, Virginia P. ; de Vera Gomez, Alvin ; Kysela, David ; Sylva, Sean P. ; Sogin, Mitchell L. ; Jannasch, Holger W.Microbial communities in hydrothermally active sediments of the Guaymas Basin (Gulf of California, Mexico) were studied by using 16S rRNA sequencing and carbon isotopic analysis of archaeal and bacterial lipids. The Guaymas sediments harbored uncultured euryarchaeota of two distinct phylogenetic lineages within the anaerobic methane oxidation 1 (ANME-1) group, ANME-1a and ANME-1b, and of the ANME-2c lineage within the Methanosarcinales, both previously assigned to the methanotrophic archaea. The archaeal lipids in the Guaymas Basin sediments included archaeol, diagnostic for nonthermophilic euryarchaeota, and sn-2-hydroxyarchaeol, with the latter compound being particularly abundant in cultured members of the Methanosarcinales. The concentrations of these compounds were among the highest observed so far in studies of methane seep environments. The δ-13C values of these lipids (δ-13C = -89 to -58%) indicate an origin from anaerobic methanotrophic archaea. This molecular-isotopic signature was found not only in samples that yielded predominantly ANME-2 clones but also in samples that yielded exclusively ANME-1 clones. ANME-1 archaea therefore remain strong candidates for mediation of the anaerobic oxidation of methane. Based on 16S rRNA data, the Guaymas sediments harbor phylogenetically diverse bacterial populations, which show considerable overlap with bacterial populations of geothermal habitats and natural or anthropogenic hydrocarbon-rich sites. Consistent with earlier observations, our combined evidence from bacterial phylogeny and molecular-isotopic data indicates an important role of some novel deeply branching bacteria in anaerobic methanotrophy. Anaerobic methane oxidation likely represents a significant and widely occurring process in the trophic ecology of methane-rich hydrothermal vents. This study stresses a high diversity among communities capable of anaerobic oxidation of methane.
-
ArticleBacterial group II introns in a deep-sea hydrothermal vent environment(American Society for Microbiology, 2002-12) Podar, Mircea ; Mullineaux, Lauren S. ; Huang, Hon-Ren ; Perlman, Philip S. ; Sogin, Mitchell L.Group II introns are catalytic RNAs and mobile retrotransposable elements known to be present in the genomes of some nonmarine bacteria and eukaryotic organelles. Here we report the discovery of group II introns in a bacterial mat sample collected from a deep-sea hydrothermal vent near 9°N on the East Pacific Rise. One of the introns was shown to self-splice in vitro. This is the first example of marine bacterial introns from molecular population structure studies of microorganisms that live in the proximity of hydrothermal vents. These types of mobile genetic elements may prove useful in improving our understanding of bacterial genome evolution and may serve as valuable markers in comparative studies of bacterial communities.
-
ArticleGene expression level influences amino acid usage, but not codon usage, in the tsetse fly endosymbiont Wigglesworthia(Society for General Mircobiology, 2003) Herbeck, Joshua T. ; Wall, Dennis P. ; Wernegreen, Jennifer J.Wigglesworthia glossinidia brevipalpis, the obligate bacterial endosymbiont of the tsetse fly Glossina brevipalpis, is characterized by extreme genome reduction and AT nucleotide composition bias. Here, multivariate statistical analyses are used to test the hypothesis that mutational bias and genetic drift shape synonymous codon usage and amino acid usage of Wigglesworthia. The results show that synonymous codon usage patterns vary little across the genome and do not distinguish genes of putative high and low expression levels, thus indicating a lack of translational selection. Extreme AT composition bias across the genome also drives relative amino acid usage, but predicted high-expression genes (ribosomal proteins and chaperonins) use GC-rich amino acids more frequently than do low-expression genes. The levels and configuration of amino acid differences between Wigglesworthia and Escherichia coli were compared to test the hypothesis that the relatively GC-rich amino acid profiles of high-expression genes reflect greater amino acid conservation at these loci. This hypothesis is supported by reduced levels of protein divergence at predicted high-expression Wigglesworthia genes and similar configurations of amino acid changes across expression categories. Combined, the results suggest that codon and amino acid usage in the Wigglesworthia genome reflect a strong AT mutational bias and elevated levels of genetic drift, consistent with expected effects of an endosymbiotic lifestyle and repeated population bottlenecks. However, these impacts of mutation and drift are apparently attenuated by selection on amino acid composition at high-expression genes.
-
ArticleMolecular characterization of sulfate-reducing bacteria in the Guaymas Basin(American Society for Microbiology, 2003-05) Dhillon, Ashita ; Teske, Andreas ; Dillon, Jesse ; Stahl, David A. ; Sogin, Mitchell L.The Guaymas Basin (Gulf of California) is a hydrothermal vent site where thermal alteration of deposited planktonic and terrestrial organic matter forms petroliferous material which supports diverse sulfate-reducing bacteria. We explored the phylogenetic and functional diversity of the sulfate-reducing bacteria by characterizing PCR-amplified dissimilatory sulfite reductase (dsrAB) and 16S rRNA genes from the upper 4 cm of the Guaymas sediment. The dsrAB sequences revealed that there was a major clade closely related to the acetate-oxidizing delta-proteobacterial genus Desulfobacter and a clade of novel, deeply branching dsr sequences related to environmental dsr sequences from marine sediments in Aarhus Bay and Kysing Fjord (Denmark). Other dsr clones were affiliated with gram-positive thermophilic sulfate reducers (genus Desulfotomaculum) and the delta-proteobacterial species Desulforhabdus amnigena and Thermodesulforhabdus norvegica. Phylogenetic analysis of 16S rRNAs from the same environmental samples resulted in identification of four clones affiliated with Desulfobacterium niacini, a member of the acetate-oxidizing, nutritionally versatile genus Desulfobacterium, and one clone related to Desulfobacula toluolica and Desulfotignum balticum. Other bacterial 16S rRNA bacterial phylotypes were represented by non-sulfate reducers and uncultured lineages with unknown physiology, like OP9, OP8, as well as a group with no clear affiliation. In summary, analyses of both 16S rRNA and dsrAB clone libraries resulted in identification of members of the Desulfobacteriales in the Guaymas sediments. In addition, the dsrAB sequencing approach revealed a novel group of sulfate-reducing prokaryotes that could not be identified by 16S rRNA sequencing.
-
ArticleExtending knowledge of Escherichia coli metabolism by modeling and experiment(BioMed Central, 2003-10-28) Voit, Eberhard O. ; Riley, MonicaOne of the challenges for ‘post-genomic’ biology is the integration of data from many different sources. Two recent studies independently take steps towards this goal for Escherichia coli, using mathematical modeling and a combination of gene expression and protein levels to predict new gene functions and metabolic behaviors.
-
ArticleSequence motif within trypanosome precursor tRNAs influences abundance and mitochondrial localization(American Society for Microbiology, 2003-12) Sherrer, R. Lynn ; Yermovsky-Kammerer, Audra E. ; Hajduk, Stephen L.Trypanosoma brucei lacks mitochondrial genes encoding tRNAs and must import nuclearly encoded tRNAs from the cytosol. The mechanism and specificity of this process remain unclear. We have identified a unique sequence motif, YGG(C/A)RRC, upstream of the genes encoding mitochondrially localized tRNAs in T. brucei. Both in vitro import studies and in vivo transfection studies indicate that deletion of the YGG(C/A)RRC sequence alters mitochondrial localization of tRNALeu, and in vivo studies also show a decrease in the cellular abundance of tRNALeu. These studies provide direct evidence for cis-acting RNA motifs within precursor tRNAs that facilitate the selection of tRNAs for mitochondrial import in trypanosomes. Furthermore, we found that mutations to the YGG(C/A)RRC sequence also altered the intracellular distribution of other endogenous tRNAs, suggesting a general role for this sequence in tRNA trafficking in trypanosomes.
-
ArticleEvolution of eukaryotic transcription : insights from the genome of Giardia lamblia(Cold Spring Harbor Laboratory Press, 2004) Best, Aaron A. ; Morrison, Hilary G. ; McArthur, Andrew G. ; Sogin, Mitchell L. ; Olsen, Gary J.The Giardia lamblia genome sequencing project affords us a unique opportunity to conduct comparative analyses of core cellular systems between early and late-diverging eukaryotes on a genome-wide scale. We report a survey to identify canonical transcription components in Giardia, focusing on RNA polymerase (RNAP) subunits and transcription-initiation factors. Our survey revealed that Giardia contains homologs to 21 of the 28 polypeptides comprising eukaryal RNAPI, RNAPII, and RNAPIII; six of the seven RNAP subunits without giardial homologs are polymerase specific. Components of only four of the 12 general transcription initiation factors have giardial homologs. Surprisingly, giardial TATA-binding protein (TBP) is highly divergent with respect to archaeal and higher eukaryotic TBPs, and a giardial homolog of transcription factor IIB was not identified. We conclude that Giardia represents a transition during the evolution of eukaryal transcription systems, exhibiting a relatively complete set of RNAP subunits and a rudimentary basal initiation apparatus for each transcription system. Most class-specific RNAP subunits and basal initiation factors appear to have evolved after the divergence of Giardia from the main eukaryotic line of descent. Consequently, Giardia is predicted to be unique in many aspects of transcription initiation with respect to paradigms derived from studies in crown eukaryotes.
-
ArticleHost-symbiont stability and fast evolutionary rates in an ant-bacterium association : cospeciation of Camponotus species and their endosymbionts, Candidatus Blochmannia(Taylor and Francis, 2004-02) Degnan, Patrick H. ; Lazarus, Adam B. ; Brock, Chad D. ; Wernegreen, Jennifer J.Bacterial endosymbionts are widespread across several insect orders and are involved in interactions ranging from obligate mutualism to reproductive parasitism. Candidatus Blochmannia gen. nov. (Blochmannia) is an obligate bacterial associate of Camponotus and related ant genera (Hymenoptera: Formicidae). The occurrence of Blochmannia in all Camponotus species sampled from field populations and its maternal transmission to host offspring suggest that this bacterium is engaged in a long-term, stable association with its ant hosts. However, evidence for cospeciation in this system is equivocal because previous phylogenetic studies were based on limited gene sampling, lacked statistical analysis of congruence, and have even suggested host switching. We compared phylogenies of host genes (the nuclear EF-1alphaF2 and mitochondrial COI/II) and Blochmannia genes (16S ribosomal DNA [rDNA], groEL, gidA, and rpsB), totaling more than 7 kilobases for each of 16 Camponotus species. Each data set was analyzed using maximum likelihood and Bayesian phylogenetic reconstruction methods. We found minimal conflict among host and symbiont phylogenies, and the few areas of discordance occurred at deep nodes that were poorly supported by individual data sets. Concatenated protein-coding genes produced a very well-resolved tree that, based on the Shimodaira-Hasegawa test, did not conflict with any host or symbiont data set. Correlated rates of synonymous substitution (dS) along corresponding branches of host and symbiont phylogenies further supported the hypothesis of cospeciation. These findings indicate that Blochmannia-Camponotus symbiosis has been evolutionarily stable throughout tens of millions of years. Based on inferred divergence times among the ant hosts, we estimated rates of sequence evolution of Blochmannia to be sim0.0024 substitutions per site per million years (s/s/MY) for the 16S rDNA gene and sim0.1094 s/s/MY at synonymous positions of the genes sampled. These rates are several-fold higher than those for related bacteria Buchnera aphidicola and Escherichia coli. Phylogenetic congruence among Blochmannia genes indicates genome stability that typifies primary endosymbionts of insects.
-
ArticleMicrobial biogeography along an estuarine salinity gradient : combined influences of bacterial growth and residence time(American Society for Microbiology, 2004-03) Crump, Byron C. ; Hopkinson, Charles S. ; Sogin, Mitchell L. ; Hobbie, John E.Shifts in bacterioplankton community composition along the salinity gradient of the Parker River estuary and Plum Island Sound, in northeastern Massachusetts, were related to residence time and bacterial community doubling time in spring, summer, and fall seasons. Bacterial community composition was characterized with denaturing gradient gel electrophoresis (DGGE) of PCR-amplified 16S ribosomal DNA. Average community doubling time was calculated from bacterial production ([14C]leucine incorporation) and bacterial abundance (direct counts). Freshwater and marine populations advected into the estuary represented a large fraction of the bacterioplankton community in all seasons. However, a unique estuarine community formed at intermediate salinities in summer and fall, when average doubling time was much shorter than water residence time, but not in spring, when doubling time was similar to residence time. Sequencing of DNA in DGGE bands demonstrated that most bands represented single phylotypes and that matching bands from different samples represented identical phylotypes. Most river and coastal ocean bacterioplankton were members of common freshwater and marine phylogenetic clusters within the phyla Proteobacteria, Bacteroidetes, and Actinobacteria. Estuarine bacterioplankton also belonged to these phyla but were related to clones and isolates from several different environments, including marine water columns, freshwater sediments, and soil.
-
ArticleEndosymbiosis : lessons in conflict resolution(Public Library of Science (PLoS), 2004-03-16) Wernegreen, Jennifer J.Symbiosis, an interdependent relationship between two species, is an important driver of evolutionary novelty and ecological diversity. Microbial symbionts in particular have been major evolutionary catalysts throughout the 4 billion years of life on earth and have largely shaped the evolution of complex organisms. Endosymbiosis is a specifi c type of symbiosis in which one—typically microbial—partner lives within its host and represents the most intimate contact between interacting organisms. Mitochondria and chloroplasts, for example, result from endosymbiotic events of lasting significance that extended the range of acceptable habitats for life. The wide distribution of intracellular bacteria across diverse hosts and marine and terrestrial habitats testifies to the continued importance of endosymbiosis in evolution. Among multicellular organisms, insects as a group form exceptionally diverse associations with microbial associates, including bacteria that live exclusively within host cells and undergo maternal transmission to offspring. These microbes have piqued the interest of evolutionary biologists because they represent a wide spectrum of evolutionary strategies, ranging from obligate mutualism to reproductive parasitism (Buchner 1965; Ishikawa 2003) (Box 1; Table 1).
-
ArticleLight and electron microscopical observations of the effects of high-density lipoprotein on growth of Plasmodium falciparum in vitro(Cambridge University Press, 2004-05-13) Imrie, H. ; Ferguson, D. J. P. ; Carter, M. ; Drain, J. ; Schiflett, A. ; Hajduk, Stephen L. ; Day, K. P.Human serum high-density lipoprotein (HDL) is necessary and sufficient for the short-term maintenance of Plasmodium falciparum in in vitro culture. However, at high concentrations it is toxic to the parasite. A heat-labile component is apparently responsible for the stage-specific toxicity to parasites within infected erythrocytes 12–42 h after invasion, i.e. during trophozoite maturation. The effects of HDL on parasite metabolism (as determined by nucleic acid synthesis) are evident at about 30 h after invasion. Parasites treated with HDL show gross abnormalities by light and electron microscopy.
-
ArticleComparative sequence analysis of IS50/Tn5 transposase(American Society for Microbiology, 2004-12) Reznikoff, William S. ; Bordenstein, Seth R. ; Apodaca, JenniferComparative sequence analysis of IS50 transposase-related protein sequences in conjunction with known structural, biochemical, and genetic data was used to determine domains and residues that play key roles in IS50 transposase function. BLAST and ClustalW analyses have been used to find and analyze six complete protein sequences that are related to the IS50 transposase. The protein sequence identity of these six homologs ranged from 25 to 55% in comparison to the IS50 transposase. Homologous motifs were found associated with each of the three catalytic residues. Residues that play roles in transposase-DNA binding, protein autoregulation, and DNA hairpin formation were also found to be conserved in addition to other residues of unknown function. On the other hand, some homologous sequences did not appear to be competent to encode the inhibitor regulatory protein. The results were also used to compare the IS50 transposase with the more distantly related transposase encoded by IS10.
-
ArticleMolecular phylogenies of Blastocystis isolates from different hosts : implications for genetic diversity, identification of species, and zoonosis(American Society for Microbiology, 2005-01) Noel, Christophe ; Dufernez, Fabienne ; Gerbod, Delphine ; Edgcomb, Virginia P. ; Delgado-Viscogliosi, Pilar ; Ho, Lip-Chuen ; Singh, Mulkit ; Wintjens, Rene ; Sogin, Mitchell L. ; Capron, Monique ; Pierce, Raymond ; Zenner, Lionel ; Viscogliosi, EricSmall-subunit (SSU) rRNA gene sequences were obtained by PCR from 12 Blastocystis isolates from humans, rats, and reptiles for which elongation factor 1{alpha} (EF-1{alpha}) gene sequences are already available. These new sequences were analyzed by the Bayesian method in a broad phylogeny including, for the first time, all Blastocystis sequences available in the databases. Phylogenetic trees identified seven well-resolved groups plus several discrete lineages that could represent newly defined clades. Comparative analysis of SSU rRNA- and EF-1{alpha}-based trees obtained by maximum-likelihood methods from a restricted sampling (13 isolates) revealed overall agreement between the two phylogenies. In spite of their morphological similarity, sequence divergence among Blastocystis isolates reflected considerable genetic diversity that could be correlated with the existence of potentially ≥12 different species within the genus. Based on this analysis and previous PCR-based genotype classification data, six of these major groups might consist of Blastocystis isolates from both humans and other animal hosts, confirming the low host specificity of Blastocystis. Our results also strongly suggest the existence of numerous zoonotic isolates with frequent animal-to-human and human-to-animal transmissions and of a large potential reservoir in animals for infections in humans.
-
ArticleGene fusions and gene duplications : relevance to genomic annotation and functional analysis(BioMed Central, 2005-03-09) Serres, Margrethe H. ; Riley, MonicaBackground: Escherichia coli a model organism provides information for annotation of other genomes. Our analysis of its genome has shown that proteins encoded by fused genes need special attention. Such composite (multimodular) proteins consist of two or more components (modules) encoding distinct functions. Multimodular proteins have been found to complicate both annotation and generation of sequence similar groups. Previous work overstated the number of multimodular proteins in E. coli. This work corrects the identification of modules by including sequence information from proteins in 50 sequenced microbial genomes. Results: Multimodular E. coli K-12 proteins were identified from sequence similarities between their component modules and non-fused proteins in 50 genomes and from the literature. We found 109 multimodular proteins in E. coli containing either two or three modules. Most modules had standalone sequence relatives in other genomes. The separated modules together with all the single (un-fused) proteins constitute the sum of all unimodular proteins of E. coli. Pairwise sequence relationships among all E. coli unimodular proteins generated 490 sequence similar, paralogous groups. Groups ranged in size from 92 to 2 members and had varying degrees of relatedness among their members. Some E. coli enzyme groups were compared to homologs in other bacterial genomes. Conclusion: The deleterious effects of multimodular proteins on annotation and on the formation of groups of paralogs are emphasized. To improve annotation results, all multimodular proteins in an organism should be detected and when known each function should be connected with its location in the sequence of the protein. When transferring functions by sequence similarity, alignment locations must be noted, particularly when alignments cover only part of the sequences, in order to enable transfer of the correct function. Separating multimodular proteins into module units makes it possible to generate protein groups related by both sequence and function, avoiding mixing of unrelated sequences. Organisms differ in sizes of groups of sequence-related proteins. A sample comparison of orthologs to selected E. coli paralogous groups correlates with known physiological and taxonomic relationships between the organisms.
-
PreprintDiscovery of a novel Wolbachia supergroup in Isoptera( 2005-05-23) Bordenstein, Seth R. ; Rosengaus, Rebeca B.Wolbachia are one of the most abundant groups of bacterial endosymbionts in the biosphere. Interest in these heritable microbes has expanded with the discovery of wider genetic diversity in undersampled host species. Here, we report on the putative discovery of a new genetic lineage, denoted supergroup H, which infects the Isopteran species Zootermopsis angusticollis and Z. nevadensis. Evidence for this novel supergroup is based on portions of new Wolbachia gene sequences from each species spanning 3.5 kilobases of DNA and the following genes: 16S rDNA, dnaA, gltA, groEL, and ftsZ. Single-gene and concatenated maximum likelihood phylogenies establish this new supergroup and validate the positioning of the other Wolbachia supergroups. This discovery is the first example of a termite Wolbachia that is highly divergent from the Isopteran Wolbachia previously described in supergroup F. This study highlights the importance of multilocus approaches to resolving Wolbachia supergroup relationships. It also suggests that surveys of Wolbachia in more earlier-originating (and undersampled) groups of arthropods are more apt to reveal novel genetic diversity.