Preface
The bacterial genome is the software that contains the information for running the cell. The information resides in the genetic material of the bacterium, metaphorically, in both digital (nucleotide sequence) and analogue (DNA topology) forms. From this information emerges the operational hardware, in the form of RNA and proteins, which compose and regulate the pathways for the building of the cell, for the operation of the processes that maintain cell function and which repair and replicate the cell and its contents. Expressing the information that is contained in the cellular software requires sophisticated molecular machines and involves complex processes. Over many decades, molecular microbiology has revealed the details of these machines and processes and has greatly deepened our understanding of the genome itself. By now, the breadth and depth of the information available about these topics can be intimidating, even to experienced investigators. This book will attempt to organize its most important features in ways that allow the reader to grasp the ‘big picture’.
Specialists who focus on ‘the nucleoid’, ‘the cell cycle’, ‘bacterial metabolism’, ‘gene regulation’, ‘transport’, etc. have often studied these bacterial cellular systems and processes in isolation from one another for the very good reason that each is an enormous subject, capable of occupying a whole scientific career. This compartmentalisation of information is valuable because it helps to organize information in discrete packets under clear headings. It is also consistent with the idea of science as an organized body of knowledge. However, even a cursory reading of the literature under the standard headings will reveal that there is tremendous overlap between distinct cellular systems in terms of their components and their governance architecture.
Recently, there has been a great growth in the quantity of data coming from whole‐genome studies of bacterial cells. Experienced investigators may feel that there is just too much information to absorb and the literature is now too vast to read and assimilate. Students entering the field are at an even greater disadvantage and can be forgiven for being discouraged by the mountain of facts. If the discipline of molecular microbiology is to thrive in the future, it is essential that new entrants to the field, and established investigators, have the ability to navigate the sea of information safely. It is the purpose of books like this one to be islands of meaning in this sea of information. This will be done by highlighting the most important components and processes of the bacterial cell, by providing context for cellular operations and by pointing out connections between the different systems and operations. The objective is to provide the reader with a unified picture of the bacterial genome at the structural and functional levels.
Detailed descriptions of the genome's container (the cell envelope) and of the metabolic processes and components that build and maintain it are not within the scope of this book. It will not be possible to provide comprehensive listings of all of the genes and gene products involved in every process that is included because there are just too many, and to attempt this would simply confirm in the reader's mind the impression that this is all just too complicated. Similarly, organism‐to‐organism comparisons will only be made to illustrate important principles; for the most part the narrative will be concerned with the model bacterium Escherichia coli and its close relatives. To attempt wider coverage of the prokaryotic world would simply create an unwieldy book and would defeat its main purposes.
The material in the book is based on the lectures I have delivered to my Junior and Senior Sophister (penultimate and final year) BA students in the Microbiology Moderatorship at Trinity College Dublin over the past 25 years. Although the Microbiology Moderatorship degree at Trinity College is an undergraduate degree, in terms of international comparisons, its advanced content and the demands it places on students are equivalent to taught MSc degrees elsewhere. The lectures seek to introduce the students to the complexities of global gene regulation in model bacteria and how these relate to the structure of the bacterial nucleoid. They are informed by my research in these fields since 1981 and by a close reading of the development of understanding in those fields over the intervening years. This work has been supported by grants from Science Foundation Ireland since 2003. I am grateful to Steve Busby and Jayaraman Gowrishankar for comments on the draft manuscript; the responsibility for any remaining errors of fact or of omission is mine alone.
Dublin 2019
Charles J. Dorman
1 The Bacterial Genome – Where the Genes Are
1.1 Genome Philosophy
The genome of a bacterium consists of its entire collective of genes, and these can be located in a chromosome (or chromosomes) and on extra‐chromosomal autonomous replicons such as plasmids. Chromosomes and plasmids replicate, copying the genes that they carry, with the replicon copies being segregated into the daughter cells at cell division. This process drives the vertical transmission of genetic information from one generation to the next and its fidelity determines the stability of the genetic information in the genome. If non‐lethal errors occur during the replication of the genome the resulting mutations will be transmitted to the next generation.
The vertical transmission of genetic copying errors is one of the driving forces of evolution in all types of organisms. Bacterial cells are also prone to the evolutionary influence that is horizontal gene transfer (HGT). Here, foreign genes are transmitted to the bacterium, adding to its genetic complement. Evolution through HGT is much less subtle than evolution through the vertical transmission of copying errors which often involves the gradual accumulation of single nucleotide changes to the genome over many generations. Other types of mutation that are transmitted vertically are inversions, insertions, and deletions of genomic DNA. In HGT a bacterium can acquire entire clusters of genes en bloc, resulting in the acquisition of novel capabilities in a single generation. Examples include the arrival of genes that make the bacterium resistant to an antibiotic or to a heavy metal that previously could kill it, or genes that allow the organism to colonise a niche in the environment from which it had previously been excluded.
HGT played a key role in the early research work that led to the mapping of the bacterial genome and to our understanding of the locations of its genes. Among the autonomously replicating plasmids found in bacteria are elements that can promote their own transfer from cell to cell. The fertility, or ‘F’, factor of Escherichia coli was among the first to be studied. F encodes proteins that can build a connecting bridge between the F+ (or male) cell and one that is F− (female). F transfers one of its DNA strands to the F− cell in a process called conjugation. This is the bacterial equivalent of sex. It resembles sex in higher organisms in that the participants are male (F+) and female (F−) but it differs from conventional sex because the process converts the female into an F+ male. The F plasmid has segments of DNA that are identical in sequence, or almost identical, to DNA segments found in the chromosome. Usually these are mobile genetic elements called insertion sequences (IS). The homologous recombination machinery of the cell can recombine the F‐associated region of DNA sequence identity with a chromosomal counterpart, causing F to become fused with that part of the chromosome (Hadley and Deonier 1980). Where this happens is determined by the location of the IS element, and these mobile elements can be found at sites distributed around the chromosome.
Once F has become one with the chromosome, it is replicated as a part of that molecule. It can still engage in conjugation, however. When this happens, the DNA that is transferred to the F− female bacterium consists of chromosomal DNA with F DNA in the vanguard. Strains that can act as DNA donors in these matings are called ‘Hfr’ (high frequency of recombination) (Reeves 1960) and homologous recombination between the incoming, horizontally transferred DNA and the resident chromosome allows the order of the genes on the chromosome to be determined. Experiments, in which chromosomal gene transfer mediated by the F plasmid was monitored as a function of time, allowed a rudimentary genetic map of the E. coli chromosome to be assembled (Bachmann 1983;