A version of this chapter is currently under review. A preprint is released as Chure, G, Kaczmarek, Z. A., and Phillips, R. Physiological adaptability and parametric versatility in a simple genetic circuit. bioRxiv 2019. DOI: 10.1101/2019.12.19.878462. G.C. and R.P. designed experiments and developed theoretical models. G.C. and Z.A.K. collected and analyzed data. G.C. and R.P. wrote the paper.
Abstract
The intimate relationship between the environment and cellular growth rate has remained a major topic of inquiry in bacterial physiology for over a century. Now, as it becomes possible to understand how the growth rate dictates the wholesale reorganization of the intracellular molecular composition, we can interrogate the biophysical principles underlying this adaptive response. Regulation of gene expression drives this adaptation, with changes in growth rate tied to the activation or repression of genes covering enormous swaths of the genome. Here, we dissect how physiological perturbations alter the expression of a circuit which has been extensively characterized in a single physiological state. Given a complete thermodynamic model, we map changes in physiology directly to the biophysical parameters which define the expression. Controlling the growth rate via modulating the available carbon source or growth temperature, we measure the level of gene expression from a LacI-regulated promoter where the LacI copy number is directly measured in each condition, permitting parameter-free prediction of the expression level. The transcriptional output of this circuit is remarkably robust, with expression of the repressor being largely insensitive to the growth rate. The predicted gene expression quantitatively captures the observations under different carbon conditions, indicating that the biophysical parameters are indifferent to the physiology. Interestingly, temperature controls the expression level in ways that are inconsistent with the prediction, revealing temperature-dependent effects that challenge current models. This work exposes the strengths and weaknesses of thermodynamic models in fluctuating environments, posing novel challenges and utility in studying physiological adaptation.
Introduction
Cellular physiology is inextricably tied to the extracellular environment. Fluctuations in nutrient availability and variations in temperature, for example, can drastically modulate the cell’s growth rate, which is often used as a measure of the evolutionary fitness (Schaechter, Maaløe, and Kjeldgaard 1958). In response to such environmental insults, cells have evolved myriad clever mechanisms by which they can adapt to their changing surroundings, many of which involve restructuring their proteome such that critical processes (i.e. protein translation) are allocated the necessary resources. Recent work exploring this level of adaptation using mass spectrometry, ribosomal profiling, and RNA sequencing have revealed that various classes of genes (termed “sectors”) are tuned such that the protein mass fraction of the translational machinery is prioritized over the metabolic and catabolic machinery in nutrient replete environments (Scott et al. 2014; Klumpp and Hwa 2014; Hui et al. 2015; Schmidt et al. 2016; Li et al. 2014). This drastic reorganization is mediated by the regulation of gene expression, relying on the concerted action of myriad transcription factors. Notably, each gene in isolation is regulated by only one or a few components (Gama-Castro et al. 2016). The most common regulatory architecture in Escherichia coli is the simple repression motif in which a transcriptional repressor binds to a single site in the promoter region, occluding binding of an RNA polymerase (Rydenfelt et al. 2014; Phillips et al. 2019). The simple activation architecture, in which the simultaneous binding of an activator and an RNA polymerase amplifies gene expression, is another common mode of regulation. Combinatorial regulation such as dual repression, dual activation, or combined activation and repression can also be found throughout the genome, albeit with lower frequency (Phillips et al. 2019). The ubiquity of the simple repression and simple activation motifs illustrate that, for many genes, the complex systems-level response to a physiological perturbation boils down the binding and unbinding of a single regulator to its cognate binding sites.
Despite our knowledge of these modes of regulation, there remains a large disconnect between concrete, physical models of their behavior and experimental validation. The simple repression motif is perhaps the most thoroughly explored theoretically and experimentally (Phillips et al. 2019) where equilibrium thermodynamic (Garcia and Phillips 2011; Garcia et al. 2012; Brewster et al. 2014; Razo-Mejia et al. 2018; Barnes et al. 2019) and kinetic (Jones, Brewster, and Phillips 2014; Ko 1991; Kepler and Elston 2001; Michel 2010) models have been shown to accurately predict the level of gene expression in a variety of contexts. While these experiments involved variations of repressor copy number, operator sequence, concentration of an external inducer, and amino acid substitutions, none have explored how the physiological state of the cell as governed by external factors influences gene expression. This is arguably one of the most critical variables one can experimentally tune to understand the roles these regulatory architectures play in cellular physiology writ large.
In this work, we interrogate the adaptability of a simple genetic circuit to various physiological stressors, namely carbon source quality and growth temperature. Following the aforementioned thermodynamic models, we build upon this theory-experiment dialogue by using environmental conditions as an experimentally tunable variable and determine their influence on various biophysical parameters. Specifically, we use physiological stressors to tune the growth rate. One mechanism by which we modulate the growth rate is by exchanging glucose in the growth medium for the poorer carbon sources glycerol and acetate, which decrease the growth rate by a factor of ≈ 1.5 and ≈ 4 compared to glucose, respectively. We hypothesize that different carbon sources should, if anything, only modulate the repressor copy number seeing as the relationship between growth rate and total protein content has been rigorously quantified (Schaechter, Maaløe, and Kjeldgaard 1958; Schmidt et al. 2016; Li et al. 2014; Jun et al. 2018). Using single-cell time-lapse fluorescence microscopy, we directly measure the copy number of the repressor in each condition. Under a simple hypothesis, all other parameters should be unperturbed, and we can thus rely on previously determined values to make parameter-free predictions of the fold-change in gene expression.
Despite the decrease in growth rate, both the fold-change in gene expression and the repressor copy number remains largely unaffected. We confirm this is the case by examining how the effective free energy of the system changes between carbon sources, a method we have used previously to elucidate parametric changes due to mutations within a transcription factor (Chure et al. 2019) and has been extensively discussed in Chapter 3. This illustrates that the energetic parameters defining the fraction of active repressors and their affinity for the DNA are ignorant of the carbon-dependent physiological states of the cell. Thus, in this context, the values of the biophysical parameters determined in one condition can be used to draw predictions in others.
We then examine how variations in temperature influence the transcriptional output. Unlike in the case of carbon source variation, temperature dependence is explicit in our model: the repressor-DNA binding energy and the energetic difference between the active and inactive states of the repressor are scaled to the thermal energy of the system at 37∘ C. This is defined via the Boltzmann distribution which states that the probability of a state pstate is related to the energy of that state εstate as
pstate ∝ e − εstate/kBT, (1)
where kB is the Boltzmann constant and T is the temperature of the system. Given knowledge of T for a particular experiment, we can easily draw predictions of the fold-change in gene expression. However, we find that the fold-change in gene expression is inconsistent with this simple model, revealing an incomplete description of the energetics. We then examine how entropic effects neglected in the initial estimation of the energetic parameters may play an important role; a hypothesis that is supported when we examine the change in the effective free energy.
The results presented here are, to our knowledge, the first attempts to systematically characterize the growth-dependent effects on biophysical parameters in thermodynamic models of transcription. While some parameters of our model are affected by changing the growth rate, they change in ways that are expected or fall close within our a priori predictions, suggesting that such modeling can still be powerful in understanding how adaptive processes influence physiology at the level of molecular interactions.
Results
Thermodynamic model
This chapter builds off the theoretical details presented in Chapters 2-3 of this thesis. Here, we once again consider the simple repression motif in which expression of a target gene is regulated by the action of a single allosteric repressor. The key measurable quantity of the following work is the fold-change in expression of the regulated gene. Thermodynamic models described previously (Garcia and Phillips 2011; Razo-Mejia et al. 2018; Phillips et al. 2019) and in Chapters 2-3 of this work result in a succinct input-output function to quantitatively describe the fold-change in gene expression and is of the form
$$
\text{fold-change} = \left(1 + p_{act}(c)\frac{R}{N_{NS}}e^{-\Delta\varepsilon_{RA}
/ k_BT}\right)^{-1},
\qquad(2)$$
where R is the total number of allosteric repressors per cell, NNS is the number of nonspecific binding sites for the repressor, ΔεRA is the repressor-DNA binding energy, and kBT is the thermal energy of the system. The prefactor pact(c) defines the probability of the repressor being in the active state at a given concentration of inducer c. In the absence of inducer, pact(c = 0) can be written as
pact(c = 0) = (1+e − ΔεAI/kBT) − 1, (3)
where ΔεAI is the energy difference between the active and inactive states. Conditioned on only a handful of experimentally accessible parameters, this model has been verified using the well-characterized LacI repressor of Escherichia coli where parameters such as the repressor copy number and DNA binding affinity (Garcia and Phillips 2011), copy number of the regulated promoter (Brewster et al. 2014), and the concentration of an extracellular inducer (Razo-Mejia et al. 2018, Chapter 2) can be tuned over orders of magnitude. Chapter 3 and the associated publication (Chure et al. 2019) illustrated that this model permits the mapping of mutations within the repressor protein directly to biophysical parameters in a manner that permits accurate prediction of double mutant phenotypes. All of these applications, however, have been performed in a single physiological state where cells are grown in a glucose-supplemented minimal medium held at 37∘ C with aeration. In this work, we challenge this model by changing the environmental conditions away from this gold-standard condition, perturbing the physiological state of the cell.
Experimental Setup
Seminal studies from the burgeoning years of bacterial physiology have demonstrated a strong dependence of the total cellular protein content on the growth rate (Schaechter, Maaløe, and Kjeldgaard 1958), a relationship which has been rigorously quantified in recent years using mass spectrometry (Schmidt et al. 2016) and ribosomal profiling (Li et al. 2014). Their combined results illustrate that modulation of the growth rate, either through controlling the available carbon source or the temperature of the growth medium, significantly alters the physiological state of the cell, triggering the reallocation of resources to prioritize expression of ribosome-associated genes. Eq. 2 has no explicit dependence on the available carbon source, but does depend on the temperature through the energetic parameters ΔεR and ΔεAI which are defined relative to the thermal energy, kBT. With this parametric knowledge, we are able to draw quantitative predictions of the fold-change in gene expression in these physiologically distinct states.
We modulated growth of Escherichia coli by varying either the quality of the available carbon source (differing ATP yield per C atom) or the temperature of the growth medium (Fig. 1 (A)). All experiments were performed in a defined M9 minimal medium supplemented with one of three carbon sources – glucose, glycerol, or acetate – at concentrations such that the total number of carbon atoms available to the culture remained the same. These carbon sources have been shown to drastically alter growth rate and gene expression profiles, indicating changes in the proteomic composition and distinct physiological states. These carbon sources yield an approximate four-fold modulation of the growth rate with doubling times ranging from ≈ 220 minutes to ≈ 65 minutes in an acetate or glucose supplemented medium, respectively Fig. 1 (B) and (C). While the growth temperature was varied over 10∘ C, both 32∘ and 42∘ C result in approximately the same doubling time of ≈ 90 min, which is 1.5 times slower than the optimal temperature of 37∘ C (Fig. 1 (B) and (C)).
The growth rate dependence of the proteome composition suggests that changing physiological conditions could change the total repressor copy number of the cell. As emphasized in Chapter 2, it can be difficult to differentiate between a change in repressor copy number R and the allosteric energy difference ΔεAI as there are many combinations of parameter values that yield the same fold-change. To combat this degeneracy, we used a genetically engineered strain of E. coli in which the expression of the repressor copy number and its regulated gene product (YFP) can be simultaneously measured. This strain, used previously to interrogate the transcription factor titration effect (Brewster et al. 2014), is diagrammed in Fig. 2 (A). A dimeric form of the LacI repressor N-terminally tagged with an mCherry fluorophore is itself regulated through the action of the TetR repressor whose level of activity can be modulated through the addition of the allosteric effector anhydrous tetracycline (ATC). This dual repression genetic circuit allows for the expression of the LacI repressor to be tuned over several orders of magnitude. This is demonstrated in Fig. 2 (B) where a titration of ATC in the growth medium results in a steady increase in the expression of the LacI-mCherry gene product (red lines and points) which in turn represses expression of the YFP reporter (yellow lines and points).
While the mCherry fluorescence is proportional to the repressor copy number, it is not a direct measurement as the fluorescence of a single LacI-mCherry dimer is unknown a priori. Using video microscopy, we measure the partitioning statistics of the fluorescence intensity into two sibling cells after division (Fig. 2 (C)). This method, described in detail in the Materials & Methods as well as in Rosenfeld, Elowitz, and Alon (2002), Rosenfeld et al. (2005), and Brewster et al. (2014), reveals a linear relationship between the variance in intensity between two sibling cells and the intensity of the parent cell, the slope of which is equal to the brightness of a single LacI repressor. Since this measurement is performed simultaneously with measurement of the expression of the YFP reporter, this calibration factor was determined for each unique experimental replicate. We direct the reader to the supplemental Chapter 8 for a more thorough discussion of this inference.
Scaling of Gene Expression With Growth Rate
Given the single-cell resolution of our experimental method, we examined how the cell volume and repressor copy number scaled across the different growth conditions at different levels of ATC induction. In agreement with the literature (Schaechter, Maaløe, and Kjeldgaard 1958; Jun et al. 2018; Shehata and Marr 1975), our measurement reveals a strong linear dependence of the cell volume on the choice of carbon source, but no significant dependence on temperature (Fig. 3 (A) and (B)). Additionally, these findings are consistent across different ATC induction regimes. Together, these observations confirm that the particular details of our experimental system does not introduce unintended physiological consequences.
Using a fluorescence calibration factor determined for each experimental replicate (see Fig. 2 (C) and the Materials & Methods), we estimated the number of repressors per cell from snapshots of the mCherry signal intensity of each induction condition. Fig. 3 (C) reveals a remarkable insensitivity of the repressor copy number on the growth rate under different carbon sources. Despite the change in cellular volume, the mean number of repressors expressed at a given induction condition is within error between all carbon sources. Previous work using mass spectrometry, a higher resolution method, has shown that there is a slight dependence of LacI copy number on growth rate expressed from its native promoter (Schmidt et al. 2016). It is possible that such a dependence exists in our experimental setup, but is not detectable with our lower resolution method. We also observe an insensitivity of copy number to growth rate when the temperature of the system is tuned (Fig. 3 (D)), though two aberrant points with large error obfuscate the presence of a growth rate dependence at high concentrations of ATC. For concentrations below 7 ng /mL, however, the repressor copy number remains constant across conditions. With no significant change in the repressor copy number and thus no dependence on the carbon source in our theoretical model, we can immediately draw predictions of the fold-change in gene expression in different growth media.
Fold-change Dependence on Temperature Variation
Unlike the identity of the carbon source, the temperature of the system is explicitly stated in Eq. 2 and Eq. 3 where ΔεRA and ΔεAI are defined relative to the thermal energy of the system in which they were determined. This scaling is mathematically quantified as kBT dividing the exponentiated terms in Eq. 2 and Eq. 3. As all biophysical parameters were determined at a reference temperature of 37∘ C, any change in the growth temperature must be included as a correction factor. The simplest approach is to rescale the energy by the relative change in temperature. This is a simple multiplicative factor of ϕT = kBTref/kBTexp where Tref is the reference temperature of 37∘ C and Texp is the experimental temperature. This is an intuitive result since an increase in temperature relative to the reference results in ϕT < 1, weakening the binding. Similarly, decreasing the temperature scales ϕT > 1, strengthening the binding relative to that of the reference temperature.
Fig. 4 (A) shows the measured fold-change in gene expression (points) plotted against the theoretical prediction with this correction factor (orange line). It is immediately evident that a simple rescaling of the energetic parameters is not sufficient for the 32∘ C condition and slightly underestimates the fold-change in the 42∘ C condition. To identify the source of this disagreement, we can again examine the free energy shift ΔF. As both ΔεAI and ΔεR are scaled to the thermal energy, ΔF defined as FT − Fref can be directly calculated as
$$
\Delta F = k_BT_{exp} \left[ -\log{1 + e^{-{\Delta\varepsilon_{AI} \over
k_BT_{ref}}} \over 1 + e^{-{\phi_T\Delta\varepsilon_{AI} \over
k_BT_{exp}}}}- \log{R_{T} \over R_{ref}}\right] +
\Delta\varepsilon_{R}\left(1 -\phi_T\right).
\qquad(4)$$
This prediction along with the empirically determined ΔF is shown in Fig. 4 (B). Again, we see that this simple correction factor significantly undershoots or overshoots the observed ΔF for 32∘ C and 42∘ C, respectively, indicating that there are temperature dependent effects that are not accounted for in the simplest null model of temperature dependence.
The model described by Eq. 2 and Eq. 3 subsumes the myriad rich dynamical processes underlying protein binding and conformational changes into two effective energies, ΔεR and ΔεAI. By no means is this done to undercut the importance of these details in transcriptional regulation. Rather, it reduces the degrees of freedom in this objectively complex system to the set of the details critical to particular conditions in which we want to draw predictions. All prior dissections of this thermodynamic model have been performed at a single temperature, abrogating the need to consider temperature dependent effects. As we now vary temperature, we must consider details that are swept into the effective energies.
The model presented here only considers entropy by enumerating the multiplicity of states in which the repressor can bind to the DNA nonspecifically, resulting in terms of the form R/NNS. However, there are many other temperature-dependent entropic contributions to the effective energies such as the fraction of repressors bound to DNA versus in solution (Elf, Li, and Xie 2007; Kao-Huang et al. 1977), the vibrational entropy of the repressor (Goethe, Fita, and Rubi 2015), or conformational entropy of the genome (Driessen et al. 2014; Mondal et al. 2011). We can consider the effective energies ΔεR and ΔεAI as having generic temperature dependent-entropic components ΔSR and ΔSAI,
ΔεR = ΔHR − TΔSR, (5)
and
ΔεAI = ΔHAI − TΔSAI, (6)
where ΔHR and ΔHAI is the enthalpic contribution to the energies ΔεR and ΔεAI, respectively. Given the fold-change measurements at 32∘ C and 42∘ C, we estimated the entropic parameters ΔSR and ΔSAI under the constraints that at 37∘ C, ΔεR = − 13.9 kBT and ΔεAI = 4.5 kBT (see supplemental Chapter 8 for more discussion on this parameter estimation). The grey shaded lines in Fig. 4 show the result of this fit where the width represents the 95% credible region of the prediction given the estimated values of ΔSR and ΔSAI. Including this phenomenological treatment of the entropy improves the prediction of the fold-change in gene expression (Fig. 4 (A)) as well as shift in free energy (Fig. 4 (B)). This phenomenological description suggests that even small shifts in temperature can drastically alter the expression of a genetic circuit simply by tuning hidden entropic effects rather than scaling the difference in affinity between specific and nonspecific binding.
Discussion
The past century of work in bacterial physiology has revealed a rich molecular complexity that drives cellular growth rate (Jun et al. 2018). A key finding of this body of work is that the composition of the proteome is highly dependent on the specific growth condition, with entire classes of genes being up- or down-regulated to ensure that enough resources are allocated towards maintaining a pool of active ribosomes (Hui et al. 2015; Scott et al. 2014). These studies have led to a coarse-grained view of global gene expression where physiological perturbations substantially change the molecular composition of the cell, obfuscating the utility of using thermodynamic models of individual regulatory elements across physiological states. In this work, we rigorously examine how robust the values of the various biophysical parameters are to changes in cellular physiology.
We first examined how nutrient fluctuations dictate the output of this architecture. We took three carbon sources with distinct metabolic pathways and varying quality and measured the level of gene expression, hypothesizing that the values of the biophysical parameters to be independent of the growth medium. We found that even when the growth rate is varied across a wide range (220 minute doubling time in acetate to 60 minute doubling time in glucose supported medium), there is no significant change to the fold-change in gene expression or in the expression of the transcription factor itself, within the resolution of our experiments. Given numerous quantitative studies of the proteomic composition reveal a dependence on protein content with growth rate (Hui et al. 2015; Schmidt et al. 2016; Li et al. 2014), we find this robustness to be striking.
Schmidt et al. (2016) found that the native expression of LacI has a weak positive correlation with the growth rate. The native LacI promoter region is solely regulated by activation via the cAMP Receptor Protein (CRP), a broadly acting dual regulator in E. coli (Gama-Castro et al. 2016). This is in contrast to the LacI expression system used in the present work where the promoter is negatively regulated by the TetR repressor, itself expressed from a low-copy number plasmid. Furthermore, the expression of LacI in this work is tuned by the addition of the allosteric effector of TetR, ATC, adding yet another layer of allosteric regulation on LacI expression. The significant difference in the regulatory mechanisms between the native and synthetic circuit used in this work makes the two findings difficult to directly compare. Regardless, our finding that the fold-change in gene expression is unaltered from one carbon source to another illustrates that the values of the biophysical parameters ΔεR and ΔεAI remain unperturbed, permitting quantitative prediction of gene expression across numerous physiological states.
However, in varying the temperature, we find that the predictive utility of the biophysical parameter values determined at 37∘ C is diminished, indicating that there are hidden effects not explicitly accounted for in our thermodynamic model. The measurements of the fold-change in gene expression are under- or over-estimated when the temperature is increased or decreased, respectively, when one simply rescales the energetic terms by the relative change in temperature. There are many features of transcriptional regulation that are not explicitly considered in our coarse-graining of the architecture into a two-state model. Recently, it has been suggested that the phenomenon of allostery writ large should be framed in the language of an ensemble of states rather than a simple active/inactive distinction (Motlagh et al. 2014). While our recent work illustrates that a two-state rendering of an allosteric repressor is highly predictive in a variety of situations (Razo-Mejia et al. 2018; Chure et al. 2019), we must now consider details which are dependent on the temperature of the system. In Fig. 4, we demonstrate that incorporating a temperature-dependent entropic cost to the energetic terms significantly improves the description of the experimental data. This is not to say, however, that this is now an open-and-closed case for what precisely defines this entropic cost. Rather, we conclude that the phenomenology of the temperature dependence can be better described by the inclusion of a correction factor that is linearly dependent on the system temperature. Biology is replete with phenomena which can introduce such an effect, including changes to the material properties of the repressor and DNA (Goethe, Fita, and Rubi 2015; Mondal et al. 2011), excluded volume effects (Driessen et al. 2014), and solubilities (Elf, Li, and Xie 2007; Kao-Huang et al. 1977; Yakovchuk, Protozanova, and Frank-Kamenetskii 2006). Understanding the mechanistic underpinnings of temperature dependence in elasticity theory was borne out of similar phenomenological characterization (Friedel 1974) and required a significant dialogue between theory and experiment (Phillips 2001). Further work is now needed to develop a theory of temperature effects in the regulation of gene expression.
The effective free energy F, as defined in Eq. ¿eq:growth_bohr_parameter?, is a state variable of the simple repression regulatory architecture. This is illustrated in Fig. 5 where fold-change measurements from a wide array of conditions (and measurement techniques) can be collapsed onto the same theoretical description. Evolutionary perturbations (such as mutations in the operator or repressor sequence), physiological changes (such as modulations of the growth rate), or changes in the level of activity of the repressor (due to changes in inducer concentration) do not change the fundamental physics of the system and can all be described by changes in the free energy relative to one another. While such a statement is not “surprising,” we can now say it with quantitative confidence and use this principle to probe the degree to which physiological perturbations influence the biophysical parameters writ large. With such a framework in hand, we are in the auspicious position to take a predictive approach towards understanding how this regulatory architecture evolves in experimental settings, shedding light on the interplay between biophysical parameters, organismal fitness, and the fundamental forces of evolution.
Materials & Methods
Bacterial Strains and Growth Media
Three genotypes were primarily used in this work, all in the genetic background of Escherichia coli MG1655-K12 and all derived from those used in Brewster et al. (2014). For each experiment, an autofluorescence control was used which contained no fluorescent reporters (except for a CFP volume marker used for segmentation in Brewster et al. 2014) which had the lacI and lacZYA genes deleted from the chromosome. The constitutive expression strain (ΔlacI; ΔlacZYA) included a YFP reporter gene integrated into the galK locus of the chromosome along with a kanamycin resistance cassette. The experimental strains in which LacI expression was controlled contained a lacI-mCherry fluorescent fusion integrated into the ybcN locus of the chromosome along with a chloramphenicol resistance cassette. This cassette was later deleted from the chromosome using FLP/FRT recombination (Schlake and Bode 1994; Zhu and Sadowski 1995). The strain was then transformed with plasmid (pZS3-pN25-tetR following notation described in Lutz and Bujard 1997) constitutively expressing the TetR repressor along with a chloramphenicol resistance cassette. All bacterial strains and plasmids used in this work are reported in the supplemental Chapter 8.
Bacterial Growth Curves
Bacterial growth curves were measured in a multi-well plate reader (BioTek Cytation5) generously provided by the David Van Valen lab at Caltech. Cells constitutively expressing YFP were grown overnight to saturation in LB broth (BD Medical) at 37∘ C with aeration. Once saturated, cells were diluted 1000 fold into 50 mL of the desired growth medium and were allowed to grow at the appropriate experimental temperature with aeration for several hours until an OD600nm ≈ 0.3 was reached. Cells were then diluted 1:10 into the desired growth media at the desired temperature. After being thoroughly mixed, 500 μL aliquots were transferred to a black-walled 96-well plate (Brooks Automation Incorporated, Cat No. MGB096-1-2-LG-L), leaving two rows and two columns of wells on each side of the plate filled with sterile growth medium to serve as blanks and buffer against temperature variation. The plate was then transferred to the pre-warmed plate reader. OD600nm measurements were made every five minutes for 12 to 24 hours until cultures had saturated. In between measurements, the plate incubated at the appropriate temperature with a linear shaking mode. We found that double-orbital shaking modes led to the formation of cell aggregates which gave inconsistent measurements.
Estimation of Bacterial Growth Rate
Non-parametric estimation of the maximum growth rate was performed using the FitDeriv Python package as described in Swain et al. (2016). Using this approach, the bacterial growth curve is modeled as a Gaussian process in which the measured growth at a given time point is modeled as a Gaussian distribution whose mean is dependent on the mean of the neighboring time points. This allows for smooth interpolation between adjacent measurements and calculation of second derivatives without an underlying parametric model. The reported growth rates are the maximum value inferred from the exponential phase of the experimental growth curve.
Growth Conditions
Parent strains (autofluorescence control, ΔlacI constitutive control, and the ATC-inducible LacI-mCherry strain) were grown in LB Miller broth (B.D. Medical, Cat. No. 244620 ) at 37∘ C with vigorous aeration until saturated. Cells were then diluted between 1000 and 5000 fold into 3 mL of M9 minimal medium (B.D. Medical, Cat. No. 248510). The bacterial strain expressing the tetracycline-inducible LacI-mCherry was diluted into six 3 mL cultures with differing concentrations of ATC (Chemodex, Cat. No. CDX-A0197-T78) ranging from 0.1 to 10 ng / mL to induce expression of the transcription factor. These concentrations were reached by dilution from 1 μg / mL stock in 50% ethanol. All cultures were shielded from ambient light using either aluminum foil or via an enclosure and were grown at the appropriate experimental temperature with aeration until an OD600nm of approximately 0.25 − 0.35. Due to pipetting errors, cultures reached OD600nm ≈ 0.3 at slightly different points in time. To ensure that strains could be directly compared, all strains were diluted by several fold until equivalent. When all samples reached the appropriate OD600nm, the cells were harvested for imaging.
Imaging Sample Preparation
Prior to the preparation of cell cultures for imaging, a 2% (w/v) agarose substrate (UltraPure, Thermo Scientific) was prepared and allowed to reach room temperature. For experiments conducted at 42∘C, 4% (w/v) agarose substrates were prepared. Briefly, the agarose was mixed with the appropriate growth medium in a 50 mL conical polystyrene tube and then microwaved until molten. A 300 to 500 μL aliquot was then sandwiched between two glass coverslips to ensure a flat imaging surface. Once solidified, the agarose pads were cut into squares approximately 0.5 cm per side.
To determine the calibration factor between fluorescence and protein copy number, production the fluorophore in question must be halted such that all differences in intensity between two daughter cells result from binomial partitioning of the fluorophores and not from continuing expression. This was achieved by removing the anhydrous tetracycline inducer from the growth medium through several washing steps as outlined in Brewster et al. (2014). Aliquots of 100 μL from each ATC-induced culture were combined and pelleted at 13000 × g for 2 minutes. The supernatant (containing ATC) was aspirated and replaced with 1 mL of M9 growth medium without ATC. The pellet was resuspended and pelleted at 13000 × g. This wash step was repeated three times to ensure that residual ATC had been removed and LacI-mCherry production was halted. After the final wash, the cell pellet was resuspended in 1 mL of M9 medium and diluted ten-fold. A 1 μL aliquot of the diluted mixture was then spotted onto an agarose substrate containing the appropriate carbon source. The remaining bacterial cultures (autofluorescence control, constitutive expression control, and the ATC-induced samples) were diluted ten-fold into sterile M9 medium. This dilution was thoroughly mixed and 1 μL aliquots were spotted onto agarose substrates lacking the carbon source.
Once the spotted cells had dried onto the agarose substrates (about 5 to 10 minutes after deposition), the agarose pads were inverted and pressed onto a glass bottom dish (Electron Microscopy Sciences, Cat. No. 70674-52) and sealed with parafilm. Strips of double stick tape were added to the edge of the dish to help immobilize the sample on the microscope stage and minimize drift.
Microscopy
All imaging was performed on a Nikon Ti-Eclipse inverted microscope outfitted with a SOLA LED fluorescence illumination system. All images were acquired on a Andor Zyla 5.5 sCMOS camera (Oxford Instruments Group). The microscope body and stage was enclosed in a plexiglass incubation chamber (Haison, approximately 1∘ C regulation control) connected to an external heater. Temperature of the stage was measured via a thermometer which controlled heating of the system.
All static images (i.e. images from which fold-change and repressor counts were calculated) were measured in an identical manner. Ten to fifteen fields of view containing on average 25 cells were imaged using phase contrast and fluorescence excitation. Fluorescence exposures were each 5 seconds while the phase contrast exposure time was between 75 ms and 150 ms. This procedure was repeated for each unique strain and ATC induction concentration.
To compute the calibration factor for that day of imaging, time-lapse images were taken of a separate agarose pad covered in cells containing various levels of LacI-mCherry. Fifteen to twenty positions were marked, choosing fields of view containing 20 to 50 cells. Cells were allowed to grow for a period of 90 to 120 minutes (depending on the medium-dependent growth rate) with phase contrast images taken every 5 to 10 minutes. At the final time-point, both phase contrast and fluorescence images were acquired using the same settings for the snapshots. Once the experiment was completed, images were exported to .tif
format and transferred to cold storage and a computational cluster for analysis.
Lineage Tracking
Cells were segmented and lineages reconstructed using the deep-learning-based bacterial segmentation software SuperSegger v1.1
(Stylianidou et al. 2016; Cass et al. 2017) operated through MATLAB (Mathworks, v2017b
). We found that the pre-trained network constants 100XEcM9
(packaged with the SuperSegger software) worked well for all growth conditions tested in this work. The generated files (clist.mat
) associated with each sample and position were parsed using bespoke Python scripts to reconstruct lineages and apply appropriate filtering steps before calculating the fluorescence calibration factor. We direct the reader to the SI text for details of our data validation procedure to ensure proper lineage tracking.
Calculation of the Calibration Factor
To estimate the calibration factor α, we used a Bayesian definition of probability to define a posterior distribution of α conditioned on intensity measurements of sibling cells after division. We direct the reader to supplemental Chapter 8 for a detailed discussion of the inferential procedures and estimation of systematic error. We give a brief description of the inference below.
We are interested in determining the fluorescence of a single LacI-mCherry repressor dimer given a set of intensity measurements of sibling cells, [I1, I2]. The intensity of a given cell I is related to the number of LacI-mCherry dimers it is expressing by a multiplicative factor α which can be enumerated mathematically as
I = αN, (7)
where N is the total number of LacI-mCherry dimers. We can define the posterior probability distribution of α conditioned on the intensity measurements using Bayes’ theorem as
$$
g(\alpha\,\vert\, [I_1, I_2]) = \frac{f([I_1, I_2]\,\vert\,
\alpha)g(\alpha)}{f([I_1, I_2])},
\qquad(8)$$
where we have used g and f as probability densities over parameters and data, respectively. The denominator of this expression (the evidence) is equivalent to the first term of the numerator (the likelihood) marginalized over α. In this work, this term serves as normalization factor and can be neglected.
Assuming that no more LacI-mCherry dimers are produced during cell division, the number of repressors that each sibling cell receives after division of the parent cell is binomially distributed with a probability p. We can make the approximation that partitioning of the repressors is even such that p = 1/2. The validity of this approximation is discussed in detail in Chapter 8. Using Eq. 7 and the change-of-variables formula, we can define the likelihood g([I1, I2] | α) as
$$
g([I_1, I_2]\,\vert\,\alpha) =
\frac{1}{\alpha^k}\prod_i^k\frac{\Gamma\left(\frac{ {I_1}_i + {I_2}_i}{\alpha} +
1\right)}{\Gamma\left(\frac{ {I_1}_i}{\alpha} +
1\right)\Gamma\left(\frac{ {I_2}_i}{\alpha} + 1\right)}2^{-\frac{ {I_1}_i +
{I_2}_i}{\alpha}},
\qquad(9)$$
where k is the total number of sibling pairs measured.
With a likelihood defined, we must now define a functional form for g(α) which describes all prior information known about the calibration factor knowing nothing about the actual measurements. Knowing that we design the experiments such that only ≈ 2/3 of the dynamic range of the camera is used and α cannot be less than or equal to zero, we can define a half-normal distribution with a standard deviation of σ as
$$
g(\alpha) = \sqrt{2 \over \pi\sigma^2}\exp\left[{-\alpha^2 \over
2\sigma^2}\right]\,;\, \forall \alpha > 0,
\qquad(10)$$
where the standard deviation is large, for example, σ = 500 a.u. / fluorophore. We evaluated the posterior distribution using Markov chain Monte Carlo (MCMC) as is implemented in the Stan probabilistic programming language (Carpenter et al. 2017). The .stan
file associated with this model along with the Python code used to execute it can be accessed on the paper website.
Counting Repressors
Given an estimation for α for each experiment, we calculate the total number of repressors per cell from
$$
R = \frac{I_{mCherry}}{\alpha}.
\qquad(11)$$
However, as discussed in detail in Chapter 8, a systematic error in the repressor count is introduced due to division in the asynchronous culture between the cessation of LacI-mCherry production and the actual imaging. The entire sample preparation procedure is ≈ 30 − 60 min, during which time some cells complete a cell division, thereby diluting the total repressor count. To ensure that the measured number of repressors corresponded to the measured YFP intensity, we restricted our dataset for all experiments to cells that had a pole-to-pole length ℓ ≥ 3.5 μm, indicating that they had likely not undergone a division during the sample preparation.
Code and Data Availability
All code used in this work is available on the paper website and associated GitHub repository(DOI: 0.5281/zenodo.3560369). This work also used the open-source software tools SuperSegger v.1.1
(Stylianidou et al. 2016; Cass et al. 2017) for lineage tracking and FitDeriv v.1.03
(Swain et al. 2016) for the nonparametric estimation of growth rates. Raw image files are large (1.8 Tb) and are therefore available upon request. The clist.mat files used to calculate fold-change and to assign sibling cells can be accessed via the associated GitHub repository via (DOI: 0.5281/zenodo.3560369) or through Caltech DATA under the DOI: 10.22002/D1.1315.
References
Barnes, Stephanie L., Nathan M. Belliveau, William T. Ireland, Justin B. Kinney, and Rob Phillips. 2019. “Mapping DNA Sequence to Transcription Factor Binding Energy in Vivo.” Edited by Gary D. Stormo. PLOS Computational Biology 15 (2). https://doi.org/10.1371/journal.pcbi.1006226.
Brewster, Robert C., Franz M. Weinert, Hernan G. Garcia, Dan Song, Mattias Rydenfelt, and Rob Phillips. 2014. “The Transcription Factor Titration Effect Dictates the Level of Gene Expression.” Cell 156 (6): 1312–23. https://doi.org/10.1016/j.cell.2014.02.022.
Carpenter, Bob, Andrew Gelman, Matthew D. Hoffman, Daniel Lee, Ben Goodrich, Michael Betancourt, Marcus Brubaker, Jiqiang Guo, Peter Li, and Allen Riddell. 2017. “Stan: A Probabilistic Programming Language.” Journal of Statistical Software 76 (1): 1–32. https://doi.org/10.18637/jss.v076.i01.
Cass, Julie A., Stella Stylianidou, Nathan J. Kuwada, Beth Traxler, and Paul A. Wiggins. 2017. “Probing Bacterial Cell Biology Using Image Cytometry.” Molecular Microbiology 103 (5): 818–28. https://doi.org/10.1111/mmi.13591.
Chure, Griffin, Manuel Razo-Mejia, Nathan M Belliveau, Tal Einav, Zofii A Kaczmarek, Stephanie L Barnes, and Rob Phillips. 2019. “Predictive Shifts in Free Energy Couple Mutations to Their Phenotypic Consequences.” Proceedings of the National Academy of Sciences 116 (37). https://doi.org/10.1073/pnas.1907869116.
Driessen, Rosalie P. C., Gerrit Sitters, Niels Laurens, Geri F. Moolenaar, Gijs J. L. Wuite, Nora Goosen, and Remus Th. Dame. 2014. “Effect of Temperature on the Intrinsic Flexibility of DNA and Its Interaction with Architectural Proteins.” Biochemistry 53 (41): 6430–8. https://doi.org/10.1021/bi500344j.
Elf, Johan, Gene-Wei Li, and X. Sunney Xie. 2007. “Probing Transcription Factor Dynamics at the Single-Molecule Level in a Living Cell.” Science 316 (5828): 1191–4. https://doi.org/10.1126/science.1141967.
Friedel, J. 1974. “On the Stability of the Body Centred Cubic Phase in Metals at High Temperatures.” Journal de Physique Lettres 35 (4): 59–63. https://doi.org/10.1051/jphyslet:0197400350405900.
Gama-Castro, Socorro, Heladia Salgado, Alberto Santos-Zavaleta, Daniela Ledezma-Tejeida, Luis Muñiz-Rascado, Jair Santiago García-Sotelo, Kevin Alquicira-Hernández, et al. 2016. “RegulonDB Version 9.0: High-Level Integration of Gene Regulation, Coexpression, Motif Clustering and Beyond.” Nucleic Acids Research 44 (D1): D133–D143. https://doi.org/10.1093/nar/gkv1156.
Garcia, Hernan G., Alvaro Sanchez, James Q. Boedicker, Melisa Osborne, Jeff Gelles, Jane Kondev, and Rob Phillips. 2012. “Operator Sequence Alters Gene Expression Independently of Transcription Factor Occupancy in Bacteria.” Cell Reports 2 (1): 150–61. https://doi.org/10.1016/j.celrep.2012.06.004.
Garcia, H. G., and R. Phillips. 2011. “Quantitative Dissection of the Simple Repression Input-Output Function.” Proceedings of the National Academy of Sciences 108 (29): 12173–8. https://doi.org/10.1073/pnas.1015616108.
Goethe, Martin, Ignacio Fita, and J. Miguel Rubi. 2015. “Vibrational Entropy of a Protein: Large Differences Between Distinct Conformations.” Journal of Chemical Theory and Computation 11 (1): 351–59. https://doi.org/10.1021/ct500696p.
Hui, Sheng, Josh M. Silverman, Stephen S. Chen, David W. Erickson, Markus Basan, Jilong Wang, Terence Hwa, and James R. Williamson. 2015. “Quantitative Proteomic Analysis Reveals a Simple Strategy of Global Resource Allocation in Bacteria.” Molecular Systems Biology 11 (2). https://doi.org/10.15252/msb.20145697.
Jones, Daniel L., Robert C. Brewster, and Rob Phillips. 2014. “Promoter Architecture Dictates Cell-to-Cell Variability in Gene Expression.” Science 346 (6216): 1533–6. https://doi.org/10.1126/science.1255301.
Jun, Suckjoon, Fangwei Si, Rami Pugatch, and Matthew Scott. 2018. “Fundamental Principles in Bacterial Physiology - History, Recent Progress, and the Future with Focus on Cell Size Control: A Review.” Reports on Progress in Physics 81 (5): 056601. https://doi.org/10.1088/1361-6633/aaa628.
Kao-Huang, Ying, Arnold Revzin, Andrew P. Butler, Pamela O’Conner, Daniel W. Noble, and Peter H. Von Hippel. 1977. “Nonspecific DNA Binding of Genome-Regulating Proteins as a Biological Control Mechanism: Measurement of DNA-Bound Escherichia Coli Lac Repressor in Vivo.” Proceedings of the National Academy of Sciences 74 (10): 4228–32. https://doi.org/10.1073/pnas.74.10.4228.
Kepler, Thomas B., and Timothy C. Elston. 2001. “Stochasticity on Transcriptional Regulation: Origins, Consequences, and Mathematical Representations.” Biophysical Journal 81 (6): 3116–36. https://doi.org/10.1016/S0006-3495(01)75949-8.
Klumpp, Stefan, and Terence Hwa. 2014. “Bacterial Growth: Global Effects on Gene Expression, Growth Feedback and Proteome Partition.” Current Opinion in Biotechnology 28 (August): 96–102. https://doi.org/10.1016/j.copbio.2014.01.001.
Ko, Minoru S. H. 1991. “A Stochastic Model for Gene Induction.” Journal of Theoretical Biology 153 (2): 181–94. https://doi.org/10.1016/S0022-5193(05)80421-7.
Li, Gene-Wei, David Burkhardt, Carol Gross, and Jonathan S. Weissman. 2014. “Quantifying Absolute Protein Synthesis Rates Reveals Principles Underlying Allocation of Cellular Resources.” Cell 157 (3): 624–35. https://doi.org/10.1016/j.cell.2014.02.033.
Lutz, Rolf, and Hermann Bujard. 1997. “Independent and Tight Regulation of Transcriptional Units in Escherichia Coli via the LacR/O, the TetR/O and AraC/I1-I2 Regulatory Elements.” Nucleic Acids Research 25 (6): 1203–10. https://doi.org/10.1093/nar/25.6.1203.
Michel, Denis. 2010. “How Transcription Factors Can Adjust the Gene Expression Floodgates.” Progress in Biophysics and Molecular Biology 102 (1): 16–37. https://doi.org/10.1016/j.pbiomolbio.2009.12.007.
Mondal, Jagannath, Benjamin P. Bratton, Yijie Li, Arun Yethiraj, and James C. Weisshaar. 2011. “Entropy-Based Mechanism of Ribosome-Nucleoid Segregation in E. coli Cells.” Biophysical Journal 100 (11): 2605–13. https://doi.org/10.1016/j.bpj.2011.04.030.
Motlagh, Hesam N., James O. Wrabl, Jing Li, and Vincent J. Hilser. 2014. “The Ensemble Nature of Allostery.” Nature 508 (7496): 331–39. https://doi.org/10.1038/nature13001.
Phillips, Rob. 2001. Crystals, Defects and Microstructures. Cambridge University Press. https://doi.org/10.1017/CBO9780511606236.
Phillips, Rob, Nathan M. Belliveau, Griffin Chure, Hernan G. Garcia, Manuel Razo-Mejia, and Clarissa Scholes. 2019. “Figure 1 Theory Meets Figure 2 Experiments in the Study of Gene Expression.” Annual Review of Biophysics 48 (1): 121–63. https://doi.org/10.1146/annurev-biophys-052118-115525.
Razo-Mejia, Manuel, Stephanie L. Barnes, Nathan M. Belliveau, Griffin Chure, Tal Einav, Mitchell Lewis, and Rob Phillips. 2018. “Tuning Transcriptional Regulation Through Signaling: A Predictive Theory of Allosteric Induction.” Cell Systems 6 (4): 456–469.e10. https://doi.org/10.1016/j.cels.2018.02.004.
Rosenfeld, N., M. B. Elowitz, and U. Alon. 2002. “Negative Autoregulation Speeds the Response Times of Transcription Networks.” Journal of Molecular Biology 323 (November): 785–93. https://doi.org/S0022283602009944\\\%0020[pii].
Rosenfeld, Nitzan, Jonathan W. Young, Uri Alon, Peter S. Swain, and Michael B. Elowitz. 2005. “Gene Regulation at the Single-Cell Level.” Science 307 (5717): 1962–5. https://doi.org/10.1126/science.1106914.
Rydenfelt, Mattias, Hernan G. Garcia, Robert Sidney Cox, and Rob Phillips. 2014. “The Influence of Promoter Architectures and Regulatory Motifs on Gene Expression in Escherichia Coli.” Edited by Jordi Garcia-Ojalvo. PLOS ONE 9 (12): e114347. https://doi.org/10.1371/journal.pone.0114347.
Schaechter, M., O. Maaløe, and N. O. Kjeldgaard. 1958. “Dependency on Medium and Temperature of Cell Size and Chemical Composition During Balanced Growth of Salmonella Typhimurium.” Microbiology 19 (3): 592–606. https://doi.org/10.1099/00221287-19-3-592.
Schlake, Thomas, and Juergen Bode. 1994. “Use of Mutated FLP Recognition Target (FRT) Sites for the Exchange of Expression Cassettes at Defined Chromosomal Loci.” Biochemistry 33 (43): 12746–51. https://doi.org/10.1021/bi00209a003.
Schmidt, Alexander, Karl Kochanowski, Silke Vedelaar, Erik Ahrné, Benjamin Volkmer, Luciano Callipo, Kèvin Knoops, Manuel Bauer, Ruedi Aebersold, and Matthias Heinemann. 2016. “The Quantitative and Condition-Dependent Escherichia Coli Proteome.” Nature Biotechnology 34 (1): 104–10. https://doi.org/10.1038/nbt.3418.
Scott, M., S. Klumpp, E. M. Mateescu, and T. Hwa. 2014. “Emergence of Robust Growth Laws from Optimal Regulation of Ribosome Synthesis.” Molecular Systems Biology 10 (8): 747–47. https://doi.org/10.15252/msb.20145379.
Shehata, T E, and A G Marr. 1975. “Effect of Temperature on the Size of Escherichia Coli Cells.” Journal of Bacteriology 124 (2): 857–62.
Stylianidou, Stella, Connor Brennan, Silas B. Nissen, Nathan J. Kuwada, and Paul A. Wiggins. 2016. “SuperSegger: Robust Image Segmentation, Analysis and Lineage Tracking of Bacterial Cells.” Molecular Microbiology 102 (4): 690–700. https://doi.org/10.1111/mmi.13486.
Swain, Peter S., Keiran Stevenson, Allen Leary, Luis F. Montano-Gutierrez, Ivan B. N. Clark, Jackie Vogel, and Teuta Pilizota. 2016. “Inferring Time Derivatives Including Cell Growth Rates Using Gaussian Processes.” Nature Communications 7 (December): 13766. https://doi.org/10.1038/ncomms13766.
Yakovchuk, Peter, Ekaterina Protozanova, and Maxim D. Frank-Kamenetskii. 2006. “Base-Stacking and Base-Pairing Contributions into Thermal Stability of the DNA Double Helix.” Nucleic Acids Research 34 (2): 564–74. https://doi.org/10.1093/nar/gkj454.
Zhu, Xu-Dong, and Paul D. Sadowski. 1995. “Cleavage-Dependent Ligation by the FLP Recombinase: Characterization of a Mutant Protein with an Alteration in a Catalytic Amino Acid.” Journal of Biological Chemistry 270 (39): 23044–54. https://doi.org/10.1074/jbc.270.39.23044.