Noise Control In Gene Regulatory Networks With Negative Feedback

Michael Hinczewski; D. Thirumalai

doi:10.1101/049502

Abstract

Genes and proteins regulate cellular functions through complex circuits of biochemical reactions. Fluctuations in the components of these regulatory networks result in noise that invariably corrupts the signal, possibly compromising function. Here, we create a practical formalism based on ideas introduced by Wiener and Kolmogorov (WK) for filtering noise in engineered communications systems to quantitatively assess the extent to which noise can be controlled in biological processes involving negative feedback. Application of the theory, which reproduces the previously proven scaling of the lower bound for noise suppression in terms of the number of signaling events, shows that a tetracycline repressor-based negative-regulatory gene circuit behaves as a WK filter. For the class of Hill-like nonlinear regulatory functions, this type of filter provides the optimal reduction in noise. Our theoretical approach can be readily combined with experimental measurements of response functions in a wide variety of genetic circuits, to elucidate the general principles by which biological networks minimize noise.

The genetic regulatory circuits that control all aspects of life are inherently stochastic. They depend on fluctuating populations of biomolecules interacting across the crowded, thermally agitated interior of the cell. Noise is also exacerbated by low copy numbers of particular proteins and mRNAs, as well as variability in the local environment.¹^-⁶ Yet the robust and reproducible functioning of key systems requires mechanisms to filter out fluctuations. For example, regulating noise is relevant in stabilizing cell-fate decisions in embryonic development,⁷ prevention of random switching to proliferating states in cancer-regulating miRNA networks,⁸ and maximization of the efficiency of bacterial chemotaxis along attrac-tant gradients.⁹ Comprehensive analysis of yeast protein expression reveals that proteins involved in translation initiation, ribosome formation, and protein degradation, have lower relative noise levels,¹⁰ suggesting natural selection could favor noise reduction for certain essential cellular components.¹¹,¹²

A common regulatory motif capable of suppressing noise is the negative feedback loop,¹,²,¹³^-¹⁸ as has been explicitly demonstrated in synthetic gene circuits.¹,¹⁴,¹⁵ Feedback pathways for a given chemical species can be mediated by numerous signaling molecules, each with its own web of interactions and stochastic characteristics that determine the ultimate effectiveness of the system in damping the fluctuations of the target population and maintaining home-ostasis. Thus, uncovering generic laws governing the behavior of such control networks is difficult. A major advance was made by Lestas, Vinnicombe, and Paulsson (LVP),¹⁹ who showed that information theory can set a rigorous lower bound on the magnitude of fluctuations within an arbitrarily complicated homeostatic negative feedback network. Since the bound scales like the fourth root of the number of signaling events, noise reduction is extremely expensive. This underscores the pervasiveness of biological noise, even in cases where there may be evolutionary pressure to minimize it.

The existence of a rigorous bound raises a number of intriguing issues. Can a biochemical network actually reach this lower bound, and thus optimally suppress fluctuations? What would be the dynamic behavior of such an optimal system, and how would it depend on the noise spectrum of the system components? Here we answer these equations using a theory related to the optimal linear noise-reduction filter, developed by Wiener²⁰ and Kolmogorov.²¹ Though the original context of Wiener-Kolmogorov (WK) filter theory was removing noise from corrupted signals in engineered communications systems, it has recently become a powerful tool for characterizing the constraints on signaling in biochemical networks.²²,²³ recently, we showed that the action of kinase and phosphatase enzymes on their protein substrates, the basic elements of many cellular signaling pathways, can in fact effectively be represented as an optimal WK filter.²² The WK theory also describes how systems like E. coli chemotaxis can optimally anticipate future changes in concentrations of extracellular ligands.²³ Although the classic WK theory is strictly defined for linear filtering of continuous signals (a reasonable approximation for certain biochemical networks), it can also be extended to yield constraints in the more general case of nonlinear production of molecular species with discrete population values.²²

Interestingly, for a broad class of systems the WK linear solution turns out to be the global optimum among all nonlinear or linear networks, allowing us to delineate where non-linearity is potentially advantageous in biochemical noise control. Most importantly, since the WK theory is formulated in terms of experimentally accessible dynamic response functions, it also provides a design template for realizing optimality in synthetic circuits. As an illustrative example, we predict that a synthetic autoregulatory TetR loop, engineered in yeast,²⁴ can be fine-tuned to approximate an optimal WK filter for TetR mRNA levels. Though a simple design, similar filters could be employed in nature to cope with Poisson noise arising from small copy numbers of mRNAs, often on the order of 10 per cell.²⁵ Based on the application of the theory to the synthetic gene network we propose that the extent of noise reduction in biological circuits is determined by competing factors such as functional efficiency, adaptation, and robustness.

Results

To make the paper readable and as self-contained as possible many of the details of the calculation are relegated to four Appendices. The main text contains only the necessary details needed to follow the results without the distraction of the mathematics.

Linear response theory for a general control network

To motivate the WK approach for a general control network, we start with the simple case where two species within the network are explicitly singled out:¹⁹ a target R with time-varying population r(t) fluctuating around mean , and one of the mediators in the feedback signaling pathway P, with population p(t) varying around . We assume a continuum Langevin description of the dynamics,¹³,¹⁶,²⁶,²⁷ where the rate for α = ror p, can be broken down into deterministic (κ_α) and stochastic (n_α) parts. The function κ_α(t) encapsul rates the entire web of biochemical reactions underlying synthesis and degradation of species α, and can be an arbitrary functional of the past history of the system up to time t. It is typically divided into two parts, , corresponding to the production (+) and destruction (-) rates of the species α. The term n_α(t) is the additive noise contribution, which can also be divided into two parts, . The first is the “intrinsic” or shot noise, arising from the stochastic Poisson nature of α generation, , where is the mean production rate, or equivalently the mean destruction rate, , and η_α(t) is a Gaussian white noise function with correlation . The second part, , is “extrinsic” noise, which arises due to fluctuations in cellular components affecting the dynamics of and P that are not explicitly taken into account in the two-species picture. These could include mediators in the signaling pathway, or global factors likeibosome and RNA polymerase levels. For simplicity, our main focus will be the case of no extrinsic noise. However, we will show later how a straightforward extension of the theory reveals that the same system can behave like an optimal WK filter under a variety of extrinsic noise conditions.

For small deviations δα(t) = α(t)-κ from the mean populations κ, κ_α(t) can be linearized withespect to δα(t), where G_αα'(t) are linear response functions, which express the dependence of κ_α(t) on the past history of δα'(t). The functions G_αα'(t) capture the essential characteristic responses of the control network to perturbations away from equilibrium (Fig. 1). In the static limit, G_αα'(t) have appeared in various guises as gains,⁶ susceptibilities,¹⁷ or steady-state Jacobian matrices,²⁷ and in the frequency-domain as loop transfer functions.¹³,¹⁶ Feedback between R and P is encoded in the cross-responses G_rp(t) and G_pr(t). In the simplest scenario, the only non-zero self-responses G_αα(t) are decay terms, , where τ_α is the decay time scale for species α. However, the theory works generally for more complicated self-response mechanisms.

Figure 1:

Schematic of a complex signaling network with the target species R and one mediator P singled out. In focusing on two species, the action of all the other components is effectively encoded in four response functions—G_rr(t), G_pp(t), G_rp(t), G_pr(t)—that describe how the entire dynamical system responds to perturbations in R and P.

Control network as a noise filter

The connection between the linearized dynamical description and WK filter theory arises from comparing the original system to the case where feedback is turned off (i.e. setting G_rp(ω) or G_pr(ω) to zero). Let us define a few terms to make the noise filter analogy clear. Without feedback, the target fluctuations are δr₀(t) = s(t), where we denote s(t) the signal. This is to distinguish it from δr(t) in the original system, which is the output. The difference between the two, which reflects the impact of the feedback network, we express as , where is referred to as the estimate. In this analogy, minimizing δr(t) requires a feedback loop where the estimate is as close as possible to the signal s(t). The only thing left to specify is the relationship between and s(t).

The dynamical system in Eqs. (1)-(2) takes a simple form in Fourier space, where the fluctuations δα(ω) satisfy:

We solve Eq. (3) for δr(ω) and break up the R fluctuation into two contributions, , with the signal s(ω) and estimate given by:

Here we have introduced a noise function n(ω), and a filter function H(ω):

Thus in the time domain the estimate is the convolution of the filter function H(t) and a noise-corrupted signal y(t) ≡ s(t) + n(t),

Eqs. (4)-(6) constitute a one-to-one mapping between the linear response and noise filter descriptions of the system in Fourier space. They relate the four filter quantities, s(ω), , n(ω), and H(ω), to the four linear response functions G_rr(ω), G_rp(ω), G_pr(ω), and G_pp(ω).

The entire noise filter system is illustrated schematically in Fig. 2. Note that the noise function in the filter analogy, n(t), is related to n_p(t) in Fourier space as n(ω) = n_p(ω)/G_pr(ω). Thus, the stochastic nature of the mediator P production makes estimation non-trivial, since the function H(t) must try to filter out the n(t) component in y(t) in order to produce close to s(t). Though we confine ourselves throughout this work to the case of a dynamical system with a single target and mediator species, one can easily generalize the entire approach to explicitly include many mediators, which could potentially be involved in a complex signaling pathway. The linearized dynamical system in Eqs. (1)-(2) would still have the same form (with index α running over all the species of interest), and the mapping onto the filter problem for the target species would be analogous. The only difference is that n(ω) and H(ω) would be more complicated functions of the various individual noise terms n_α(ω) and the response functions G_αα'(ω) of the mediators. In our reduced, two species description, the action of all the unspecified chemical components is effectively included in the four response functions described above, with their stochastic effects contributing to the extrinsic noise. Fig. 1 shows a schematic of such a reduction. The fine-grained details of the signaling pathways connecting our target R and mediator P, potentially involving many interacting species, are encoded in G_rr, G_pp, G_rp, and G_pr. As an example of how this two-species reduction would work in practice, in Appendix B we treat an important example of a feedback loop involving multiple mediators, representing a signaling cascade in series.

Figure 2:

Signal processing diagram illustrating noise suppression in a negative feedback loope-interpreted as a linear filter. The fluctuations in the target species δr(t) (lower left) are expressed as , where theaw signal s(t) (upper left) equals δr(t) in the absence of feedback control, and the estimate (loweright) is the contribution of the feedback loop. This estimate is given by the convolution of a filter function H(t) (center) and the corrupted signal s(t) + n(t), where n(t) is the noise (upperight). The goal of Wiener-Kolmogorov theory is to find a causal H(t) such that the standard deviation of δr(t) is minimized. All sample trajectories shown in the figure are generated from numerically solving the linearized version of the dynamical system in Eq. (10).

Wiener-Kolmogorov theory yields the optimal filter

The WK optimization problem consists of minimizing , the variance of target fluctuations, which are related to H(t), s(t), and n(t) through the frequency-domain integral²⁸ (see derivation in Appendix A): where H(ω) is the Fourier transform of H(t), and P_n(ω), P_s(ω) are the power spectral densities (PSD) of n(t) and s(t) respectively, i.e. the Fourier transforms of their autocorrelation functions. If P_n(ω) and P_s(ω) are given, the task is to minimize in Eq. (8) over all possible H(ω). The main constraint that makes the solution mathematically difficult is that H(ω) must correspond to a physically realizable control network, which imposes the crucial restriction that the time-domain convolution function H(t) must be causal, depending only on the past history of the input, H(t) = 0 for t<0. The great achievement of Wiener and Kolmogorov was to derive the form of the optimal causal solution H_opt(ω):

The c super/subscripts refer to two different decompositions in the frequency domain which enforce causality: (i) Any physical PSD, in this case P_y(ω) corresponding to the corrupted signal y(t) = s(t) + n(t), can be written as . The factor , if treated as a function over the complex ω plane, contains no zeros and poles in the upper half-plane (Imω > 0).²⁹ (ii) We also define an additive decomposition denoted by {F(ω)}_c (see Appendix A) for any function F(ω), which consists of all terms in the partial fraction expansion of F(ω) with no poles in the upper half-plane. In Appendix A we provide in detail a new derivation of Eq. (9), the heart of the WK theory.

Optimal noise control in a yeast gene circuit with feedback

To illustrate the nature of the optimal WK solution we choose as a case study the yeast negative autoregulatory gene circuit designed by Nevozhay et. al.,²⁴ drawn schematically in Fig. 3(a). The gene encoding for the TetR protein is under the control of the P_GAL1-D12 promoter, whose activity can be repressed by binding TetR dimers. The strength of the feedback can be modulated by changing the extracellular concentration A of the inducer anhydrotetracycline (ATc), which enters the cell, binds to TetR and prevents its association with the promoter, thus weakening repression.

Figure 3:

(a) The synthetic yeast gene circuit designed by Nevozhay et. al.²⁴ The TetR protein negatively regulates itself by binding to its own promoter. The inducer molecule ATc associates with TetR, inhibiting its repressor activity. The subsequent panels show results for this gene circuit using the linear filter theory applied to the dynamical model of Eq. (10), with experimentally-derived parameters (Table 1). (b) Filter functions H(t) and H_opt(t), sample signal s(t) and estimate time series for burst ratio B = 10 and three different values of extracellular ATc concentration A [ng/mL]. H(t) is from Eq. (19), while H_opt(t) is from Eq. (18). The sample time series trajectories are numerical solutions of the linearized Eq. (10). On the right are the resulting equilibrium probability distributions P(δr), where , which are Gaussians with variance . For A ≈ 54 ng/mL, the circuit approximately functions as an optimal WK filter (H(t) is close to H_opt(t)), maximally suppressing fluctuations in the population levels of TetR mRNA (minimizing ). (c) Mean populations of free intracellular TetR mRNA, , and TetR protein dimers, . (d) The decay rates of free mRNA and proteins, and , which are related to the network self-response functions G_rr and G_pp (both are constants in the frequency domain as shown in Eq. 11). (e) The magnitude of the network cross-response, |G_rp| (solid lines), plotted together with the optimal magnitude (dashed lines). Filled circles mark the intersection defining A = A_opt, where the system behaves approximately like an optimal WK filter. (f) The Fano factor (solid lines), compared to the optimal WK value (horizontal dashed lines). Filled circles mark the position A = A_opt.

In order to analyze the TetR negative feedback gene circuit, we start with the simple mathematical model introduced in Ref. 24, which provided results that are consistent with the experimental data. The simplified model, which captures the essence of the synthetic gene network, features as the main variables the population of free intracellular TetR dimer, p(t), and free intracellular ATc molecules, a(t). In addition to the regulatory loop, the experimental gene circuit has a parallel yEGFP reporter portion, which acts as a monitor of TetR protein levels. Because we focus on the system as a noise filter for the TetR mRNA population, and the yEGFP part does not influence this analysis,²⁴ we ignore the reporter circuit.

The production of the TetR dimers occurs in a single step, with the autoregulation of the rate described by a repressory Hill function. We divide this step into two parts, introducing as an additional variable the population of TetR mRNA r(t). The feedback loop (Fig. 3(a)) consists of mRNA production at a rate given by the Hill function κ_r(t) = κ₀θⁿ/(θⁿ + pⁿ(t)), followed by TetR dimer generation at a rate given by κ_pr(t). The degradation/dilution of the mRNA and dimers is modeled through decay terms γ_rr(t) and γ_pp(t). We could have modeled additional (comparatively fast) chemical substeps involved in this loop, such as TetR dimerization, the binding of the repressor to the individual promoter sites, or the role of RNAP and ribosomes in the transcription and translation processes. Though we limit ourselves to the two substep description to illustrate the filter theory, the stochastic effects of additional complexity can be approximately treated through general “extrinsic” noise terms incorporated into n_r(t) and n_p(t).

The main experimental variable that allows tuning of the yeast gene network behavior is the external ATc concentration A, which is assumed to be time independent. As illustrated in Fig. 2(a), there is an influx ΦA of ATc molecules into the cell. Once inside, the ATc molecules associate with the TetR at a rate βa(t)p(t). Additional loss of intracellular ATc through degradation, outflux, and dilution is modeled through an effective decay rate γ_aa(t). We assume that the dissociation of ATc from TetR occurs on long enough timescales that it can be ignored. Since the influx/association/outflux of ATc is fast compared to the transcription and translation processes of the main feedback loop, we further assume that a(t) instantaneously equilibriates at the current value of p(t). Thus, the dependence of a(t) on p(t) is determined by equating the influx and total loss rate, which leads to a(p(t)) = ΦA/(γ_a + βp(t)).

For the model described above, the dynamical equations for r(t) and p(t) are,

The parameters, with values derived from experimental fitting,²⁴ are listed in Table 1. The only quantity that is not independently known from the fit is the rate κ_p, which we allow to vary in the range κ_p/γ_r ≡ B = 2 - 10, comparable to typical experimentally measured protein burst sizes.³⁰ Setting the right sides of Eq. (10) to zero, and averaging over n_r(t) and n_p(t), we numerically solve for the equilibrium values and as a function of external ATc concentration A [Fig. 3(c)]. For A = 0, the promoter is nearly fully repressed, but with increasing A, the mean population of free TetR dimers is reduced, weakening the repression and boosting the mean mRNA population . Changing A allows us to explore a wide range of control network behavior. Note that since depends on B only through the the product κ₀B, and the value of this product is fixed at a constant value from the experimental fit (Table 1), is independent of B. On the other hand, , which is proportional to κ₀, is inversely proportional to B.

View this table:

Table 1:

Parameter values for the dynamical model of the yeast synthetic gene circuit (Eq. (10)). The cell volume V is assumed fixed. Unless otherwise noted, all values are taken from the experimental fit of Ref. 24.

^aRef. 31

^bThe burst ratio B = κ_p /γ_r. Though not independently determined by the experimental fit, we assume that B is in the range B = 2 - 10.³⁰

^cFor external ATc concentration A, 1 ng/mL corr responds to 2.25 nM.

Linearizing Eq. (10) around and , we extract the following frequency-domain response functions:

All the functions are constants in the frequency domain. Here τ_r and τ_p are effective decay times for the mRNA and proteins, respectively. The value of τ_r is fixed, and sets the intrinsic time scale of mRNA fluctuations, but τ_p and G_rp depend on , which is a function of the external ATc concentration A. In fact, association with intracellular ATc, described by the second term in the G_pp expression above, is the dominant form of decay for the free TetR dimers. Fig. 3(d) plots the effective decay constants and as a function of A. Except for A ≲ 8 ng/mL we are in the regime where , which is relevant in simplifying the optimality condition for G_rp(ω) discussed below.

The optimal filter calculation for the TetR gene circuit depends on the linear response functions of Eq. (11). We obtain the following power spectra for the signal and noise in the absence of extrinsic noise: where the burst ratio B ≡ κ_pτ_r is the mean number of proteins synthesized per mRNA during the lifetime τ_r. The problem is to evaluate Eq. (9) for H_opt(ω). The sum of signal plus noise, y(ω) = s(ω) + n(ω), has a power spectrum P_y(ω) = P_s(ω) + P_n(ω), which we canewrite as follows:

The expression within the absolute value brackets is zero only at , and has a simple pole at . Since all the zeros and poles are in the lower complex ω half-plane, it satisfies the criterion for the causal term in the factorization . Thus:

The other causal term in Eq. (9) involves the additive decomposition . This is calculated by looking at the partial fraction expansion of :

Of the two terms in the partial fraction expansion, only the first has poles solely in the lower complex ω half-plane. Hence, it is the only one that contributes to :

Inserting Eqs. (14) and (15) into Eq. (9), we finally find that the optimal filter is:

Transforming H_opt(ω) into the time domain, we find where , and Θ(t) is a unit step function ensuring that the filter operates only on the past history of its input. For B≫1 the prefactor in Eq. (18) is , and H_opt(t) has a straightforward interpretation: it approximately acts as a moving average of the corrupted signal y(t) = s(t)+n(t) over a time scale τ_avg. In order to get the best estimate , the averaging interval τ_avg can neither be too long, since it would blur out the features of the signal s(t) (which vary on the time scale τ_r), nor too short, since it would be ineffective at smoothing out the noise distortion n(t). Hence, there must exist an optimum τ_avg, which is naturally proportional to τ_r, the main time scale for the mRNA.

In Fig. 3(b), we show how the noise filter properties of the system vary with A for a burst ratio of B = 10. The filter function H(t) (solid red curve) differs substantially from H_opt(t) (dotteded curve) for large and small A, but approaches the optimal form near A = 54 ng/mL. Consequently, at this value of A we get the closest correspondence between the plotted sample trajectories of signal s(t) (cyan curve) and estimate (blue curve). Similarly, the equilibrium probability distribution of the output, P(δr), shown to the right of the trajectories, exhibits the smallest Fano factor . The latter is a measure of noise magnitude, and has a reference value of unity if mRNA production was a pure Poisson process, as would be the case without feedback. Optimality is realized in the intermediate A regime of partial repression, where the R to P responsiveness, as measured by |G_rp|, is large. Effective noise suppression requires that R be sensitive to changes in P, so that information about R fluctuations can be transmitted through the negative feedback loop.

In order to understand the optimality condition for H(t) in more detail, let us look at the explicit expression for H(t) in the TetR system, given by the inverse Fourier transform of Eq. (6) with the response functions of Eq. (11): where ω₁, ω₂ are the two ω roots of the denominator in Eq. (6). Assuming τ_p≪ τ_r (which holds good except for small values A ≲ 8 ng/mL, as seen in Fig. 3(d)), we can directly show the approach of H(t) to optimality at a specific intermediate value of G_rp. When G_rp equals , the roots , up to corrections of order . In this case, Eq. (19) becomes where the factor in the brackets on the right equals 1 in the limit τ_p → 0 for all t > 0. Up to this correction factor, we thus expect the system to behave optimally at A = A_opt, defined by the condition , so long as A_opt is large enough to satisfy τ_p ≪ τ_r. Fig. 3(e) shows G_rp and curves for B = 2, 5, 10, with dots marking the intersection points that define A_opt for each B. As explained above, |G_rp| is small at small and large A, and reaches a maximum in between. At fixed B, , so it increases monotonically with A, as larger concentrations of the inducer increase the effective decay rate of free proteins. Thus, for each B there is a single intersection point A_opt at an intermediate concentration of the inducer.

Fig. 3(f) shows the Fano factor versus A for various B. As the control network approxim rates optimality at A_opt for each B, the Fano factor nears its minimum, close to the theoretical limit marked by the horizontal dashed lines. This limit is the minimal possible , calculated from Eq. (8) using H_opt(t) from Eq. (18):

A few comments concerning the above equation are in order. (1) The result on the far right-hand side is the rigorous lower bound derived by LVP.¹⁹ In their case, the feedback mechanism through the rate function κ_r(t) could be any causal functional of p(t), linear or nonlinear. The Fano factor of the optimal linear filter differs in form only by the coefficient of B, and is always within a factor of 2 of the lower bound for any value of B. (2) For Gaussian-distributed signal s(t) and noise n(t) time series, the linear filter is optimal among all possible filters.²⁸ If the system fluctuates around a single stable state, and the copy numbers of the species are large enough that their Poisson distributions converge to Gaussians (mean populations ≳ 10), the signal and noise are usually approximately Gaussian. This is a wide class of systems where the rigorous lower bound (the last term in Eq. 21) can never be achieved. In other words, here the WK filter yields the most efficient feedback mechanism. Although, as pointed out by LVP, nonlinearity could lead to additional noise reduction, the benefits are likely to be restricted to those systems where the signal and/or noise are substantially non-Gaussian. However, since the form of the optimal control network has not been found in the general nonlinear case, it remains an interesting open question whether the LVP bound can actually be reached even within this category of systems. We willeturn to this issue in the next section. (3) The parameter B is the key determinant of noise reduction. For B ≪ 1, there are not enough signaling events to control the mRNA fluctuations, and as B → 0 we approach , the no-feedback Poissonesult. In the limit B ≫ 1 signaling is effective, and the Fano factor decreases with B as . For large enough B we approach perfect control, but at extreme expense: the standard deviation of the mRNA fluctuations σ_r,opt ∝ B^−1/4, the same scaling derived by LVP.

WK theory constrains the performance of a broad class of nonlinear, discrete regulatory networks

The results in Fig. 3 rely on a linearized, continuum approach to the TetR dynamical system. To assess if the conclusions based on the WK optimal filter hold if these approximations are relaxed, we first performed kinetic Monte Carlo simulations of the full nonlinear system (Eq. (10)) using the Gillespie algorithm.³² We chose a cell volume of V = V₀ = 60 fL, within the observed range for yeast,³³ which corresponds to the mean populations and shown in Fig. 4(a) as a function of A. (For example, at A = A_opt = 62.7 ng/mL when B = 5, ≈ 84 and ≈ 11. In addition to the nonlinearity, the discrete nature of the populations in the simulation might play a role at these low copy numbers.) The numerical results for the Fano factor are plotted in Fig. 4(b) at B = 2, 5, 10, for V = V₀ (circles) and also for comparison at a larger volume V = 10V₀ (squares). The blue curves show the linear theory results, and the dashed lines are the optimality predictions for . Although nonlinearity and discreteness effects do change the results, the linear theory gives a reasonable approximation, and the minimum is still near A_opt. The feedback mechanism is nonlinear in the simulations, but it does not do better than the linear predictions for for the parameters used to describe the experimental results. Though the intrinsic population noise is Poisson-distributed in the simulations, the Poisson distribution is very close to Gaussian, even for copy numbers as low as ~ . Since the linear filter is the true optimum for a Gaussian-distributed signal and noise,²⁸ we do not expect improvements in noise suppression by employing a nonlinear version. In the opposite limit of large copy numbers, V → ∞, the continuum approximation should be valid, and population fluctuations increasingly negligible relative to the mean. Thus, the linear theory should directly apply in this limit, and indeed we see that for V = 10V₀ the discrepancies between numerical and theory results are substantially reduced (Fig. 4(b)). It is worth emphasizing, that even at the realistically small cell volume V₀, the linear theory retains much of its predictive power. More generally, the conditions for WK optimality do not have to be perfectly satisfied in order for the filter to perform close to maximum efficiency There is an inherent adaptability and robustness in near-optimal networks, as reflected in the broad minima of as a function of A (Fig. 4(b)).

Figure 4:

Results of simulation and theory for the yeast synthetic gene circuit,²⁴ as a function of extracellular ATc concentration A. (a) Mean populations of free TetR mRNA and TetR dimer , assuming a cell volume V₀ = 60 fL. (b) The Fano factor for burst factor B = 2,5,10, as predicted by the linear filter theory (solid lines), versus stochastic numerical simulations at two different volumes, V = V₀ (circles) and V = 10V₀ (squares). The WK filter theory predicts the minimal Fano factor given by Eq. (21) (horizontal dashed lines). The system can be tuned to approach optimality near a particular A_opt obtained by the condition (filled circles).

The semi-quantitative agreement between the linearized theory and the simulation results displayed in Fig. 4 still leaves open the possibility that some type of nonlinear, discrete filter, not described by the experimentally fitted parameters of the TetR gene network, could perform better than the WK optimum at sufficiently small volumes. Fig. 5 plots both the WK value for the Fano factor (solid curve) and the rigorous lower bound of LVP (dashed curve) as a function of B (Eq. (21)). The above question can be posed as follows: is it possible to achieve a Fano factor that falls between the two curves by taking advantage of nonlinearity and discreteness? Ideally, one should do an optimization over all possible nonlinear regulatory functions that could describe feedback between the TetR protein and mRNA. In full generality, such an optimization appears intractable, but one can tackle a limited version of the nonlinear optimization. We will confine ourselves to Hill-like regulatory functions, which describe the experimental behavior of many cellular systems,³⁴ and explore whether it is possible to find any scenario where this type of nonlinear feedback outperforms the linear WK optimum. We consider the following generalized TetR feedback loop: with two Hill-like regulatory functions, involving arbitrary non-negative parameters A_i, n_i, θ_i, i = 1, 2. The original TetR system (Eq. (10)) is a special case of the equations above with:

Figure 5:

The Fano factor as a function of burst ratio B. The solid curve is the optimal result predicted by the WK linear theory, and the dashed curve is the rigorous lower bound derived by LVP.¹⁹ Symbols show numerical optimization for the generalized nonlinear TetR feedback system (Eq. (22)) at two volumes, V = V₀ and V = 0.1V₀.

The production function K_r(p) is a monotonically decreasing function of p, as is expected for negative feedback, while Γ_p(p) is monotonically increasing, a generalization of some regulatory network which effectively removes the TetR protein from the feedback loop (the role played by ATc binding in the experimental system). With these monotonicity constraints, there is always only one steady-state solution and to Eq. (22).

The optimization consists of searching for K_r(p) and Γ_p(p) that minimize the Fano factor . The following quantities are fixed during the search: the degradation rates γ_r, γ_p, the P production rate κ_p (or equivalently the burst ratio B = κ_p/γ_r), and the steady state values , . Note that in the general nonlinear case, the steady state values do not necessarily coincide with the mean values 〈r〉, 〈p〉, since the equilibrium distributions are generally asymmetric with respect to the steady state. Fixing and during the optimization is one way to set an overall copy number scale, to investigate the role of discreteness. It turns out that the optimization results described below end up being independent of and . In terms of the Hill function parameters, fixing and means setting A₁ and A₂ to the following values,

Thus the goal of optimization is to minimize over the four remaining free parameters: n₁, θ₁, n₂, θ₂.

In order to carry out this minimization, one needs an efficient procedure to calculate from Eq. (22), keeping both the full nonlinearity of the dynamical system, and the discreteness of the r(t) and p(t) populations. The system can always be simulated through the Gillespie algorithm,³² and accurate estimates of 〈r〉 and determined from sufficiently long trajectories. However this approach is too slow for searching over the four-dimensional parameter space, since each distinct set of parameters would require a separate long simulationun. An equivalent, faster alternative is to directly solve the system’s master equation for the steady state probability distribution, which then yields 〈r〉 and . The joint probability distribution P_r,p(t) of finding mRNAs and p proteins at time t is governed by the master equation,

The steady state distribution , is the solution obtained by setting to zero the right-hand side of the above equation, which we denote R _r,p:

The result is linear in the components for various r and p, and thus the set {R_rp = 0} for = 0,1,… and p = 0, 1,…, constitutes a linear system of equations for . The master equation can be solved by spectral methods, which are generally more efficient than brute force Gillespie simulations.³⁵ However we use a different approach, described below, to solve Eq. (27), which is sufficiently fast for our numerical optimization purposes. Since r and p can take on any integer values between 0 and ∞, we truncate the system to focus only on the non-negligible , in other words (r,p) within several standard deviations of the mean (〈r〉,〈p〉). Specifically, we keep only those equations R_rp = 0 which involve r_min ≤ r ≤ r_max and p_min ≤ p ≤ p_max. The largest truncation range required for accurate results was r_max - r_min = 100 and p_max - p_min = 50. All outside the range which appear in the truncated system of equations are set to a positive constant ∊ > 0. (The precise value of ∊ is unimportant since the distribution is subsequently normalized, and the truncation range is chosen large enough so that the boundary condition does not significantly affect the outcome.) The resulting finite linear system, which is sparse, can be efficiently solved using an unsymmetric-pattern multifrontal algorithm.³⁶ Knowing , we then directly calculate the moments of the distribution to find 〈r〉 and . The numerical accuracy of the procedure is verified by comparison to Gillespie simulation results.

In order to set a starting point for each round of nonlinear optimization, we use the following initialization procedure: we take the original TetR system at a given volume V and burst ratio B (fixing the Hill function parameters according to Eq. (24)) and find the ATc concentration A_min where is smallest, evaluating the Fano factor using the linear solver described above. The and at this concentration are then chosen to be fixed constants for the nonlinear optimization, where we vary the parameters n₁, θ₁, n₂, θ₂ from the initial values given by Eq. (24) to minimize . The minimization is carried out using Brent’s principal axis method,³⁷ which is feasible due to the fast evaluation of 〈r〉 and at each different parameter set through the linear solver.

Fig. 6 shows results of a typical minimization run, where the initial system is at volume V = V₀ with B = 10, with a corresponding A_min = 50 ng/mL. The dashed lines in Fig. 6(a) and (b) show the Hill functions K_r(p) and Γ_p(p) of the original TetR system at these parameter values, and the heat map in Fig. 6(c) represents the associated steady-state probability distribution . The dashed lines superimposed on the heat map are the loci of solutions to ṙ(t) = 0 and ṗ(t) = 0 (the right-hand sides of Eq. (22) set to zero), which intersect at the steady state . The Fano factor for this distribution, which represents the best the TetR system can perform given the experimentally fitted parameters, is . This is above the linear WK optimum for , and significantly larger than the rigorous LVP lower bound of . Once we relax the experimental constraints, and carry out the numerical minimization, the Fano factor decreases. The solid lines in Fig. 6(a) and (b) show K_r(p) and Γ_p(p) after several steps of the minimization algorithm, and Fig. 6(d) shows the corresponding . The Hill functions have become very steep steps around , while the average of the distribution 〈r〉 has been pushed above . The probabilities for p < p₀ become negligible, where is the largest integer value below . For p > p₀, rapidly decay to zero. The Fano factor, , approaches closer to the linear WK optimum, but is still above it. If we allow the minimization to proceed, these trends continue: at each iteration the Hill functions get steeper, 〈r〉 increases, for p < p₀ tends to zero, and approaches arbitrarily close to the linear WK optimum from above.

Figure 6:

Results for numerical optimization of the generalized nonlinear TetR feedback system of Eq. (22), with starting parameters B = 10 and V = V₀. (a) The mRNA production regulation function K_r(p) in its initial form before optimization (dashed curve), and after several steps of the minimization algorithm (solid curve). (b) Similar to (a), but showing the protein degradation function Γ_p(p). (c) Heat map of the steady-state probability distribution before optimization, corresponding to regulation governed by the dashed curves in the top panels. The nullclines ṙ(t) = 0 and ṗ(t) = 0 are superimposed. (d) Similar to (c), but after several steps of the minimization algorithm, corresponding to regulation governed by the solid curves in the top panels.

In fact, the same behavior is seen irrespective of the volume V and burst ratio B used to define the initial point of the optimization. Fig. 5 shows the results of nonlinear optimization for B = 2 - 10 at two volumes, V = V₀ and V = 0.1V₀. Even for the smallest volume, the nonlinear optimization results can get arbitrarily close to the WK optimum, but never do better. No generalized nonlinear system based on Hill function regulation brings us close to the theoretically possible LVP lower bound. This overall conclusion holds even when we change the functional form for the generalized feedback. We tried two alternatives: (i) using sigmoidal (logistic) functions instead of Hill functions; (ii) expanding K_r(p) and Γ_p(p) in a Taylor series around , truncating after the third order term, and minimizing with respect to the Taylor coefficients. In both cases numerical minimization of the Fano factor led to similar step-like behavior for K_r(p) and Γ_p(p), and the Fano factor tended to WK optimum from above.

From the distribution in Fig. 6(d) we see that the step-function limit leads to a system which is highly nonlinear along the p axis: in fact the gene network spends most of its time at p = p₀, just below the sudden change in regulation due to the steep Hill functions, and p > p₀ just above the sudden regulatory change. The feedback on the TetR mRNA population is mediated by p fluctuations between the two regimes, resulting in threshold-like regulatory behavior. Remarkably, despite this discrete, nonlinear character, the network can still approach the efficiency of an optimal WK linear filter. To gain a deeper understanding of how the step-like regulation can match WK optimality, we used the numerical optimization results described above to posit a limiting form of the nonlinear gene network that can be solved analytically (details in Appendix C). The analytic results explicitly show that we can asymptotically approach the WK optimum behavior from above, even in systems where the protein copy numbers are very small. Thus at least for a two-component TetR-like system regulated by biologically-realistic Hill functions, the constraint derived from the WK theory has a broader validity than one would guess from the underlying continuum, linear assumptions. It thus becomes an interesting and a non-trivial problem, left for future studies, to find an example of a gene network where the rigorous lower bound of LVP could be directly achieved.

Realizing optimality under the influence of extrinsic noise

Extrinsic noise is ubiquitous and hence must also be considered in any effective description of the control network. Inevitably, certain cellular components are not explicitly included in such a description, which in our case study could include RNA polymerase, ribosomes, and transcription factors that bind to the same promoter. Each of these components have their own stochastic characteristics and may contribute noise to a smaller or greater extent. Particularly for eukaryotes like yeast, the extrinsic noise contribution may be significantly larger than the intrinsic component.³⁸,³⁹ We adopt a simple model for the extrinsic noise based on earlier approaches,¹⁴,¹⁶ which assume that it is band-limited at a low frequency , where τ_e is on the order of the cell growth time scale. The justification is that higher frequency contributions to the extrinsic noise are filtered out by the gene circuits associated with its sources. This idea is consistent with the experimental observation of extrinsic noise in protein production in E. coli, which found long autocorrelation times for the extrinsic noise on the order of the cell cycle period.⁴⁰

For the TetR system, our theory is extended to the extrinsic noise case in Appendix D, with the results illustrated in Fig. 7. The outcome is that a given TetR gene circuit, tuned appropriately such that A = A_opt, can act as a WK filter for an entire family of extrinsic noise scenarios. A single set of parameters can approximately represent the optimal solution for a variety of extrinsic inputs. This makes the WK concept a versatile design tool for noise suppression in biological systems: the same control network can act with maximum efficiency in a variety of different contexts. It is possible that the requirement of adaptability to a wide range of conditions has resulted in the evolution of control networks acting as WK filters. It remains to be seen whether nature has exploited this feature in vivo.

Figure 7:

Comparison of simulation and theory results based on the dynamical model (Eq. (10)) of the yeast synthetic gene circuit,²⁴ in the presence of extrinsic noise given by Eq. (73). All quantities are plotted as a function of extracellular ATc concentration A for the burst ratio B = 5. Each set of curves shows the Fano factor , as predicted by the linear filter theory (solid lines), versus stochastic numerical simulations at two different volumes, V = V₀ = 60 fL (circles) and V = 10V₀ (squares). The two sets correspond to noise magnitudes c_p = 80, c_r = 23 and c_p = 160, c_r = 46. In both cases c_r and c_p are related through the condition in Eq. (83), and the minimal Fano factor predicted by WK filter theory (horizontal dashed lines) is modified as shown in Eq. (84). The system can be tuned to approach optimality near a particular A_opt obtained by the condition (filled circles).

Conclusion

The TetR feedback loop is a concrete example of how a WK filter can be implemented in a gene network driven by a complex set of biochemical reaction rates, but the overall approach outlined here has far reaching implications, thus highlighting the appeal of engineering paradigms in biology.⁴¹ With the entire network complexity encoded in a handful of response functions, we can derive fundamental limits and design principles governing biological regulation. The key step is to map the linear response picture onto a signal estimation problem, whose solution is given by WK theory. This idea allows us to predict the dynamic properties of the feedback pathway required to optimally filter noise in a broad class of negative feedback circuits. As already demonstrated in earlier works,²²,²³ the mapping, and the potential utility of the WK approach, is not unique to the negative feedback loop. Another important byproduct of the theory is that the behavior of gene circuits away from optimality can also be predicted. In this sense, our practical approach goes beyond just obtaining rigorous bounds, and allows us to characterize how close or far gene networks are from optimality for biologically relevant parameters.

We have derived response functions by linearizing a minimal model extracted from experimental observations, but it is also possible to directly apply small perturbations to a system, and measure the resulting time-dependent changes in populations of species. Recently, the yeast hyperosmolar signaling pathway has been probed by perturbations in the form of salt shocks.⁴²^-⁴⁴ Despite the underlying complex nonlinear network, the details of which are not completely characterized, a linear response description quantitatively captures the frequency-dependent behavior of the pathway over a wide range of inputs. E. Coli chemotaxis signaling also exhibits a linear regime,⁴⁵ where the fluctuation-dissipation relationship between the system’s unperturbed behavior and its reaction to external stimuli has been explicitly verified.

Linear response functions can thus become a fundamental tool in analyzing biochemical circuits, analogous to their established role in control engineering and signal processing. More extensive experimental measurements will be critical in this effort, in order to ascertain how varied the response relationships between regulatory components are in nature. Once we understand the essential dynamical building blocks out of which complex biological function is realized, we can map out the hidden constraints that control the behavior of living systems.

Graphical TOC Entry

Acknowledgement

This work was done while the authors were in the Institute for Physical Sciences and Technology in the University of Maryland, College Park. We are grateful to C. Güven, G. Reddy, Z. Zhang, and P. Zhuravlev for useful discussions. This work was supported by a grant from the National Science Foundation (CHE 13-61946).

Appendix A: Derivation of the optimal WK filter

In this section we derive Eqs. (8) and (9) in the main text. They describe the output variance and the linear filter H_opt(ω) that minimizes , which are the main quantities in the Wiener-Kolmogorov theory.

Output variance in terms of signal and noise power spectra P_s(ω) and P_n(ω)

From Eq. (4), which defines the signal s(ω) and estimate in the frequency domain, the Fourier transformed output δr(ω) for any H(ω) can be rewritten as,

In the time domain, s(ω) = -n_r(ω)/(G_rr(ω)+iω), is a convolution of the noise function n_r(t), and n(ω) = n_p(ω)/G_pr(ω) is a convolution of n_p(t). So long as the noise functions n_r(t) and n_p(t) are uncorrelated, s(t) and n(t) are also uncorrelated, so the frequency domain average . (The theory can also be generalized to correlated noise sources, but for simplicity we consider only the uncorrelated case.) As a result, the correlation , related to the output power spectrum P_δr(ω), can be written in terms of P_s(ω) and P_n(ω), the individual power spectra of the signal and noise:

In the above equation we have used the definition of the power spectrum, i.e. , and the relation H(-ω) = H*(ω) since H(ω) is the Fourier transform of a real function H(t). The power spectrum P_δr(ω) is the Fourier transform of the time autocorrelation function :

At t = 0, the autocorrelation function gives us the variance : which is Eq. (8) in the main text.

Minimizing over all causal H(ω) yields the optimal WK filter H_opt(ω)

The convolution of the filter function H(t) on the corrupted signal s(t) + n(t) must satisfy causality. The filter can only operate on the past history of s(t) + n(t), so H(t) = 0 for t<0. In the frequency domain, enforcing causality restricts H(ω) to have certain general properties as a function of complex ω:²⁹ it can have no poles or zeros in the upper half-plane Imω>0. Equivalently, the real and imaginary parts of H(ω) evaluated at real ω must satisfy the well-known Kramers-Kronig relation: where P is the Cauchy principal value of the integral. The goal of WK optimization is to minimize in Eq. (31) over all possible causal functions H(ω), given the power spectra P_s(ω) and P_n(ω).

Assume such an optimum H_opt(ω) exists, with the corresponding minimal variance . Let us add a small perturbation, H(ω) = H_opt(ω) + δH(ω), where δH(ω) is also a causal function of complex ω. From Eq. (31), the resulting variance change , to lowest order in δH(ω), is: where

For H_opt(ω) to be the WK optimum, in Eq. (33) must be zero for any causal perturbation δH(ω).

Out of all possible causal perturbations, we will focus on one with the specific form: where Imω₀ = 0 and A, ∊>0. It has no zeros, and the only pole, ω = ω₀ - i∊, is in the lower half-plane, so δH(ω) is causal. We will be interested in the limit as this pole approaches the real axis, ∊ → 0⁺, where the real and imaginary parts of δH(ω) are,

Substituting these into Eq. (33) for , we find that the optimality condition implies the following relation between the real and imaginary parts of F_opt(ω):

This has the same form as the Kramers-Kronig relation in Eq. (32), with the important difference of a minus sign in front. Consequently, F_opt(ω) must be anticausal, which we define as a function with no poles or zeros in the lower complex ω half-plane.

In order to use this result to derive a solution for H_opt(ω), we define two types of decompositions, described briefly in the main text. In practice, all the frequency domain power spectral density and filter functions we work with in the linear response formalism are mero-morphic functions over the complex ω plane. Any meromorphic function F(ω) can be written as a partial fraction expansion of the form F(ω) = Σ_{n, k} c_ik/(ω - ω_n)^k, where {ω_n} is the set of poles of F(ω), and c_ik are constants. Most generally, the expansion could include a polynomial term, but the functions F(ω) we encounter have well-defined inverse Fourier transforms, which require |F(ω)| → 0 as |ω| → ∞ (decay at least as fast as 1/|ω|). Thus, all the terms in the expansion are of the form c_ik/(ω - ω_n)^k, and we can segregate them according to whether the pole ω_n is in the upper half plane. The causal part {F(ω)}_c is defined as all those terms where ω_n is not in the upper half plane, and the anticausal part {F(ω)}_ac contains the remaining terms in the expansion. The overall function F(ω) = {F(ω)}_c + {F(ω)}_ac.

The second type of decomposition, an example of Wiener-Hopf factorization,²⁰ concerns power spectral density functions like P_y(ω), which are meromorphic and also real-valued on the real ω axis. Let us factor P_y(ω) as the product of two meromorphic functions, . The function contains all the zeros and poles in P_y(ω) which are not in the upper half plane. Such a decomposition is always possible, since a meromorphic function can always be written as a ratio of two holomorphic functions. Hence, the numerator and denominator of P_y(ω) can be decomposed individually into a product of elementary factors by the Weierstrass factorization theorem, with each factor containing a single zero. Because P_y(ω) is real for real ω, so P_y(ω)^* = P_y(ω) when Imω = 0. Thus, . Since for real ω has all its zeros and poles in the upper half plane, we must have , and similarly . By appropriately absorbing an overall constant into , we can factor P_y(ω) as .

With these decompositions defined, we return now to the condition in Eq. (37), which shows that F_opt(ω) is anticausal. Thus, its causal part in the additive decomposition must be zero, {F_opt(ω)}_c = 0. From the definition of F_opt(ω), Eq. (34), it follows that where P_y(ω) = P_s(ω) + P_n(ω) is the power spectrum of the noise-corrupted signal y(t) = s(t) + n(t). Equivalently, since we can substitute {F(ω)}_c = F(ω) - {F(ω)}_ac for any F(ω), the optimality condition can be rewritten as:

Divide both sides of Eq. (39) by , and then take the causal additive part {·}_c of both sides. The result is:

The second terms on both the left and right hand sides are the causal parts of a ratio between two anticausal functions. Since a ratio of anticausal functions is also anticausal, these terms are zero. On the left hand side the first term , since H_opt(ω) and are causal, and hence their product is also causal. Making these simplifications, we can then solve for H_opt(ω) as: which is the optimal WK filter result shown as Eq. (9) in the main text.

Appendix B: Linear response and noise filter analysis for a regulatory cascade

As an example of how our theory generalizes to control networks with multiple mediator species, we will consider the case where the feedback loop consists of a regulatory cascade. We will still explicitly single out a target species R and a mediator P, but now the signaling pathway which communic rates changes from R to P will be more complicated, consisting of a cascade of N species Uj, j = 1,…, N, with populations u_j. The production of the j^th species will depend on the population of the (j - 1)^th species (with j = 0 corresponding to), and P will depend on the last member of the cascade, U_N. In terms of Fourier-transformed fluctuations δu_j, the dynamical equations for the pathway have the form:

Thus the dynamics includes three parts: (i) the self-responses G_{u_j}_{u_j} which we can assume in the simplest case to be given by the inverse decay lifetimes of the species, ; (ii) the cross-response terms G_{u_j}_{u_j-1} which describe how the j_th member of the cascade is related to the (j - 1)th member; (iii) the stochastic noise terms n_uj. To complete the description of the feedback loop, we specify the equations for R and P:

Instead of the simple cross-response G_pr from R to P, P is influenced by the final species of the U_j pathway through G_{pu_N}.

The regulatory cascade system described by Eqs. (42)-(43) can in fact be simplified extensively, by solving for the dynamics of the mediator species U_j and substituting the results into Eq. (43). This yields equations for R and P which have the same form as in the two-species case in the main text, but with an effective cross-response function and noise term , where:

In this effective two-species reduction of the full system, all the stochastic effects of the mediators in the U_i pathway enter in as “extrinsic” noise contributions to . This is a particular example that shows how extrinsic noise encapsul rates the stochastic influence of all the species that are not explicitly specified in the dynamical equations.

The mapping of the two-species system onto the noise filter formalism, and the calculation of the optimal filter, can be carried out by the methods outlined in the main text. While this in general results in a more complicated problem than the simple example analyzed in the main text, in one scenario the noise filter optimization problem for the cascade is relatively straightforward: (i) we assume linear production functions for all U_j, so the cross-responses are constants in frequency space, G_{u_j}_{u_j-1}(ω) = κ_{u_j}. Similarly, the P production function is κ_pu_N(t), so G_{pu_N}(ω) = κ_p. (ii) We assume the decay timescales of all the cascade species are negligible, τ_{u_j} ≪ τ_r, so we can take the limits τ_{u_j} → 0 in Eq. (45). However, the products κ_{u_j}τ_{u_j} remain finite for all j, since from the equilibrium conditions of the cascade (balance of production and destruction), they are related to ratios of the steady-state populations ū_j:

Hence rapid decay goes hand in hand with fast production. This is the same type of serial cascade analyzed in Ref. 19, where it was shown to maximize information transfer along the pathway. (iii) Finally, we assume that each species in the original, full description of the system is subject only to intrinsic noise, so the noise functions are given by: where the η_α(ω) for different α are independent Fourier-transformed Gaussian white noise functions.

With these assumptions the effective cross-response and noise functions in Eq. (45) become: where the P burst ratio is analogous to B in the main text, i.e. the average number of P molecules produced per R during the time interval τ_r. Similarly the burst ratio is the average number of U_k molecules produced per R during τ_r.

The resulting signal and noise power spectra within the filter formalism are: where

Since the power spectra in Eq. (49) have the same form as Eq. (12), with B replaced by B_eff, all the subsequent optimality results are identical, but expressed in terms of the effective total burst ratio B_eff of the signaling pathway. This agrees with the effective burst ratio for the cascade derived by the information theory approach in Ref. 19, under the assumptions of rapid production/decay outlined above. Physically, this result implies that B_eff will be dominated by the smallest values among the B and B_{u_k}. Hence, the efficiency of the noise filtration in the cascade is limited by the weakest links.

Appendix C: Analytic limiting form of the generalized nonlinear feedback network

We will use the numerical optimization results described in the main text for the generalized nonlinear TetR feedback network (Eq. (22)) to derive a limiting form of the system that can be solved analytically. Since the optimization algorithm results in steep step-like functions K_r(p) and Γ_p(p) with thresholds at , let us assume that optimal limit for these Hill functions looks like: where the Heaviside step function Θ (x) = 0 for x<0 and Θ (x) = 1 for x>0. The plateau heights are assumed to be large, with a well defined ratio . Since and thus Γ_p(p) acts as the dominant protein degradation term, we will set γ_p = 0 for simplicity. (This has negligible effect on the resulting , particularly since h was already the longest time scale in the system.)

Under these assumptions, we would like to find an analytical steady-state probability distribution which satisfies R_rp = 0 from Eq. (27) for all r, p>0. We cannot solve the system of equations directly, but we will introduce an ansatz for and verify that it is a solution to Eq. (27). The first part of the ansatz is trivial: we assume for . This satisfies R_rp = 0 for p<p₀ exactly, regardless of the values of at p ≥ p₀. To motivate the second part of the ansatz, which covers the p ≥ p₀ region, we need some more information about the moments of the distribution. This can be gathered by defining the generating function,

Summing the steady-state conditions R_rp = 0 in Eq. (27) over all r>0, p≥p0, we obtain an equation that can be expressed in terms of F: where . Taking the z₁ derivative of Eq. (53), and evaluating the result at z₁ = z1, z2 = 1, gives:

Similarly, differentiating Eq. (53) with respect to z₂ yields:

Using the fact that F(1, 1) = 1 from the normalization of , and F^(1,0)(1,1) = 〈r〉, from the definition of the generating function in Eq. (52), we can use Eqs. (54) and (55) to find:

Thus we have an analytical expression for 〈r〉, one of the moments necessary for calculating the Fano factor. If we proceed to the next order of derivation, applying , and ∂_z₁∂_z₂ on Eq. (53) and evaluating at z₁ = 1, z₂ = 1, we can extract from these three equations the following moment relations: where ∆ is defined as

Thus the Fano factor , but unfortunately we do not have an explicit solution for ∆ from the generating function approach. (Higher order partial derivatives of Eq. (53) do not form a closed system of equations.) However, the moment relations in Eq. (57) will prove useful below.

From Eq. (56) we note that 〈r〉 → ∞ as , so the distribution is pushed toward larger r as the step functions become steeper, just as we saw in the numerical optimization (Fig. 6). In the large r limit, we can approximate , as a continuous function of r (though it remains discrete in p). Based on the numerical optimization results, we choose the following Gaussian ansatz for , the first non-negligible p slice of the distribution:

The parameters λ₀ and s₀ are to be determined, while A₀ must be chosen to satisfy from Eq. (56). In the continuum, large limit we can approximate the sum as which implies that

Similarly, Eq. (58) gives so finding λ₀ is equivalent to finding ∆.

Let us now show that the ansatz of Eq. (59) yields a solution for p≥p₀ that satisfies Eq. (27) in the large limit. Using Eq. (51) and the continuum approximation along the direction, we canewrite Eq. (27) for p≥p₀ as

Plugging the ansatz for from Eq. (59) into Eq. (62) for p = p₀, we can solve for

Similarly, once are known, Eq. (62) for p = p₀ + 1 yields

We can iterate this procedure, using Eq. (62) to generate analytical expressions for all , m> 0, which depend on the unknown parameters λ₀ and s₀. To solve for these parameters, let us first enforce the normalization condition,

Though tedious, the integrals on the right-hand side of Eq. (65) can be explicitly carried out for each m, since has the form of a Gaussian exp(-(r - λ₀)²/(2s₀)) times a polynomial in r. Since we are interested in the large limit, we can Taylor expand the integrals up to first order in the small variable , which gives the following result: where , and we have used Eq. (61) to write λ₀ = 〈r〉 - ∆, and Eq. (56) for 〈r〉. Plugging Eq. (66) into Eq. (65) and carrying out the sum over m, the normalization condition becomes

Thus the term of order on the right must be zero, implying the following relation between and ∆,

In order to complete the derivation and solve for ∆, we need to calculate the moment 〈p-p₀〉,

Plugging in Eq. (66) for the integral, we carry out the sum over m and simplify using Eq. (68), giving

Setting this equal to the 〈p-p₀〉 result from Eq. (57), we finally can solve for ∆, or equivalently the Fano factor keeping the leading terms in the Taylor expansion for small . The Fano factor achieves a minimum value equal to the WK linear optimum, at , where B = κ_p/γ_r. Thus we see explicitly that nonlinear threshold regulation with K_r(p) and Γ_p(p) behaving like step functions can directly match (but not improve on) the efficiency of the optimal WK linear filter, so long as is large and the ratio of the step function heights assumes a particular value ξ_min. Counter intuitively, this occurs despite the fact that the p copy numbers can be very small in our system, with a narrow range of fluctuations in which discreteness plays a major role.

Appendix D: Optimality for the TetR gene network under extrinsic noise

In the frequency domain, we will model , the extrinsic part of the noise associated with species α using, where c_α is a coefficient measuring the strength of the noise, and η^ext(ω) is a Fourier-space Gaussian white noise function. Comparing to the definition of the intrinsic noise, , we see that c_α is the ratio of the extrinsic to intrinsic noise PSD for species α at ω = 0. The (1 - iωτ_e)⁻¹ factor acts as a cutoff that suppresses frequencies . The total noise function for species α is the sum of intrinsic and extrinsic contributions, . We will focus on how the addition of extrinsic noise affects the optimality conditions using the TetR yeast gene circuit example.

The calculation of H_opt(ω) proceeds analogously to the no-extrinsic-noise procedure described in the main text. The power spectra of the signal and noise are,

The first and second terms in the square brackets represent the intrinsic and extrinsic contributions respectively. The latter is parameterized by the coefficients c_r and c_p, and the timescale τ_e, which is assumed to be much larger than the dominant timescale, τ_r, characterizing the R fluctuations. The signal plus noise power spectrum, P_y(ω) = P_s(ω) + P_n(ω), can be rewritten as a causal decomposition in the following manner: where ∊ ≡ τ_r/τ_e, and

The expression and its additive causal decomposition is given by:

Using Eqs. (78) and (75) in Eq. (9), we obtain the form for the optimal filter function: where

Since ∊ is presumed small, we will expand H_opt to lowest order in ∊, giving the approximate expression:

The first rational term is just the optimal filter result in the intrinsic-only case, Eq. (17), while the second term represents the modification needed to accommodate the extrinsic noise. As expected, the latter term approaches 1 when c_r, c_p → 0, since v → 1 + B in this limit.

There is a different non-trivial scenario where the second term is equal to 1. If the noise magnitudes c_r and c_p are related such that, then the numerator and denominator exactly cancel each other out, removing the τ_e dependence from the optimal filter. Using the definition v = 1 + B(1 + c_r) + c_p, Eq. (82) can be simplified to yield the relation:

If this condition is satisfied, H_opt(ω) is identical to the intrinsic-only optimal filter of Eq. (17) (to lowest order in ∊), and hence the approximate optimality is also achieved at the same feedback value, .

Thus, the yeast gene circuit can still be fine-tuned to approach a WK optimal filter even in the presence of extrinsic noise. However, this tuning requires the relative strengths c_r and c_p of the R and P extrinsic noise to be related (at least approximately) by Eq. (83). The resulting minimal possible Fano factor is:

This is the intrinsic-only result of Eq. (21) in the main text plus an extrinsic noise contribution in the second term. Not surprisingly, with more total noise in the system, the standard deviation of the optimally filtered output increases. Since the second term is of the order τ_r/τ_e it follows that the bigger the difference in time scales between the extrinsic noise (τ_e) and the mRNA dynamics (τ_r), the easier it is to filter out the extrinsic influence on the mRNA fluctuations. For B ≫ 1, the fundamental limit on the noise suppression still arises from the intrinsic term in , which scales like ~ B^−1/2; the extrinsic contribution decays more rapidly, ~ B⁻¹.

The blue curves in Fig. 7 show the linear theory predictions for as a function of A in two cases: (i) c_p = 80, c_r = 23; (ii) c_p = 160, c_r = 46. The burst ratio B = 5, and τ_e is set equal to , the longest time scale among the experimentally fitted parameters. For both these cases the noise strengths c_p and c_r satisfy the relation in Eq. (83), and hence it is possible to tune the system to approximately achieve WK optimality, just as in the intrinsic-only scenario. The noise magnitudes were chosen so that the system is noticeably perturbed by the extrinsic contribution. For example, if the signal s(t) is split into intrinsic and extrinsic parts s^int(t) and s^ext(t), the ratios of their respective standard deviations are for case (i) and 1.6 for case (ii). The value of is marked by horizontal dashed lines, and the point A = A_opt, where is satisfied, by a filled circle. In all cases the system approaches near A = A_opt, verifying the optimality prediction.

As in the intrinsic-only scenario discussed in the main text, we can test the usefulness of the linear theory through Gillespie simulations (results shown as open squares and circles in Fig. 7), and reach a similar conclusion even in the presence of extrinsic noise. At large volumes, V = 10V₀, the simulations converge to the linear theory, whereas for the more realistic volume V = V₀ we see discrepancies due to nonlinearity and low copy numbers (V₀ = 60 fL). Nevertheless, the Fano factor still reaches a minimum close to the predicted A_opt and values.

References

(1).↵
Becskei, A.; Serrano, L. Engineering stability in gene networks by autoregulation. Nature 2000, 405, 590–593.
OpenUrl CrossRef PubMed Web of Science
(2).↵
Thattai, M.; van Oudenaarden, A. Intrinsic noise in gene regulatory networks. Proc. Natl. Acad. Sci. U. S. A. 2001, 98, 8614–8619.
OpenUrl Abstract/FREE Full Text
(3).
Swain, P. S.; Elowitz, M. B.; Siggia, E. D. Intrinsic and extrinsic contributions to stochasticity in gene expression. Proc. Natl. Acad. Sci. U. S. A. 2002, 99, 12795–12800.
OpenUrl Abstract/FREE Full Text
(4).
Ozbudak, E. M.; Thattai, M.; Kurtser, I.; Grossman, A. D.; van Oudenaarden, A. Regulation of noise in the expression of a single gene. Nature Genet. 2002, 31, 69–73.
OpenUrl CrossRef PubMed Web of Science
(5).
Elowitz, M. B.; Levine, A. J.; Siggia, E. D.; Swain, P. S. Stochastic gene expression in a single cell. Science 2002, 297, 1183–1186.
OpenUrl Abstract/FREE Full Text
(6).↵
Paulsson, J. Summing up the noise in gene networks. Nature 2004, 427, 415–418.
OpenUrl CrossRef PubMed Web of Science
(7).↵
Arias, A. M.; Hayward, P. Filtering transcriptional noise during development: concepts and mechanisms. Nat. Rev. Genet. 2006, 7, 34–44.
OpenUrl CrossRef PubMed Web of Science
(8).↵
Tsang, J.; Zhu, J.; van Oudenaarden, A. MicroRNA-mediated feedback and feedforward loops are recurrent network motifs in mammals. Mol. Cell 2007, 26, 753–767.
OpenUrl CrossRef PubMed Web of Science
(9).↵
Andrews, B. W.; Yi, T.-M.; Iglesias, P. A. Optimal noise filtering in the chemotactic response of Escherichia coli. PLoS Comput. Biol. 2006, 2, 1407–1418.
OpenUrl
(10).↵
Newman, J. R. S.; Ghaemmaghami, S.; Ihmels, J.; Breslow, D. K.; Noble, M.; De-Risi, J. L.; Weissman, J. S. Single-cell proteomic analysis of S-cerevisiae reveals the architecture of biological noise. Nature 2006, 441, 840–846.
OpenUrl CrossRef PubMed Web of Science
(11).↵
Fraser, H. B.; Hirsh, A. E.; Giaever, G.; Kumm, J.; Eisen, M. B. Noise minimization in eukaryotic gene expression. PLoS Biol. 2004, 2, 834–838.
OpenUrl Web of Science
(12).↵
Lehner, B. Selection to minimise noise in living systems and its implications for the evolution of gene expression. Mol. Syst. Biol. 2008, 4, 170.
OpenUrl Abstract/FREE Full Text
(13).↵
Simpson, M. L.; Cox, C. D.; Sayler, G. S. Frequency domain analysis of noise in au-toregulated gene circuits. Proc. Natl. Acad. Sci. U. S. A. 2003, 100, 4551–4556
OpenUrl Abstract/FREE Full Text
(14).↵
Austin, D. W.; Allen, M. S.; McCollum, J. M.; Dar, R. D.; Wilgus, J. R.; Sayler, G. S.; Samatova, N. F.; Cox, C. D.; Simpson, M. L. Gene network shaping of inherent noise spectra. Nature 2006, 439, 608–611.
OpenUrl CrossRef PubMed Web of Science
(15).↵
Dublanche, Y.; Michalodimitrakis, K.; Kummerer, N.; Foglierini, M.; Serrano, L. Noise in transcription negative feedback loops: simulation and experimental analysis. Mol. Syst. Biol. 2006, 2, 41.
OpenUrl Abstract/FREE Full Text
(16).↵
Cox, C. D.; McCollum, J. M.; Austin, D. W.; Allen, M. S.; Dar, R. D.; Simpson, M. L. Frequency domain analysis of noise in simple gene circuits. Chaos 2006, 16, 026102.
OpenUrl CrossRef PubMed
(17).↵
Zhang, J. J.; Yuan, Z. J.; Zhou, T. S. Physical limits of feedback noise-suppression in biological networks. Phys. Biol. 2009, 6, 046009.
OpenUrl PubMed
(18).↵
Singh, A.; Hespanha, J. P. Optimal Feedback Strength for Noise Suppression in Au-toregulatory Gene Networks. Biophys. J. 2009, 96, 4013–4023.
OpenUrl CrossRef PubMed Web of Science
(19).↵
Lestas, I.; Vinnicombe, G.; Paulsson, J. Fundamental limits on the suppression of molecular fluctuations. Nature 2010, 467, 174–178.
OpenUrl CrossRef PubMed Web of Science
(20).↵
Wiener, N. Extrapolation, Interpolation and Smoothing of Stationary Times Series; Wiley: New York, 1949.
(21).↵
Kolmogorov, A. N. Interpolation and extrapolation of stationary random sequences. Izv. Akad. Nauk SSSR., Ser. Mat. 1941, 5, 3–14.
OpenUrl
(22).↵
Hinczewski, M.; Thirumalai, D. Cellular signaling networks function as generalized Wiener-Kolmogorov filters to suppress noise. Phys. Rev. X 2014, 4, 041017.
OpenUrl
(23).↵
Becker, N. B.; Mugler, A.; ten Wolde, P. R. Optimal prediction by cellular signaling networks. Phys. Rev. Lett. 2015, 115, 258103.
OpenUrl
(24).↵
Nevozhay, D.; Adams, R. M.; Murphy, K. F.; Josic, K.; Balazsi, G. Negative autoregulation linearizes the dose-response and suppresses the heterogeneity of gene expression. Proc. Natl. Acad. Sci. U. S. A. 2009, 106, 5123–5128.
OpenUrl Abstract/FREE Full Text
(25).↵
Schwanhaeusser, B.; Busse, D.; Li, N.; Dittmar, G.; Schuchhardt, J.; Wolf, J.; Chen, W.; Selbach, M. Global quantification of mammalian gene expression control. Nature 2011, 473, 337–342.
OpenUrl CrossRef PubMed Web of Science
(26).↵
Gillespie, D. T. The chemical Langevin equation. J. Chem. Phys. 2000, 113, 297–306.
OpenUrl CrossRef Web of Science
(27).↵
de Ronde, W. H.; Tostevin, F.; ten Wolde, P. R. Effect of feedback on the fidelity of information transmission of time-varying signals. Phys. Rev. E 2010, 82, 031914.
OpenUrl
(28).↵
Bode, H. W.; Shannon, C. E. A simplified derivation of linear least square smoothing and prediction theory. Proc. Inst. Radio. Engin. 1950, 38, 417–425.
(29).↵
Chaikin, P. M.; Lubensky, T. C. Principles of condensed matter physics; Cambridge University Press, 1995.
(30).↵
Cai, L.; Friedman, N.; Xie, X. S. Stochastic protein expression in individual cells at the single molecule level. Nature 2006, 440, 358–362.
OpenUrl CrossRef PubMed Web of Science
(31).
Garcia-Martinez, J.; Aranda, A.; Perez-Ortin, J. E. Genomic run-on evaluates transcription rates for all yeast genes and identifies gene regulatory mechanisms. Mol. Cell 2004, 15, 303–313.
OpenUrl CrossRef PubMed Web of Science
(32).↵
Gillespie, D. T. Exact stochastic simulation of coupled chemical-reactions. J. Phys. Chem. 1977, 81, 2340–2361.
OpenUrl CrossRef Web of Science
(33).↵
Jorgensen, P.; Nishikawa, J. L.; Breitkreutz, B. J.; Tyers, M. Systematic identification of pathways that couple cell growth and division in yeast. Science 2002, 297, 395–400.
OpenUrl Abstract/FREE Full Text
(34).↵
Alon, U. An Introduction to Systems Biology: Design Principles of Biological Circuits; Chapman and Hall/CRC, 2006.
(35).↵
Mugler, A.; Walczak, A. M.; Wiggins, C. H. Spectral solutions to stochastic models of gene expression with bursts and regulation. Phys. Rev. E. 2009, 80, 041921.
OpenUrl
(36).↵
Davis, T. A. Algorithm 832: UMFPACK V4.3—an unsymmetric-pattern multifrontal method. ACM Trans. Math. Software 2004, 30, 196–199.
OpenUrl
(37).↵
Brent, R. Algorithms for Minimization without Derivatives; Dover, 2002.
(38).↵
Raser, J. M.; O’Shea, E. K. Control of stochasticity in eukaryotic gene expression. Science 2004, 304, 1811–1814.
OpenUrl Abstract/FREE Full Text
(39).↵
Volfson, D.; Marciniak, J.; Blake, W. J.; Ostroff, N.; Tsimring, L. S.; Hasty, J. Origins of extrinsic variability in eukaryotic gene expression. Nature 2006, 439, 861–864.
OpenUrl CrossRef PubMed
(40).↵
Rosenfeld, N.; Young, J. W.; Alon, U.; Swain, P. S.; Elowitz, M. B. Gene regulation at the single-cell level. Science 2005, 307, 1962–1965.
OpenUrl Abstract/FREE Full Text
(41).↵
Csete, M. E.; Doyle, J. C. Reverse engineering of biological complexity. Science 2002, 295, 1664–1669.
OpenUrl Abstract/FREE Full Text
(42).↵
Mettetal, J. T.; Muzzey, D.; Gomez-Uribe, C.; van Oudenaarden, A. The frequency dependence of osmo-adaptation in Saccharomyces cerevisiae. Science 2008, 319, 482–484.
OpenUrl Abstract/FREE Full Text
(43).
Hersen, P.; McClean, M. N.; Mahadevan, L.; Ramanathan, S. Signal processing by the HOG MAP kinase pathway. Proc. Natl. Acad. Sci. U. S. A. 2008, 105, 7165–7170.
OpenUrl Abstract/FREE Full Text
(44).↵
Muzzey, D.; Gomez-Uribe, C. A.; Mettetal, J. T.; van Oudenaarden, A. A Systems-Level Analysis of Perfect Adaptation in Yeast Osmoregulation. Cell 2009, 138, 160–171.
OpenUrl CrossRef PubMed Web of Science
(45).↵
Park, H.; Pontius, W.; Guet, C. C.; Marko, J. F.; Emonet, T.; Cluzel, P. Interdependence of behavioural variability and response to small stimuli in bacteria. Nature 2010, 468, 819–U114.
OpenUrl CrossRef PubMed Web of Science

View the discussion thread.

Posted April 20, 2016.

Download PDF

Citation Tools

Subject Area

Biophysics

Subject Areas

All Articles

Animal Behavior and Cognition (5201)
Biochemistry (11718)
Bioengineering (8724)
Bioinformatics (29132)
Biophysics (14936)
Cancer Biology (12051)
Cell Biology (17360)
Clinical Trials (138)
Developmental Biology (9406)
Ecology (14146)
Epidemiology (2067)
Evolutionary Biology (18269)
Genetics (12223)
Genomics (16768)
Immunology (11844)
Microbiology (28016)
Molecular Biology (11560)
Neuroscience (60822)
Paleontology (450)
Pathology (1864)
Pharmacology and Toxicology (3231)
Physiology (4940)
Plant Biology (10401)
Scientific Communication and Education (1680)
Synthetic Biology (2878)
Systems Biology (7333)
Zoology (1642)

[1] (1).↵
Becskei, A.; Serrano, L. Engineering stability in gene networks by autoregulation. Nature 2000, 405, 590–593.
OpenUrl CrossRef PubMed Web of Science

[2] (2).↵
Thattai, M.; van Oudenaarden, A. Intrinsic noise in gene regulatory networks. Proc. Natl. Acad. Sci. U. S. A. 2001, 98, 8614–8619.
OpenUrl Abstract/FREE Full Text

[3] (3).
Swain, P. S.; Elowitz, M. B.; Siggia, E. D. Intrinsic and extrinsic contributions to stochasticity in gene expression. Proc. Natl. Acad. Sci. U. S. A. 2002, 99, 12795–12800.
OpenUrl Abstract/FREE Full Text

[4] (4).
Ozbudak, E. M.; Thattai, M.; Kurtser, I.; Grossman, A. D.; van Oudenaarden, A. Regulation of noise in the expression of a single gene. Nature Genet. 2002, 31, 69–73.
OpenUrl CrossRef PubMed Web of Science

[5] (5).
Elowitz, M. B.; Levine, A. J.; Siggia, E. D.; Swain, P. S. Stochastic gene expression in a single cell. Science 2002, 297, 1183–1186.
OpenUrl Abstract/FREE Full Text

[6] (6).↵
Paulsson, J. Summing up the noise in gene networks. Nature 2004, 427, 415–418.
OpenUrl CrossRef PubMed Web of Science

[7] (7).↵
Arias, A. M.; Hayward, P. Filtering transcriptional noise during development: concepts and mechanisms. Nat. Rev. Genet. 2006, 7, 34–44.
OpenUrl CrossRef PubMed Web of Science

[8] (8).↵
Tsang, J.; Zhu, J.; van Oudenaarden, A. MicroRNA-mediated feedback and feedforward loops are recurrent network motifs in mammals. Mol. Cell 2007, 26, 753–767.
OpenUrl CrossRef PubMed Web of Science

[9] (9).↵
Andrews, B. W.; Yi, T.-M.; Iglesias, P. A. Optimal noise filtering in the chemotactic response of Escherichia coli. PLoS Comput. Biol. 2006, 2, 1407–1418.
OpenUrl

[10] (10).↵
Newman, J. R. S.; Ghaemmaghami, S.; Ihmels, J.; Breslow, D. K.; Noble, M.; De-Risi, J. L.; Weissman, J. S. Single-cell proteomic analysis of S-cerevisiae reveals the architecture of biological noise. Nature 2006, 441, 840–846.
OpenUrl CrossRef PubMed Web of Science

[11] (11).↵
Fraser, H. B.; Hirsh, A. E.; Giaever, G.; Kumm, J.; Eisen, M. B. Noise minimization in eukaryotic gene expression. PLoS Biol. 2004, 2, 834–838.
OpenUrl Web of Science

[12] (12).↵
Lehner, B. Selection to minimise noise in living systems and its implications for the evolution of gene expression. Mol. Syst. Biol. 2008, 4, 170.
OpenUrl Abstract/FREE Full Text

[13] (13).↵
Simpson, M. L.; Cox, C. D.; Sayler, G. S. Frequency domain analysis of noise in au-toregulated gene circuits. Proc. Natl. Acad. Sci. U. S. A. 2003, 100, 4551–4556
OpenUrl Abstract/FREE Full Text

[14] (14).↵
Austin, D. W.; Allen, M. S.; McCollum, J. M.; Dar, R. D.; Wilgus, J. R.; Sayler, G. S.; Samatova, N. F.; Cox, C. D.; Simpson, M. L. Gene network shaping of inherent noise spectra. Nature 2006, 439, 608–611.
OpenUrl CrossRef PubMed Web of Science

[15] (15).↵
Dublanche, Y.; Michalodimitrakis, K.; Kummerer, N.; Foglierini, M.; Serrano, L. Noise in transcription negative feedback loops: simulation and experimental analysis. Mol. Syst. Biol. 2006, 2, 41.
OpenUrl Abstract/FREE Full Text

[16] (16).↵
Cox, C. D.; McCollum, J. M.; Austin, D. W.; Allen, M. S.; Dar, R. D.; Simpson, M. L. Frequency domain analysis of noise in simple gene circuits. Chaos 2006, 16, 026102.
OpenUrl CrossRef PubMed

[17] (17).↵
Zhang, J. J.; Yuan, Z. J.; Zhou, T. S. Physical limits of feedback noise-suppression in biological networks. Phys. Biol. 2009, 6, 046009.
OpenUrl PubMed

[18] (18).↵
Singh, A.; Hespanha, J. P. Optimal Feedback Strength for Noise Suppression in Au-toregulatory Gene Networks. Biophys. J. 2009, 96, 4013–4023.
OpenUrl CrossRef PubMed Web of Science

[19] (19).↵
Lestas, I.; Vinnicombe, G.; Paulsson, J. Fundamental limits on the suppression of molecular fluctuations. Nature 2010, 467, 174–178.
OpenUrl CrossRef PubMed Web of Science

[20] (20).↵
Wiener, N. Extrapolation, Interpolation and Smoothing of Stationary Times Series; Wiley: New York, 1949.

[21] (21).↵
Kolmogorov, A. N. Interpolation and extrapolation of stationary random sequences. Izv. Akad. Nauk SSSR., Ser. Mat. 1941, 5, 3–14.
OpenUrl

[22] (22).↵
Hinczewski, M.; Thirumalai, D. Cellular signaling networks function as generalized Wiener-Kolmogorov filters to suppress noise. Phys. Rev. X 2014, 4, 041017.
OpenUrl

[23] (23).↵
Becker, N. B.; Mugler, A.; ten Wolde, P. R. Optimal prediction by cellular signaling networks. Phys. Rev. Lett. 2015, 115, 258103.
OpenUrl

[24] (24).↵
Nevozhay, D.; Adams, R. M.; Murphy, K. F.; Josic, K.; Balazsi, G. Negative autoregulation linearizes the dose-response and suppresses the heterogeneity of gene expression. Proc. Natl. Acad. Sci. U. S. A. 2009, 106, 5123–5128.
OpenUrl Abstract/FREE Full Text

[25] (25).↵
Schwanhaeusser, B.; Busse, D.; Li, N.; Dittmar, G.; Schuchhardt, J.; Wolf, J.; Chen, W.; Selbach, M. Global quantification of mammalian gene expression control. Nature 2011, 473, 337–342.
OpenUrl CrossRef PubMed Web of Science

[26] (26).↵
Gillespie, D. T. The chemical Langevin equation. J. Chem. Phys. 2000, 113, 297–306.
OpenUrl CrossRef Web of Science

[27] (27).↵
de Ronde, W. H.; Tostevin, F.; ten Wolde, P. R. Effect of feedback on the fidelity of information transmission of time-varying signals. Phys. Rev. E 2010, 82, 031914.
OpenUrl

[28] (28).↵
Bode, H. W.; Shannon, C. E. A simplified derivation of linear least square smoothing and prediction theory. Proc. Inst. Radio. Engin. 1950, 38, 417–425.

[29] (29).↵
Chaikin, P. M.; Lubensky, T. C. Principles of condensed matter physics; Cambridge University Press, 1995.

[30] (30).↵
Cai, L.; Friedman, N.; Xie, X. S. Stochastic protein expression in individual cells at the single molecule level. Nature 2006, 440, 358–362.
OpenUrl CrossRef PubMed Web of Science

[31] (31).
Garcia-Martinez, J.; Aranda, A.; Perez-Ortin, J. E. Genomic run-on evaluates transcription rates for all yeast genes and identifies gene regulatory mechanisms. Mol. Cell 2004, 15, 303–313.
OpenUrl CrossRef PubMed Web of Science

[32] (32).↵
Gillespie, D. T. Exact stochastic simulation of coupled chemical-reactions. J. Phys. Chem. 1977, 81, 2340–2361.
OpenUrl CrossRef Web of Science

[33] (33).↵
Jorgensen, P.; Nishikawa, J. L.; Breitkreutz, B. J.; Tyers, M. Systematic identification of pathways that couple cell growth and division in yeast. Science 2002, 297, 395–400.
OpenUrl Abstract/FREE Full Text

[34] (34).↵
Alon, U. An Introduction to Systems Biology: Design Principles of Biological Circuits; Chapman and Hall/CRC, 2006.

[35] (35).↵
Mugler, A.; Walczak, A. M.; Wiggins, C. H. Spectral solutions to stochastic models of gene expression with bursts and regulation. Phys. Rev. E. 2009, 80, 041921.
OpenUrl

[36] (36).↵
Davis, T. A. Algorithm 832: UMFPACK V4.3—an unsymmetric-pattern multifrontal method. ACM Trans. Math. Software 2004, 30, 196–199.
OpenUrl

[37] (37).↵
Brent, R. Algorithms for Minimization without Derivatives; Dover, 2002.

[38] (38).↵
Raser, J. M.; O’Shea, E. K. Control of stochasticity in eukaryotic gene expression. Science 2004, 304, 1811–1814.
OpenUrl Abstract/FREE Full Text

[39] (39).↵
Volfson, D.; Marciniak, J.; Blake, W. J.; Ostroff, N.; Tsimring, L. S.; Hasty, J. Origins of extrinsic variability in eukaryotic gene expression. Nature 2006, 439, 861–864.
OpenUrl CrossRef PubMed

[40] (40).↵
Rosenfeld, N.; Young, J. W.; Alon, U.; Swain, P. S.; Elowitz, M. B. Gene regulation at the single-cell level. Science 2005, 307, 1962–1965.
OpenUrl Abstract/FREE Full Text

[41] (41).↵
Csete, M. E.; Doyle, J. C. Reverse engineering of biological complexity. Science 2002, 295, 1664–1669.
OpenUrl Abstract/FREE Full Text

[42] (42).↵
Mettetal, J. T.; Muzzey, D.; Gomez-Uribe, C.; van Oudenaarden, A. The frequency dependence of osmo-adaptation in Saccharomyces cerevisiae. Science 2008, 319, 482–484.
OpenUrl Abstract/FREE Full Text

[43] (43).
Hersen, P.; McClean, M. N.; Mahadevan, L.; Ramanathan, S. Signal processing by the HOG MAP kinase pathway. Proc. Natl. Acad. Sci. U. S. A. 2008, 105, 7165–7170.
OpenUrl Abstract/FREE Full Text

[44] (44).↵
Muzzey, D.; Gomez-Uribe, C. A.; Mettetal, J. T.; van Oudenaarden, A. A Systems-Level Analysis of Perfect Adaptation in Yeast Osmoregulation. Cell 2009, 138, 160–171.
OpenUrl CrossRef PubMed Web of Science

[45] (45).↵
Park, H.; Pontius, W.; Guet, C. C.; Marko, J. F.; Emonet, T.; Cluzel, P. Interdependence of behavioural variability and response to small stimuli in bacteria. Nature 2010, 468, 819–U114.
OpenUrl CrossRef PubMed Web of Science