Toward a Genome Scale Sequence Specific Dynamic Model of Cell-Free Protein Synthesis in Escherichia coli

Nicholas Horvath; Michael Vilkhovoy; Joseph A. Wayman; Kara Calhoun; James Swartz; Jeffrey D. Varner

doi:10.1101/215012

Abstract

Cell-free protein expression systems have become widely used in systems and synthetic biology. In this study, we developed an ensemble of dynamic E. coli cell-free protein synthesis (CFPS) models. Model parameters were estimated from a training dataset for the cell-free production of a protein product, chloramphenicol acetyltransferase (CAT). The dataset consisted of measurements of glucose, organic acids, energy species, amino acids, and CAT. The ensemble accurately predicted these measurements, especially those of the central carbon metabolism. We then used the trained model to evaluate the optimality of protein production. CAT was produced with an energy efficiency of 12%, suggesting that the process could be further optimized. Reaction group knockouts showed that protein productivity and the metabolism as a whole depend most on oxidative phosphorylation and glycolysis and gluco-neogenesis. Amino acid biosynthesis is also important for productivity, while the overflow metabolism and TCA cycle affect the overall system state. In addition, the translation rate is shown to be more important to productivity than the transcription rate. Finally, CAT production was robust to allosteric control, as was most of the network, with the exception of the organic acids in central carbon metabolism. This study is the first to use kinetic modeling to predict dynamic protein production in a cell-free E. coli system, and should provide a foundation for genome scale, dynamic modeling of cell-free E. coli protein synthesis.

Introduction

Cell-free protein expression has become a widely used research tool in systems and synthetic biology, and a promising technology for personalized point of use biotechnology [1]. Cell-free systems offer many advantages for the study, manipulation and modeling of metabolism compared to in vivo processes. Central amongst these, is direct access to metabolites and the biosynthetic machinery without the interference of a cell wall, or complications associated with cell growth. This allows us to interrogate (and potentially manipulate) the chemical microenvironment while the biosynthetic machinery is operating, potentially at a fine time resolution. Cell-free protein synthesis (CFPS) systems are arguably the most prominent examples of cell-free systems used today [2]. However, CFPS is not new; CFPS in crude E. coli extracts has been used since the 1960s to explore fundamental biological mechanisms. For example, Matthaei and Nirenberg used E. coli cell-free extract in ground-breaking experiments to decipher the sequencing of the genetic code [3, 4]. Spirin and coworkers later improved protein production in cell free extracts by continuously exchanging reactants and products; however, while these extracts could run for tens of hours, they could only synthesize a single product and were energy limited [5]. More recently, energy and cofactor regeneration in CFPS has been significantly improved; for example ATP can be regenerated using substrate level phosphorylation [6] or even oxidative phosphorylation [2]. Today, cell-free systems are used in a variety of applications ranging from therapeutic protein production [7] to synthetic biology [8, 1]. Moreover, there are also several CFPS technology platforms, such as the PANOx-SP and Cytomin platforms developed by Swartz and coworkers [9, 2], and the TX/TL platform of Noireaux [10]. However, if CFPS is to become a mainstream technology for applications such as point of care biomanufacturing, we must first understand the performance limits of these systems, and eventually optimize their yield and productivity. A critical tool towards this goal is the development of a CFPS mathematical model.

Mathematical modeling has long contributed to our understanding of metabolism [11]. Decades before the genomics revolution, mechanistically structured metabolic models arose from the desire to predict microbial phe-notypes resulting from changes in intracellular or extracellular states [12]. The single cell E. coli models of Shuler and coworkers pioneered the con-struction of large-scale, dynamic metabolic models that incorporated multiple regulated catabolic and anabolic pathways constrained by experimentally determined kinetic parameters [13]. Shuler and coworkers generated many single cell kinetic models, including single cell models of eukaryotes [14, 15], minimal cell architectures [16], and DNA sequence based whole-cell models of E. coli [17]. More recent studies have extended the approach to integrate disparate models of cellular processes in M. genitalium [18], describe dozens of mutant strains in E. coli with a single kinetic model [19], and identify industrially useful target enzymes in E. coli to improve 1,4-butanediol production [20]. However, cell-free genome scale kinetic models of industrially important organisms such as E. coli have yet to be constructed.

In this study, we developed an ensemble of kinetic cell-free protein synthesis (CFPS) models using dynamic metabolite measurements from an early glucose powered Cytomin E. coli cell-free extract. While cell-free technology has evolved considerably since these measurements were taken, developing a model using a previous generation CFPS platform offers several unique advantages. First and foremost, is the ability to directly compare the dif-ferent improvements established by purely experimental means, to those estimated from a mathematical model. The CFPS model equations were formulated using the hybrid cell-free modeling framework of Wayman et al. [21], which integrates traditional kinetic modeling with a logical rule-based description of allosteric regulation. Model parameters were estimated from measurements of glucose, organic acids, energy species, amino acids, and the protein product, chloramphenicol acetyltransferase (CAT) over the course of a three hour protein synthesis reaction. A constrained Markov Chain Monte Carlo (MCMC) approach was used to minimize the squared difference between model simulations and experimental measurements, where a plausible range for each kinetic parameter was established from BioNumbers [22]. The ensemble of parameter sets described the training data with a median cost greater than two orders of magnitude smaller than a population of random parameter sets constructed using the same literature parameter constraints. We then used the ensemble of kinetic models to analyze the performance of the CFPS system, and to estimate the pathways most important to protein production. We calculated that CAT was produced with an energy efficiency of 12%, suggesting that much of the energy resources for protein synthesis were diverted to non-productive pathways. By knocking out metabolic enzymes in groups, we showed that metabolism and protein production in particular depended upon oxidative phosphorylation and glycolysis /gluconeogenesis. Taken together, this study provides a foundation for sequence specific genome scale, dynamic modeling of cell-free E. coli protein synthesis.

Results

The cell-free E. coli metabolic network was constructed by removing growth associated reactions from the iAF1260 reconstruction of K-12 MG1655 E. coli [23], and by adding reactions describing chloramphenicol acetyltransferase (CAT) biosynthesis (Fig. 1). In addition, reactions that were knocked out in the host strain used to prepare the extract were removed from the network (ΔspeA, ΔtnaA, ΔsdaA, ΔsdaB, ΔgshA, ΔtonA, ΔendA). Lastly, we added the transcription and translation template reactions of Allen and Palsson for the specific proteins of interest [24]. The metabolic network, which contained XX metabolites and YY reactions, is available in the supplemental materials. The dynamic CFPS model equations were formulated using the hybrid cell-free modeling framework of Wayman et al. [21]. An ensemble of model parameter sets (N = 3,000) was estimated from measurements of glucose, CAT, organic acids (pyruvate, lactate, acetate, succinate, malate), energy species (A(x)P, G(x)P, C(x)P, U(x)P), and 18 of the 20 proteinogenic amino acids [25] using a constrained Markov Chain Monte Carlo (MCMC) approach.

Figure 1:

Schematic of the core portion of the cell-free E. coli metabolic network. Metabolites of glycolysis, pentose phosphate pathway, Entner-Doudoroff pathway, and TCA cycle are shown. Metabolites of oxidative phosphorylation, amino acid biosynthesis and degra-dation, transcription/translation, chorismate metabolism, and energy metabolism are not shown.

The MCMC algorithm minimized the squared difference (residual) between the training data and model simulations starting from an initial parameter set assembled from literature and inspection. Bounds on permissible parameter values were established using studies from the BioNumbers database [22]. For each newly generated parameter set, we re-solved the bal-ance equations and calculated the cost function; all sets with a lower cost (and some with higher cost) were accepted into the ensemble. Parameter sets were also required to meet strict ordinary differential equation solver tolerances, to ensure numerical stability. Approximately N = 3,000 sets were accepted into an initial ensemble; N = 100 sets were then selected based upon error for the final ensemble. The final ensemble had a mean Pearson correlation coefficient of 0.78; this suggested parameter sets were not over sampled in the region of a local minimum. The median maximum reaction rate (V_max) across the ensemble was 11.6 mM/h, assuming a total cell-free enzyme concentration of 170 nM. This V_max corresponded to a median catalytic rate of 19 s⁻¹ across the ensemble; this was in agreement with the 13.7 s⁻¹ median catalytic rate found by Bar-Even and coworkers [26]. The median enzyme activity decay constant was 0.0045 h⁻¹, corresponding to an enzyme activity half life of 6 days. The median saturation constant was 1.0 mM; this is within one order of magnitude of the 130 μM reported by Bar-Even and coworkers. Lastly, both the median control gain parameter, and the control order parameter in the ensemble were order 1. While the maximum reaction rates of the ensemble were distributed evenly across the allowed range (Fig. S1A), the saturation constants were clustered around the upper and lower bounds (Fig. S1B).

Figure S1:

Histograms of model parameters, across the ensemble of 100 sets. A. Histogram of rate maxima. B. Histogram of saturation constants.

The ensemble of kinetic CFPS models captured the time evolution of protein biosynthesis, and the consumption and production of organic acid, amino acid and energy species. Central carbon metabolites (Fig. 2, top), energy species (Fig. 4), and amino acids (Fig. 3) were captured by the ensemble and the best-fit set. The constrained MCMC approach estimated parameter sets with a median error more than two orders of magnitude less than random parameter sets generated within the same parameter bounds established from literature (Fig. 5); thus, we have confidence in the predictive capability of the estimated parameters. For 29 of the 37 measurements in the training dataset, the mean Akaike information criterion (AIC) of the ensemble was lower than that of the random sets, signifying a better fit of the data (Table 3). For the other 8 measurements, the random AIC was lower than the ensemble by an amount less than the standard deviation of either the random AIC or ensemble AIC (with the exception of isoleucine, which was quite close: . Taken together, these results suggested that the parameter ensemble modeled cell free metabolism and protein production, significantly better than if sampled randomly, not just overall but for the majority of individual measurements.

Figure 2:

Central carbon metabolism in the presence (top) and absence (bottom) of allosteric control, including glucose (substrate), CAT (product), and intermediates, as well as total concentration of energy species. Best-fit parameter set (orange line) versus experimental data (points). 95% confidence interval (blue or gray shaded region) over the ensemble of 100 sets.

Figure 3:

Amino acids in the presence of allosteric control. Best-fit parameter set (orange line) versus experimental data (points). 95% confidence interval (blue shaded region) over the ensemble of 100 sets.

Figure 4:

Energy species and energy totals by base in the presence of allosteric control. Best-fit parameter set (orange line) versus experimental data (points). 95% confidence interval (blue shaded region) over the ensemble of 100 sets.

Figure 5:

Log of cost function (residual between training data and model simulations) across 37 datasets for data-trained ensemble (blue) and randomly generated ensemble (red, gray background). Median (bars), interquartile range (boxes), range excluding outliers (thin lines), and outliers (circles) for each dataset. Median across all datasets (large bar overlaid).

The model captured the biphasic time course of CAT production. During the first hour glucose powered protein production, and CAT was produced at 8 μM/h; subsequently, pyruvate and lactate reserves were consumed to power metabolism, and CAT was produced less quickly at 5 μM/h. Allosteric control was important to central carbon metabolism, especially pyruvate, acetate, and succinate (Fig. 2, bottom). The difference between the allosteric control and no-control cases was mostly seen in the second phase of CAT production, following glucose exhaustion. Specifically, pyruvate, succinate, and malate consumption and acetate accumulation increased following glucose exhaustion without the allosteric control mechanisms. The rate of acetate accumulation increased by 172%, while the rates of malate, pyruvate, and lactate consumption increased by 146%, 82%, and 9%, respectively. Succinate went from accumulating slightly in the second phase, in the presence of allosteric control, to being fully consumed. However, CAT production was robust to the removal of allosteric control, as seen in both the fits against data and the metabolic fluxes (see supplementary information). While ATP generation varied when allosteric control was removed, ATP expenditure toward CAT production did not. Most of the fluxes that differed between the two cases involved PEP and pyruvate, which directly participated in many of the reactions modulated by allosteric control. Taken together, the ensemble of kinetic models was consistent with time series measurements of the cell free production of a model protein. Although the ensemble described the experimental data, it was unclear which kinetic parameters and pathways most influenced CAT production. To explore this question, we performed reaction group knockout analysis.

The importance of CFPS pathways was estimated using pathway group knockout analysis (Fig. 7). The metabolic network was divided into 19 reaction groups, spanning central carbon metabolism, energetics, and amino acid biosynthesis. The response in the productivity or overall system state was calculated for single or pairwise deletion of each of these reaction groups. Lastly, the overall effect of the deletion of a pathway was estimated by sum-ming the single and pairwise effects (summation across the columns of the response array). Glycolysis/gluconeogenesis and oxidative phosphorylation had the greatest effect on both productivity and system state. This supports previous studies that have suggested oxidative phosphorylation is occurring in a cell-free system [2]; Jewett and coworkers observed a decrease in CAT yield, ranging from 1.5-fold to 4-fold, when inhibiting oxidative phosphorylation reactions in the Cytomim cell-free platform, using both pyruvate and glutamate as substrates. CAT productivity was also affected by two sectors of amino acid biosynthesis: alanine/aspartate/asparagine, and glutamate/ glutamine biosynthesis. This was consistent with aspartate, glutamate, and glutamine being key reactants in the biosynthesis of many other amino acids, all of which are required for CAT synthesis. Meanwhile, the TCA cycle and overflow metabolism (which included acetyl-coA/acetate reactions and the interconversion of pyruvate and lactate) also had a significant effect on the system state. These reactions directly impacted key system species: succinate and malate in the TCA cycle, and acetate, pyruvate, and lactate in the overflow metabolism. In addition, the relative influence of transcription and translation were interrogated via Sobol sampling [27]. Productivity was seen to have a sensitivity of 0.43 ± 0.06 with respect to the maximum reaction rate of transcription, and 0.66 ± 0.08 for the maximum reaction rate of translation. Thus, translation was the limiting step of cell free protein synthesis.

Figure 6:

Key reaction fluxes of the network, in the first (gray boxes, top row) and second (gray boxes, bottom row) phases of metabolism. A. Fluxes of ATP generation and consumption, and GTP consumption toward protein synthesis. B. Fluxes of glycolysis and lactate and acetate metabolism. Fluxes are normalized to the first-phase glucose uptake rate. For PEP and pyruvate, accumulation (normalized to glucose uptake) is also shown.

Figure 7:

Effect of group knockouts on system. A. Change in CAT productivity when one (diagonal) or two (off-diagonal) reaction groups are turned off. B. Change in system state (only species for which data exist) when one (diagonal) or two (off-diagonal) reaction groups are turned off. Total-order effect for each group calculated as the sum of first-order effect and all pairwise effects. Larger and darker circles represent greater effects.

The energy efficiency of CAT production, as well as the sources of energy generation and consumption, were tracked for the best-fit set. Energy efficiency was calculated as the ratio of transcription and translation rates (weighted by the associated ATP costs of each step) to the amount of ATP generated by all sources. During the first phase of protein production, with glucose as the substrate, CAT was produced with a productivity of 8 μM/h and an energy efficiency of 10%. Oxidative phosphorylation accounted for greater than 50% of the ATP generated during the rapid phase of protein production (Table 1). The organic acids that accumulated in the first phase (with the exception of acetate) were then utilized as substrates in the second phase, once glucose was depleted. We assumed the second phase of CAT production was powered largely by pyruvate; although malate was consumed in the second phase, it only accounted for 11% of substrate consumption. Furthermore, lactate is connected in the stoichiometry only to pyruvate. Thus, it is reasonable to consider the second phase as pyruvate-driven production. Interestingly, while this mode of protein production was slower (5 μM/h), it exhibited a higher energy efficiency (14%). Of the ATP generated, about half was observed to come from oxidative phosphorylation in each of the two phases of production (Table 1, R_atp). Another 30% was generated by glycolysis during the first phase (R_pgk,R_pyk), which decreased to approximately 20% following glucose exhaustion. However, glycolysis was also amongst the largest consumers of ATP during first phase of production (R_glk_atp, R_pfk) (Table 2). The TCA cycle (R_sucCD) contributed 3% of to the overall ATP generation in the first phase and 5% in the second. The hypothesis that pyruvate drives the second phase explains this; stores of accumulated pyruvate can be converted to acetyl-CoA, as well as OAA (via PEP), and thus power the TCA cycle just as when glucose was available. Interestingly, ATP generation through acetate metabolism (R_ackA) increased from 12% in the first phase to 28% in the second. Amino acid degradation also contributes a negligible amount to energy production. While the efficiency of production was higher for the pyruvate-driven phase, it was still relatively low, suggesting that there is room for platform optimization. Taken together, this strengthens the importance of glycolysis and oxidative phosphorylation, and presents a trade-off between productivity and energy efficiency in CFPS.

View this table:

Table 1:

Breakdown of ATP generation. Flux through ATP-generating pathways in the first and second phases as percentages of total ATP generation in that phase.

View this table:

Table 2:

Breakdown of ATP consumption. Flux through ATP-consuming pathways in the first and second phases as percentages of total ATP consumption in that phase.

View this table:

Table 3:

Mean and standard deviation of Akaike information criterion (AIC), by measurement, for the ensemble and random ensemble.

Discussion

In this study, we developed an ensemble of kinetic cell-free protein synthesis (CFPS) models using dynamic metabolite measurements from an early glucose powered Cytomin E. coli cell-free extract. We used the hybrid cell-free modeling approach of Wayman and coworkers, which integrates traditional kinetic modeling with a logic-based description of allosteric regula-tion, to describe the time evolution of the CFPS reaction. The ensemble captured dynamic metabolite measurements over 2-orders of magnitude better than random parameter sets generated in the same region of parameter space. The ensemble captured the biphasic time course of CAT production, relying on glucose during the first hour and pyruvate and lactate following glucose exhaustion. Allosteric control was essential to the description of the organic acid trajectories; without allosteric control, pyruvate, lactate, succinate, and malate were predicted to be consumed more quickly following glucose exhaustion, to power CAT synthesis. Interestingly, CAT production was robust to the removal of allosteric control; because the amino acids and energy species that are reactants for CAT synthesis were also not affected by allosteric control. We then used the ensemble of kinetic models to analyze the performance of the CFPS system, and to estimate the pathways most important to protein production. We calculated that CAT was produced with an energy efficiency of 12%, suggesting that much of the energy resources for protein synthesis were diverted to non-productive pathways. By knocking out metabolic enzymes in groups, we showed that metabolism and protein production in particular depended upon oxidative phosphorylation and glycolysis /gluconeogenesis. Using the Sobol sampling technique we demonstrated the greater importance of translation rate than transcription. Taken together, this study provides a foundation for sequence specific genome scale, dynamic modeling of cell-free E. coli protein synthesis.

The ensemble of models quantitatively described the dynamic time evolution of the cell-free protein production. Thus, the model could serve as a surrogate to rationally design cell-free production processes to optimize production rate and yield. In analyzing the effect of reaction groups on CAT production and the system state, the regions of metabolism associated with substrate utilization and energy generation were the most important. Oxidative phosphorylation was vital, since it provides most of the energetic needs of CFPS. While it is unknown how active oxidative phosphorylation is compared to that of in vivo systems, our modeling approach suggested it was critical to CFPS performance. However, the biphasic operation of CFPS highlights the ability of the system to respond to an absence of glucose. During the first phase, there is an accumulation of central carbon metabolites with the majority of flux going toward acetate and some toward pyruvate, lactate, succinate and malate. While acetate continued to accumulate as a byproduct, the other organic acids were consumed as secondary substrates after glucose was no longer available. Glutamate also served as a substrate throughout both phases, powering amino acid synthesis. These results confirm experimental findings that CAT production can be sustained by other substrates in the absence of glucose, providing alternative strategies to optimize CFPS performance. While CAT synthesis can be powered by other substrates, the productivity is significantly lower (5 μM/h, as opposed to 8 μM/h). This is in accordance with literature, where pyruvate provided a relatively slow but continuous supply of ATP [28]. However, the energy efficiency is slightly higher (14% as compared with 10%). Taken together, this shows CFPS can be designed towards a specified application, either requiring a slow stable energy source or faster production.

This work represents the first dynamic model of E. coli cell-free protein synthesis. We apply a hybrid modeling framework to capture an experimental dataset for production of a test protein, and identify system limitations and areas of improvement for production efficiency. This work could be extended through further experimentation to gain a deeper understanding of model performance under a variety of conditions. Specifically, CAT production performed in the absence of amino acids could inform the system’s ability to manufacture them, while experimentation in the absence of glucose or oxygen could shed light on the importance of those substrates. In addition, the approach should be extended to other protein products. CAT is only a test protein used for model identification; the modeling framework, and to some extent the parameter values, should be protein agnostic. An important extension of this study would be to apply its insights to other protein applications, where possible. Having captured the experimental data, we in-vestigated if CFPS performance could be further improved. We showed that the model predicts CAT production with an energy efficiency of 10% under glucose and 14% under pyruvate. The accumulation of glycolytic intermediates and byproducts such as acetate and carbon dioxide were responsible for this sub-optimal performance. If fluxes could be balanced such that intermediates were fully utilized, CAT production would increase. Knocking out sections of network metabolism revealed that glycolysis/gluconeogenesis and oxidative phosphorylation were the most important to CAT production and the system as a whole. Productivity was also heavily dependent on the synthesis reactions of alanine, aspartate, asparagine, glutamate, and glutamine, while TCA cycle and overflow reactions affected the system state. Taken together, these findings represent the first dynamic model of E. coli cell-free protein synthesis, an important step toward a functional genome scale description of cell free systems.

Materials and Methods

Formulation and solution of the model equations

We used ordinary differential equations (ODEs) to model the time evolution of metabolite (x_i), scaled enzyme activity (ϵ_i), transcription (m) and translation in an E. coli cell free metabolic network:

The quanity denotes the number of metabolic reactions, denotes the number of metabolites and denotes the number of metabolic enzymes in the model. The quantity r_j(x, ϵ, k) denotes the rate of reaction j. Typically, reaction j is a non-linear function of metabolite and enzyme abundance, as well as unknown kinetic parameters . The quantity σ_ij denotes the stoichiometric coefficient for species i in reaction j. If σ_ij > 0, metabolite i is produced by reaction j. Conversely, if σ_ij < 0, metabolite i is consumed by reaction j, while σ_ij = 0 indicates metabolite i is not connected with reaction j. Lastly, λ_i denotes the scaled enzyme activity decay constant. The system material balances were subject to the initial conditions x (t_o) = x_o and ϵ (t_o) = 1 (initially we have 100% cell-free enzyme activity).

Metabolic reaction rates were written as the product of a kinetic term and a control term . We used multiple saturation kinetics to model the reaction term : where denotes the maximum rate for reaction j, ϵ_i denotes the scaled enzyme activity which catalyzes reaction j, K_js denotes the saturation constant for species s, in reaction j and denotes the set of reactants for reaction j.

The control term 0 ≤ V_j ≤ 1 depended upon the combination of factors which influenced rate process j. For each rate, we used a rule-based approach to select from competing control factors. If rate j was influenced by 1,…, m factors, we modeled this relationship as where 0 ≤ f_ij (⋅) ≤ 1 denotes a transfer function quantifying the influence of factor i on rate j. The function (⋅) is an integration rule which maps the output of regulatory transfer functions into a control variable. We used hill-like transfer functions and in this study [21]. We included 17 allosteric regulation terms, taken from literature, in the CFPS model. PEP was modeled as an inhibitor for phosphofructokinase [29, 30], PEP carboxykinase [29], PEP synthetase [29, 31], isocitrate dehydrogenase [29, 32], and isocitrate lyase/malate synthase [29, 32, 33], and as an activator for fructose-biphosphatase [29, 34, 35, 36]. AKG was modeled as an inhibitor for citrate synthase [29, 37, 38] and isocitrate lyase/malate synthase [29, 33]. 3PG was modeled as an inhibitor for isocitrate lyase/malate synthase [29, 33]. FDP was modeled as an activator for pyruvate kinase [29, 39] and PEP carboxylase [29, 40]. Pyruvate was modeled as an inhibitor for pyruvate dehydrogenase [29, 41, 42] and as an activator for lactate dehydrogenase [43]. Acetyl CoA was modeled as an inhibitor for malate dehydrogenase [29].

The symbol denotes the transcription rate, u denotes a promoter specific activation model, and denotes the transcript degradation rate. The transcription rate was modeled as: where denotes the maximum transcription rate, R_T denotes the RNA polymerase concentration, G_P denotes the gene concentration, denotes the gene saturation constant, denotes the saturation constant for species s, and denotes the set of reactants for transcription: ATP, GTP, CTP, UTP, and water. In this study, we considered only the T7 promoter; we have previously estimated u ≃0.95 for a T7 [REF-MIKE]. While transcription was modeled as saturating with respect to gene concentration, the gene was not considered a reactant in the stoichiometry as it was not consumed. Transcript degradation was modeled as first-order in transcript: where k^d denotes the transcript degradation rate constant.

The symbol denotes the translation rate, which was modeled as: where denotes the maximum translation rate, R_X denotes the ribosome concentration, m denotes the transcript concentration, denotes the transcript saturation constant, denotes the saturation constant for species s, and denotes the set of reactants for translation: GTP, water, and the 20 species representing tRNA charged with amino acids. While translation was modeled as saturating with respect to transcript concentration, the transcript was not considered a reactant in the stoichiometry as it is not consumed.

Estimation of kinetic model parameters

We estimated an ensemble of kinetic parameter sets using a constrained Markov Chain Monte Carlo (MCMC) random walk strategy. We have used this technique previously to estimate numerically stable low-error parameter sets for signal transduction models [44, 45]. Starting from a small number of parameter sets estimated by inspection and literature, we calculated the cost function, equal to the sum-squared-error between experimental data and model predictions: where denotes the number of datasets , w_i denotes the weight of the i^th dataset, denotes the number of timepoints in the i^th dataset, t(j) denotes the j^th timepoint, y_ij denotes the measurement value of the i^th dataset at the j^th timepoint, and x_i|_t(j) denotes the simulated value of the metabolite corresponding to the i^th dataset, interpolated to the j^th timepoint. Lastly, the cost function was scaled by the maximum experimental value in the i^th dataset, . We then perturbed each model parameter between an upper and lower bound that varied by parameter type: where denotes the number of parameters , which includes 204 maximum reaction rates (V^max), 204 enzyme activity decay constants, 548 saturation constants (K_js), and 34 control parameters, denotes the new value of the i^th parameter, k_i denotes the current value of the i^th parameter, a denotes a distribution variance, r_i denotes a random sample from the normal distribution, l_i denotes the lower bound for that parameter type, and u_i denotes the upper bound for that parameter type. Model parameters were constrained by literature collected using the BioNumbers database [22]. Transcription, translation, and mRNA degradation were bounded within a factor of two of their reference values. A characteristic cell-free enzyme concentration of 170 nM was calculated by diluting the one-tenth maximal con-centration of lacZ (5 μM, BNID 100735) by a cell-free dilution factor of 30. This enzyme level was then used to calculate rate maxima from turnover numbers for various enzymes from BioNumbers (Table 4). Rate maxima were bounded within one order of magnitude of the reference value where available; all other rate maxima were bounded within two orders of magnitude of the geometric mean of the available values. Enzyme activity decay constants were bounded between 0 and 1 h⁻¹, corresponding to half lives of 42 minutes and infinity. Saturation constants were bounded between 0.0001 and 10 mM. Control gain parameters were bounded between 0.05 and 10 (dimensionless), while order parameters were bounded between 0.02 and 10 (dimensionless).

For each newly generated parameter set, we re-solved the balance equations and calculated the cost function. All sets with a lower cost were accepted into the ensemble. Sets with a higher cost were also accepted into the ensemble, if they satisfied the acceptance constraint: where denotes a random number taken from a uniform distribution between 0 and 1, cost denotes the cost of the current parameter set, cost_new denotes the cost of the new parameter set, and a denotes a tunable parameter to control the tolerance to high-error sets. A total of 3,875 sets were accepted into the initial ensemble, from which we selected N = 100 with minimal error for the final ensemble.

View this table:

Table 4:

Reference values for reaction rate maxima (V_max) from BioNumbers. V_max values calculated from turnover numbers (k_cat) from BioNumbers, and a characteristic enzyme concentration of 170 nM. Characteristic rate maximum for all other reactions calculated as geometric mean of calculated rate maxima.

View this table:

Table 5:

Reference values for transcription, translation, and mRNA degradation from literature. Transcription rate calculated from elongation rate, mRNA length, and promoter activity level. Translation rate calculated from elongation rate, protein length, and polysome amplification constant. mRNA degradation rate calculated from mRNA degradation time.

View this table:

Table 6:

Nomenclature

Lastly, a random ensemble of 100 parameter sets was generated within the same parameter bounds as the trained ensemble. The randomized parameter sets were generated using a Monte Carlo approach: each parameter was taken from a uniform distribution constructed between its upper and lower bounds. The model equations were then solved and the cost function, and the Akaike information criterion (AIC) were calculated for each of the 37 separate experimental datasets.

Reaction group knockouts

The metabolic network was divided into 19 reaction groups: glycolysis/gluconeogenesis, pentose phosphate, Entner-Doudoroff, TCA cycle, oxidative phosphorylation, cofactor reactions, anaplerotic/glyoxylate reactions, overflow metabolism, folate synthesis, purine/pyrimidine reactions, alanine/ aspartate/asparagine synthesis, glutamate/glutamine synthesis, arginine/proline synthesis, glycine/serine synthesis, cysteine/methionine synthesis, threonine/ lysine synthesis, histidine synthesis, tyrosine/tryptophan/phenylalanine synthesis, and valine/leucine/isoleucine synthesis. Each reaction group and pair of reaction groups were removed and the model was re-solved; the CAT productivity was then calculated and subtracted from that of the base case (no knockouts): where P_ii denotes the first-order productivity knockout effect for reaction group i, P_ij denotes the pairwise productivity knockout effect for reaction groups i and j, denotes the total-order productivity knockout effect for reaction group i, ΔCAT denotes the base case CAT productivity, ΔCAT_ΔRi denotes the CAT productivity when reaction group i is knocked out, ΔCAT_ΔRiΔRj denotes the CAT productivity when reaction groups i and j are knocked out, and |x| denotes the absolute value of x. The system state, defined as the model predictions for all species for which experimental data exists, was also recorded for each knockout and compared to the base case: where S_ii denotes the first-order system state knockout effect for reaction group i, Sij denotes the pairwise system state knockout effect for reaction groups i and j, denotes the total-order system state knockout effect for reaction group i, x^data denotes the base-case system state, denotes the system state when reaction group i is knocked out, denotes the system state when reaction groups i and j are knocked out, and ||x||₂ denotes the l² norm of x. In order to not dominate the colorbar, the total-order knockout effects were normalized to the same ranges as the main arrays (first-order and pairwise effects).

Sensitivity of CAT productivity to transcription and translation

The catalytic rates of transcription and translation were sampled within one order of magnitude on each side from the best-fit values. The parameter bounds were set as the base-10 logarithms of the upper and lower bound for each rate; then, 10 was taken to the power of each parameter sample to obtain the catalytic rates: where denotes the sample of the transcription catalytic rate, denotes the sample of the translation catalytic rate, denotes the best-fit value of the transcription catalytic rate, and denotes the best-fit value of the translation catalytic rate. The sampling was performed using the Sensitivity Analysis Library in Python (Numpy) with 3000 samples [46].

Calculation of energy efficiency

Energy efficiency was calculated as the ratio of transcription and translation (weighted by the appropriate energy species coefficients) to ATP generation: where Δ_TmRNA denotes the net accumulation of mRNA in phase τ (first, second, or overall), Δ_TCAT denotes the net accumulation of protein in phase τ, α_T denotes the energy cost of transcription, α_x denotes the energy cost of translation, R_ATP denotes the set of ATP-producing reactions, and denotes the ATP coefficient for reaction j. ATP_T, CTP_T, GTP_T, UTP_T denote the stoichiometric coefficients of each energy species for transcription, and ATP_X, GTP_X denote the stoichiometric coefficients of ATP and GTP for translation. During transcription and tRNA charging, triphosphate molecules are consumed with monophosphates as byproducts; this is the reason for the factors of 2 on ATP_T, CTP_T, GTP_T, UTP_T, and ATP_X.

Availability of model code

The cell free model equations, and the parameter estimation procedure, were implemented in the Julia programming language. The model equations were solved using the CVODE solver of the SUNDIALS suite [47], with an absolute tolerance and relative tolerance of 1e⁻⁹; any sets exhibiting CVODEerrors were discarded. Thus, the numerical stability of all parame-ters in the ensemble was ensured. The model code and parameter ensemble is freely available under an MIT software license and can be downloaded from http://www.varnerlab.org.

Competing interests

The authors declare that they have no competing interests.

Author’s contributions

J.V directed the modeling study. K.C and J.S conducted the cell-free protein synthesis experiments. J.V, J.W, and N.H developed the cell-free protein synthesis mathematical model, and parameter ensemble. The manuscript was prepared and edited for publication by J.S, N.H, M.V, J.W and J.V.

Funding

This study was supported by a National Science Foundation Graduate Research Fellowship (DGE-1333468) to N.H. Research reported in this publication was also supported by the Systems Biology Coagulopathy of Trauma Program with support from the US Army Medical Research and Materiel Command under award number W911NF-10-1-0376.

Acknowledgements

We gratefully acknowledge the suggestions from the anonymous reviewers to improve this manuscript.

References

[1].↵
K. Pardee, S. Slomovic, P. Q. Nguyen, J. W. Lee, N. Donghia, D. Burrill, T. Ferrante, F. R. McSorley, Y. Furuta, A. Vernet, M. Lewandowski, C. N. Boddy, N. S. Joshi, J. J. Collins, Portable, on-demand biomolecular manufacturing, Cell 167 (2016) 248–59.e12.
OpenUrl CrossRef
[2].↵
M. C. Jewett, K. A. Calhoun, A. Voloshin, J. J. Wuu, J. R. Swartz, An integrated cell-free metabolic platform for protein production and synthetic biology, Mol Syst Biol 4 (2008) 220.
OpenUrl Abstract/FREE Full Text
[3].↵
J. H. Matthaei, M. W. Nirenberg, Characteristics and stabilization of dnaase-sensitive protein synthesis in e. coli extracts, Proc Natl Acad Sci U S A 47 (1961) 1580–8.
OpenUrl FREE Full Text
[4].↵
M. W. Nirenberg, J. H. Matthaei, The dependence of cell-free protein synthesis in e. coli upon naturally occurring or synthetic polyribonu-cleotides, Proc Natl Acad Sci U S A 47 (1961) 1588–602.
OpenUrl FREE Full Text
[5].↵
A. Spirin, V. Baranov, L. Ryabova, S. Ovodov, Y. Alakhov, A continuous cell-free translation system capable of producing polypeptides in high yield, Science 242 (1988) 1162–1164.
OpenUrl Abstract/FREE Full Text
[6].↵
D.-M. Kim, J. R. Swartz, Regeneration of adenosine triphosphate from glycolytic intermediates for cell-free protein synthesis, Biotechnology and Bioengineering 74 (2001) 309–316.
OpenUrl CrossRef PubMed Web of Science
[7].↵
Y. Lu, J. P. Welsh, J. R. Swartz, Production and stabilization of the trimeric influenza hemagglutinin stem domain for potentially broadly protective influenza vaccines, Proc Natl Acad Sci U S A 111 (2014) 125–30.
OpenUrl Abstract/FREE Full Text
[8].↵
C. E. Hodgman, M. C. Jewett, Cell-free synthetic biology: thinking outside the cell, Metab Eng 14 (2012) 261–9.
OpenUrl CrossRef PubMed
[9].↵
M. C. Jewett, J. R. Swartz, Mimicking the escherichia coli cytoplasmic environment activates long-lived and efficient cell-free protein synthesis, Biotechnology and Bioengineering 86 (2004) 19–26.
OpenUrl CrossRef PubMed Web of Science
[10].↵
J. Garamella, R. Marshall, M. Rustad, V. Noireaux, The all e. coli tx-tl toolbox 2.0: A platform for cell-free synthetic biology, ACS Synth Biol 5 (2016) 344–55.
OpenUrl CrossRef PubMed
[11].↵
J. A. Wayman, J. D. Varner, Biological systems modeling of metabolic and signaling networks, Curr Opin Chem Eng 2 (2013).
[12].↵
A. G. Fredrickson, Formulation of structured growth models, Biotechnol Bioeng 18 (1976) 1481–6.
OpenUrl PubMed
[13].↵
M. M. Domach, S. K. Leung, R. E. Cahn, G. G. Cocks, M. L. Shuler, Computer model for glucose-limited growth of a single cell of escherichia coli b/r-a, Biotechnol Bioeng 26 (1984) 203–16.
OpenUrl CrossRef PubMed Web of Science
[14].↵
D. Steinmeyer, M. Shuler, Structured model for Saccharomyces cerevisiae., Chem Eng Sci 44 (1989) 2017–30.
OpenUrl
[15].↵
P. Wu, N. G. Ray, M. L. Shuler, A single-cell model for cho cells, Ann N Y Acad Sci 665 (1992) 152–87.
OpenUrl PubMed
[16].↵
M. Castellanos, D. B. Wilson, M. L. Shuler, A modular minimal cell model: purine and pyrimidine transport and metabolism, Proc Natl Acad Sci U S A 101 (2004) 6681–6.
OpenUrl Abstract/FREE Full Text
[17].↵
J. C. Atlas, E. V. Nikolaev, S. T. Browning, M. L. Shuler, Incorporating genome-wide dna sequence information into a dynamic whole-cell model of escherichia coli: application to dna replication, IET Syst Biol 2 (2008) 369–82.
OpenUrl CrossRef PubMed
[18].↵
J. R. Karr, J. C. Sanghvi, D. N. Macklin, M. V. Gutschow, J. M. Jacobs, B. Bolival, N. Assad-Garcia, J. I. Glass, M. W. Covert, A whole-cell computational model predicts phenotype from genotype, Cell 150 (2012) 389–401.
OpenUrl CrossRef PubMed Web of Science
[19].↵
A. Khodayari, C. D. Maranas, A genome-scale Escherichia coli kinetic metabolic model k-ecoli457 satisfying flux data for multiple mutant strains, Nat Commun 7 (2016) 13806.
OpenUrl CrossRef
[20].↵
S. Andreozzi, A. Chakrabarti, K. C. Soh, A. Burgard, T. H. Yang, S. Van Dien, L. Miskovic, V. Hatzimanikatis, Identification of metabolic engineering targets for the enhancement of 1,4-butanediol production in recombinant E. coli using large-scale kinetic models, Metab. Eng. 35 (2016) 148–159.
OpenUrl CrossRef
[21].↵
J. A. Wayman, A. Sagar, J. D. Varner, Dynamic modeling of cell-free biochemical networks using effective kinetic models, Processes 3 (2015) 138.
OpenUrl
[22].↵
R. Milo, P. Jorgensen, U. Moran, G. Weber, M. Springer, Bionumbers-the database of key numbers in molecular and cell biology, Nucleic Acids Res 38 (2009) 750–3.
OpenUrl
[23].↵
A. M. Feist, C. S. Henry, J. L. Reed, M. Krummenacker, A. R. Joyce, P. D. Karp, L. J. Broadbelt, V. Hatzimanikatis, B. Ø. Palsson, A genome-scale metabolic reconstruction for Escherichia coli K-12 MG1655 that accounts for 1260 ORFs and thermodynamic information, Mol Syst Biol 3 (2007) 121.
OpenUrl Abstract/FREE Full Text
[24].↵
T. E. Allen, B. Ø. Palsson, Sequence-based analysis of metabolic demands for protein synthesis in prokaryotes, J Theor Biol 220 (2003) 1–18.
OpenUrl CrossRef PubMed
[25].↵
M. Vilkhovoy, N. Horvath, C.-h. Shih, J. Wayman, K. Calhoun, J. Swartz, J. Varner, Sequence specific modeling of e. coli cell-free protein synthesis, bioRxiv (2017).
[26].↵
A. Bar-Even, E. Noor, Y. Savir, W. Liebermeister, D. Davidi, D. S. Tawfik, R. Milo, The moderately efficient enzyme: Evolutionary and physicochemical trends shaping enzyme parameters, Biochemistry 50 (2011).
[27].↵
I. Sobol, Global sensitivity indices for nonlinear mathematical models and their monte carlo estimates., Math Comput Simulat 55 (2001) 271–80.
OpenUrl CrossRef
[28].↵
J. Swartz, A pure approach to constructive biology, Nat Biotechnol 19 (2001) 732–3.
OpenUrl CrossRef PubMed Web of Science
[29].↵
O. Kotte, J. B. Zaugg, M. Heinemann, Bacterial adaptation through distributed sensing of metabolic fluxes, Mol Syst Biol 6 (2010) 355.
OpenUrl Abstract/FREE Full Text
[30].↵
R. Cabrera, M. Baez, H. M. Pereira, A. Caniuguir, R. C. Garratt, J. Babul, The crystal complex of phosphofructokinase-2 of Escherichia coli with fructose-6-phosphate: kinetic and structural analysis of the allosteric ATP inhibition, J Biol Chem 286 (2011) 5774–83.
OpenUrl Abstract/FREE Full Text
[31].↵
M. Chulavatnatol, D. E. Atkinson, Phosphoenolpyruvate synthetase from Escherichia coli. Effects of adenylate energy charge and modifier concentrations, J Biol Chem 248 (1973) 2712–5.
[32].↵
T. Ogawa, K. Murakami, H. Mori, N. Ishii, M. Tomita, M. Yoshin, Role of phosphoenolpyruvate in the NADP-isocitrate dehydrogenase and isocitrate lyase reaction in Escherichia coli, J Bacteriol 189 (2007) 1176–8.
OpenUrl Abstract/FREE Full Text
[33].↵
C. MacKintosh, H. G. Nimmo, Purification and regulatory properties of isocitrate lyase from Escherichia coli ML308, Biochem J 250 (1988) 25–31.
OpenUrl Abstract/FREE Full Text
[34].↵
J. L. Donahue, J. L. Bownas, W. G. Niehaus, T. J. Larson, Purification and characterization of glpX-encoded fructose 1, 6-bisphosphatase, a new enzyme of the glycerol 3-phosphate regulon of Escherichia coli, J Bacteriol 182 (2000) 5624–7.
OpenUrl Abstract/FREE Full Text
[35].↵
J. K. Hines, H. J. Fromm, R. B. Honzatko, Novel allosteric activation site in Escherichia coli fructose-1,6-bisphosphatase, J Biol Chem 281 (2006) 18386–93.
OpenUrl Abstract/FREE Full Text
[36].↵
J. K. Hines, H. J. Fromm, R. B. Honzatko, Structures of activated fructose-1,6-bisphosphatase from Escherichia coli. Coordinate regulation of bacterial metabolism and the conservation of the R-state, J Biol Chem 282 (2007) 11696–704.
OpenUrl Abstract/FREE Full Text
[37].↵
D. S. Pereira, L. J. Donald, D. J. Hosfield, H. W. Duckworth, Active site mutants of Escherichia coli citrate synthase. Effects of mutations on catalytic and allosteric properties, J Biol Chem 269 (1994) 412–7.
OpenUrl Abstract/FREE Full Text
[38].↵
M. S. Robinson, R. A. Easom, M. J. Danson, P. D. Weitzman, Citrate synthase of Escherichia coli. Characterisation of the enzyme from a plasmid-cloned gene and amplification of the intracellular levels, FEBS Lett 154 (1983) 51–4.
OpenUrl PubMed
[39].↵
T. Zhu, M. F. Bailey, L. M. Angley, T. F. Cooper, R. C. Dobson, The quaternary structure of pyruvate kinase type 1 from Escherichia coli at low nanomolar concentrations, Biochimie 92 (2010) 116–20.
OpenUrl CrossRef PubMed
[40].↵
R. C. Wohl, G. Markus, Phosphoenolpyruvate carboxylase of Escherichia coli. Purification and some properties, J Biol Chem 247 (1972) 5785–92.
OpenUrl Abstract/FREE Full Text
[41].↵
S. Kale, P. Arjunan, W. Furey, F. Jordan, A dynamic loop at the active center of the Escherichia coli pyruvate dehydrogenase complex E1 component modulates substrate utilization and chemical communication with the E2 component, J Biol Chem 282 (2007) 28106–16.
OpenUrl Abstract/FREE Full Text
[42].↵
P. Arjunan, N. Nemeria, A. Brunskill, K. Chandrasekhar, M. Sax, Y. Yan, F. Jordan, J. R. Guest, W. Furey, Structure of the pyruvate de-hydrogenase multienzyme complex E1 component from Escherichia coli at 1.85 A resolution, Biochemistry 41 (2002) 5213–21.
OpenUrl CrossRef PubMed Web of Science
[43].↵
S. Okino, M. Suda, K. Fujikura, M. Inui, H. Yukawa, Production of D-lactic acid by Corynebacterium glutamicum under oxygen deprivation, Appl Microbiol Biotechnol 78 (2008) 449–54.
OpenUrl CrossRef PubMed
[44].↵
R. Tasseff, S. Nayak, S. Salim, P. Kaushik, N. Rizvi, J. D. Varner, Analysis of the molecular networks in androgen dependent and independent prostate cancer revealed fragile and robust subsystems, PLoS One 5 (2010) e8864.
OpenUrl CrossRef PubMed
[45].↵
R. Tasseff, S. Nayak, S. O. Song, A. Yen, J. D. Varner, Modeling and analysis of retinoic acid induced differentiation of uncommitted precursor cells, Integr Biol (Camb) 3 (2011) 578–91.
OpenUrl
[46].↵
J. D. Herman, http://jdherman.github.io/salib/, ????
[47].↵
A. C. Hindmarsh, P. N. Brown, K. E. Grant, S. L. Lee, R. Serban, D. E. Shumaker, C. S. Woodward, SUNDIALS: Suite of nonlinear and differential/algebraic equation solvers, ACM T Math Software (TOMS) 31 (2005) 363–396.
OpenUrl
[48].
T. Kigawa, Y. Muto, S. Yokoyama, Cell-free synthesis and amino acid-selective stable isotope labeling of proteins for NMR analysis, J Biomolec NMR 6 (1995) 129–134.
OpenUrl

View the discussion thread.

Posted November 06, 2017.

Download PDF

Citation Tools

Subject Area

Synthetic Biology

Subject Areas

All Articles

Animal Behavior and Cognition (5217)
Biochemistry (11755)
Bioengineering (8757)
Bioinformatics (29209)
Biophysics (14980)
Cancer Biology (12103)
Cell Biology (17416)
Clinical Trials (138)
Developmental Biology (9425)
Ecology (14187)
Epidemiology (2067)
Evolutionary Biology (18314)
Genetics (12246)
Genomics (16807)
Immunology (11870)
Microbiology (28101)
Molecular Biology (11599)
Neuroscience (60995)
Paleontology (452)
Pathology (1872)
Pharmacology and Toxicology (3238)
Physiology (4961)
Plant Biology (10429)
Scientific Communication and Education (1683)
Synthetic Biology (2887)
Systems Biology (7341)
Zoology (1651)

[1] [1].↵
K. Pardee, S. Slomovic, P. Q. Nguyen, J. W. Lee, N. Donghia, D. Burrill, T. Ferrante, F. R. McSorley, Y. Furuta, A. Vernet, M. Lewandowski, C. N. Boddy, N. S. Joshi, J. J. Collins, Portable, on-demand biomolecular manufacturing, Cell 167 (2016) 248–59.e12.
OpenUrl CrossRef

[2] [2].↵
M. C. Jewett, K. A. Calhoun, A. Voloshin, J. J. Wuu, J. R. Swartz, An integrated cell-free metabolic platform for protein production and synthetic biology, Mol Syst Biol 4 (2008) 220.
OpenUrl Abstract/FREE Full Text

[3] [3].↵
J. H. Matthaei, M. W. Nirenberg, Characteristics and stabilization of dnaase-sensitive protein synthesis in e. coli extracts, Proc Natl Acad Sci U S A 47 (1961) 1580–8.
OpenUrl FREE Full Text

[4] [4].↵
M. W. Nirenberg, J. H. Matthaei, The dependence of cell-free protein synthesis in e. coli upon naturally occurring or synthetic polyribonu-cleotides, Proc Natl Acad Sci U S A 47 (1961) 1588–602.
OpenUrl FREE Full Text

[5] [5].↵
A. Spirin, V. Baranov, L. Ryabova, S. Ovodov, Y. Alakhov, A continuous cell-free translation system capable of producing polypeptides in high yield, Science 242 (1988) 1162–1164.
OpenUrl Abstract/FREE Full Text

[6] [6].↵
D.-M. Kim, J. R. Swartz, Regeneration of adenosine triphosphate from glycolytic intermediates for cell-free protein synthesis, Biotechnology and Bioengineering 74 (2001) 309–316.
OpenUrl CrossRef PubMed Web of Science

[7] [7].↵
Y. Lu, J. P. Welsh, J. R. Swartz, Production and stabilization of the trimeric influenza hemagglutinin stem domain for potentially broadly protective influenza vaccines, Proc Natl Acad Sci U S A 111 (2014) 125–30.
OpenUrl Abstract/FREE Full Text

[8] [8].↵
C. E. Hodgman, M. C. Jewett, Cell-free synthetic biology: thinking outside the cell, Metab Eng 14 (2012) 261–9.
OpenUrl CrossRef PubMed

[9] [9].↵
M. C. Jewett, J. R. Swartz, Mimicking the escherichia coli cytoplasmic environment activates long-lived and efficient cell-free protein synthesis, Biotechnology and Bioengineering 86 (2004) 19–26.
OpenUrl CrossRef PubMed Web of Science

[10] [10].↵
J. Garamella, R. Marshall, M. Rustad, V. Noireaux, The all e. coli tx-tl toolbox 2.0: A platform for cell-free synthetic biology, ACS Synth Biol 5 (2016) 344–55.
OpenUrl CrossRef PubMed

[11] [11].↵
J. A. Wayman, J. D. Varner, Biological systems modeling of metabolic and signaling networks, Curr Opin Chem Eng 2 (2013).

[12] [12].↵
A. G. Fredrickson, Formulation of structured growth models, Biotechnol Bioeng 18 (1976) 1481–6.
OpenUrl PubMed

[13] [13].↵
M. M. Domach, S. K. Leung, R. E. Cahn, G. G. Cocks, M. L. Shuler, Computer model for glucose-limited growth of a single cell of escherichia coli b/r-a, Biotechnol Bioeng 26 (1984) 203–16.
OpenUrl CrossRef PubMed Web of Science

[14] [14].↵
D. Steinmeyer, M. Shuler, Structured model for Saccharomyces cerevisiae., Chem Eng Sci 44 (1989) 2017–30.
OpenUrl

[15] [15].↵
P. Wu, N. G. Ray, M. L. Shuler, A single-cell model for cho cells, Ann N Y Acad Sci 665 (1992) 152–87.
OpenUrl PubMed

[16] [16].↵
M. Castellanos, D. B. Wilson, M. L. Shuler, A modular minimal cell model: purine and pyrimidine transport and metabolism, Proc Natl Acad Sci U S A 101 (2004) 6681–6.
OpenUrl Abstract/FREE Full Text

[17] [17].↵
J. C. Atlas, E. V. Nikolaev, S. T. Browning, M. L. Shuler, Incorporating genome-wide dna sequence information into a dynamic whole-cell model of escherichia coli: application to dna replication, IET Syst Biol 2 (2008) 369–82.
OpenUrl CrossRef PubMed

[18] [18].↵
J. R. Karr, J. C. Sanghvi, D. N. Macklin, M. V. Gutschow, J. M. Jacobs, B. Bolival, N. Assad-Garcia, J. I. Glass, M. W. Covert, A whole-cell computational model predicts phenotype from genotype, Cell 150 (2012) 389–401.
OpenUrl CrossRef PubMed Web of Science

[19] [19].↵
A. Khodayari, C. D. Maranas, A genome-scale Escherichia coli kinetic metabolic model k-ecoli457 satisfying flux data for multiple mutant strains, Nat Commun 7 (2016) 13806.
OpenUrl CrossRef

[20] [20].↵
S. Andreozzi, A. Chakrabarti, K. C. Soh, A. Burgard, T. H. Yang, S. Van Dien, L. Miskovic, V. Hatzimanikatis, Identification of metabolic engineering targets for the enhancement of 1,4-butanediol production in recombinant E. coli using large-scale kinetic models, Metab. Eng. 35 (2016) 148–159.
OpenUrl CrossRef

[21] [21].↵
J. A. Wayman, A. Sagar, J. D. Varner, Dynamic modeling of cell-free biochemical networks using effective kinetic models, Processes 3 (2015) 138.
OpenUrl

[22] [22].↵
R. Milo, P. Jorgensen, U. Moran, G. Weber, M. Springer, Bionumbers-the database of key numbers in molecular and cell biology, Nucleic Acids Res 38 (2009) 750–3.
OpenUrl

[23] [23].↵
A. M. Feist, C. S. Henry, J. L. Reed, M. Krummenacker, A. R. Joyce, P. D. Karp, L. J. Broadbelt, V. Hatzimanikatis, B. Ø. Palsson, A genome-scale metabolic reconstruction for Escherichia coli K-12 MG1655 that accounts for 1260 ORFs and thermodynamic information, Mol Syst Biol 3 (2007) 121.
OpenUrl Abstract/FREE Full Text

[24] [24].↵
T. E. Allen, B. Ø. Palsson, Sequence-based analysis of metabolic demands for protein synthesis in prokaryotes, J Theor Biol 220 (2003) 1–18.
OpenUrl CrossRef PubMed

[25] [25].↵
M. Vilkhovoy, N. Horvath, C.-h. Shih, J. Wayman, K. Calhoun, J. Swartz, J. Varner, Sequence specific modeling of e. coli cell-free protein synthesis, bioRxiv (2017).

[26] [26].↵
A. Bar-Even, E. Noor, Y. Savir, W. Liebermeister, D. Davidi, D. S. Tawfik, R. Milo, The moderately efficient enzyme: Evolutionary and physicochemical trends shaping enzyme parameters, Biochemistry 50 (2011).

[27] [27].↵
I. Sobol, Global sensitivity indices for nonlinear mathematical models and their monte carlo estimates., Math Comput Simulat 55 (2001) 271–80.
OpenUrl CrossRef

[28] [28].↵
J. Swartz, A pure approach to constructive biology, Nat Biotechnol 19 (2001) 732–3.
OpenUrl CrossRef PubMed Web of Science

[29] [29].↵
O. Kotte, J. B. Zaugg, M. Heinemann, Bacterial adaptation through distributed sensing of metabolic fluxes, Mol Syst Biol 6 (2010) 355.
OpenUrl Abstract/FREE Full Text

[30] [30].↵
R. Cabrera, M. Baez, H. M. Pereira, A. Caniuguir, R. C. Garratt, J. Babul, The crystal complex of phosphofructokinase-2 of Escherichia coli with fructose-6-phosphate: kinetic and structural analysis of the allosteric ATP inhibition, J Biol Chem 286 (2011) 5774–83.
OpenUrl Abstract/FREE Full Text

[31] [31].↵
M. Chulavatnatol, D. E. Atkinson, Phosphoenolpyruvate synthetase from Escherichia coli. Effects of adenylate energy charge and modifier concentrations, J Biol Chem 248 (1973) 2712–5.

[32] [32].↵
T. Ogawa, K. Murakami, H. Mori, N. Ishii, M. Tomita, M. Yoshin, Role of phosphoenolpyruvate in the NADP-isocitrate dehydrogenase and isocitrate lyase reaction in Escherichia coli, J Bacteriol 189 (2007) 1176–8.
OpenUrl Abstract/FREE Full Text

[33] [33].↵
C. MacKintosh, H. G. Nimmo, Purification and regulatory properties of isocitrate lyase from Escherichia coli ML308, Biochem J 250 (1988) 25–31.
OpenUrl Abstract/FREE Full Text

[34] [34].↵
J. L. Donahue, J. L. Bownas, W. G. Niehaus, T. J. Larson, Purification and characterization of glpX-encoded fructose 1, 6-bisphosphatase, a new enzyme of the glycerol 3-phosphate regulon of Escherichia coli, J Bacteriol 182 (2000) 5624–7.
OpenUrl Abstract/FREE Full Text

[35] [35].↵
J. K. Hines, H. J. Fromm, R. B. Honzatko, Novel allosteric activation site in Escherichia coli fructose-1,6-bisphosphatase, J Biol Chem 281 (2006) 18386–93.
OpenUrl Abstract/FREE Full Text

[36] [36].↵
J. K. Hines, H. J. Fromm, R. B. Honzatko, Structures of activated fructose-1,6-bisphosphatase from Escherichia coli. Coordinate regulation of bacterial metabolism and the conservation of the R-state, J Biol Chem 282 (2007) 11696–704.
OpenUrl Abstract/FREE Full Text

[37] [37].↵
D. S. Pereira, L. J. Donald, D. J. Hosfield, H. W. Duckworth, Active site mutants of Escherichia coli citrate synthase. Effects of mutations on catalytic and allosteric properties, J Biol Chem 269 (1994) 412–7.
OpenUrl Abstract/FREE Full Text

[38] [38].↵
M. S. Robinson, R. A. Easom, M. J. Danson, P. D. Weitzman, Citrate synthase of Escherichia coli. Characterisation of the enzyme from a plasmid-cloned gene and amplification of the intracellular levels, FEBS Lett 154 (1983) 51–4.
OpenUrl PubMed

[39] [39].↵
T. Zhu, M. F. Bailey, L. M. Angley, T. F. Cooper, R. C. Dobson, The quaternary structure of pyruvate kinase type 1 from Escherichia coli at low nanomolar concentrations, Biochimie 92 (2010) 116–20.
OpenUrl CrossRef PubMed

[40] [40].↵
R. C. Wohl, G. Markus, Phosphoenolpyruvate carboxylase of Escherichia coli. Purification and some properties, J Biol Chem 247 (1972) 5785–92.
OpenUrl Abstract/FREE Full Text

[41] [41].↵
S. Kale, P. Arjunan, W. Furey, F. Jordan, A dynamic loop at the active center of the Escherichia coli pyruvate dehydrogenase complex E1 component modulates substrate utilization and chemical communication with the E2 component, J Biol Chem 282 (2007) 28106–16.
OpenUrl Abstract/FREE Full Text

[42] [42].↵
P. Arjunan, N. Nemeria, A. Brunskill, K. Chandrasekhar, M. Sax, Y. Yan, F. Jordan, J. R. Guest, W. Furey, Structure of the pyruvate de-hydrogenase multienzyme complex E1 component from Escherichia coli at 1.85 A resolution, Biochemistry 41 (2002) 5213–21.
OpenUrl CrossRef PubMed Web of Science

[43] [43].↵
S. Okino, M. Suda, K. Fujikura, M. Inui, H. Yukawa, Production of D-lactic acid by Corynebacterium glutamicum under oxygen deprivation, Appl Microbiol Biotechnol 78 (2008) 449–54.
OpenUrl CrossRef PubMed

[44] [44].↵
R. Tasseff, S. Nayak, S. Salim, P. Kaushik, N. Rizvi, J. D. Varner, Analysis of the molecular networks in androgen dependent and independent prostate cancer revealed fragile and robust subsystems, PLoS One 5 (2010) e8864.
OpenUrl CrossRef PubMed

[45] [45].↵
R. Tasseff, S. Nayak, S. O. Song, A. Yen, J. D. Varner, Modeling and analysis of retinoic acid induced differentiation of uncommitted precursor cells, Integr Biol (Camb) 3 (2011) 578–91.
OpenUrl

[46] [46].↵
J. D. Herman, http://jdherman.github.io/salib/, ????

[47] [47].↵
A. C. Hindmarsh, P. N. Brown, K. E. Grant, S. L. Lee, R. Serban, D. E. Shumaker, C. S. Woodward, SUNDIALS: Suite of nonlinear and differential/algebraic equation solvers, ACM T Math Software (TOMS) 31 (2005) 363–396.
OpenUrl

[48] [48].
T. Kigawa, Y. Muto, S. Yokoyama, Cell-free synthesis and amino acid-selective stable isotope labeling of proteins for NMR analysis, J Biomolec NMR 6 (1995) 129–134.
OpenUrl

Toward a Genome Scale Sequence Specific Dynamic Model of Cell-Free Protein Synthesis in Escherichia coli

Abstract

Introduction

Results

Discussion

Materials and Methods

Formulation and solution of the model equations

Estimation of kinetic model parameters

Reaction group knockouts

Sensitivity of CAT productivity to transcription and translation

Calculation of energy efficiency

Availability of model code

Competing interests

Author’s contributions

Funding

Acknowledgements

References

Citation Manager Formats

Subject Area