Analysis of Two-State Folding Using Parabolic Approximation III: Non-Arrhenius Kinetics of FBP28 WW Part-I

Robert S. Sade

doi:10.1101/038331

Abstract

A model which treats the denatured and the native conformers as being confined to harmonic Gibbs energy wells has been used to analyse the non-Arrhenius behaviour of spontaneously-folding fixed two-state systems. The results demonstrate that when pressure and solvent are constant: (i) a two-state system is physically defined only for a finite temperature range; (ii) irrespective of the primary sequence, the 3-dimensional structure of the native conformer, the residual structure in the denatured state, and the magnitude of the folding and unfolding rate constants, the equilibrium stability of a two-state system is a maximum when its denatured conformers bury the least amount of solvent accessible surface area (SASA) to reach the activated state; (iii) the Gibbs barriers to folding and unfolding are not always due to the incomplete compensation of the activation enthalpies and entropies; (iv) the difference in heat capacity between the reaction-states is due to both the size of the solvent-shell and the non-covalent interactions; (v) the position of the transition state ensemble along the reaction coordinate (RC) depends on the choice of the RC; and (vi) the atomic structure of the transiently populated reaction-states cannot be inferred from perturbation-induced changes in their energetics.

Introduction

It was shown elsewhere, henceforth referred to as Papers I and II, that the equilibrium and kinetic behaviour of spontaneously-folding fixed two-state systems can be analysed by a treatment that is analogous to that given by Marcus for electron transfer.^1-3 In this framework termed the parabolic approximation, the Gibbs energy functions of the denatured state ensemble (DSE) and the native state ensemble (NSE) are represented by parabolas whose curvature is given by their temperature-invariant force constants, α and ω, respectively. The temperature-invariant mean length of the reaction coordinate (RC) is given by m_D-N and is identical to the separation between the vertices of the DSE and the NSE-parabolas along the abscissa. Similarly, the position of the transition state ensemble (TSE) relative to the DSE and the NSE are given by m_TS-D(T) and m_TS-N(T), respectively, and are identical to the separation between the curve-crossing and the vertices of the DSE and the NSE-parabolas, respectively. The Gibbs energy of unfolding at equilibrium, ΔG_D-N(T), is identical to the separation between the vertices of the DSE and the NSE-parabolas along the ordinate. Similarly, the Gibbs activation energy for folding (ΔG_TS-D(T)) and unfolding (ΔG_TS-N(T)) are identical to the separation between the curve-crossing and the vertices of the DSE and the NSE-parabolas along the ordinate, respectively.

The purpose of this article is to use the framework described in Papers I and II to analyse the non-Arrhenius behaviour of the 37-residue FBP28 WW domain, at an unprecedented range and resolution.⁴

Equations

The expressions for the position of the TSE relative to the vertices of the DSE and the NSE Gibbs parabolas are given by where the discriminant φ = λω + ΔG_D-N(T) (ω − α), and λ = α(m_D-N)² is the Marcus reorganization energy for two-state protein folding. The expressions for the activation energies for folding and unfolding are given by where the parameters β_T(fold)(T) (= m_TS-D(T)/m_D-N) and β_T(unfold)(T) (= m_TS-N(T)/m_D-N) are according to Tanford’s framework.⁵ The expressions for the rate constants for folding (k_f(T)) and unfolding (k_u(T)), and ΔG_D-N(T) are given by where, k⁰ is the temperature-invariant prefactor with units identical to those of the rate constants (s⁻¹), R is the gas constant, T is the absolute temperature. If the temperature-dependence of ΔG_D-N(T) and the values of α, ω, and m_D-N are known for any two-state system at constant pressure and solvent conditions (see Methods), the temperature-dependence of the curve-crossing relative to the ground states may be readily ascertained. The temperature-dependence of curve-crossing is central to this analysis since all other parameters can be readily derived by manipulating the same using standard kinetic and thermodynamic relationships.

The activation entropies for folding (ΔS_TS-D(T)) and unfolding (ΔS_TS-N(T)) are given by the first derivatives of ΔG_TS-D(T) and ΔG_TS-N(T) functions with respect to temperature where T_S is the temperature at which the entropy of unfolding at equilibrium is zero (ΔS_D-N(T) = 0) and ΔC_pD-N is the temperature-invariant difference in heat capacity between the DSE and the NSE.⁶ The activation enthalpies for folding (ΔH_TS-D(T)) and unfolding (ΔH_TS-N(T)) may be readily obtained by recasting the Gibbs equation: ΔH_(T) = ΔG_(T) + TΔS_(T), or from the temperature-dependence of k_f(T) and k_u(T) to give

The difference in heat capacity between the DSE and the TSE (i.e., for the partial unfolding reaction [TS] ⇌ D) is given by

Similarly, the difference in heat capacity between the TSE and the NSE (for the partial unfolding reaction N ⇌ [TS]) is given by

The reader may refer to Papers I and II for the derivations.

Results and Discussion

As mentioned earlier and discussed in sufficient detail in Papers I and II, the analysis we are going to perform has an explicit requirement for a minimal experimental dataset which are: (i) an experimental chevron obtained at constant temperature, pressure and solvent conditions (except for the denaturant); (ii) an equilibrium thermal denaturation curve obtained under constant pressure, and in solvent conditions identical to those in which the chevron was acquired but without the denaturant, using either calorimetry or spectroscopy; and (iii) the calorimetrically determined ΔC_pD-N value (i.e., the slope of the linear regression of a plot of model-independent ΔH_{D-N(T_m)} vs T_m, where ΔH_{D-N(T_m)} is the enthalpy of unfolding at the midpoint of thermal denaturation, T_m; see Fig. 4 in Privalov, 1989).⁷ Fitting the chevron to a modified chevron-equation using non-linear regression yields the values of m_D-N, the force constants α and ω, and the prefactor k⁰ (k⁰ is assumed to be temperature-invariant; see Methods in Paper I). Fitting a spectroscopic sigmoidal equilibrium thermal denaturation curve using standard two-state approximation (van’t Hoff analysis using temperature-invariant ΔC_pD-N) yields van’t Hoff ΔH_{D-N(T_m)} and T_m and enables the temperature-dependence of ΔH_D-N(T), ΔS_D-N(T) and ΔG_D-N(T) functions to be ascertained across a wide temperature regime (Eqs. (A1)-(A3), Figure 1 and Figure 1−figure supplement 1).⁶ Once the values of m_D-N, the force constants, the prefactor, and the temperature dependence of ΔG_D-N(T) are known, the rest of the analysis is fairly straightforward. The values of all the reference temperatures that appear in this article are given in Table 1.

Figure 1-figure supplement 1. Stability curve for the folding reaction N ⇌ D.

(A) Temperature-dependence of ΔG_N-D(T), ΔH_N-D(T), and TΔS_N-D(T). The green pointers identify T_c and T_m. The slopes of the red and black curves are given by ∂ΔG_N-D(T)/∂T=-ΔS_N-D(T) and ∂ΔH_N-D(T)/∂T=ΔC_pN-D, respectively. (B) An appropriately scaled version of plot on the left. The reference temperatures are as described in the parent figure.

Figure 1 Stability curve for the unfolding reaction N ⇌ D.

(A) Temperature-dependence of ΔH_D-N(T), ΔS_D-N(T) and ΔG_D-N(T) according to Eqs. (A1), (A2) and (A3), respectively. The green pointers identify the cold (T_c) and heat (T_m) denaturation temperatures. The slopes of the red and black curves are given by ∂ΔG_D-N(T)/∂T=-ΔS_D-N(T) and ∂ΔH_D-N(T)/∂T=ΔC_pD-N, respectively. (B) An appropriately scaled version of the plot on the left. T_H is the temperature at which ΔH_D-N(T) = 0, and T_S is the temperature at which ΔS_D-N(T) = 0. The values of the reference temperatures are given in Table 1.

View this table:

Table 1

Reference temperatures

Temperature-dependence of m_TS-D(T) and m_TS-N(T)

Substituting the expression for the temperature-dependence of G_D-N(T) (Eq. (A3), Figure 1) in Eqs. (1) and (2) enables the temperature-dependence of the curve-crossing relative to the DSE and the NSE to be ascertained (Figure 2; substituted expressions not shown). Because by postulate the force constants, ΔC_pD-N, and m_D-N are temperature-invariant for any given primary sequence that folds in a two-state manner at constant pressure and solvent conditions, we get from inspection of Eqs. (1) and (2) that the discriminant φ, and must be a maximum when ΔG_D-N(T) is a maximum. Because ΔG_D-N(T) is a maximum at T_S (the temperature at which the entropy of unfolding at equilibrium, ΔS_D-N(T), is zero),⁶ a corollary is that φ and must be a maximum at T_S; and any deviation in the temperature from T_S will only lead to their decrease. Consequently, m_TS-D(T) and β_T(fold)(T) (= m_TS-D(T)/m_D-N) are always a minimum, and m_TS-N(T) and β_T(unfold)(T) (= m_TS-N(T)/m_D-N) are always a maximum at T_S. This gives rise to two further corollaries: Any deviation in the temperature from T_S can only lead to: (i) an increase in m_TS-D(T) and β_T(fold)(T); and (ii) a decrease in m_TS-N(T) and β_T(unfold)(T) (Figure 2 and Figure 2−figure supplement 1). In other words, when T = T_S, the TSE is the least native-like in terms of the SASA (solvent accessible surface area), and any deviation in temperature causes the TSE to become more native-like. A further consequence of m_TS-D(T) being a minimum at T_S is that if for a two-state-folding primary sequence there exists a chevron with a well-defined linear folding arm at T_S, then m_TS-D(T) > 0 and β_T(fold)(T) > 0 for all temperatures (Figure 2A and Figure 2−figure supplement 1A). Since the curve-crossing is physically undefined for φ < 0 owing to there being no real roots, the maximum theoretically possible value of m_TS-D(T) will occur when φ = 0 and is given by: where T_α and T_ω are the temperature limits such that for T < T_α and T > T_ω, a two-state system is not physically defined (see Paper II). Because m_D-N = m_TS-D(T) + m_TS-N(T) for a two-state system, and m_D-N is temperature-invariant by postulate, the theoretical minimum of m_TS-N(T) is given by: . Now, since m_TS-N(T) is a maximum and positive at T_S but its minimum is negative, a consequence is that m_TS-N(T) = β_T(unfold)(T) = 0 at two unique temperatures, one in the ultralow (T_S(α)) and the other in the high (T_S(ω)) temperature regime, and negative for T_α ≤ T < T_S(α) and T_S(ω) < T ≤ T_ω (Figure 2B and Figure 2−figure supplement 1B). Obviously, m_TS-D(T) = m_D-N and β_T(fold)(T) is unity at T_S(α) and T_S(ω). To summarize, unlike m_TS-D(T) and β_T(fold)(T) which are positive for all temperatures and a minimum at T_S, m_TS-N(T) and β_T(unfold)(T) are a maximum at T_S, zero at T_S(α) and T_S(ω), and negative for T_α ≤ T < T_S(α) and T_S(ω) < T ≤ T_ω.

Figure 2−figure supplement 1 Temperature-dependence of β_T(fold)(T) and β_T(unfold)(T).

(A) β_T(fold)(T) is a minimum at T_S, unity at T_S(α) and T_S(ω), and greater than unity for T_α ≤ T < T_S(α) and T_S(ω) < T ≤ T_ω. The slope of this curve is given by (B) β_T(unfold)(T) is a maximum at T_S, zero at T_S(α) and T_S(ω), and negative for T_α ≤ T < T_S(α) and T_S(ω) < T ≤ T_ω. The slope of this curve is given by . From the perspective of Tanford’s framework, the SASA of the TSE is the least native-like at T_S but becomes progressively more native-like as the temperature deviates from the T_S, and is identical to the SASA of the NSE at T_S(α) and T_S(ω); and for T_α ≤ T < T_S(α) and T_S(ω) < T ≤ T_ω, the TSE is more compact than the NSE.

Figure 2 Temperature-dependence of m_TS-D(T) and m_TS-N(T).

(A) m_TS-D(T) is a minimum at T_S, is identical to m_D-N at T_S(α) and T_S(ω), and is greater than m_D-N for T_α ≤ T < T_S(α) and T_S(ω) < T ≤ T_ω. The slope of this curve is given by (B) m_TS-N(T) is a maximum at T_S, zero at T_S(α) and T_S(ω), and negative for T_α ≤ T < T_S(α) and T_S(ω) < T ≤ T_ω. The slope of this curve is given by . While the slopes of these curves are related to the activation entropies, the second derivatives of these functions with respect to temperature are related to the heat capacities of activation as shown in Paper-II. The values of the reference temperatures are given in Table 1.

The predicted Leffler-Hammond shift, which must be valid for any two-state system, is in agreement with the experimental data on the temperature-dependent behaviour of other two-state systems (Table 1 in Dimitriadis et al., 2004; Table 1 in Taskent et al., 2008; Fig. 5C in Otzen and Oliveberg, 2004),^8-12 with the rate at which the curve-crossing shifts with stability (relative to the vertex of the DSE-parabola) being given by . Importantly, just as the Leffler-Hammond movement is rationalized in physical organic chemistry using Marcus theory,^13-15 we can similarly rationalize these effects in protein folding using parabolic approximation (Figures 3, 4, and Figure 4−figure supplement 1). When T = T_S, ΔG_TS-D(T) is a minimum, and ΔG_D-N(T) and ΔG_TS-N(T) are both a maximum; and any increase or decrease in the temperature relative to T_S leads to a decrease in ΔG_TS-N(T), and an increase in ΔG_TS-D(T), consequently, leading to a decrease in ΔG_D-N(T) (Figures 1, 3B and 5). Naturally at T_c and T_m, ΔG_TS-D(T) = ΔG_TS-N(T), k_f(T) = k_u(T), and ΔG_D-N(T)= 0 (Figure 3C). The reason why m_TS-D(T) = m_D-N, and m_TS-N(T) = 0 at T_S(α) and T_S(ω) is apparent from Figures 4A, 4C and Figure 4−figure supplement 1A: The right arm of the DSE-parabola intersects the vertex of the NSE-parabola leading to ΔG_TS-D(T) = α(m_TS-D(T))² = α(m_D-N)² = λ, ΔG_TS-N(T) = ω(m_TS-N(T))² = 0, and ΔG_D-N(T) = − λ. Importantly, in contrast to unfolding which can become barrierless at T_S(α) and T_S(ω), folding is barrier-limited at all temperatures, with the absolute minimum of ΔG_TS-D(T) occurring at T_S; and any deviation in the temperature from T_S will only lead to an increase in ΔG_TS-D(T) (Figure 5A). Thus, a corollary is that if folding is barrier-limited at T_S (i.e., the chevron has a well-defined linear folding arm with a finite slope at T_S), then a protein that folds via two-state mechanism can never spontaneously (i.e., unaided by ligands, co-solvents etc.) switch to a downhill mechanism (Type 0 scenario according to the Energy Landscape Theory; see Fig. 6 in Onuchic et al., 1997), no matter what the temperature, and irrespective of how fast or slow it folds. Although unfolding is barrierless at T_S(α) and T_S(ω), it is once again barrier-limited for T_α ≤ T < T_S(α) and T_S(ω) < T ≤ T_ω, with the curve-crossing occurring to the right of the vertex of the NSE-parabola (Figures 4A, 4B, Figure 4−figure supplement 1B and 5B), such that m_TS-D(T) > m_D-N, m_TS-N(T) < 0, β_T(fold)(T) > 1 and β_T(unfold)(T) < 0 (Figure 2 and Figure 2−figure supplement 1).

Figure 4−figure supplement 1 An appropriately scaled view of Marcus curve-crossings at T_S(ω) and T_ω.

(A) Curve-crossing at T_S(α) and T_S(ω) where m_TS-D(T) = m_D-N, m_TS-N(T) = 0, ΔG_TS-N(T) = 0, ΔG_TS-D(T) = α(m_D-N)² = λ, ΔG_D-N(T) = − λ, and k_u(T) = k⁰. (B) Curve-crossing at T_ω where m_TS-D(T) > m_D-N.

Figure 3 Marcus curve-crossings at T_S and T_m.

(A) Figure 2A reproduced for comparison. (B) Curve-crossing at T_S where ΔG_D-N(T) is a maximum and purely enthalpic (Figure 1). The relevant parameters are as follows: ΔG_TS-D(T) = 2.547 kcal.mol^-1, ΔG_TS-N(T) = 4.964 kcal.mol^-1, ΔG_D-N(T) = 2.417 kcal.mol^-1, k_f(T) = 22009 s^-1, k_u(T) = 280.8 s^-1, m_TS-D(T) = 0.5792 kcal.mol^-1.M^-1 and m_TS-N(T) = 0.2408 kcal.mol^-1.M^-1. (C) Curve-crossing at T_m and T_c where ΔG_TS-D(T) = ΔG_TS-N(T) = 3.032 kcal.mol^-1, ΔG_D-N(T) = 0, k_f(T) = k_u(T) = 23618 s^-1, m_TS-D(T) = 0.6319 kcal.mol^-1.M^-1 and m_TS-N(T) = 0.1881 kcal.mol^-1.M^-1.The DSE and the NSE-parabolas are given by G_DSE(r,T)=αr² and G_NSE(r,T) = ω(m_D-N − r)² − ΔG_D-N(T), respectively, α = 7.594 M².mol.kcal^-1, ω = 85.595 M².mol.kcal^-1, m_D-N (0.82 kcal.mol^-1.M^-1) is the separation between the vertices of the DSE and the NSE-parabolas along the abscissa, and r is any point on the abscissa. The abscissae are identical for plots B and C. The values of the reference temperatures are given in Table 1.

Figure 5 Temperature-dependence of the Gibbs activation energies for folding and unfolding.

(A) ΔG_TS-D(T) is a minimum at T_S, identical to λ = α(m_D-N)² = 5.106 kcal.mol^-1 at T_S(α) and T_S(ω), and greater than λ for T_α ≤ T < T_S(α) and T_S(ω) < T ≤ T_ω. Note that ∂ΔG_TS-D(T)/∂T = −ΔS_TS-D(T) = 0 at T_S. (B) In contrast to ΔG_TS-D(T) which has only one extremum, ΔG_TS-N(T) is a maximum at T_S and a minimum (zero) at T_S(α) and T_S(ω); consequently, ∂ΔG_TS-N(T)/∂T = −∂S_TS-N(T) = 0 at T_S(α), T_S and T_S(ω). Although unfolding is barrierless at T_S(α) and T_S(ω), it is once again barrier-limited for T_α ≤ T < T_S(α) and T_S(ω) < T ≤ T_ω; however, unlike the conventional barrier-limited unfolding which is characteristic for T_S(α) < T < T_S(ω), these two regimes fall under the Marcus-inverted-region and can be rationalized from Figures 2, 4, and their figure supplements. The values of the reference temperatures are given in Table 1.

Figure 6 Arrhenius plots for the temperature-dependence of the rate constants.

(A) k_f(T) is a maximum and ΔHTS-D(T)= 0 at T_H(TS-D). The slope of this curve is given by −ΔH_TS-D(T)/R. (B) Unlike k_f(T) which has only one extremum, k_u(T) is a minimum at T_H(TS-N) and a maximum at T_S(α) and T_S(ω). Consequently, ΔH_TS-N(T) = 0 at T_S(α), T_H(TS-N) and T_S(ω). The slope of this curve is given by −ΔH_TS-N(T)/R. When T = T_S(α) or T_S(ω), we have a unique scenario: m_TS-N(T) = ΔG_TS-N(T) = ΔH_TS-N(T) = 0 ⇒ ΔSTS-N(T) = 0, and k_u(T) = k⁰. Although unfolding is barrier-limited for T_α ≤ T < T_S(α) and T_S(ω) < T ≤ T_ω, leading to k_u(T) < k⁰, these ultra-low and high temperature regimes fall under the Marcus-inverted-regime as compared to the conventional barrier-limited unfolding which is characteristic for T_S(α) < T < T_S(ω) (the curve-crossing occurs in-between the vertices of the DSE and the NSE Gibbs basins) and can be rationalized comprehensively when considered in conjunction with Figures 2, 4, and 5 (see also their figure supplements if any). The maxima of k_f(T) and k_u(T), as well as the inverted-region can be better appreciated on a linear scale as shown in the figure supplement. The values of the reference temperatures are given in Table 1.

Figure 7 Temperature-dependence of the observed rate constant.

(A) k_obs(T) is a maximum at T_S(α) and T_S(ω), and a minimum around T_c (blue pointer). The red pointer indicates T_m. The steep increase in k_obs(T) at very low and high temperatures is due to ΔG_TS-N(T) approaching zero as described in previous figures. (B) An overlay of k_f(T), k_u(T) and k_obs(T) to illuminate how the features of k_obs(T) arise from the sum of k_f(T) and k_u(T). The slopes of the red and blue curves are given by ∆H_TS-D(T)/RT² and ∆H_TS-N(T)/RT², respectively.

To summarize, for any two-state folder, unfolding is conventional barrier-limited for T_S(α) < T < T_S(ω) and the position of the TSE or the curve-crossing occurs in between the vertices of the DSE and the NSE parabolas. As the temperature deviates from T_S, the SASA of the TSE becomes progressively native-like, with a concomitant increase and a decrease in ΔG_TS-D(T) and ΔG_TS-N(T), respectively. When T = T_S(α) and T_S(ω), the curve-crossing occurs precisely at the vertex of the NSE-parabola, the SASA of the TSE is identical to that of the NSE, and unfolding is barrierless; and for T_α ≤ T < T_Sα and T_S(ω) < T ≤ T_ω, unfolding is once again barrier-limited but falls under the Marcus-inverted-regime with the curve-crossing occurring on the right-arm of the NSE-parabola, leading to the SASA of the NSE being greater than that of the TSE (i.e., the TSE is more compact than the NSE). Importantly, for T < T_α and T > T_ω, the TSE cannot be physically defined owing to being mathematically undefined for φ > 0. A consequence is that k_f(T) and k_u(T) become physically undefined, leading to ΔG_D-N(T) = RT ln (k_f(T)/k_u(T)) being physically undefined, such that all of the conformers will be confined to a single Gibbs energy well, which is the DSE, and the protein will cease to function.¹⁶ Thus, from the view point of the physics of phase transitions, T_α ≤ T ≤ T_ω denotes the coexistence temperature-range where the DSE and the NSE, which are in a dynamic equilibrium, will coexist as two distinct phases; and for T < T_α and T > T_ω there will be a single phase, which is the DSE, with T_α and T_ω being the limiting temperatures for coexistence, or phase boundary temperatures from the view point of the DSE.^17-23 This is roughly analogous to the operating temperature range of a logic circuit such as a microprocessor; and just as this range is a function of its constituent material, the physically definable temperature range of a two-state system is a function of the primary sequence when pressure and solvent are constant, and importantly, can be modulated by a variety of cis-acting and trans-acting factors (see Paper-I). The limit of equilibrium stability below which a two-state system becomes physically undefined is given by: ΔG_D-N(T)|_{T = T_α, T_ω} = −λω/(ω − α). Consequently, the physically meaningful range of equilibrium stability for a two-state system is given by: ΔG_D-N_{(T_S)} + [λω/(ω − α)], where ΔG_D-N_{(T_S)} is the stability at T_S and is apparent from inspection of Figure 5−figure supplement 1. This is akin to the stability range over which Marcus theory is physically realistic (see Kresge, 1973, page 494).²⁴

Figure 5−figure supplement 1 The principle of least displacement.

(A) The stability of a two-state system at constant pressure and solvent conditions is the greatest when the denatured conformers are displaced the least from the mean of their ensemble along the SASA-RC to reach the TSE. The length of the green dotted line is identical to ΔG_{D-N(T_s)} + [ λω/(ω-α)], where ΔG_{D-N(T_S)} is the stability at T_S. The slope of this curve equals . (B) ΔG_D-N(T) will be the greatest when the native conformers expose the greatest amount of SASA to reach the TSE. The slope of this curve equals .

Because by postulate m_D-N, m_TS-D(T) and m_TS-N(T) are true proxies for ΔSASA_D-N, ΔSASA_D-TS(T) and ΔSASA_TS-N(T), respectively (see Paper I), we have three fundamentally important corollaries that must hold for all two-state systems at constant pressure and solvent conditions: (i) the Gibbs barrier to folding is the least when the denatured conformers bury the least amount of SASA to reach the TSE (Figure 5−figure supplement 2A); (ii) the Gibbs barrier to unfolding is the greatest when the native conformers expose the greatest amount of SASA to reach the TSE (Figure 5−figure supplement 2B); and (iii) equilibrium stability is the greatest when the conformers in the DSE are displaced the least from the mean of their ensemble along the SASA-RC to reach the TSE (the principle of least displacement; Figure 5−figure supplement 1).

Figure 5−figure supplement 2 Gibbs activation energies as a function of the position of the TSE along the RC.

(A) ΔG_TS-D(T) is the least when the denatured conformers bury the least amount of SASA to reach the TSE. The slope of this curve equals 2λβ_T(fold)(T). (B) ΔG_TS-N(T) is the greatest when the native conformers expose the greatest amount of SASA to reach the TSE. The green pointer indicates T_S(α) and T_S(ω) where m_TS-D(T) = m_D-N, m_TS-N(T) = β_T(unfold)(T) = 0, ΔG_TS-N(T) = 0, ΔG_TS-D(T) = α(m_D-N)² = λ, and ΔG_D-N(T) = − λ. The slope of this curve equals 2ωm_TS-N(T)m_D-N. Because m_TS-N(T) < 0 for T_α ≤ T < T_S(α) and T_S(ω) < T ≤ T_ω, the slope is negative for the part that is to the left of the green pointer.

Temperature-dependence of the folding, unfolding, and the observed rate constants

Inspection of Figures 6A and Figure 6−figure supplement 1A demonstrates that Eq. (5) makes a remarkable prediction that k_f(T) has a non-linear dependence on temperature. Starting from the lowest temperature (T_α) at which a two-state system is physically defined, k_f(T) initially increases with an increase in the temperature and reaches a maximal value at T = T_H(TS-D) where ∂ ln k_f(T)/∂T = ΔH_TS-D(T)/RT² = 0; and any further increase in temperature beyond this point will cause a decrease in k_f(T) until the temperature T_ω is reached, such that for T > T_ω, k_f(T) is undefined. Inspection of Figures 6B and Figure 6−figure supplement 1B demonstrates that the temperature-dependence of k_u(T) is far more complex: Starting from T_α, k_u(T) increases with temperature for the regime T_α ≤ T < T_S(α) (the low-temperature Marcus-inverted-regime), reaches a maximum when T = T_S(α) (k_u(T) = k⁰; the first extremum of k_u(T)), and decreases with further rise in temperature for the regime T_S(α) < T < T_H(TS-N) such that when T = T_H(TS-N), k_u(T) is a minimum (the second extremum of k_u(T)). And for T_H(TS-N) < T < T_S(ω), an increase in temperature will lead to an increase in k_u(T), eventually leading to its saturation at T = T_S(ω) (k_u(T) = k⁰; the third extremum of k_u(T)), and decreases with further rise in temperature for T_S(ω) < T ≤ T_ω (the high-temperature Marcus-inverted-regime). Thus, in contrast to k_f(T) which has only one extremum, k_u(T) is characterised by three extrema where ∂ ln k_u(T)/∂T = ΔH_TS-N(T)/R² = 0, and may be rationalized from the temperature-dependence of m_TS-D(T) and m_TS-N(T), the Gibbs barrier heights for folding and unfolding, and the intersection of the DSE and the NSE Gibbs parabolas (Figures 2-5 and their figure supplements). We will show in subsequent publications that the inverted behaviour at very low and high temperatures is not common to all fixed two-state systems and depends on the mean and variance of the Gaussian distribution of the SASA of the conformers in the DSE and the NSE.

Figure 6−figure supplement 1 Temperature-dependence of k_f(T) and k_u(T) on a linear scale.

(A) k_f(T) is a maximum and ΔH_TS-D(T) = 0 at T_H(TS-D). The slope of this curve is given by k_f(T)∆H_TS-D(T)/RT². (B) Unlike k_f(T) which has only one extremum, k_u(T) is a minimum at T_H(TS-N) and a maximum at T_S(α) and T_S(ω). Although the minimum of k_u(T) is not apparent on a linear scale, the barrierless and inverted-regimes for unfolding are readily apparent. The slope of this curve is given by k_u(T)∆H_TS-N(T)/RT². The features of these curves arise primarily from the temperature-dependence of the equilibrium constants for the partial folding (D ⇌ [TS]) and unfolding (N ⇌ [TS]) reactions as shown later. The green dots represent T_S.

Since the ultimate test of any hypothesis is experiment, the most important question now is how well do the calculated rate constants compare with experiment? Although Nguyen et al. have investigated the non-Arrhenius behaviour of the FBP28 WW, they find that the behaviour of its wild type is erratic, with its folding being three-state for T < T_m and two-state for T > T_m (Fig. 3A in Nguyen et al., 2003). Consequently, non-Arrhenius data for the wild type FBP28 WW are lacking. Incidentally, this atypical behaviour is probably artefactual since the protein aggregates and forms fibrils under the experimental conditions in which the measurements were made (see Figs. 2, 3 and 6 in Ferguson et al., 2003).^25,26 Nevertheless, data for ΔNΔC Y11R W30F, a variant of FBP28 WW are available between ∽ 298 and ∽357 K (Fig. 4A in Nguyen et al., 2003). Now since the relaxation time constants for the fast phase of wild type FBP28 WW (∽ 30 μs at 39.5 °C and < 15 μs at 65 °C, page 3950, Fig. 3A, Nguyen et al., 2003) are very similar to those of ΔNΔC Y11R W30F (∽ 28 μs at 40 °C and 11 μs at 65 °C, page 3952), a reasonable approximation is that the temperature-dependence of k_f(T) and k_u(T) of the wild type and the mutant must be similar. Consequently, the temperature-dependence of the rate constants for the wild type FBP28 WW calculated using parabolic approximation must be very similar to the data for ΔNΔC Y11R W30F reported by Nguyen et al. The remarkable agreement between the said datasets is readily apparent from a comparison of Fig. 4A of Nguyen et al., and Figure 6−figure supplement 2, and serves an important test of the hypothesis.

Figure 6−figure supplement 2 Arrhenius plot for the temperature-dependence of the rate constants with the ordinate on a Log scale (base10).

A combined and appropriately rescaled version of Figure 6 to enable a ready comparison of the rate constants for FBP28 WW wild type (calculated using parabolic approximation) and the experimental rate constants for ΔNΔC Y11R W30F, a variant of FBP28 WW (reported by Nguyen et al., 2003, Fig. 4A). Note that the intersection of k_f(T) and k_u(T) is shifted to the left along the abscissa for the wild type FBP28 WW since its T_m is ∽ 10 K greater than that of ΔNΔC Y11R W30F (see Table 1 in Nguyen et al., 2003).²⁵

Since the temperature-dependence of k_f(T) and k_u(T) across a wide temperature range is known, the variation in the observed rate constant (k_obs(T)) with temperature may be readily ascertained using (see Appendix)

Inspection of Figure 7 demonstrates that ln(k_obs(T)) vs temperature is a smooth ‘W-shaped’ curve, with k_obs(T) being dominated by k_f(T) around T_H(TS-N), and by k_u(T) for T < T_c and T > T_m, which is precisely why the kinks in ln(k_obs(T)) occur around these temperatures. It is easy to see that at T_c or T_m, k_f(T) = k_u(T) ⇒ k_obs(T) = 2k_f(T) = 2k_u(T), ΔG_D-N(T) = RT ln (k_f(T)/k_u(T))) = 0 or ΔG_TS-D(T) = ΔG_TS-N(T) (Figures 3C and Figure 7−figure supplement 1). In other words, for a two-state system, T_c and T_m determined at equilibrium must be identical to the temperatures at which k_f(T) and k_u(T) intersect. This is a consequence of the principle of microscopic reversibility, i.e., the equilibrium and kinetic stabilities must be identical for a two-state system at all temperatures.²⁷ It is precisely for this reason that the value of the prefactor in the Arrhenius expressions for the rate constants must be identical for both the folding and the unfolding reactions at all temperatures (Eqs. (5) and (6)). The steep increase in k_obs(T) for T < T_c and T > T_m is due to the ΔG_TS-N(T) approaching zero as described earlier. The argument that the shapes of the curves must be conserved across two-state systems applies not only to the temperature-dependence of m_TS-D(T), m_TS-N(T), ΔG_TS-D(T) and ΔG_TS-N(T) described so far, but to the rest of the state functions that will be described in this article (see Paper-I).

Figure 7−figure supplement 1 The principle of microscopic reversibility.

(A) k_f(T) is a maximum at T_H(TS-D) and k_u(T) is a minimum at T_H(TS-N). The slopes of the black and grey curves are given by ∆H_TS-D(T)/RT² and ∆H_TS-N(T)/RT², respectively. (B) ΔGTS-D(T) and ΔGTS-N(T) are a minimum and a maximum, respectively, at T_S (red pointers) leading to ΔG_D-N(T) being a maximum at T_S (Figure 1). Equilibrium stability is thus a consequence or the equilibrium manifestation of the underlying kinetic behaviour. The rate constants are identical at T_c and T_m, leading to ∆G_D-N(T)=RT ln (k_f(T)/k_u(T))=∆G_TS-N(T)-∆G_TS-D(T)=0.

An important conclusion that we may draw from these data is the following: Because we have assumed a temperature-invariant prefactor and yet find that the kinetics are non-Arrhenius, it essentially implies that one does not need to invoke a super-Arrhenius temperature-dependence of the configurational diffusion constant to explain the non-Arrhenius behaviour of proteins.^28-32 Instead, as long as the enthalpies and the entropies of unfolding/folding at equilibrium display a large variation with temperature, and equilibrium stability is a non-linear function of temperature, both k_f(T) and k_u(T) will have a non-linear dependence on temperature. This leads to two corollaries: (i) since the large variation in equilibrium enthalpies and entropies of unfolding, including the pronounced curvature in ΔG_D-N(T) of proteins with temperature is due to the large and positive ΔC_pD-N, “non-Arrhenius kinetics can be particularly acute for reactions that are accompanied by large changes in the heat capacity”; and (ii) because the change in heat capacity upon unfolding is, to a first approximation, proportional to the change in SASA that accompanies it, and since the change in SASA upon unfolding/folding increases with chain-length,^33,34 “non-Arrhenius kinetics, in general, can be particularly pronounced for large proteins, as compared to very small proteins and peptides.”

Temperature-dependence of activation enthalpies

Inspection of Figure 8 demonstrates that for the partial folding reaction D ⇌ [TS]: (i) ΔH_TS-D(T) > 0 for T_α ≤ T < T_H(TS-D); (ii) ΔH_TS-D(T) < 0 for T_H(TS-D) < T ≤ T_ω and (iii) ΔH_TS-D(T) = 0 for T = T_H(TS-D). Thus, the activation of the denatured conformers to the TSE is enthalpically: (i) unfavourable for T_α ≤ T < T_H(TS-D); (ii) favourable for T_H(TS-D) < T ≤ T_ω; and (iii) neutral when T = T_H(TS-D). Consequently, at T_H(TS-D), ΔG_TS-D(T) is purely due to the difference in entropy between the DSE and the TSE (ΔG_TS-D(T) = −TΔS_TS-D(T)) with k_f(T) being given by

Figure 8 Temperature-dependence of the activation enthalpy for folding.

(A) The variation in ΔH_TS-D(T) function with temperature. The slope of this curve varies with temperature, equals ΔC_pTS-D(T), and is algebraically negative. (B) An appropriately scaled version of the plot on the left to illuminate the three important scenarios: (i) ΔH_TS-D(T) > 0 for T_α ≤ T < T_H(TS-D); (ii) ΔH_TS-D(T) < 0 for T_H(TS-D) < T ≤ T_ω; and (iii) ΔH_TS-D(T) = 0 when T = T_H(TS-D). Note that k_f(T) is a maximum at T_H(TS-D).

Because k_f(T) is a maximum at T_H(TS-D) (∂ ln k_f(T)/∂T = 0), a corollary is that “for a two-state folder at constant pressure and solvent conditions, if the prefactor is temperature-invariant, then k_f(T) will be a maximum when the Gibbs barrier to folding is purely entropic.” This statement is valid only if the prefactor is temperature-invariant. Now since ΔG_TS-D(T) > 0 for all temperatures (Figure 5A and Table 1), it is imperative that ΔS_TS-D(T) < 0 at T_H(TS-D) (see activation entropy for folding).

Unlike the ΔH_TS-D(T) function which changes its algebraic sign only once across the entire temperature range over which a two-state system is physically defined, the behaviour of ΔH_TS-N(T) function is far more complex (Figure 9): (i) ΔH_TS-N(T) > 0 for T_α ≤ T < T_S(α) and T_H(TS-N) < T < T_S(ω); (ii) ΔH_TS-N(T) < 0 for T_S(α) < T < T_H(TS-N) and T_S(ω) < T ≤ T_ω; and (iii) ΔH_TS-N(T) = 0 at T_S(α), T_H(TS-N), and T_S(ω). Consequently, we may state that the activation of native conformers to the TSE is enthalpically: (i) unfavourable for T_α ≤ T < T_S(α) and T_H(TS-N) < T < T_S(ω); (ii) favourable for T_S(α) < T < T_H(TS-N) and T_S(ω) < T ≤ T_ω; and (iii) neutral at T_S(α), T_H(TS-N), and T_S(ω). If we reverse the reaction-direction, the algebraic signs invert leading to a change in the interpretation. Thus, for the partial folding reaction [TS] ⇌ N, the flux of the conformers from the TSE to the NSE is enthalpically: (i) favourable for T_α ≤ T < T_S(α) and T_H(TS-N) < T < T_S(ω) (ΔH_N-TS(T) < 0); (ii) unfavourable for T_S(α) < T < T_H(TS-N) and T_S(ω) < T ≤ T_ω (ΔH_N-TS(T) > 0); and (iii) neither favourable nor unfavourable at T_S(α), T_H(TS-N), and T_S(ω) (Figure 9−figure supplement 1A). Note that the term “flux” implies “diffusion of the conformers from one reaction state to the other on the Gibbs energy surface,” and as such is an “operational definition.”

Figure 9−figure supplement 1 The variation in ΔH_N-TS(T) with temperature and the intersection of ΔH_TS-D(T) and ΔH_TS-N(T) functions.

(A) An appropriately scaled view of the change in enthalpy for the partial folding reaction [TS] ⇌ N. The flux of the conformers from the TSE to the NSE is enthalpically: (i) favourable for T_α ≤ T < T_S(α) and T_H(TS-N) < T < T_S(ω) (ΔH_N-TS(T) < 0); (ii) unfavourable for T_S(α) < T < T_H(TS-N) and T_S(ω) < T ≤ T_ω (ΔH_N-TS(T) > 0); and (iii) neither favourable nor unfavourable at T_S(α), T_H(TS-N), and T_S(ω). The blue pointers indicate the temperatures where ΔC_pN-TS(T) (or −ΔC_pTS-N(T)) is zero. (B) The intersection of the ΔH_TS-D(T) and ΔH_TS-N(T) functions occurs precisely at T_H. The requirement that both ΔH_TS-D(T) and ΔH_TS-N(T) be positive at the point of intersection is a consequence of the theoretical relationship: T_H(TS-N) < T_H < T_S < T_H(TS-D) and must be satisfied by all two-state systems (see Paper II). Note that the net flux of the conformers from the DSE to the NSE at T_H is driven purely by entropy (ΔG_D-N(T) = –T ΔS_D-N(T)).

Figure 9 Temperature-dependence of the activation enthalpy for unfolding.

(A) The variation in ΔHTS-N(T) function with temperature. The slope of this curve equals ΔC_pTS-N(T) and is zero at T_{C_pTS-N(α)} and T_{C_pTS-N(ω)}. (B) An appropriately scaled version of the figure on the left to illuminate the various temperature-regimes and their implications: (i) ΔH_TS-N(T) > 0 for T_α ≤ T < T_S(α) and T_H(TS-N) < T < T_S(ω); (ii) ΔH_TS-N(T) < 0 for T_S(α) < T < T_H(TS-N) and T_S(ω) < T ≤ T_ω; and (iii) ΔH_TS-N(T) = 0 at T_S(α), T_H(TS-N), and T_S(ω). Note that at T_S(α) and T_S(ω), we have the unique scenario: m_TS-N(T) = ΔG_TS-N(T) = ΔS_TS-N(T) = ΔH_TS-N(T) = 0, and k_u(T) = k⁰. The values of the reference temperatures are given in Table 1.

Importantly, although ∂ ln k_u(T)/∂T = 0 ⇒ ΔH_TS-N(T) = 0 at T_S(α), T_H(TS-N), and T_S(ω), the behaviour of the system at T_S(α) and T_S(ω) is distinctly different from that at T_H(TS-N): While m_TS-N(T) = ΔG_TS-N(T) = ΔH_TS-N(T) = ΔS_TS-N(T) = 0, m_TS-D(T) = m_D-N, ΔG_TS-D(T) = ΔG_N-D(T) = λ, and k_u(T) = k⁰ at T_S(α) and T_S(ω) (note that if both ΔG_TS-N(T) and ΔH_TS-N(T) are zero, then ΔS_TS-N(T) must also be zero, see activation entropies), k_u(T) is a minimum (k_u(T) ≪ k⁰) with the Gibbs barrier to unfolding being purely entropic (ΔG_TS-N(T) = −TΔS_TS-N(T)) at T_H(TS-N). Consequently, we may write

Thus, a corollary is that “for two-state system at constant pressure and solvent conditions, if the prefactor is temperature-invariant, then k_u(T) will be a minimum when the Gibbs barrier to unfolding is purely entropic.” Since ΔG_TS-N(T) > 0 at T_H(TS-N) (Figure 5B and Table 1), it is imperative that ΔS_TS-N(T) be negative at T_H(TS-N) (see activation entropy for unfolding).

The criteria for two-state folding from the viewpoint of enthalpy are the following: (i) the condition that ΔH_D-N(T) = ΔH_TS-N(T) − ΔH_TS-D(T) must be satisfied at all temperatures; (ii) the intersection of ΔH_TS-D(T) and ΔH_TS-N(T) functions calculated directly from the temperature-dependence of the experimentally determined k_f(T) and k_u(T), respectively, must be identical to the independently estimated T_H from equilibrium thermal denaturation experiments; and (iii) the condition that T_H(TS-N) < T_H < T_S < T_H(TS-D) must be satisfied. A corollary of the last statement is that both ΔH_TS-D(T) and ΔH_TS-N(T) functions must be positive at the point of intersection. These aspects are readily apparent from Figure 9−figure supplement 1B and Figure 9−figure supplement 2.

Figure 9−figure supplement 2 Comparison of equilibrium and activation enthalpies.

(A) ΔH_D-N(T) for the reaction N ⇌ D is zero at the temperature where ΔH_TS-D(T) and ΔH_TS-N(T) functions intersect (the intersection of green curve and zero reference line must align vertically with the point where the blue and the red curves intersect). The intersection of ΔH_D-N(T) and ΔH_TS-N(T) functions (green and blue curves) occurs precisely when T = T_H(TS-D). This is expected since ΔH_TS-D(T) = 0 at T_H(TS-D). The similarity in the slopes of the ΔH_D-N(T) and ΔH_TS-N(T) functions between ∽ 240 K and ∽ 320 K implies that most of ΔC_pD-N stems from the first-half of the unfolding reaction N ⇌ [TS]. (B) An appropriately scaled view of the encircled area in the figure on the left. When T = T_H(TS-N), ΔH_TS-D(T) is identical to |ΔH_D-N(T)| or ΔH_N-D(T). Further, at the temperature where ΔH_TS-D(T) and ΔH_D-N(T) functions intersect (i.e., the intersection of the red and the green curves), the absolute enthalpy of the DSE (H_D(T)) is exactly half the algebraic sum of the absolute enthalpies of the TSE (H_TS(T)) and the NSE (H_N(T)), i.e., H_D(T) = (H_TS(T)+H_N(T))/2. The various auxiliary relationships that may obtained from the intersection of various state functions are addressed in subsequent publications.

Temperature-dependence of activation entropies

Inspection of Figure 10 shows that for the partial folding reaction D ⇌ [TS], ΔS_TS-D(T) which is positive at low temperature, decreases in magnitude with an increase in temperature and becomes zero at T_S, where the SASA of the TSE is the least native-like, ΔG_TS-D(T) is a minimum (∂ΔG_TS-D(T)/∂T = −ΔS_TS-D(T) = 0) and ΔG_D-N(T) is a maximum (∂ΔG_D-N(T)/∂T = −ΔS_D-N(T) = 0; Figures 1, 2, 5A, Figure 10−figure supplements 1 and 2); and any further increase in temperature beyond this point causes ΔS_TS-D(T) to become negative. Thus, the activation of denatured conformers to the TSE is entropically: (i) favourable for T_α ≤ T < T_S; (ii) unfavourable for T_S < T ≤ T_ω; and (iii) neutral when T = T_S. At T_S the Gibbs barrier to folding is purely due to the difference in enthalpy between the DSE and the TSE with k_f(T) being given by

Figure 10−figure supplement 1 Activation entropy for folding vs curve-crossing.

(A) ΔS_TS-D(T) is zero when the denatured conformers are displaced the least from the mean of their ensemble to reach the TSE along the SASA-RC. The slope of this curve is given by (B) ΔS_TS-D(T) is zero when the SASA of the TSE is the least native-like. The slope of this curve is given by . The blue and the red sections of the curves represent the temperature regimes T_α ≤ T ≤ T_S and T_S ≤ T ≤ T_ω, respectively.

Figure 10 Temperature-dependence of the activation entropy for folding

(A) The variation in ΔS_TS-D(T) function with temperature. The slope of this curve varies with temperature and equals ΔC_pTS-D(T)/T. (B) An appropriately scaled version of the figure on the left to illuminate the three temperature regimes and their implications: (i) ΔS_TS-D(T) > 0 for T_α ≤ T < T_S; (ii) ΔS_TS-D(T) < 0 for T_S < T ≤ T_ω; and (iii) ΔS_TS-D(T) = 0 when T = T_S.

Inspection of Figure 11 demonstrates that the behaviour of the ΔS_TS-N(T) function is far more complex than the ΔS_TS-D(T) function: (i) ΔS_TS-N(T) > 0 for T_α ≤ T < T_S(α) and T_S < T < T_S(ω); (ii) ΔS_TS-N(T) < 0 for T_S(α) < T < T_S and T_S(ω) < T ≤ T_ω; and (iii) ΔS_TS-N(T) = 0 at T_S(α), T_S, and T_S(ω). Consequently, we may state that the activation of native conformers to the TSE is entropically: (i) favourable for T_α ≤ T < T_S(α) and T_S < T < T_S(ω); (ii) unfavourable for T_S(α) < T < T_S and T_S(ω) < T ≤ T_ω; and (iii) neutral at T_S(α), T_S, and T_S(ω). If we reverse the reaction-direction (Figure 11−figure supplement 1A), the algebraic signs invert leading to a change in the interpretation. Consequently, we may state that for the partial folding reaction [TS] ⇌ N, the flux of the conformers from the TSE to the NSE is entropically: (i) unfavourable for T_α ≤ T < T_S(α) and T_S < T < T_S(ω) (ΔS_N-TS(T) < 0); (ii) favourable for T_S(α) < T < T_S and T_S(ω) < T ≤ T_ω (ΔS_N-TS(T) > 0); and (iii) neutral at T_S(α), T_S, and T_S(ω).

Figure 10−figure supplement 2 Activation entropy vs ΔG_TS-D(T) and ΔG_D-N(T).

(A) ΔG_TS-D(T) is always the least when it is purely enthalpic. The slope of this curve equals-TΔS_TS-D(T)/ΔC_pTS-D(T). (B) The stability is always the greatest when the activation entropy for folding is the zero. The slope of this curve equals-TΔS_TS-D(T)/ΔC_pTS-D(T). The blue and the red sections of the curves represent the temperature regimes T_α ≤ T ≤ T_S and T_S ≤ T ≤ T_ω, respectively.

Figure 11−figure supplement 1 The variation in ΔS_N-TS(T) with temperature and comparison of equilibrium and activation entropies.

(A) An appropriately scaled view of the change in entropy for the partial folding reaction [TS] ⇌ N. The slope of this curve equals ΔC_pN-TS(T)/T (or −ΔC_pN-TS(T)/T and is zero at T_{C_pTS-N(α)} and T_{C_pTS-N(ω)}. The flux of the conformers from the TSE to the NSE is entropically: (i) unfavourable for T_α ≤ T < T_S(α) and T_S < T < T_S(ω) (ΔS_N-TS(T) < 0); (ii) favourable for T_S(α) < T < T_S and T_S(ω) < T ≤ T_ω (ΔS_N-TS(T) > 0); and (iii) neutral at T_S(α), T_S, and T_S(ω). (B) An overlay of ΔS_D-N(T), ΔS_TS-D(T) and ΔS_TS-N(T) functions. Unlike the ΔH_TS-D(T) and ΔH_TS-N(T) functions which must be positive at the point of intersection (Figure 9−figure supplement 1B), theory dictates that both ΔS_TS-D(T) and ΔS_TS-N(T) functions must independently be equal to zero at T_S, leading to the unique scenario: S_D(T) = S_TS(T) = S_N(T). The similarity in the slopes of the ΔS_DN(T) and ΔS_TS-N(T) functions between ∽ 240 K and ∽ 320 K implies that most of ΔC_pD-N stems from the first-half of the unfolding reaction N ⇌ [TS]. Consequently at T_S, ΔG_D-N(T) = ΔH_DN(T), i.e., the net flux of the conformers from the DSE to the NSE is driven purely by enthalpy.

Figure 11 Temperature-dependence of the activation entropy for unfolding

(A) The variation in ΔS_TS-N(T) function with temperature. The slope of this curve, given by ΔC_pTS-N(T)/T, varies with temperature, and is zero at T_{C_pTS-N(α)} and T_{C_pTS-N(ω)}. (B) An appropriately scaled version of the figure on the left to illuminate the temperature regimes and their implications: (i) ΔS_TS-N(T) > 0 for T_α ≤ T < T_S(α) and T_S < T < T_S(ω); (ii) ΔS_TS-N(T) < 0 for T_S(α) < T < T_S and T_S(ω) < T ≤ T_ω; and (iii) ΔS_TS-N(T) = 0 at T_S(α), T_S, and T_S(ω). Note that at T_S(α) and T_S(ω), we have the unique scenario: m_TS-N(T) = ΔG_TS-N(T) = ΔS_TS-N(T) = ΔH_TS-N(T) = 0, and k_u(T) = k⁰. The values of the reference temperatures are given in Table 1.

At T = T_S, the Gibbs barrier to unfolding is purely due to the difference in enthalpy between the TSE and the NSE (ΔG_TS-N(T) = ΔH_TS-N(T)) with k_u(T) being given by

Although ΔS_TS-N(T) = 0 ⇒ S_TS(T) = S_N(T) at T_S(α), T_S, and T_S(ω), the underlying thermodynamics is fundamentally different at T_S as compared to T_S(α) and T_S(ω). While both ΔG_TS-N(T) and m_TS-N(T) are positive and a maximum, and ΔG_TS-N(T) is purely enthalpic at T_S (ΔG_TS-N(T) = ΔH_TS-N(T)), at T_S(α) and T_S(ω) we have m_TS-N(T) = 0 ⇒ ΔG_TS-N(T) = ω(m_TS-N(T)² = 0 ⇒ ΔH_TS-N(T) = 0, and ΔG_N-D(T) = ΔG_TS-D(T) = λ; and because ΔG_TS-N(T) = 0 at T_S(α) and T_S(ω), the rate constant for unfolding will reach an absolute maximum for that particular solvent and pressure at these two temperatures. To summarize, while at T_S we have G_TS(T) ≫ G_N(T), S_D(T) = S_TS(T) = S_N(T), and k_u(T) ≪ k⁰, when T = T_S(α) and T_S(ω), we have G_TS(T) = G_N(T), H_TS(T) = H_N(T), S_TS(T) = S_N(T), and k_u(T) = k⁰ (Figure 11−figure supplements 2 and 3). Thus, a fundamentally important conclusion that we may draw from these relationships is that “if two reaction-states on the folding pathway of a two-state system have identical SASA and Gibbs energy under identical environmental conditions, then their absolute enthalpies and entropies must be identical.” This must hold irrespective of whether or not the two reaction-states have identical, similar or dissimilar structures. We will revisit this scenario when we discuss the heat capacities of activation and the inapplicability of the Hammond postulate to protein folding reactions.

Figure 11−figure supplement 2 Activation entropy for unfolding vs curve-crossing

(A) Unlike the ΔS_TS-D(T) function which is zero only once, ΔS_TS-N(T) is zero once when the native conformers are displaced the greatest to reach the TSE (T_S), and twice when this displacement is zero (green pointer; T_S(α) and T_S(ω)). The slope of this curve is given by . (B) ΔS_TS-N(T) is zero once when the difference in SASA between the TSE and the NSE is the greatest (T_S), and twice when the SASA of the TSE is identical to that of the NSE (green pointer; T_S(α) and T_S(ω)). The slope of this curve is given by . The blue and the red sections of the curves represent the temperature regimes T_α ≤ T ≤ T_S and T_S ≤ T ≤ T_ω, respectively.

The criteria for two-state folding from the viewpoint of entropy are the following: (i) the condition that ΔS_D-N(T) = ΔS_TS-N(T) — ΔS_TS-D(T) must be satisfied at all temperatures; (ii) the intersection of ΔS_TS-D(T) and ΔS_TS-N(T) functions calculated directly from the slopes of the temperature-dependent shift in the curve-crossing relative to the DSE and the NSE, respectively, must be identical to the independently estimated T_S from equilibrium thermal denaturation experiments (Figure 11−figure supplements 1B, 4 and 5); and (iii) both ΔS_TS-D(T) and ΔS_TS-N(T) functions must independently be equal to zero at T_S.

Temperature-dependence of the Gibbs activation energies

Although the general features of the temperature-dependence of ΔG_TS-D(T) and ΔG_TS-N(T) were described earlier (Figure 5 and its figure supplements), it is instructive to discuss the same in terms of their constituent enthalpies and entropies.

The determinants of ΔG_TS-D(T) in terms of its activation enthalpy and entropy may be readily deduced by partitioning the entire temperature range over which the two-state system is physically defined (T_α ≤ T ≤ T_ω) into three distinct regimes using four unique reference temperatures: T_α, T_S, T_H(TS-D), and T_ω (Figure 12 and Figure 12−figure supplement 1). (1) For T_α ≤ T < T_S, the activation of conformers from the DSE to the TSE is entropically favoured (TΔS_TS-D(T) > 0) but is more than offset by the endothermic activation enthalpy (ΔH_TS-D(T) > 0), leading to incomplete compensation and a positive ΔG_TS-D(T) (ΔH_TS-D(T) – TΔS_TS-D(T). When T = T_S, ΔG_TS-D(T) is a minimum (its lone extremum), and is purely due to the endothermic enthalpy of activation (ΔG_TS-D(T) = ΔH_TS-D(T) > 0. (2) For T_S < T < T_H(TS-D), the activation of denatured conformers to the TSE is enthalpically and entropically disfavoured (ΔH_TS-D(T) > 0 and TΔS_TS-D(T)< 0) leading to a positive ΔG_TS-D(T). (3) In contrast, for T_H(TS-D) < T ≤ T_ω, the favourable exothermic activation enthalpy (ΔH_TS-D(T) < 0) is more than offset by the unfavourable entropy of activation (TΔS_TS-D(T) < 0), leading once again to a positive ΔG_TS-D(T). When T = T_H(TS-D), ΔG_TS-D(T) is purely due to the negative change in the activation entropy or the negentropy of activation (ΔG_TS-D(T) = –TΔS_TS-D(T) > 0), ΔG_TS-D(T)/T is a minimum, and k_f(T) is a maximum (their lone extrema; see Massieu-Planck functions below). An important conclusion that we may draw from these analyses is the following: While it is true that for the temperature regimes T_α ≤ T < T_S and T_H(TS-D) < T ≤ T_ω, ΔG_TS-D(T) is due to the incomplete compensation of the opposing activation enthalpy and entropy, this is clearly not the case for T_S < T < T_H(TS-D) where both these two state functions are unfavourable and complement each other to generate a positive Gibbs activation barrier.

Figure 11−figure supplement 3 Activation entropy vs ΔG_TS-N(T) and ΔG_D-N(T)

(A) ΔG_TS-N(T) is the greatest and also the least (zero) when it is purely enthalpic, with the former occurring at T_S, and the latter occurring at T_S(α) and T_S(ω) (green pointer). The slope of this curve equals −TΔS_TS-N(T)/ΔC_pTS-N(T). (B) The stability is always the greatest at T_S where the Gibbs barrier to unfolding is purely enthalpic; and at T_S(α) and T_S(ω) (green pointer), ΔG_D-N(T) = − λ. The slope of this curve equals −TΔS_D-N(T)/ΔC_pTS-N(T). The blue and the red sections of the curves represent the temperature regimes T_α ≤ T ≤ T_S and T_S ≤ T ≤ T_ω, respectively.

Figure 11−figure supplement 4 The first derivatives of m_TS-D(T) and the square of m_TS-D(T) with respect to temperature.

(A) ∂m_TS-D(T) is negative for T_α ≤ T < T_S, positive for T_S < T ≤ T_ω, and zero at T_S and is dictated by Eq. (A4). (B) Because ∂(m_TS-D(T))²/∂T=2m_TS-D(T)(∂m_TS-D(T)/∂T) and m_TS-D(T) > 0 throughout the temperature regime, the variation of its algebraic sign is identical to that of ∂m_TS-D(T)/∂T. The relationship between ∂m_TS-D(T)/∂T and ΔS_TS-D(T) is given by Eq. (8).

Figure 11−figure supplement 5. The first derivatives of m_TS-N(T) and the square of m_TS-N(T) with respect to temperature.

(A) ∂m_TS-N(T)/∂T is positive for T_α≤T < T_S, negative for T_S < T ≤ T_ω, and zero at T_S and is governed by Eq. (A6). (B) Because ∂(m_TS-N(T))²/∂T = 2m_TS-N(T)(∂m_TS-N(T)/∂T) and m_TS-N(T) can be negative, zero or positive depending on the temperature, the variation of its algebraic sign with temperature is far more complex: (i) ∂(m_TS-N(T)²/∂T is negative for T_α≤ T < T_S(α) and T_S < T < T_S(ω); (ii) positive for T_S(α) < T < T_S and T_S(ω) < T ≤ T_ω; and (iii) zero at T_S(α), T_S, and T_S(ω). The relationship between ∂m_TS-N(T)/∂T and ΔS_TS-N(T) is given by Eq. (9). The blue pointers indicate the temperatures at which the second derivative of the square of m_TS-N(T) is zero and are identical to the temperatures at which ΔC_pTS-N(T) is zero.

Figure 12−figure supplement 1. Deconvolution of the Gibbs activation energy for the reaction D ⇌ [TS].

This is an appropriately scaled view of Figure 12B. For T_α ≤ T < T_S, TΔS_TS-D(T) > 0 but is more than offset by unfavourable ΔH_TS-D(T), leading to incomplete compensation and a positive ΔG_TS-D(T) − (ΔH_TS-D(T) TΔS_TS-D(T). When T = T_S, ΔG_TS-D(T) is a minimum and purely enthalpic (ΔG_TS-D(T) = ΔH_TS-D(T) > 0). For T_S < T < T_H(TS-D), the activation is enthalpically and entropically disfavoured (ΔH_TS-D(T) > 0 and TΔS_TS-D(T)< 0) leading to a positive ΔG_TS-D(T). In contrast, for T_H(TS-D) < T ≤ T_ω, ΔH_{TS-D(T) < 0} but is more than offset by the unfavourable entropy (TΔS_TS-D(T) < 0), leading once again to a positive ΔG_TS-D(T). When T = T_H(TS-D), ΔG_TS-D(T) is purely entropic (ΔG_TS-D(T) = −TΔS_TS-D(T) > 0) and k_f(T) is a maximum.

Figure 12. Entropy-enthalpy compensation for the partial folding reaction D ⇌ [TS].

Despite large changes in ΔH_TS-D(T) (∽ 400 kcal.mol^-1) ΔG_TS-D(T) which is a minimum at T_S, varies only by ∽3.4 kcal.mol^-1 due to compensating changes in ΔS_TS-D(T). See the appropriately scaled figure supplement for description. The physical basis for entropy-enthalpy compensation is addressed in the accompanying article.

Similarly, the determinants of ΔG_TS-N(T) in terms of its activation enthalpy and entropy may be readily divined by partitioning the entire temperature range into five distinct regimes using six unique reference temperatures: T_α, T_S(α), T_H(TS-N), T_S, T_S(ω), and T_ω (Figure 13 and Figure 13−figure supplement 1). (1) For T_α ≤ T < T_S(α), which is the ultralow temperature Marcus-inverted-regime for unfolding, the activation of the native conformers to the TSE is entropically favoured (TΔS_TS-N(T) > 0) but is more than offset by the unfavourable enthalpy of activation (ΔH_TS-N(T) > 0) leading to incomplete compensation and a positive ΔG_TS-N(T) (ΔH_TS-N(T) – ΔTΔS_TS-N(T) > 0). When T = T_S(α), ΔS_TS-N(T) = ΔH_TS-N(T) = 0 ⇒ ΔG_TS-N(T) = 0. The first extrema of ΔG_TS-N(T) and ΔG_TS-N(T)/T (which are a minimum), and the first extremum of k_u(T) (which is a maximum, k_u(T) = k⁰) occur at T_S(α). (2) For T_S(α) < T < T_H_(TS-N), the activation of the native conformers to the TSE is enthalpically favourable (ΔH_TS-N(T) < 0) but is more than offset by the unfavourable negentropy of activation (TΔS_TS-N(T) < 0) leading to ΔG_TS-N(T) > 0. When T = T_H(TS-N), ΔH_TS-N(T) = 0 for the second time, and the Gibbs barrier to unfolding is purely due to the negentropy of activation (ΔG_TS-N(T) = –TΔS_TS-N(T) > 0. The second extrema of ΔG_TS-N(T)/T (which is a maximum) and k_u(T) (which is a minimum) occur at T_H(TS-N). (3) For T_H(TS-N) < T < T_S, the activation of the native conformers to the TSE is entropically and enthalpically unfavourable (ΔH_TS-N(T) > 0 and TΔS_TS-N(T) < 0) leading to ΔG_TS-N(T) > 0. When T = T_S, ΔSTS-N(T) = 0 for the second time, and the Gibbs barrier to unfolding is purely due to the endothermic enthalpy of activation (ΔG_TS-N(T) = ΔH_TS-N(T) > 0). The second extremum of ΔG_TS-N(T) (which is a maximum) occurs at T_S. (4) For T_S < T < T_S(ω), the activation of the native conformers to the TSE is entropically favourable (TΔS_TS-N(T) > 0) but is more than offset by the endothermic enthalpy of activation (ΔH_TS-N(T) > 0) leading to incomplete compensation and a positive ΔG_TS-N(T). When T = T_S(ω), ΔS_TS-N(T) = ΔH_TS-N(T) = 0 for the third and the final time, and ΔG_TS-N(T) = 0 for the second and final time. The third extrema of ΔG_TS-N(T) and ΔG_TS-N(T)/T (which are a minimum), and the third extremum of k_u(T) (which is a maximum, k_u(T) = k⁰) occur at T_S(ω). (5) For T_S(ω)< T ≤ T_ω, which is the high-temperature Marcus-inverted-regime for unfolding, the activation of the native conformers to the TSE is enthalpically favourable (ΔH_TS-N(T) < 0) but is more than offset by the unfavourable negentropy of activation (TΔS_TS-N(T) < 0), leading to ΔG_TS-N(T) > 0. Once again we note that although the Gibbs barrier to unfolding is due to the incomplete compensation of the opposing enthalpies and entropies of activation for the temperature regimes T_α ≤ T < T_S(α), T_S(α) < T < T_H(TS-N), T_S < T < T_S(ω), and T_S(ω)< T ≤ T_ω, both the enthalpy and the entropy of activation are unfavourable and collude to generate the Gibbs barrier to unfolding for the temperature regime T_H(TS-N) < T < T_S. Thus, a fundamentally important conclusion that we may draw from this analysis is that “the Gibbs barriers to folding and unfolding are not always due to the incomplete compensation of the opposing enthalpy and entropy.”

Figure 13-figure supplement 1. Deconvolution of the Gibbs activation energy for unfolding.

These are appropriately scaled split views of Figure 13B. (A) For T_α ≤ T < T_S(α), N ⇌ [TS] entropically favoured (TΔS_TS-N(T) > 0) but is more than offset by endothermic enthalpy (ΔH_TS-N(T) > 0) leading to ΔH_TS-N(T) − TΔS_TS-N(T) > 0. When T = T_S(α), ΔS_TS-N(T) = ΔH_TS-N(T) = 0 ⇒ ΔG_TS-N(T) = 0, and k_u(T) = k⁰. For T_S(α) < T < T_H(TS-N), N ⇌ [TS] is enthalpically favourable (ΔH_TS-N(T) < 0) but is more than offset by the unfavourable negentropy (TΔS_TS-N(T) < 0) leading to ΔG_TS-N(T) > 0. When T = T_H_(TS-N), ΔH_TS-N(T) = 0 for the second time, ΔG_TS-N(T) is purely due to the negentropy (ΔG_TS-N(T) = −TΔS_TS-N(T) 0), and k_u(T) is a minimum. For T_H(TS-N) < T < T_S, N ⇌ [TS] is entropically and enthalpically unfavourable (ΔH_TS-N(T) > 0 and TΔS_TS-N(T) < 0) leading to ΔG_TS-N(T) > 0. When T = T_S, ΔS_TS-N(T) = 0 for the second time, and ΔG_TS-N(T) is a minimum and purely enthalpic (ΔG_TS-N(T) = ΔH_TS-N(T) 0). (B) For T_S < T < T_S(ω), N ⇌ [TS] is entropically favourable (ΔH_TS-N(T) > 0) but is more than offset by the endothermic enthalpy (ΔH_TS-N(T) > 0) leading to a positive ΔG_TS-N(T). When T = T_S(ω), ΔS_TS-N(T) = ΔH_TS-N(T) = 0 for the third and the final time, ΔG_TS-N(T) = 0 for the second and final time, and k_u(T) = k⁰. For T_S(ω) < T ≤ T_ω, N ⇌ [TS is enthalpically favourable (ΔH_TS-N(T) < 0) but is more than offset by the unfavourable negentropy (TΔS_TS-N(T) < 0), leading to ΔG_TS-N(T) > 0 and k_u(T) < k⁰.

Figure 13 Entropy-enthalpy compensation for the partial unfolding reaction N ⇌ [TS]

Despite large changes in ΔH_TS-N(T), ΔG_TS-N(T) which is a maximum at T_S, varies only by ∽5 kcal.mol^-1 due to compensating changes in ΔS_TS-N(T). See the appropriately scaled figure supplement for description.

In a protein folding scenario where the activated conformers diffuse on the Gibbs energy surface to reach the NSE, the algebraic signs of the state functions invert leading to a change in the interpretation (Figure 13−figure supplements 2 and 3). Thus, for the partial folding reaction[TS] ⇌: (1) For T_α ≤ T < T_S(α), the flux of the conformers from the TSE to the NSE is entropically disfavoured (TΔS_TS-N(T) > 0 ⇒ TΔS_N-TS(T) < 0) but is more than compensated by the favourable change in enthalpy (ΔH_TS-N(T) > 0 ⇒ ΔH_N-TS(T) < 0), leading to ΔG_N-TS(T) < 0. (2) For T_S(α) < T < T_H(TS-N), the flux of the conformers from the TSE to the NSE is enthalpically unfavourable (ΔH_TS-N(T) < 0 ⇒ ΔH_N-TS(T) > 0) but is more than compensated by the favourable change in entropy (TΔS_TS-N(T) < 0 ⇒ TΔS_N-TS(T) > 0) leading to ΔG_N-TS(T) < 0. When T = T_H(TS-N), the flux is driven purely by the positive change in entropy (ΔG_N-TS(T) = –TΔS_N-TS(T) > 0). (3) For T_H(TS-N) < T < T_S, the flux of the conformers from the TSE to the NSE is entropically and enthalpically favourable (ΔH_N-TS(T) < 0 and TΔS_N-TS(T) > 0) leading to ΔG_N-TS(T) < 0. When T = T_S, the flux is driven purely by the exothermic change in enthalpy (ΔG_N-TS(T) = ΔH_N-TS(T) < 0). (4) For T_S < T < T_S(ω), the flux of the conformers from the TSE to the NSE is entropically unfavourable (TΔS_TS-N(T) > 0 ⇒ TΔS_N-TS(T) < 0) but is more than compensated by the exothermic change in enthalpy (ΔH_TS-N(T) > 0 ⇒ ΔH_N-TS(T) < 0) leading to ΔG_N-TS(T) < 0. (5) For T_S(ω)< T ≤ T_ω, the flux of the conformers from the TSE to the NSE is enthalpically unfavourable (ΔH_TS-N(T) < 0 ⇒ ΔH_N_TS(T) > 0) but is more than compensated by the favourable change in entropy (TΔS_TS-N(T) < 0 ⇒ TΔS_N-TS(T) > 0), leading to ΔG_N-TS(T) < 0.

Figure 13-figure supplement 2. Entropy-enthalpy compensation for the partial folding reaction [TS] ⇌ N

Despite large changes in ΔH_N-TS(T), ΔG_N-TS(T) which is a minimum at T_S, varies only by ∽5 kcal.mol^-1 due to compensating changes in ΔS_N-TS(T). See the appropriately scaled figure supplement for description.

Thus, the criteria for two-state folding from the viewpoint of Gibbs energy are the following: (i) the condition that ΔG_D-N(T) = ΔG_TS-N(T) – ΔG_TS-D(T) must be satisfied at all temperatures; (ii) the cold and heat denaturation temperatures estimated from equilibrium thermal denaturation must be identical to independently determined temperatures at which k_f(T) and k_u(T) are identical, i.e., the temperatures at which ΔG_TS-D(T) and ΔG_TS-N(T) functions intersect must be identical to the temperatures at which ΔH_D-N(T) – TΔS_D-N(T)= ΔG_D-N(T) = 0. The basis for these relationships, as mentioned earlier, is the principle of microscopic reversibility;²⁷ (iii) ΔG_TS-D(T) and ΔG_TS-N(T) must be a minimum and a maximum, respectively, at T_S; and (iv) the condition that T_H(TS-N) < T_H < T_S < T_H(TS-D) must be satisfied. A far more detailed explanation in terms of chain and desolvation entropies and enthalpies is given in the accompanying article.

Massieu-Planck functions

The Massieu-Planck function, ΔG/T, or its equivalent –RlnK (K is the equilibrium constant) predates the Gibbs energy function by a few years and is especially useful when analysing temperature-dependent changes in protein behaviour (see Schellman, 1997, on the use of Massieu-Planck functions to analyse protein folding, and why the use of ΔG versus T curves can sometimes lead to ambiguous conclusions).^6,35 Comparison of Figure 6−figure supplement 1A and Figure 14A demonstrates that although ΔG_TS-D(T) is a minimum at T_S (Figure 5A), k_f(T) will be a maximum not at T_S but instead at T_H(TS-D) where the Massieu-Planck activation potential for folding (ΔG_TS-D(T)/T ≡ –R ln K_TS-D(T)) is a minimum, and is readily apparent if we recast the Arrhenius expression for k_f(T) in terms of the equilibrium constant for the partial folding reaction D ⇌ [TS].

Figure 14. Temperature-dependence of the Massieu-Planck activation potentials.

(A) The Massieu-Planck activation potential for folding is a minimum at T_H(TS-D). The slope of this curve is given by −ΔH_TS-D(T)/T². (B) The Massieu-Planck activation potential for unfolding is a maximum at T_H(TS-N) and a minimum (zero) at T_S(α) and T_S(ω). The slope of this curve is given by −ΔH_TS-N(T)/T². The temperature T_S at which ΔG_TS-D(T) and ΔG_TS-N(T) are a minimum and a maximum, respectively, are indicated by green circles.

Eq. (19) shows that the rate determining K_TS-D(T) ([TS]/[D]) or the population of activated conformers relative to those that nestle at the bottom of the denatured Gibbs energy well is a maximum not at T_S but at T_H(TS-D) (Figure 14−figure supplement 1A). Similarly, comparison of Figure 6−figure supplement 1B and Figure 14B shows that although ΔG_TS-N(T) is a maximum at T_S (Figure 5B), the minimum in k_u(T) will occur not at T_S but instead at T_H(TS-N) where the Massieu-Planck activation potential for unfolding (ΔG_TS-N(T)/T ≡ –R ln K_TS-D(T)) is a maximum (Eq. (20)).

Figure 13−figure supplement 3. Deconvolution of the change in Gibbs energy for the partial folding reaction [TS] ⇌ N.

These are appropriately scaled split views of Figure 13−figure supplement 2B. (A) For T_α ≤ T < T_S(α), [TS] ⇌ N is entropically disfavoured (TΔS_N-TS(T) < 0) but is more than compensated by the exothermic enthalpy (ΔH_N-TS(T) < 0), leading to ΔG_N-TS(T) < 0. When T = T_S(α), ΔS_N-TS(T) = ΔH_N-TS(T) = ΔG_N-TS(T) = 0, and the net flux of the conformers from the TSE to the NSE is zero. For T_S(α) < T < T_H(TS-N), [TS] ⇌ N is enthalpically unfavourable (ΔH_N-TS(T) > 0) but is more than compensated by entropy (TΔS_N-TS(T) > 0) leading to ΔG_N-TS(T) < 0. When T = T_H(TS-N), the net flux from the TSE to the NSE is driven purely by the favourable change in entropy (ΔG_N-TS(T) = –TΔS_N-TS(T) < 0). For T_H(TS-N) < T < T_S, the net flux of the conformers from the TSE to the NSE is entropically and enthalpically favourable (ΔH_N-TS(T) < 0 and TΔS_N-TS(T) > 0) leading to ΔG_N-TS(T) < 0. When T = T_S, the net flux is driven purely by the exothermic change in enthalpy (ΔG_N-TS(T) = ΔH_N-TS(T) < 0). (B) For T_S < T < T_S(ω),[TS] ⇌ N is entropically unfavourable (TΔS_N-TS(T) < 0) but is more than compensated by the exothermic enthalpy (ΔH_N-TS(T) < 0) leading to ΔG_N-TS(T) < 0. When T = T_S(ω), ΔS_N-TS(T) = ΔH_N-TS(T) = ΔG_N-TS(T) = 0, and the net flux of the conformers from the TSE to the NSE is zero. For T_S(ω)< T ≤ T_ω, [TS] ⇌ N is enthalpically unfavourable (ΔH_N-TS(T) > 0) but is more than compensated by the favourable change in entropy (TΔS_N-TS(T) > 0), leading to ΔG_N-TS(T) < 0.

Figure 14−figure supplement 1. Temperature-dependence of K_TS-D(T) and K_TS-N(T).

(A) Temperature-dependence of K_TS-D(T) = [TS]/[D] for the partial folding reaction D ⇌ [TS]. K_TS-D(T) is a maximum not when ΔG_TS-D(T) is a minimum (green circle) but when the Massieu-Planck activation potential for folding, ΔG_TS-D(T)/T, is a minimum, and occurs precisely when T = T_H(TS-D). The slope of this curve is given by K_TS-D(T) ΔH_TS-D(T)/RT². (B) Temperature-dependence of K_TS-N(T) = [TS]/[N] for the partial unfolding reaction N ⇌ [TS]. K_TS-N(T) is a minimum not when ΔG_TS-N(T) is a maximum (green circle) but when the Massieu-Planck activation potential for unfolding, ΔG_TS-N(T)/T, is a maximum, and occurs precisely when T = T_H(TS-N). The slope of this curve is given by K_TS-N(T) ΔH_TS-N(T)/RT². Note that K_TS-N(T) is unity at T_S(α) and T_S(ω). It is not possible to capture the minimum of K_TS-N(T) on a linear scale; hence the ordinate is shown on a log scale (base 10). The green circles represent the temperature T_S at which ΔG_D-N(T) and ΔG_TS-N(T) are both a maximum, ΔG_TS-D(T) is a minimum, and the absolute entropies of the DSE, the TSE and the NSE are identical.

Thus, for the partial unfolding reaction N ⇌ [TS], the rate determining K_TS-N(T) ([TS]/[N]) or the population of activated conformers relative to those at the bottom of the native Gibbs basin is a minimum not at T_S but at T_H(TS-N) (Figure 14−figure supplement 1B). Similarly, we see that although the ΔG_N-D(T) is a minimum or the most negative at T_S (Figure 1−figure supplement 1), K_N-D(T) ([N]/[D]) is a maximum not at T_S but at T_H where ΔH_N-D(T)= 0 and k_f(T)/k_u(T) is a maximum (Figure 14−figure supplement 2A).⁶ Because the ratio of the solubilities of any two reaction-states is identical to the equilibrium constant, we may state that for any two-state folder at constant pressure and solvent conditions: (i) the solubility of the TSE as compared to the DSE is the greatest when the Gibbs barrier to folding is purely entropic, and this occurs precisely at T_H(TS-D) (Figure 14−figure supplement 3A); (ii) the solubility of the TSE as compared to the NSE is the least when the Gibbs barrier to unfolding is purely entropic and occurs precisely at T_H(TS-N) (Figure 14−figure supplement 3B); (iii) the solubilities of the TSE and the NSE are identical at T_S(α) and T_S(ω)where ΔS_TS-N(T) = ΔH_TS-N(T) = ΔG_TS-N(T) = 0, and k_u(T) = k⁰ (Figure 14−figure supplement 3B); and (iv) the solubility of the NSE as compared to the DSE is the greatest when the net flux of the conformers from the DSE to the NSE is driven purely by the difference in entropy between these two reaction-states and occurs precisely at T_H (Figure 14−figure supplement 2B). The notion that “certain aspects of the temperature-dependent protein behaviour are greatly simplified when the Massieu-Planck functions are used in preference to the Gibbs energy” is readily apparent from inspection of Figure 14−figure supplements 4 and 5: While the natural logarithms of k_f(T) and k_u(T) have a complex dependence on their respective Gibbs barriers, a simple linear relationship exists between the rate constants and their respective Massieu-Planck functions.

Figure 14−figure supplement 2. Temperature-dependence of the equilibrium constant for the reaction D ⇌ N.

(A) An overlay of the ratio of the rate constants for folding and unfolding and the equilibrium constant derived from the Gibbs energy of folding at equilibrium. The curve fits to Boltzmann distribution and is a maximum at T_H. The slope of this curve is given by K_N-D(T) ΔH_N-D(T)/RT². Although the value of ΔH_D-N(T) can be calculated for any temperature above absolute zero using Eq. (A1), it has physical meaning only for T_α ≤ T ≤ T_ω. This applies to ΔS_D-N(T) and ΔG_D-N(T) as well (Eqs. (A2) and (A3)). (B) The solubility of the NSE as compared to the DSE is the greatest when the net flux of the conformers from the DSE to the NSE is driven purely by the difference in entropy between these two reaction-states. The slope of this curve is given by K_N-D(T) ΔH_N-D(T)/ΔC_pN-DRT²

Figure 14−figure supplement 3. The solubility of the TSE relative to the DSE and the NSE across a broad temperature regime.

(A) The solubility of the TSE as compared to the DSE is the greatest when ΔH_TS-D(T) = 0, or equivalently, when the Gibbs barrier to folding is purely entropic. The slope of this curve is given by K_TS-D(T) ΔH_TS-D(T)/ΔC_pTS-D(T)RT². The blue and red sections of the curve represent the temperature regimes T_α ≤ T ≤ T_H(TS-D) and T_H(TS-D) ≤ T ≤ T_ω, respectively. (B) The solubility of the TSE as compared to the NSE is the least when ΔH_TS-N(T)= 0 and when the Gibbs barrier to unfolding is purely entropic. The slope of this curve is given by K_TS-N(T) ΔH_TS-N(T)/ΔC_pTS-N(T)RT². The point where the solubility of the TSE is identical to that of the NSE is indicated by the unlabelled black pointer, and described earlier, occurs precisely at T_S(α) and T_S(ω). The blue and red sections of the curve represent the temperature regimes T_α ≤ T ≤ T_H(TS-N) and T_H(TS-N) ≤ T ≤ T_ω, respectively. Note that the ordinate is on a log scale (base 10).

Figure 14−figure supplement 4. The natural logarithm of k_f(T) is linearly dependent on the Massieu-Planck activation potential for folding.

(A) The natural logarithm of k_f(T) has a complex dependence on the Gibbs barrier to folding when explored over a large temperature range. The slope of this curve is given by −ΔH_TS-D(T)/ΔS_TS-D(T)RT². (B) The natural logarithm of k_f(T) decreases linearly with an increase in the Massieu-Planck activation potential for folding, with the magnitude of the negative slope being given by the reciprocal of the gas constant. The y-intercept at zero Massieu-Planck potential yields the value of the prefactor. Naturally, k_f(T) is a maximum when the magnitude of the Massieu-Planck function for folding is a minimum, and this occurs precisely at T_H(TS-D).

Temperature-dependence of ΔC_pD-TS(T) and ΔC_pTS-N(T)

In order to provide a rational explanation for the temperature-dependence of the ΔC_pD-TS(T) and ΔC_pTS-N(T) functions, it is instructive to first discuss the inter-relationships between ΔSASA_D-N, m_D-N, and ΔC_pD-N. According to the “liquid-liquid transfer” model (LLTM) the greater heat capacity of the DSE as compared to the NSE (i.e., ΔC_pD-N > 0 and substantial) is predominantly due to anomalously high heat capacity and low entropy of water that surrounds the exposed non-polar residues in the DSE (referred to as “microscopic icebergs” or “clathrates”; see references in Baldwin, 2014).³⁶ Because the size of the solvation shell depends on the SASA of the non-polar solute, it naturally follows that the change in the heat capacity must be proportional to the change in the non-polar SASA that accompanies a reaction. Consequently, protein unfolding reactions which are accompanied by large changes in non-polar SASA lead to large and positive changes in the heat capacity.^33,37,38 Because the denaturant m values are also directly proportional to the change in SASA that accompanies protein unfolding reactions, the expectation is that m_D-N and ΔC_pD-N values must also be proportional to each other: The greater the m_D-N value, the greater is the ΔC_pD-N value and vice versa (Figs. 2, 3 and 5 in Myers et al., 1995). However, since the residual structure in the DSEs of proteins under folding conditions is both sequence and solvent-dependent (i.e., the SASAs of the DSEs two proteins of identical chain lengths but dissimilar primary sequences need not necessarily be the same even under identical solvent conditions),^39,40 and because we do not yet have reliable theoretical or experimental methods to accurately quantify the SASA of the DSEs of proteins under folding conditions (i.e., the values are model-dependent),^41-43 the data scatter in plots that show correlation between the experimentally determined m_D-N or ΔC_pD-N values (which reflect the true ΔSASA_D-N) and the calculated values of ΔSASA_D-N can be significant (Fig. 2 in Myers et al., 1995, and Fig. 3 in Robertson and Murphy, 1997). Now, since the solvation shell around the DSEs of large proteins is relatively greater than that of small proteins even when the residual structure in the DSEs under folding conditions is taken into consideration, large proteins on average expose relatively greater amount of non-polar SASA upon unfolding than do small proteins; consequently, both m_D-N and ΔC_pD-N values also correlate linearly with chain-length, albeit with considerable scatter since chain length, owing to the residual structure in the DSEs, is unlikely to be a true descriptor of the SASA of the DSEs of proteins under folding conditions (note that the scatter can also be due to certain proteins having anomalously high or low number of non-polar residues). The point we are trying to make is the following: Because the native structures of proteins are relatively insensitive to small variations in pH and co-solvents,⁴⁴ and since the number of ways in which foldable polypeptides can be packed into their native structures is relatively limited (as inferred from the limited number of protein folds, see SCOP: www.mrc-lmb.cam.ac.uk and CATH: www.cathdb.info databases), one might find a reasonably good correlation between chain lengths and the SASAs of the NSEs of proteins of differing primary sequences under varying solvents (Fig. 1 in Miller et al., 1987).^45,46 However, since the SASAs of the DSEs under folding conditions, owing to residual structure are variable, until and unless we find a way to accurately simulate the DSEs of proteins, and if and only if these theoretical methods are sensitive to point mutations, changes in pH, co-solvents, temperature and pressure, it is almost impossible to arrive at a universal equation that will describe how the ΔSASA_D-N under folding conditions will vary with chain length, and by logical extension, how m_D-N and ΔC_pD-N will vary with SASA or chain length. Nevertheless, if we consider a single two-state-folding primary sequence under constant pressure and solvent conditions and vary the temperature, and if the properties of the solvent are temperature-invariant (for example, no change in the pH due to the temperature-dependence of the pK_a of the constituent buffer), then the manner in which the ΔC_pD-TS(T) and ΔC_pTS-N(T) functions vary with temperature must be consistent with the temperature-dependence of m_TS-D(T) and m_TS-N(T), respectively, and by logical extension, with ΔSASA_D-TS(T) and ΔSASA_TS-N(T), respectively.

Inspection of Figures 15 and Figure 15−figure supplements 1, 2 and 3 demonstrate that: (i) both ΔC_pD-TS(T) and ΔC_pTS-N(T) vary with temperature; and (ii) their gross features stem primarily from the second derivatives of the temperature-dependence of the curve-crossing with respect to the DSE and the NSE. The prediction that the change in heat capacities for the partial unfolding reactions, N ⇌ [TS] and [TS] ⇌ D, must vary with temperature is due to Eqs. (12) and (13). Although this may not be readily apparent from a casual inspection of the equations, even a cursory examination of Figures 8 and 9 shows that it is simply not possible for ΔC_pD-TS(T) and ΔC_pTS-N(T) functions to be temperature-invariant since the slopes of the ΔHTS-D(T) and the ΔH_TS-N(T) functions are continuously changing with temperature. If we recall that the force constants are temperature-invariant, it becomes readily apparent that the second terms in the brackets on the right-hand-side (RHS) of Eqs. (12) and (13) i.e., ωT(ΔS_D-N(T) and αT(ΔS_D-N(T))², respectively, will be parabolas with a minimum (zero) at T_S. This is due to ΔS_D-N(T) being negative for T < T_S, positive for T > T_S, and zero for T = T_S. Furthermore, since φ, and m_TS-N(T) are a maximum, and m_TS-D(T) a minimum at T_S, the expectation is that ΔC_pD-TS(T) must be a minimum (or ΔC_pTS-D(TS) is the least negative), and ΔC_pTS-N(T) must be a maximum at T_S. Thus, for T = T_S, Eqs. (12) and (13) become

Figure 14−figure supplement 5. The natural logarithm of k_u(T) is linearly dependent on the Massieu-Planck activation potential for unfolding.

(A) The natural logarithm of k_u(T) has a complex dependence on the Gibbs barrier to unfolding when explored over a large temperature range. The slope of this curve is given by −ΔH_TS-N(T)/ΔS_TS-N(T)RT². (B) The natural logarithm of k_u(T) decreases linearly with an increase in the Massieu-Planck activation potential for unfolding, with the magnitude of the negative slope being given by the reciprocal of the gas constant. The y-intercept at zero Massieu-Planck potential yields the value of the prefactor. Naturally, k_u(T) is a minimum when the magnitude of the Massieu-Planck function for unfolding is a maximum, and this occurs precisely at T_H(TS-N). The reason why the data points for the unfolding rate constants extend all the way to the intercept is because the Gibbs barrier to unfolding becomes zero at T_S(α) and T_S(ω).

Figure 15−figure supplement 1. Appropriately scaled ΔC_pD-TS(T) and ΔC_pTS-N(T) functions to showcase their features.

(A) ΔC_pTS-N(T) which is a maximum and positive at T_S, decreases with any deviation in temperature from T_S, is zero at T_{C_pTS-N(α)} and T_{C_pTS-N(ω)}, and negative for T_α ≤ T < T_{C_pTS-N(α)} and T_{C_pTS-N(ω)} < T ≤ T_ω. (B) At the temperatures where the ΔC_pD-TS(T) and ΔC_pTS-N(T) functions intersect (214.1K and 345.9 K), the absolute heat capacity of the TSE is exactly half the sum of the absolute heat capacities of the DSE and the NSE. The black pointers indicate that the extrema of ΔC_pD-TS(T) and ΔC_pTS-N(T) functions, while the green pointers indicate their intersection. Inspection shows that ΔC_pTS-N(T) > ΔC_pD-TS(T) for 240 K < T < 320 K (shaded region), and is approximately five fold greater than ΔC_pTS-N(T) at T_S (343.7/73.3 = ∽ 4.7) despite ∽30% and ∽70% of the total change in SASA for the unfolding reaction N ⇌ D, occurring in the partial unfolding reactions N ⇌ [TS] and [TS] ⇌ D, respectively (Figure 2−figure supplement 1).

The prediction that the extrema of ΔC_pD-TS(T) and ΔC_pTS-N(T) functions must occur at T_S is readily apparent from Figure 15 and Figure 15−figure supplement 1B. Importantly, consistent with the relationship between m_D-N and ΔC_pD-N values, comparison of these two figures with Figure 2 and Figure 2−figure supplement 1 demonstrates that just as m_TS-D(T) and m_TS-N(T) are a minimum and a maximum at T_S, respectively, so too are ΔC_pD-TS(T)and ΔC_pTS-N(T) functions. This leads to two obvious corollaries: (i) the difference in heat capacity between the DSE and the TSE is a minimum when the difference in SASA between the DSE and the TSE is a minimum; and (ii) the difference in heat capacity between the TSE and the NSE is a maximum when the difference in SASA between the TSE and the NSE is a maximum. Because ΔS_TS-D(T) = ΔS_TS-N(T) = 0, ΔG_TS-D(T) is a minimum, and both ΔG_TS-N(T) and ΔG_D-N(T) are a maximum, at T_S (Figures 1, 5 and Figure 11−figure supplement 1B), a fundamentally important conclusion is that the Gibbs barriers to folding and unfolding are a minimum and a maximum, respectively, and equilibrium stability is a maximum, and are all purely enthalpic when ΔC_pD-TS(T) and ΔC_pTS-N(T) are a minimum and a maximum, respectively.

Figure 15. Temperature-dependence of ΔC_pD-TS(T) and ΔC_pTS-N(T) functions.

(A) ΔC_pD-TS(T) is positive throughout the temperature range and is a minimum at T_S (or ΔC_pTS-D(T) is a maximum or the least negative at T_S). (B) ΔC_pTS-N(T) is a maximum at T_S, positive for T_{C_pTS-N(α)} < T < T_{C_pTS-N(ω)}, negative for T_α ≤ T < T_{C_pTS-N(α)} and T_{C_pTS-N(ω)} < T ≤ T_ω, and as described earlier, zero at T_{C_pTS-N(α)} and T_{C_pTS-N(ω)}. These aspects can be better appreciated from the appropriately scaled views shown in the figure supplement. Note that the algebraic sum of ΔC_pD-TS(T) and ΔC_pTS-N(T) must equal ΔC_pD-N throughout the temperature-regime.

Inspection of Figure 15 and Figure 15−figure supplement 1 demonstrates that unlike ΔC_pD-TS(T) which is positive across the entire temperature range, ΔC_pTS-N(T) which is a maximum and positive at T_S, decreases with any deviation in temperature from T_S, and is zero at T_{C_pTS-N(α)} and T_{C_pTS-N(ω)}; consequently, ΔC_pTS-N(T) < 0 for T_α ≤ T < T_{C_pTS-N(α)} and T_{C_pTS-N(ω)} < T ≤ T_ω. The reason for this behaviour is apparent from inspection of Figures 9 and 11: The slope of the ΔH_TS-N(T) and ΔS_TS-N(T) functions becomes zero at T_{C_pTS-N(α)} and T_{C_pTS-N(ω)}; and any further decrease or increase in temperature, respectively, causes the slope to invert. This can be mathematically shown as follows: Since m_TS-N(T) = 0 at T_S(α) and T_S(ω), we have and φ = (αm_D-N)² at T_S(α) and T_S(ω). Substituting these relationships in Eq. (13) leads to

Further, since ΔC_pD-N = ΔC_pD-TS(T) + ΔC_pTS-N(T) for a two-state system, we have

Because ΔC_pTS-N(T) < 0 at T_S(α) and T_S(ω), and the lone extremum of ΔC_pTS-N(T) (which is algebraically positive and a maximum) occurs at T_S, it implies that there will be two unique temperatures at which ΔC_pTS-N(T) = 0, one in the low temperature (T_{C_pTS-N(α)}) such that T_S(α) < T_{C_pTS-N(α)} < T_S, and the other in the high temperature regime (T_{C_pTS-N(ω)}) such that T_S < T_{C_pTS-N(ω)} < T_S(ω). Thus, at the these two unique temperatures T_{C_pTS-N(α)} and T_{C_pTS-N(ω)}, we have ΔC_pD-TS(T) = ΔC_pD-N ⇒ β_H(fold)(T) = 1 and βH(unfold)(T) = 0; and for the temperature regimes T_α ≤ T < T_{C_pTS-N(α)} and T_{C_pTS-N(ω)} < T ≤ T_ω, we have ΔC_pD-TS(T) > ΔC_pD-N ⇒ β_H(fold)(T) > 1, and ΔC_pTS-N(T) < 0 ⇒ β_H(unfold)(T) < 0 (see heat capacity RC below for the definition of β_H(fold)(T) and β_H(unfold)(T)).

Although the prediction that ΔC_pTS-N(T) must approach zero at very low and high temperatures may not be readily verified by experiment for the low-temperature regime owing to technical difficulty in making a measurement, the prediction for the high-temperature regime is strongly supported by the data on CI2 from the Fersht lab: Despite the temperature-range not being substantial (320 to 340 K), and the data points that define the ΔH_TS-N(T) function being sparse (7 in total), it is apparent even from a cursory inspection that it is clearly non-linear with temperature (Fig. 5B in Tan et al., 1996).⁴⁷ Although Fersht and co-workers have fitted the data to a linear function and reached the natural conclusion that the heat capacity of activation for unfolding is temperature-invariant, they nevertheless explicitly mention that if the non-linearity of ΔH_TS-N(T) were given due consideration, and the data are fit to an empirical-quadratic instead of a linear function, ΔC_pTS-N(T) indeed becomes temperature-dependent and is predicted to approach zero at ∽ 360 K (see text in page 382 in Tan et al., 1996).⁴⁷ Now, since ΔC_pTS-N(T) > 0 and a maximum, and ΔC_pD-TS(T) is a minimum and positive at T_S, and decrease and increase, respectively, with any deviation in temperature from T_S, and since ΔC_pTS-N(T) becomes zero at T_{C_pTS-N(α)} and T_{C_pTS-N(ω)}, the obvious mathematical consequence is that ΔC_pD-TS(T) and ΔC_pTS-N(T) functions must intersect at two unique temperatures. Because at the points of intersection we have the relationship: ΔC_pD-TS(T) = ΔC_pTS-N(T) = ΔC_pD-N/2, a consequence is that ΔC_pTS-N(T) must be positive at the said temperatures, with the low-temperature intersection occurring between T_{C_pTS-N(α)} and T_S, and the high-temperature intersection between T_S and T_{C_pTS-N(ω)}. This is readily apparent from inspection of Figure 15−figure supplement 1B: Both ΔC_pD-TS(T) and ΔC_pTS-N(T) are identical at 214.1 K and 345.9 K. An equivalent interpretation is that at these temperatures, the absolute heat capacity of the TSE is exactly half the algebraic sum of the absolute heat capacities of the DSE and the NSE. As we shall show in subsequent publications, the intersection of various state functions is a source of interesting relationships that may be used as constraints in simulations (see also Figure 9−figure supplement 2).

The position of the TSE along the heat capacity RC

Inspection and comparison of Figure 2−figure supplement 1 and Figure 15−figure supplement 1B demonstrates that although the manner in which the ΔC_pD-TS(T) and ΔC_pTS-N(T) functions vary with temperature is consistent with the relationship between m_D-N and ΔC_pD-N values, there is nevertheless an intriguing anomaly that is at odds with the LLTM for heat capacity. If we consider the partial folding reaction D ⇌ [TS], it is readily apparent from these figures that although the denatured conformer diffuses > ∽ 70% along the normalized SASA-RC to reach the TSE for 240 K < T < 320 K, ΔC_pD-TS(T) ≪ ΔC_pTS-N(T) throughout this regime. Conversely, if we consider the total unfolding reaction N ⇌ D, a large fraction of ΔC_pD-N is accounted for not by the second-half of the unfolding reaction ([TS]) ⇌ D but by the first-half ( N ⇌ [TS]), despite the native conformer diffusing less than ∽30% along the SASA-RC to reach the TSE. To put things into perspective, we will need to normalize the heat capacities of activation. Adopting Leffler’s framework for the relative sensitivities of the activation and equilibrium enthalpies in response to a perturbation in temperature,⁴⁸ we may write where β_H(fold)(T) = β_S(fold)(T) and β_H(unfold)(T) = β_S(unfold)(T) (see Paper-II) are classically interpreted to be a measure of the position of the TSE along the heat capacity RC.⁴⁹ Naturally, for a two-state system the algebraic sum of β_H(fold)(T) and β_H(unfold)(T) is unity. Recasting Eqs. (24) and (25) in terms of (12) and (13) gives

When T = T_S, ΔS_D-N(T) = 0 and Eqs. (26) and (27) reduce to

As explained earlier, because ΔC_pD-N is temperature-invariant by postulate, and ΔC_pD-TS(T) is a minimum, and ΔC_pTS-N(T) is a maximum at T_S, β_H(fold)(T) and β_H(unfold)(T) are a minimum and a maximum, respectively, at T_S. How do β_H(fold)(T) and β_H(unfold)(T) compare with their counterparts, β_T(fold)(T) and β_T(unfold)(T)? This is important because a statistically significant correlation exists between m_D-N and ΔC_pD-N, and both these two parameters independently correlate with ΔSASA_D-N. Recasting Eqs. (28) and (29) gives

Since m_TS-N(T) > 0 and a maximum, and m_TS-D(T) > 0 and a minimum, respectively, at T_S, it is readily apparent from inspection of Eqs. (1) and (2) that and at T_S. Consequently, we have: β_T(fold)(T)|_{T=T_S} > β_H(fold)(T)|_{T=T_S} and β_T(unfold)(T)|_{T=T_S} < β_H(unfold)(T)|_{T=T_S}.

In agreement with the predictions of Eqs. (30) and (31), inspection of Figure 16 demonstrates that although the denatured conformer advances by > ∽ 70% along the SASA-RC to reach the TSE when T = T_S, it accounts for < ∽20% of the total change in ΔC_pD-N (i.e., β_T(fold)(T)|_{T=T_S} > β_H(fold)(T)|_{T=T_S}), with the rest of the change (> ∽ 80%) in heat capacity coming from a mere ∽ 30% diffusion of the activated conformer along the SASA-RC to reach the bottom of the native Gibbs basin (i.e., β_T(unfold)(T)|_{T=T_S} < β_H(unfold)(T)|_{T=T_S}). The theoretical prediction that β_T(fold)(T) > β_H(fold)(T) across a substantial temperature range is supported by the finding by Gloss and Matthews (1998) that the position of the TSE relative to the DSE along the heat capacity RC is consistently lower than the same along the SASA-RC (see also page 178 in Bilsel and Matthews, 2000, and references therein).^50,51

Figure 16 Comparison of the position of the TSE along the heat capacity RC and the SASA-RC.

(A) Although the temperature-dependences of β_H(fold)(T) is consistent with β_T(fold)(T), and both are a minimum at T_S, their magnitudes are not even remotely similar across a large temperature regime (240 K < T < 320 K, shaded area); and when T = T_S, β_T(fold)(T) is four fold greater than β_H(fold)(T) (0.7063/0.1759 = 4.0). Note that the position of the TSE relative to the DSE along the heat capacity and SASA-RCs are identical at the points of intersection (203.6 K and 358.3 K). (B) Although the temperature-dependence of β_H(unfold)(T) is consistent with β_T(unfold)(T), and both are a maximum at T_S, β_H(unfold)(T) > β_T(unfold)(T) for 240 K < T < 320 K; and when T = T_S, β_H(unfold)(T) is ∽2.8 fold greater than β_T(unfold)(T) (0.8241/0.2937 = 2.81). The position of the TSE relative to the NSE along the heat capacity and SASA-RCs are identical at the points of intersection (203.6 K and 358.3 K). See Figure 16−figure supplement 1 for unscaled plots of β_H(fold)(T) and β_H(unfold)(T).

Figure 15-figure supplement 2 The second derivatives of m_TS-D(T) and m_TS-N(T) with respect to temperature. (A) The second derivative of m_TS-D(T) according to Eq. (A9). (B) The second derivative of m_TS-N(T) according to Eq. (A10). The sole intent of these figures is to demonstrate that the gross features of the temperature-dependence of the heat capacity functions arise primarily from the second derivatives of the temperature-dependent shift in the position of the TSE relative to the vertices of the DSE or the NSE Gibbs parabolas along the RC. See Figure 15−figure supplement 3 for the location of the extrema of these two functions.

Figure 15-figure supplement 3 The extrema of the second derivatives of m_TS-D(T) and m_TS-N(T) with respect to temperature are not at T_S.

(A) The sole intent of these appropriately scaled figures is to demonstrate that although the gross features of the temperature-dependence of the heat capacity functions arise predominantly from the second derivatives of the temperature-dependent shift in the position of the TSE along the RC, the minimum of ∂²m_TS-D(T)/∂T² and the maximum of ∂²m_TS-N(T)/∂T² do not occur at T_S (green circles), and is apparent from comparison of Eqs. (12), (13), (A9) and (A10).

Figure 16-figure supplement 1 Temperature-dependence of β_H(fold)(T) and β_H(unfold)(T).

(A) Variation in β_H(fold)(T) with temperature according to Eq. (26). (B) Variation in β_H(unfold)(T) with temperature according to Eq. (27). The location of the extrema is not apparent in these figures. Note that although the algebraic sum of β_H(fold)(T) and β_H(unfold)(T) must always be unity for a two-state system, they need not be individually restricted to a canonical range of 0 to 1.

Now, if we accept the long held premise that the greater heat capacity of the DSE as compared to the NSE is purely or predominantly due to structured water around the exposed non-polar residues in the DSE, then the only way we can explain why ΔC_pD-TS(T) ≪ ΔC_pTS-N(T) despite β_T(fold)(T) > ∽70% for the partial folding reaction D ⇌ [TS] is that the non-polar SASA of both the DSE and the TSE are very similar at T_S. Because it is physically near-impossible for the denatured conformer to advance by > ∽ 70% along the SASA-RC to reach the TSE, and yet keep the non-polar SASA fairly constant such that ΔC_pD-TS(T) is just about 20% of ΔC_pD-N, the natural conclusion is that “the large and positive difference in heat capacity between the DSE and the NSE cannot be only due to the clathrates of water molecules around exposed non-polar residues in the DSE.”^38,52-54 This brings us to two studies on the heat capacities of proteins, one by Sturtevant almost four decades ago, and the other by Lazaridis and Karplus.^55,56 While Sturtevant identified six possible sources of heat capacity which are: (i) the hydrophobic effect; (ii) electrostatic charges; (iii) hydrogen bonds; (iv) conformational entropy; (v) intramolecular vibrations; and (vi) changes in equilibria, and concluded that the most important of these are the hydrophobic, conformational and vibrational effects, Lazaridis and Karplus concluded from their molecular dynamics simulations on truncated CI2 that the heat capacity can have a significantly large and a positive contribution from intra-protein non-covalent interactions. What these two studies essentially imply is that when the pressure and solvent properties are defined and temperature-invariant, the ability of the conformers in a protein reaction-state to absorb thermal energy and yet resist an increase in temperature is dependent on: (i) its molecular structure; and (ii) the size and the character of its molecular surface (i.e., the relative proportion of polar and non-polar SASA). While the first variable determines the capacity of the reaction-state to absorb thermal energy and distribute it across its various internal modes of motion (the vibrational, rotational, and to some extent, the translational entropy from elements such as the N and C-terminal regions, loops etc. that can flap around in the solvent), the second variable determines not only the size and thickness of the solvent shell but also how tightly or loosely the solvent molecules are bound to the protein surface and to themselves (i.e., the dynamics of water in the solvation shell as compared to bulk water; see Fig. 1 in Frauenfelder et al., 2009), and by extension, the amount of excess thermal energy needed to disrupt the solvent shell as the reaction-states interconvert due to thermal noise.^36,52,57-61 Further discussion on the determinants of heat capacity is beyond the scope of this article and will be addressed elsewhere.

On the inapplicability of the Hammond postulate to protein folding

Although it is difficult to provide a detailed physical explanation for the temperature-dependence of the heat capacities of activation without deconvoluting the activation enthalpies and entropies into their constituent chain and desolvation enthalpies and entropies (shown in the accompanying article), it is instructive to give one extreme example to emphasize why both the solvent shell and the non-covalent interactions make a significant contribution to heat capacity (note that as long as the difference in the number of covalent bonds between the reaction-states is zero, to a first approximation, their contribution to the difference in heat capacity between the reaction-states can be ignored; see Lecture II in Finkelstein and Ptitsyn, 2002, and references therein).^38,56,62,63

It was shown earlier that when T = T_S(α) and T_S(ω), we have m_TS-N(T) = 0 ⇒ ΔSASA_TS-N(T) = 0, leading to a unique set of relationships: G_TS(T) = G_N(T), H_TS(T) = H_N(T), S_TS(T) = S_N(T), and k_u(T) = k⁰ (Figures 2B, Figure 2−figure supplement 1B, 4C, 5B, 6B, 9, and 11). However, we note from Eq. (22) that ΔC_pTS-N(T) < 0 at these two temperatures and is ∽ −6.2 kcal.mol^-1.K^-1 for FBP28 WW (Figure 15B). Since the molar concentration of the TSE is identical to that of the NSE at T_S(α) and T_S(ω), what this physically means is that if we were to take a mole of NSE and a mole of TSE and heat them at constant pressure under identical solvent conditions, we will find that the NSE, relative to the TSE, will absorb thermal energy equivalent to ∽6.2 calories before both the TSE and the NSE will independently register a 10⁻³ K rise in temperature. Because at these two temperatures the SASA, the Gibbs energy, the enthalpy, and the entropy of the TSE and the NSE are identical, this large difference in heat capacity which is ∽15-fold greater than ΔC_pD-N (6.2/0.417 = 14.8) must stem from a complex combination of: (i) a difference in the number and kinds of non-covalent interactions;⁶⁴ (ii) the precise 3D-arrangement of the non-covalent interactions (i.e., the network of interactions) leading to a difference in their fundamental frequencies;^55,56 and (iii) the character of the surface exposed to the solvent (i.e., polar vs non-polar SASA) between the said reaction-states.^65-67 Thus, a fundamentally important conclusion that we may draw from this behaviour is that “two reaction-states on a protein folding pathway need not necessarily have the same structure even if their interconversion proceeds with concomitant zero net-change in SASA, enthalpy, entropy, and Gibbs energy.” A corollary is that the reaction-states on a protein folding pathway are distinct entities with respect to both their internal structure and the character of their molecular surface. What this implies is that the Hammond postulate which states that “if two states, as for example, a transition state and an unstable intermediate, occur consecutively during a reaction process and have nearly the same energy content, their interconversion will involve only a small reorganization of the molecular structures,”⁶⁸ although may be applicable to reactions of small molecules, is inapplicable to protein folding. The inapplicability stems primarily from the profound differences between non-covalent protein folding reactions and covalent reactions of small molecules. In the simplest reactions of small molecules, except for the one or two bonds that are being reconfigured, the rest of the reactant-structure, to a first approximation, usually remains fairly intact as the reaction proceeds (this need not necessarily hold for all simple chemical reactions and probably not for complex reactions). Consequently, if we were to use the bond-length of the bond that is being reconfigured as the RC, and find that the difference in Gibbs energy between any two reaction-states that occur consecutively along the RC are very similar, a reasonable assumption/expectation would be that their structures must be very similar.^69-77 However, such an assumption cannot be valid for protein folding since an incredibly large number of chain and solvent configurations can lead to conformers having exactly the same Gibbs energy. Consequently, it is difficult to imagine how one can infer the structure of the transiently populated protein reaction-states, including the TSEs, to a near-atomic resolution purely from energetics (see Φ-value analysis later).^78-80

The position of the TSE along the entropic RC

The Leffler parameters for the relative sensitivities of the activation and equilibrium Gibbs energies in response to a perturbation in temperature are given by the ratios of the derivatives of the activation and equilibrium Gibbs energies with respect to temperature.^13-15,81 Thus, for the partial folding reaction D ⇌ [TS], we have where β_G(fold)(T) is classically interpreted to be a measure of the position of the TSE relative to the DSE along the entropic RC.⁴⁹ Recasting Eq. (32) in terms of (8) and (A4) and rearranging gives

Similarly for the partial unfolding reaction N ⇌ [TS] we have Where β_G(unfold)(T) is a measure of the position of the TSE relative to the NSE along the entropic RC. Substituting Eqs. (9) and (A6) in (34) gives

Inspection of Eqs. (32) and (34) shows that β_G(fold)(T) + β_G(unfold)(T) = 1 for any given reaction-direction. Now, since ΔS_D-N(T) = ΔS_TS-D(T) = ΔS_TS-N(T) = 0 at T_S, β_G(fold)(T) and β_G(unfold)(T) will be undefined for T = T_S. However, these are removable discontinuities as is apparent from Eqs. (33) and (35); consequently, curves simulated using the latter set of equations will have a hole at T_S. If we ignore the hole at T_S to enable a physical description and their comparison to other RCs, the extremum of β_G(fold)(T) (which is positive and a minimum) and the extremum of β_G(unfold)(T) (which is positive and a maximum) will occur at T_S (Figure 17 and Figure 17−figure supplement 1) and is a consequence of m_TS-D(T) being a minimum, and both m_TS-N(T) and φ being a maximum, respectively, at T_S. This can also be demonstrated by differentiating Eqs. (32) and (34) with respect to temperature (not shown). Comparison of Eqs. (28) and (33), and Eqs. (29) and (35) demonstrate that when T = T_S, we have β_H(fold)(T) = β_G(fold)(T) and β_H(unfold)(T) = β_G(unfold)(T), i.e., the position of the TSE along the heat capacity and entropic RCs are identical at T_S, and non-identical for T ≠ T_S (Figure 17). Further, since m_TS-N(T) = β_T(unfold)(T) = 0 at T_S(α) and T_S(ω) (Figure 2B and Figure 2−figure supplement 1B), β_G(unfold)(T) ≡ β_T(unfold)(T) = 0 and β_G(fold)(T) ≡ β_T(fold)(T) = 1, and not identical for T ≠ T_S(α) and T_S(ω); and for T_α ≤ T < T_S(α) and T_S(ω) < T ≤ T_ω (the ultralow and high temperature Marcus-inverted-regimes, respectively), β_G(fold)(T) and β_T(fold)(T) are greater than unity, and β_G(unfold)(T) and β_T(unfold)(T) are negative (Figure 18). Note that although β_G(fold)(T) is unity at T_S(α) and T_S(ω), the structures of the TSE and the NSE cannot be assumed to be identical as explained earlier.

Figure 17-figure supplement 1 Temperature-dependence of β_G(fold)(T) and β_G(unfold)(T).

(A) Variation in β_G(fold)(T) with temperature according to Eq. (33). (B) Variation in β_G(unfold)(T) with temperature according to Eq. (35). The location of the extrema is not apparent in these figures. Note that although the algebraic sum of β_G(fold)(T) and β_G(unfold)(T) must always be unity for a two-state system, they need not be individually restricted to a canonical range of 0 to 1.

Figure 17 Comparison of the position of the TSE along the heat capacity and the entropic RCs.

(A) The position of the TSE with respect to the DSE along the heat capacity and the entropic RCs are identical at T_S, and non-identical for T ≠ T_S. (B) The position of the TSE with respect to the NSE along the heat capacity and the entropic RCs are identical at T_S, and non-identical for T ≠ T_S. See Figure 17−figure supplement 1 for unscaled plots of β_G(fold)(T) and β_G(unfold)(T).

Figure 18 Comparison of the position of the TSE along the heat capacity and the entropic RCs.

(A) The position of the TSE with respect to the DSE along the SASA and the entropic RCs are identical at T_S(α) and T_S(ω), and dissimilar for T ≠ T_S(α) and T_S(ω). (B) The position of the TSE with respect to the NSE along the SASA and the entropic RCs are identical at T_S(α) and T_S(ω), and dissimilar for T ≠ T_S(α) and T_S(ω). The green pointers indicate T_S(α) and T_S(ω).

Although it is beyond the scope of this manuscript to perform a large-scale survey of literature for corroborating evidence, the notion that these equations must hold for any two-state folder (as long as they conform to the postulates laid out in Paper-I) is readily apparent from the experimental data of Kelly, Gruebele and colleagues.^25,82-84 However, the reader will note that what Gruebele and coworkers refer to as Φ_T(T, P) (see Eq. (8) in Crane et al., 2000 and Jäger et al., 2001, Eq. (5) in Ervin and Gruebele, 2002, and Eq. (3) in Nguyen et al., 2003) is equivalent to β_G(T) in this article. We will reserve the letter Φ for Φ-value analysis which we will address later.⁷⁹ Inspection of Fig. 7a in Crane et al., 2000 demonstrates that β_G(fold)(T) increases with temperature for T > T_S for both the wild type hYAP WW domain and its mutant W39F (∽0.4 at 38 °C and ∽0.8 at 78 °C). This pattern is once again repeated for the wild type and several mutants of Pin WW domain (Fig. 8 in Jäger et al., 2001) and more importantly for ΔNΔC Y11R W30F, a variant of FBP28 WW (inset in Fig. 4B in Nguyen et al., 2003). Nevertheless, all is not in agreement since the shapes of their β_G(fold)(T) curves are distinctly different from what is expected from the formalism discussed in this article. This discrepancy most probably has to do with their use of Taylor expansion with three adjustable parameters to calculate the temperature-dependence of equilibrium stability and the Gibbs activation energies. While it is stated that the use of this non-classical model and the associated adjustable parameters in preference to the physically realistic Schellman formalism (which requires the model-independent calorimetrically determined value of ΔC_pDN)⁶ makes little or no difference to the temperature-dependence of equilibrium stability over an extended temperature range, this may not be true for the activation energy. Once again in good agreement with prediction that β_G(unfold)(T) must decrease with temperature for T > T_S, Tokmakoff and coworkers find that β_G(unfold)(T) for ubiquitin decreases with temperature (0.77 at 53 °C and 0.67 at 67 °C).⁸⁵ Note that although raw data of the said groups and their conclusion that the position of the TSE shifts closer to the NSE as the temperature is raised for T > T_S is in agreement with the predictions of the equations derived here, their Hammond-postulate-based inference of the structure of the TSE is flawed from the perspective of the parabolic approximation.

Now, at the midpoint of thermal (T_m) or cold denaturation (T_c), ΔG_D-N(T) = 0; therefore, Eqs. (1) and (2) become

Substituting Eqs. (36) and (37), and in (33) and (35), respectively, and simplifying gives

Simply put, at the midpoint of cold or heat denaturation, the position of the TSE relative to the DSE along the entropic RC is identical to the position of the TSE relative to the NSE along the SASA-RC (Figure 19A). Similarly, the position of the TSE relative to the NSE along the entropic RC is identical to the position of the TSE relative to the DSE along the SASA-RC (Figure 19B). Dividing Eq. (38) by (39) gives

This seemingly obvious relationship has far deeper physical meaning. Simplifying further and recasting gives

Figure 19 The intersection of β_G(T) and β_T(T) functions.

(A) At the midpoint of cold or heat denaturation, the position of the TSE relative to the DSE along the entropic RC is identical to the position of the TSE relative to the NSE along the SASA-RC. (B) The position of the TSE relative to the NSE along the entropic RC is identical to the position of the TSE relative to the DSE along the SASA-RC at the midpoint of cold or heat denaturation.

Thus, at the temperatures T_c and T_m where the concentration of the DSE and the NSE are identical, the ratio of the slopes of the folding and unfolding arms of the chevron determined at the said temperatures are a measure of the ratio of the change in entropies for the partial folding reactions [TS] ⇋ N and D ⇋[TS], or the square root of the ratio of the Gaussian variances of the DSE and the NSE along the SASA-RC, or equivalently, the ratio of the standard deviations of the DSE σ_DES(T) and the NSE σ_NES(T) Gaussians (Figure 19−figure supplement 1; see Paper-I for the relationship between force constants, Gaussian variances and equilibrium stability). A corollary is that irrespective of the primary sequence, or the topology of the native state, or the residual structure in the DSE, if for a spontaneously folding two-state system at constant pressure and solvent conditions it is found that at a certain temperature the ratio of the distances by which the denatured and the native conformers must travel from the mean of their ensemble to reach the TSE along the SASA RC is identical to the ratio of the standard deviations of the Gaussian distribution of the SASA of the conformers in the DSE and the NSE, then at this temperature the Gibbs energy of unfolding or folding must be zero.

Figure 19-figure supplement 1 The correspondence between Gibbs parabolas and Gaussian PDFs.

(A) Parabolic Gibbs energy curves with α = 7.594 M².mol.kcal^-1 and ω = 85.595 M².mol.kcal^-1, m_D-N = 0.82 kcal.mol^-1.M^-1 and ΔG_D-N(T) = 2.138 kcal.mol^-1. The separation between curve-crossing and the vertices of the DSE and the NSE-parabolas along the abscissa are 0.5848 kcal.mol^-1.M^-1 and 0.2352 kcal.mol^-1.M^-1, respectively. The absolute values of ΔG_TS-D(T) and ΔG_TS-N(T) are 2.597 kcal.mol^-1 and 4.735 kcal.mol^-1, respectively. The parabolas have been generated as described in the legend for Figure 3. (B) Gaussian PDFs for the DSE and NSE generated using , where r is any point on the abscissa, μ = 0 kcal.mol^-1.M^-1 and σ² = RT/2α for the DSE-Gaussian, and μ = 0.82 kcal.mol^-1.M^-1 and σ² = RT/2ω for the NSE-Gaussian. The units for the Gaussian variances are in kcal².mol^-2.M^-2. The relationship between equilibrium stability and the areas enclosed by the DSE and the NSE Gaussians has been addressed in Paper-I.

As an aside, the reader will note that β_G(fold)(T) and β_G(unfold)(T) are equivalent to the Brønsted exponents alpha and beta, respectively, in physical organic chemistry; and their classical interpretation is that they are a measure of the structural similarity of the transition state to either the reactants or the products.⁸¹ If the introduction of a systematic perturbation (often a change in structure via addition or removal of a substituent, pH, solvent etc.) generates a reaction-series, and if for this reaction-series it is found that alpha is close to zero (or beta close to unity), then it implies that the energetics of the transition state is perturbed to the same extent as that of the reactant, and hence inferred that the structure of the transition state is very similar to that of the reactant. Conversely, if alpha is close to unity (or beta is almost zero), it implies that the energetics of the transition state is perturbed to the same extent as the product, and hence inferred that the transition state is structurally similar to the product. Although the Brønsted exponents in many cases can be invariant with the degree of perturbation (i.e., a constant slope leading to linear free energy relationships),^70,86 this is not necessarily true, especially if the degree of perturbation is substantial (Fig. 3 in Cohen and Marcus, 1968; Fig. 1 in Kresge, 1975).^y14,72,81 Further, this seemingly straightforward and logical Hammond-postulate-based conversion of Brønsted exponents to similarity or dissimilarity of the structure of the transition states to either of the ground states nevertheless fails for those systems with Brønsted exponents greater than unity and less than zero (see page 1897 in Kresge, 1974).^24,81,87-91

To summarise, a comparison of the position of the TSE along the solvent (β_T(T)), heat capacity (β_H(T)), and entropic (β_G(T)) RCs leads to three important general conclusions (Figure 20): (i) as long as ΔSASA_D-N is large, and by extension ΔC_pD-N is large and positive, the position of the TSE relative to the ground states along the various RCs is neither constant nor a simple linear function of temperature when investigated over a large temperature range; (ii) for a given temperature, the position of the TSE along the RC depends on the choice of the RC; and (iii) although the algebraic sum of β_T(fold)(T) and β_T(unfold)(T), β_H(fold)(T) and β_H(unfold)(T), and β_G(fold)(T) and β_G(unfold)(T) must be unity for a two-state system for any particular temperature, individually they can be positive, negative, or zero. Consequently, the notion that the atomic structure of the transiently populated reaction-states in protein folding can be inferred from their position along the said RCs is flawed.⁷⁸

Figure 20 Comparison of the position of the TSE along the various reaction coordinates.

The position of the TSE relative to the ground states depends on the choice of the RC and changes in a complex manner with temperature. (A) For 210 K∽ < T < ∽350 K (shaded region), the position of the TSE relative to the DSE is the most advanced along the solvent RC as compared to the heat capacity and entropic RCs; and for T ≠ T_S, is the most advanced along the heat capacity RC as compared to the entropic RC (B) In contrast, for 210 K∽ < T < ∽350 K and T ≠ T_S, the position of the TSE relative to the NSE is the most advanced along the entropic RC as compared to the heat capacity RC, and is the least advanced along the SASA-RC.

Temperature-dependence of Φ-values

Φ-value analysis is a variation of the Brønsted procedure introduced by Fersht and coworkers which when properly implemented claims to provide a near-atomic-level description of the transiently populated reaction-states in protein folding.^79,80 In this procedure, the primary sequence of the target protein is modified using protein engineering, and the effect of these perturbations are quantified through a parameter Φ (0 ≤ Φ ≤ 1) which by definition is the ratio of mutation-induced change in the Gibbs activation energy of folding/unfolding to the corresponding change in equilibrium stability. According to the canonical formulation, when Φ_F(T) = 0 (Φ-value for folding), it implies that the energetics of the TSE is perturbed to the same extent as that of the DSE upon mutation, and hence inferred that the said reaction-states are structurally identical with respect to the site of mutation. In contrast, when Φ_F(T) = 1, it implies that the energetics of the TSE is perturbed to the same extent as that of the NSE, and hence inferred that the structure at the site of mutation is identical in both the TSE and the NSE. Partial Φ-values are difficult to interpret and are thought to be due to partially developed interactions in the TSE, or multiple routes to the TSE. Thus, while Φ per se is the slope a two-point Brønsted plot, the conversion of this value to relative-structure is based on the Hammond postulate and the canonical range: The Hammond postulate provides the licence to infer structure from energetics, and the canonical scale enables one to infer how similar or dissimilar the TSE is to either the DSE or the NSE. Assuming that the prefactor is identical for the wild type and the mutant proteins, we may write for the partial folding (D ⇌ [TS]) and unfolding (N ⇌ [TS]) reactions where the subscripts “wt” and “mut” denote the reference or the wild type, and the structurally perturbed protein, respectively, and Φ_U(T) is the Φ-value for unfolding. Inspection of Eqs. (42) and (43) shows that for a two-state system, Φ_F(T) + Φ_U(T) = 1. Now, although the primary sequence is intact in thermal denaturation experiments, we can readily calculate the temperature-dependence of Φ values for folding and unfolding using the protein at one unique temperature as the internal reference or the wild type, and protein at all the rest of the temperatures as the mutants. Thus, if the protein at T_S is defined as the internal reference or the wild type, Eqs. (42) and (43) become

Similarly, if the protein at T_m is defined as the internal reference or the wild type, Eqs. (42) and (43) become Where x = ΔG_{TS-D(T_m)} = ΔG_{TS-N(T_m)} and y = ΔG_N-D(T) ≡ −ΔG_D-N(T) (the denominator reduces to a single quantity since ΔG_{D-N(T_m)} ≡ −ΔG_{N-D(T_m)} = 0). The parameters Φ_{F(internal)(T)} and Φ_{U(internal)(T)} (which are obviously undefined for the reference temperatures) when interpreted according to the canonical Φ-value framework (i.e., the notion that 0 ≤ Φ ≤ 1) are a measure of the global similarity or dissimilarity of the structure of the TSE to either the DSE or the NSE. Thus, if Φ_{F(internal)(T)} = 0, it implies that the energetics of the TSE is perturbed to the same extent as that of the DSE upon a perturbation in temperature, and hence inferred that the global structure of the TSE is identical to that of the DSE. Conversely, if Φ_{F(internal)(T)} = 1, it implies that the energetics of the TSE is perturbed to the same extent as the NSE upon a perturbation in temperature, and hence inferred that the global structure of the TSE is identical to that of the NSE.

Inspection of Figures 21 and Figure 21−figure supplements 1, 2, 3 and 4 immediately demonstrates that: (i) irrespective of which temperature is defined as the internal reference (i.e., the wild type), Φ_{F(internal)(T)} must be a minimum and Φ_{U(internal)(T)} must be a maximum at T_S (see Appendix); (ii) the magnitude of Φ_{F(internal)(T)} is always the least, and the magnitude of Φ_{U(internal)(T)} is always the greatest when the protein at T_S is defined as the reference or the wild type protein, and any deviation in the definition of the reference temperature from T_S must lead to a uniform increase in Φ_{F(internal)(T)} and a uniform decrease in Φ_{U(internal)(T)} for all temperatures; (iii) although the algebraic sum of Φ_{F(internal)(T)} and Φ_{U(internal)(T)} is unity for all temperatures, the notion that they must independently be restricted to 0 ≤ Φ ≤ 1 is flawed; and (iv) although both Leffler β_G(T) and Fersht Φ values are derived from changes in Gibbs activation energies for folding and unfolding relative to changes in equilibrium stability upon a perturbation in temperature, their response is not the same since the equations that govern their behaviour are not the same. While the magnitude of the Leffler β_G(T) is independent of the reference owing to it being the ratio of the derivatives of the change in Gibbs energies with respect to temperature, the magnitude of Φ(internal)(T) is dependent on the definition of the reference state. For example, if the protein at T_S is defined as the wild type, then β_G(fold)(T) ≈ Φ_{F(internal)(T)} and β_G(unfold)(T) ≈ Φ_{U(internal)(T)} around the temperature of maximum stability; but as the temperature deviates from T_S, β_G(fold)(T) increases far more steeply than Φ_{F(internal)(T)}, and β_G(unfold)(T) decreases far more steeply than Φ_{U(internal)(T)} such that for T ≠ T_S we have β_G(fold)(T) > Φ_{F(internal)(T)} and β_G(unfold)(T) < Φ_{U(internal)(T)} (Figure 21−figure supplement 3). In contrast, if the protein at T_m is defined as the wild type, then we have: (i) β_G(fold)(T) < Φ_{F(internal)(T)} for T_c < T < T_m and β_G(fold)(T) > Φ_{F(internal)(T)} for T < T_c and T > T_m; and (ii) β_G(unfold)(T) > Φ_{U(internal)(T)} for T_c < T < T_m and β_G(unfold)(T) < Φ_{U(internal)(T)} for T < T_c and T > T_m(Figure 21−figure supplement 4). The point we are trying to make is that a comparison of the position of the TSE along Leffler β_G(T) and Φ_{(internal)(T)} RCs is not straightforward since both β_G(T) and Φ_{(internal)(T)} are temperature-dependent, and importantly respond differently to temperature-perturbation; and even if we restrict the comparison to one particular temperature, the answer we get is still subjective since the magnitude of Φ_{(internal)(T)} is dependent on how we define the wild type.⁹²

Figure 21-figure supplement 1 The magnitude of Φ_{(internal)(T)} is dependent on the definition of the wild type.

(A) Φ_{F(internal)(T)} calculated using the protein at T_S as the wild type must always be lower than Φ_{F(internal)(T)} calculated using protein at T ≠ T_S as the wild type. (B) Φ_{U(internal)(T)} calculated using the protein at T_S as the wild type must always be greater than Φ_{U(internal)(T)} calculated using protein at T ≠ T_S as the wild type. For the blue curves we have ΔG_(wt) ≡ ΔG_{(T_S)}, and for the red curves, we have ΔG_(wt) ≡ ΔG_{(T_m)}. This notation applies to both equilibrium and activation energies. The blue curves are undefined (0/0) at T_S, and the red curves are undefined at T_c and T_m. Note that the mathematical stipulation that Φ_{F(internal)(T)} + Φ_{U(internal)(T)} = 1 for a two-state system is satisfied for both the blue and the red curves for all temperatures.

Although the mathematical formalism for why the extrema of Φ_{F(internal)(T)} (which is a minimum) and Φ_{U(internal)(T)} (which is a maximum) must always occur precisely at T_S has been shown in the appendix, it is instructive to examine the same graphically. Inspection of Figure 21−figure supplements 5, 6 and 7 demonstrates that this is a consequence of ΔG_TS-D(T) and ΔG_N-D(T) being a minimum, and ΔG_TS-N(T) and ΔG_D-N(T) being a maximum at T_S. Subtracting the reference Gibbs energies from the numerator and the denominator (Eq. (44)) has the effect of lowering the ΔG_TS-D(T) curve and raising the ΔG_N-D(T), such that the value of the said curves are zero at the reference temperature, but the shapes of the curves are not altered in any way (Figure 21−figure supplement 5). On the other hand, for ΔG_TS-N(T) and ΔG_D-N(T) curves (Eq. (45)), apart from the value of the curves becoming zero at the reference, it causes them to flip vertically (Figure 21−figure supplement 6). Consequently, if we divide the transformed Gibbs activation energies by the transformed equilibrium Gibbs energies, we end up with Φ_{F(internal)(T)} and Φ_{U(internal)(T)} which are a minimum and a maximum, respectively, at T_S (Figure 21−figure supplement 7).

Now that the process that leads to the temperature-dependence of Φ has been addressed, the question is “Can we infer the structure of the TSE as being similar to either the DSE or the NSE from these data?” The answer is “no” for several reasons. First, as argued earlier, the Hammond postulate cannot be valid for protein folding; and because the structural interpretation of Φ values is based on the Hammond postulate, it too must be deemed fallacious. Second, even if we accept the premise that Hammond postulate is applicable to protein folding, the inference that the global structure of the TSE as being denatured-like for Φ_{F(internal)(T)} = 0, and native-like for Φ_{F(internal)(T)} = 1 is flawed since Φ values need not necessarily be restricted to 0 ≤ Φ ≤ 1 (Figure 21−figure supplement 2). Third, even if we summarily exclude those wild types that lead to anomalous Φ values as being unsuitable for Φ analysis, we still have a problem since even within the restricted set of wild types that yield 0 ≤ Φ ≤ 1, their magnitude depends on the definition of the wild type; consequently, for the same temperature, the degree of structure in the TSE relative to that in the DSE appears to increase as the definition of the wild type deviates from T_S (Figure 21−figure supplement 1). If we try to circumvent this interpretational problem by arguing that the “inference of the structure of the TSE” is always relative to the residual structure in the DSE, and that changing the definition of what constitutes the wild type will invariably affect Φ values, then we can’t really say much about the structure of the TSE without first solving the structure of the DSE. Fourth, even if through a judicious combination of various structural and biophysical methods (residual dipolar couplings, paramagnetic relaxation enhancement, small angle X-ray scattering, single molecule spectroscopy etc.), and computer simulation, we are able to determine the residual structure in the DSE,^93-96 the structural interpretation of Φ values leads to physically unrealistic scenarios. For example, inspection of Figure 21A shows that around room temperature (298 K) Φ_{F(internal)(T)} ≈ 0.18. A canonical interpretation of this number implies that the global structure of the TSE is very similar to that of the DSE. However, inspection of Figure 2−figure supplement 1A shows that the denatured conformer has buried ∽70% of the total SASA to reach the TSE (i.e., advanced by about 70% along the SASA-RC). Similarly, inspection of Figure 5A shows that ΔG_TS-D(T) = 2.6 kcal.mol^-1 at 298 K (note that this is not a small number that can be ignored since ΔG_D-N(T) = 2.1 kcal.mol^-1 at 298 K). Further, we have shown earlier in the section on the “Inapplicability of the Hammond postulate to protein folding,” that even when two reaction-states have identical SASA, Gibbs energies, enthalpies, and entropies, there need not necessarily have identical structure. Thus, the question is: How can we conclude with any measure of certainty that the global structure of the TSE is very similar to that of the DSE at 298 K when they have such a large difference in SASA, and a substantial difference in Gibbs energy? To illustrate why it is difficult to rationalize the theoretical basis of Φ analysis, it is instructive to directly examine the ratio of the Gibbs activation energies and the difference in Gibbs energy between the ground states (Figure 21−figure supplement 8). It is immediately apparent that the ratios are a complex function of temperature; and although we can readily provide an explanation for the particular features of these complex dependences, it is difficult to see how subtracting reference energies from the numerator and denominator of the ratios ΔG_TS-D(T)/ΔG_N-D(T) and ΔG_TS-N(T)/ΔG_D-N(T) allows us to divine the structure of the TSE to a near-atomic resolution. This is once again readily apparent from the complex non-linear relationship between equilibrium stability and the rate constants (Figure 21−figure supplement 9).

Figure 21-figure supplement 2 Φ values can be greater than unity, negative or zero depending on the definition of the reference state.

(A) Φ_{F(internal)(T)} calculated by defining the protein at T_ω as the wild type. (B) Φ_{U(internal)(T)} calculated by defining the protein at T_ω as the wild type. Although the mathematical stipulation that Φ_{F(internal)(T)} + Φ_{U(internal)(T)} = 1 for a two-state system is satisfied for all temperatures, Φ values for folding and unfolding are not restricted to the canonical range of 0 ≤ Φ ≤ 1 when the protein at T_ω is defined as the reference or the wild type. Note that the curves are undefined at T_ω.

Figure 21-figure supplement 3 Comparison of Leffler β_G(T) and Fersht Φ_{(internal)(T)} when the protein at T_S is defined as the wild type.

(A) β_G(fold)(T) is almost identical to Φ_{F(internal)(T)} around the temperature of maximum stability, but as the temperature deviates from T_S, β_G(fold)(T) increases far more steeply than Φ_{F(internal)(T)}, such that for T ≠T_S we have β_G(fold)(T) > Φ_{F(internal)(T)}. (B) β_G(unfold)(T) is almost identical to Φ_{U(internal)(T)} around the temperature of maximum stability, but as the temperature deviates from T_S, β_G(unfold)(T) decreases far more steeply than Φ_{U(internal)(T)}, such that for T ≠T_S we have β_G(unfold)(T) < Φ_{U(internal)(T)}.

Figure 21-figure supplement 4 Comparison of the Leffler β_G(T) and Fersht Φ_{(internal)(T)} when the protein at T_m is defined as the wild type.

(A) β_G(fold)(T) < Φ_{F(internal)(T)} for T_c < T < T_m and β_G(fold)(T) > Φ_{F(internal)(T)} for T < T_c and T > T_m. (B) β_G(unfold)(T) > Φ_{U(internal)(T)} for T_c < T < T_m and β_G(unfold)(T) < Φ_{U(internal)(T)} for T < T_c and T > T_m. Note that Φ_{F(internal)(T)} and Φ_{U(internal)(T)} are undefined for T_c and T_m (green pointers).

Figure 21-figure supplement 5 Transformation of ΔGTS-D(T) and ΔG_N-D(T)to generateΦ_{F(internal)(T)} when the protein at T_S is defined as the wild type.

(A) The transformation ΔG_TS-D(T) − ΔG_{TS-D(T_S)} (the numerator in Eq. (44)) lowers the ΔG_TS-D(T) function such that the value of ΔΔG_{TS-D(T-T_S)} is zero at the reference temperature. (B) The transformation ΔG_N-D(T) – ΔG_{N-D(T_S)} (the denominator in Eq. (44)) raises the ΔG_N-D(T) function such that the value of ΔΔG_{N-D(T-T_S)} is zero at the reference temperature. The unmodified and the transformed curves are shown in black and red, respectively.

Figure 21-figure supplement 6 Transformation of ΔG_TS-N(T) and ΔG_D-N(T) to generate Φ_{U(internal)(T)} when the protein at T_S is defined as the wild type.

(A) The transformation ΔG_{TS-N(T_S)} − ΔG_TS-N(T) (the numerator in Eq. (45)) flips the ΔG_TS-N(T) function vertically and concomitantly shifts it along the ordinate such that the value of ΔΔGTS-N(TS-T) at the reference temperature is zero. (B) The transformation ΔG_D-N(TS) – ΔG_DN(T) (the denominator in Eq. (45)) flips the ΔG_D-N(T) function vertically and concomitantly shifts it along the ordinate such that the value of ΔΔG_D-N(TS-T) at the reference temperature is zero. The unmodified and the transformed curves are shown in black and red, respectively.

Figure 21-figure supplement 7 An overlay of transformed curves and Φ_{(internal)(T)} when the protein at T_S is defined as the wild type.

(A) Dividing ΔΔG_{TS-D(T-T_S)} by ΔΔG_{N-D(T-T_S)} generates Φ_{F(internal)(T)} with its minimum at T_S. (B) Dividing ΔΔG_{TS-N(T_S-T)} by ΔΔG_{D-N(T_S-T)} generates Φ_{U(internal)(T)} with its maximum at T_S. Note that the dimensions of the ordinate apply only to the red and the blue curves since Φ_{(internal)(T)} is dimensionless.

Figure 21-figure supplement 8 Temperature-dependence of the ratio of the Gibbs activation energies to stability.

(A) The ratio ΔG_TS-D(T)/ΔG_N-D(T) is negative for T_c < T < T_m and positive for T < T_c and T < T_m. (B) The ratio ΔG_TS-N(T)/ΔG_D-N(T) is positive for T_c < T < T_m and negative for T < T_c and T > T_m. The vertical asymptotes are a consequence of ΔG_D-N(T) = −ΔG_N-D(T) approaching zero as the temperature approaches T_c and T_m. Note that the ordinate is dimensionless, and that (ΔG_TS-D(T)/ΔG_N-D(T)) + (ΔG_TS-N(T)/ΔG_D-N(T)) = 1 for a two-state system.

Figure 21-figure supplement 9 The complex non-linear relationship between the rate constants and the difference in Gibbs energies between the ground states.

(A) k_f(T) vs the Gibbs energy of folding at equilibrium. The slope of this plot equals −k_f(T) ΔH_TS-D(T)/ΔS_TS-D(T)/ΔS_N-D(T)RT². (B) k_u(T) vs the Gibbs energy of unfolding at equilibrium. The Marcus-inverted-regimes which occur at very low and high temperatures are towards the extreme left. The slope of this plot is given by −k_u(T) ΔH_TS-D(T)/ΔS_TS-D(T)/ΔS_N-D(T)RT². (C) Natural logarithm of k_f(T) vs the Gibbs energy of folding at equilibrium (slope = -ΔH_TS-D(T)/ΔS_N-D(T)RT²). (D) Natural logarithm of k_u(T) vs the Gibbs energy of unfolding at equilibrium (slope = -ΔH_TS-D(T)/ΔS_N-D(T)RT²). The abscissae for plots A and C, and plots B and D are identical. The colour-code is identical for all the four plots.

Figure 21 Temperature-dependence of Φ values when the protein at T_S is defined as the wild type.

(A) Temperature-dependence of Φ_{F(internal)(T)}. (B) Temperature-dependence of Φ_{U(internal)(T)}. The red pointers indicate the extrema of the functions. The discontinuities in the curves which must occur at T_S have been removed by mathematically manipulating Eqs. (42) and (45) (manipulated equations not shown). Nevertheless, Φ_{F(internal)(T)} and Φ_{U(internal)(T)} are undefined at T_S (i.e., the curves have holes at T_S which is not obvious). Note that the mathematical stipulation that Φ_{F(internal)(T)} + Φ_{U(internal)(T)} = 1 for a two-state system is satisfied for all temperatures.

To further illuminate the difficulty in rationalizing the Φ-value procedure, it is instructive to apply Eqs. (42) and (45) to treat enthalpies. Thus, for the partial folding (D ⇌ [TS]) and unfolding (N ⇌ [TS]) reactions we have Where the parameters Φ_{H_F(internal)(T)} and Φ_{H_U(internal)(T)} are the “enthalpic analogues” of Φ_{F(internal)(T)} and Φ_{U(internal)(T)}, respectively (the subscript “H” indicates we are using enthalpy instead of Gibbs energy), when the protein at the temperature T_S is defined as the wild type. Now, if we apply an analogous version of the canonical interpretation given by Fersht and coworkers, it implies that when Φ_{H_F(internal)(T)} = 0, the enthalpy of the TSE is perturbed to the same extent as that of the DSE upon a perturbation in temperature; and when Φ_{H_F(internal)(T)} = 1, it implies that the enthalpy of the TSE is perturbed to the same extent as that of the NSE. It is easy to see that just as Φ_{F(internal)(T)} and Φ_{U(internal)(T)} are the Fersht-analogues of the Leffler β_G(fold)(T) and β_G(unfold)(T), respectively (see entropic RC), the parameters Φ_{H_F(internal)(T)} and Φ_{H_U(internal)(T)} are similarly the Fersht-analogues of the Leffler β_H(fold)(T) and β_H(unfold)(T), respectively (see heat capacity RC).

Inspection of Figure 22 and its supplements immediately demonstrates that the same anomalies that prevent a straightforward structural interpretation of Φ_{F(internal)(T)} and Φ_{U(internal)(T)} are also emerge if we try to assign structure to their enthalpic analogues, Φ_{H_F(internal)(T)} and Φ_{H_U(internal)(T)}. First, although the algebraic sum of Φ_{H_F(internal)(T)} and Φ_{H_U(internal)(T)} is unity for all temperatures, they need not independently be restricted to a canonical range of 0 ≤ Φ ≤ 1 (Figure 22). Second, the magnitude of Φ_{HF(internal)(T)} and Φ_{H_U(internal)(T)} are dependent on the definition of the wild type (Figure 22−figure supplement 1). Third, changing the definition of the wild type has a dramatic effect on the relationship between the Leffler β_H(T) and its analogue, the Fersht Φ_{H(internal)(T)}. Consequently, the question of whether Leffler β_H(T) underestimates or overestimates structure is dependent on how we analyse the system (Figure 22−figure supplements 2 and 3). Fourth, just as the temperature-dependent position of the TSE relative to the ground states depends on the choice of the RC (Figure 20), we see that Φ_{(internal)(T)} and its enthalpic analogue, Φ_{H(internal)(T)}, change at different rates upon a perturbation in temperature (Figure 22−figure supplement 4). The difficulty in rationalizing how subtracting reference values from the numerator and the denominator of Eqs. (42) and (49) can yield residue-level information is once again apparent from the complex dependence of the ratios ∂ ln k_f(T)/∂ ln K_N-D(T) = ΔH_TS-D(T)/ΔH_N-D(T) and ∂ ln k_u(T)/∂ ln K_D-N(T) = ΔH_TS-N(T)/ΔH_D-N(T) on temperature (Figure 22−figure supplement 5).

Figure 22-figure supplement 1 The magnitude of Φ_{H(internal)(T)} is dependent on the definition of the wild type.

(A) A comparison of Φ_{H_F(internal)(T)} calculated using proteins at T_S and T_m as the wild types. (B) A comparison of Φ_{H_U(internal)(T)} calculated using proteins at T_S and T_m as the wild types. For the blue curves we have ΔH_(wt) ≡ ΔH_(TS), and for the red curves, we have ΔH_(wt) ≡ ΔH_{(T_m)}. This notation applies to both equilibrium and activation enthalpies. The blue curves are undefined (0/0) at T_S, and the red curves are undefined at T_c and T_m. This figure is the enthalpic equivalent of Figure 21−figure supplement 1.

Figure 22-figure supplement 2 Comparison of the Leffler β_H(T) and Φ_{H(internal)(T)} when the protein at T_S is defined as the wild type.

(A) β_H(fold)(T) is almost identical to Φ_{H_F(internal)(T)} around the temperature of maximum stability, but as the temperature deviates from T_S, β_H(fold)(T) increases far more steeply than Φ_{H_F(internal)(T)}, such that for T ≠T_S we have β_H(fold)(T) > Φ_{H_F(internal)(T)}. (B) β_H(unfold)(T) is almost identical to Φ_{H_U(internal)(T)} around the temperature of maximum stability, but as the temperature deviates from T_S, β_H(unfold)(T) decreases far more steeply than Φ_{H_U(internal)(T)}, such that for T ≠T_S we have β_H(unfold)(T) < Φ_{H_U(internal)(T)}. Note that the parameters Φ_{H_F(internal)(T)} and Φ_{H_U(internal)(T)} are the Fersht-analogues of the Leffler β_H(fold)(T) and β_H(unfold)(T), respectively (see heat capacity RC). Consequently, this figure is analogous to a comparison of Leffler β_G(fold)(T) and Fersht Φ_{F(internal)(T)}, and Leffler β_G(unfold)(T) and Fersht Φ_{U(internal)(T)} (see Figure 21−figure supplement 3).

Figure 22-figure supplement 3 Comparison of the Leffler β_H(T) and Φ_{H(internal)(T)} when the protein at T_m is defined as the wild type.

Changing the definition of the wild type from T_S (see previous figure) to T_m has a dramatic effect on the relationship between the Leffler-β_H(T) and the Fersht-Φ_{H(internal)(T)}. (A) β_H(fold)(T) < Φ_{H_F(internal)(T)} for ∽248 K < T < T_m, β_H(fold)(T) > Φ_{H_F(internal)(T)} for T < ∽248 K and T > T_m, and identical when T = ∽248 K. (B) β_H(unfold)(T) > Φ_{H_U(internal)(T)} for ∽248 K < T < T_m, β_H(unfold)(T) < Φ_{H_U(internal)(T)} for T < ∽248 K and T > T_m, and identical when T = ∽248 K. Note that Φ_{H_F(internal)(T)} and Φ_{H_U(internal)(T)} are undefined at T_m and the discontinuity in the functions is apparent upon close inspection. This figure is the enthalpic equivalent of Figure 21−figure supplement 4.

Figure 22-figure supplement 4 Comparison of Φ_{(internal)(T)} and Φ_{H(internal)(T)} when the protein at T_S is defined as the wild type.

(A) The normalized Gibbs parameter Φ_{F(internal)(T)} is almost identical to the normalized enthalpic parameter Φ_{H_F(internal)(T)} around the temperature of maximum stability, but as the temperature deviates from T_S, Φ_{H_F(internal)(T)} increases far more steeply than Φ_{F(internal)(T)}, such that for T ≠T_S we have Φ_{H_F(internal)(T)} > Φ_{F(internal)(T)}. (B) The normalized Gibbs parameter Φ_{U(internal)(T)} is almost identical to the normalized enthalpic parameter Φ_{H_U(internal)(T)} around the temperature of maximum stability, but as the temperature deviates from T_S, Φ_{H_U(internal)(T)} decreases far more steeply than Φ_{U(internal)(T)}, such that for T ≠T_S we have Φ_{H_F(internal)(T)} < Φ_{U(internal)(T)}. Since Φ_{(internal)(T)} and Φ_{H(internal)(T)} are the Fersht-analogues of Leffler β_G(T) and β_H(T), respectively, this figure is analogous to comparing the temperature-dependent position of the TSE along the entropic and heat capacity RCs (i.e., a comparison of the temperature-dependence of β_G(T) and β_H(T); see Figure 17).

Figure 22-figure supplement 5 Temperature-dependence of the ratio of the activation enthalpies to equilibrium enthalpies.

(A) The ratio ΔH_TS-D(T)/ΔH_N-D(T) is positive for T_α ≤ T < T_H and T_H(TS-D) < T ≤ T_ω, negative for T_H < T < T_H(TS-D) and zero at T_H(TS-D). (B) The ratio ΔH_TS-N(T)/ΔH_D-N(T) is positive for T_S(α) < T < T_H(TS-N) and T_H < T < T_S(ω), negative for T_α ≤ T < T_S(α), T_H(TS-N) < T < T_H, and T_S(ω) < T ≤ T_ω, and zero at T_S(α), T_H(TS-N), and T_S(α). The vertical asymptotes are a consequence of ΔH_D-N(T) = −ΔH_N-D(T) approaching zero as T→T_H. Note that the ordinate is dimensionless, and that (ΔH_TS-D(T)/ΔH_N-D(T))+(ΔH_TS-N(T)/ΔH_D-N(T))=1 for a two-state system.

Figure 22 Temperature-dependence of Φ_{H(internal)(T)} when the protein at T_S is defined as the wild type.

(A) Temperature-dependence of Φ_{H_F(internal)(T)}. (B) Temperature-dependence of Φ_{H_U(internal)(T)}. Note that both these curves are undefined at T_S. Although the algebraic sum of Φ_{H_F(internal)(T)} and Φ_{H_U(internal)(T)} is unity for all temperatures, they need not necessarily be are not restricted to a canonical range of 0 ≤ Φ ≤ 1. The parameters Φ_{H_F(internal)(T)} and Φ_{H_U(internal)(T)} are the “enthalpic analogues” of Φ_F(internal)(T) and Φ_U(internal)(T), respectively (the subscript “H” indicates we are using enthalpy instead of Gibbs energy). Consequently, this figure is the enthalpic equivalent of Figure 21.

Comparison of theoretical and experimental Φ-values obtained from structural perturbation across 31 two-state systems

Given that the framework of Φ-value analysis was primarily developed to be used in conjunction with structural rather than temperature perturbation, and despite its anomalies has been used extensively for more than twenty years to divine the structures of the TSEs of not just globular but also membrane proteins, it is imperative to demonstrate that the notion that the structure of the TSE cannot be inferred from Φ-values is also valid for structural perturbation.^97-101 Although a detailed reappraisal is beyond the scope of this article and will be presented elsewhere, because we have questioned the validity of Φ analysis, one is compelled to provide some justification in this article.

Consider the wild type of a hypothetical two-state folder whose equilibrium stability and the mean length of the RC at constant temperature, pressure and solvent conditions are given by ΔG_D-N(T) = 6 kcal.mol^-1 and m_D-N = 2 kcal.mol^-1.M^-1, respectively. Although not necessarily true and addressed elsewhere, to limit the number of hypothetical scenarios to a manageable number, we will assume that the force constants of the DSE and the NSE-parabolas of the wild type and all its mutants are given by α = 1 M².mol.kcal^-1 and ω = 30 M².mol.kcal^-1. The effect of single point mutations on the wild type may be classified into a total of five unique scenarios (Figure 23A).

Case I (Quadrant x2): The introduced mutation causes a concomitant decrease in both the stability and the mean length of the RC (i.e., ΔG_D-N(T)(wt) > ΔG_D-N(T)(mut) and m_D-N(wt) > m_D-N(mut)). This is equivalent to the introduced mutation causing the separation between the vertices of the DSE and the NSE-parabolas along the abscissa and ordinate to decrease (Figure 23−figure supplement 1A).

Figure 23-figure supplement 1 Parabolic Gibbs energy curves to illustrate the effect of concomitant changes in ΔG_D-N(T) and m_D-N on the position of the TSE along the abscissa and ordinate.

The parabolas corresponding to the DSE and NSE of the wild type are shown in black in all the four plots. The mutant DSE-parabolas are shown in blue and red while the mutant NSEparabolas are shown in magenta. (A) The introduced mutation causes a concomitant decrease in ΔG_D-N(T) and the mean-length of the RC. (B) The introduced mutation causes a decrease in ΔG_D-N(T) but an increase in the mean-length of the RC. (C) The introduced mutation stabilizes the mutant but causes a concomitant decrease in the mean-length of the RC. (D) The introduced mutation stabilizes the protein but concomitantly causes an increase in the mean-length of the RC. The curvatures of all the DSE-parabolas (α = 1 M².mol.kcal^-1) and all the NSE-parabolas (ɷ = 30 M².mol.kcal^-1) are identical.

Case II (Quadrant y1): The introduced mutation causes a decrease in stability but concomitantly causes an increase in the mean length of the RC (i.e., ΔG_D-N(T)(wt) > ΔG_D-N(T)(mut) and m_D-N(wt) < m_D-N(mut)). This is equivalent to the mutation causing a decrease in the separation between the vertices of the DSE and the NSE-parabolas along the ordinate, but an increase along the abscissa (Figure 23−figure supplement 1B).

Case III (Quadrant x1): The introduced mutation leads to an increase in stability but concomitantly causes a decrease in the mean length of the RC (i.e., ΔG_D-N(T)(wt) < ΔG_D-N(T)(mut) and m_D-N(wt) > m_D-N(mut)). This is equivalent to the mutation causing an increase in the separation between the vertices of the DSE and the NSE-parabolas along the ordinate, but a decrease along the abscissa (Figure 23−figure supplement 1C).

Case IV (Quadrant y2): The introduced mutation leads to a concomitant increase in both the stability and the mean length of the RC (i.e., ΔG_D-N(T)(wt) < ΔG_D-N(T)(mut) and m_D-N(wt) < m_D-N(mut)). This is equivalent to the mutation causing an increase in the separation between the vertices of the DSE and the NSE-parabolas along the ordinate and the abscissa (Figure 23−figure supplement 1D).

Case V: The introduced mutation leads to a change in stability but has no effect on the mean length of the RC (m_D-N(wt) = m_D-N(mut)). This is equivalent to the mutation causing an increase or a decrease in the separation between the vertices of the DSE and the NSE-parabolas along the ordinate, but the separation along the abscissa is invariant (Figure 23−figure supplement 2).

Figure 23-figure supplement 2 Parabolic Gibbs energy curves to illustrate the effect of a change in ΔG_D-N(T) on the position of the TSE along the abscissa and ordinate.

The wild type DSE and NSE-parabolas are shown in black, the destabilized mutants are shown in red and blue, and the stabilized mutants are shown in green and magenta. As the protein is increasingly destabilized the curve-crossing along the RC shifts closer to the vertex of the NSE-parabola, and can be due to a stabilized DSE or a destabilized NSE, or both. Conversely, as the protein is increasingly stabilized, the curve-crossing along the RC shifts away from the vertex of NSE-parabola and this can be due to a destabilized DSE or a stabilized NSE, or both. The force constant for the DSE-parabola is α = 1 M².mol.kcal^-1. The curvatures of all the NSE-parabolas are identical (ɷ = 30 M².mol.kcal^-1).

Figure 23 Comparison of the theoretical and experimental Φ_F(T) values (structural perturbation).

(A) Theoretical limits of Φ_F(T) values according to parabolic approximation where all the 1463 theoretical mutants have the following common parameters: α = 1 M².mol.kcal^-1 (DSE-parabola); ω = 30 M².mol.kcal^-1 (NSE-parabola). The wild type was arbitrarily chosen to be the one with parameters ΔG_D-N(T)= 6 kcal.mol^-1 and m_D-N = 2 kcal.mol^-1.M^-1. The legend indicates the variation in m_D-N values in kcal.mol^-1.M^-1. The quadrants labelled x1 and x2 are for mutants whose m_D-N < m_D-N(wt) (i.e., a contraction of the RC) and the quadrants labelled y1 and y2 are for those mutants whose m_D-N > m_D-N(wt) (i.e., an expansion of the RC). Close inspection shows that for those mutants whose stabilities have changed but not their m_D-N values, the Φ_F(T) values are positive but very close to zero (shown in cyan). Theoretical Φ_F values corresponding to ΔΔG_{D-N(wt-mut)(T)} = 0.0 ± 0.4 kcal.mol^-1 have been excluded for clarity. This corresponds to about 6.7% error on the wild type ΔG_D-N(wt)(T). (B) An overlay of theoretical Φ_F(T) and experimental Φ_F(T) values in water for 1035 mutants from 31 two-state systems. Data used to calculate the experimental Φ_F(T) values were taken from published literature (detailed information is given elsewhere). The vertical asymptotes are a consequence of ΔΔG_{D-N(wt-mut)(T)} approaching zero.

In summary, what we done is taken a pair of intersecting parabolas of differing curvature (ω > α), and systematically varied the separation between their vertices along the abscissa (m_D-N) and ordinate (ΔG_D-N(T)) without changing the curvature of the parabolas. Once this is done, we can calculate a priori the position of the curve-crossings relative to the vertex of the DSE-parabola along the abscissa (i.e., m_TS-D(T); Eq. (1)) and ordinate (i.e., ΔG_TS-D(T); Eq. (3)). Once the ΔG_TS-D(T) values for all combinations of ΔG_D-N(T) and m_D-N are obtained (each combination is equivalent to a point mutation), Φ_F(T) values can be readily calculated using Eq. (50) by arbitrarily choosing one particular combination of ΔG_D-N(T) (= 6 kcal.mol^-1) and m_D-N (= 2 kcal.mol^-1.M^-1) as the wild type.

Figure 23A which has been generated by plotting the theoretical Φ_F(T) values as a function of ΔΔG_{D-N(wt-mut)(T)} leads to two important conclusions: (i) Φ_F(T) values are not restricted to 0 ≤ Φ ≤ 1, and that the perceived unusualness of anomalous or non-classical Φ values is a consequence of flawed canonical limits; and (ii) the magnitude of Φ_F(T) values decrease as the difference in stability between the wild type and the mutant proteins increase, and at once debunks the idea that one must use an arbitrary ΔΔG_{D-N(wt-mut)(T)} cut-off (± 0.6 kcal.mol^-1 according to the Fersht lab, and ± 1.7 kcal.mol^-1 according to Sanchez and Kiefhaber) for Φ_F(T) values to be interpretable.^98,102 While it is true that Φ values would be error prone when |ΔΔG_{D-N(wt-mut)(T)}| is less than the error with which one can determine ΔG_D-N(T) of both the wild type and the mutant proteins (typically about ± 5-10% of ΔG_D-N(T)),¹⁰³ the increase in the magnitude of Φ_F(T) values when ΔΔG_{D-N(wt-mut)(T)} approaches zero (the vertical asymptotes) is a mathematical certainty and not because of error as is commonly argued. Nevertheless, because these conclusions are based on the results of a model that is purely hypothetical, they would naturally be meaningless without experimental validation. Thus, as a test of the hypothesis, experimental Φ_F(T) values in water were calculated according to Eq. (51) using published kinetic data of a total of 1064 proteins (1035 mutants + 29 wild types) from 31 two-state systems (details of the systems analysed will be provided elsewhere).

The remarkable agreement between theoretical prediction and experimental Φ_F(T) values is immediately apparent from an overlay of the said datasets (Figure 23B), and serves as arguably one of the most rigorous tests of the hypothesis for the following reasons: (1) The space enclosed by the curves in Figure 23A is complex and restricted. Therefore, if the experimental Φ_F(T) values fall within this restricted theoretical space it would be highly unlikely for it to be purely due to some dramatic coincidence. (2) The sample size of experimental dataset is sufficiently large (1035 mutations), and the two-state systems investigated include α, β, and α/β proteins (note that α and β refer to secondary structure in this context and not to the force constant of the DSE or the Tanford beta value, respectively), with size ranging from 37 to 107 residues. (3) The published kinetic data used to calculate experimental Φ_F(T) values were acquired by various labs under varying solvent conditions (buffers, co-solvents and pH; denaturant is either guanidine hydrochloride or urea) and temperature (as low as 278 K to as high as 301.16 K), over a period of about two decades using a variety of experimental methods, including infrared laser-induced and electrical discharge temperature-jump relaxation measurements, stopped flow and manual mixing experiments, and lineshape analysis of exchange-broadened NMR resonances. These results, including those on the temperature-dependence of Φ_F(T) values lead to an important conclusion: Because the canonical scale itself has no basis, Φ-value-based interpretation of the structure of the transiently populated protein reaction-states is dubious.

Concluding Remarks

Although the temperature-dependent behaviour of FBP28 WW was analysed in great detail using the theory developed in the Papers I and II, and novel conclusions have been drawn, this is by no means sufficient since we have barely addressed the physical chemistry underlying the effect of temperature on the Gibbs energies, the enthalpies, the entropies, and the heat capacities of activation for folding and unfolding. These aspects will be dealt with in the accompanying articles. Further, there is a good reason why we have given little importance to the actual values of the reference temperatures and instead focussed on what they actually mean and how they relate to each other. Although the remarks in Table 1 are valid for all reference temperatures, except for the values of the equilibrium reference temperatures (T_c, T_H, T_S, and T_m), the values for the rest of them can change depending on the values of the force constants. However, what will not change is the inter-relationship between them. The nature of this limitation will be addressed when the mechanism of action of denaturants is investigated.

Methods

The temperature-dependence of ΔG_D-N(T) of FBP28 WW wild type (Figure 1) was simulated according to Eq. (A1) using T_m = 337.2 K, ΔH_{D-N(T_m)} = 26.9 kcal.mol^-1 and ΔC_pD-N = 417 cal.mol^-1.K^-1 (Table 1 in Petrovich et al., 2006).⁴ The values of k⁰ = 2180965 s^-1, α = 7.594 M².mol.kcal^-1, ω = 85.595 M².mol.kcal^-1, and m_D-N = 0.82 kcal.mol^-1.M^-1 were extracted from the chevron of FBP28 WW (acquired at 283.16 K in 20 mM 3-[morpholino] propanesulfonic acid, ionic strength adjusted to150 mM with Na₂SO₄, pH 6.5) by fitting it to a modified chevron-equation using non-linear regression as described in Paper-I. The data required to simulate the chevron (k_f(H₂O)(T), k_u(H₂O)(T), m_TS-D(T) and m_TS-N(T)) were taken from Table 4 in Petrovich et al., 2006.⁴ Once the parameters ΔH_{D-N(T_m)}, T_m, ΔC_pD-N, m_D-N, the force constants α and ω, and k⁰ are known, the left-hand side of all the equations in this article may be readily calculated for any temperature. Note that the spring constants, k⁰, m_D-N, and ΔC_pD-N are temperature-invariant.

Competing Financial Interests

The author declares no competing financial interests.

Appendix

The temperature-dependence of ΔG_D-N(T), ΔH_D-N(T), and ΔS_D-N(T) functions

The temperature-dependence of the change in Gibbs energy, enthalpy and entropy of two-state systems upon unfolding at equilibrium are given by⁶

Where ΔH_D-N(T), ΔH_{D-N(T_m)} and ΔS_D-N(T), ΔS_{D-N(T_m)} denote the equilibrium enthalpies and entropies of unfolding, respectively, at any given temperature, and at the midpoint of thermal denaturation (T_m), respectively, for a given two-state folder under defined solvent conditions. The temperature-invariant and the temperature-dependent difference in heat capacity between the DSE and NSE are denoted by ΔC_pD-N and ΔC_pD-N(T), respectively.

The first derivatives of m_TS-D(T), m_TS-N(T), β_T(fold)(T) and β_T(unfold)(T) with respect to temperature

The first derivative of m_TS-D(T) is given by

Because β_T(fold)(T) = m_TS-D(T)/m_D-N, we also have

Since ∂m_TS-D(T)/∂T and ∂β_T(fold)(T)/∂T are physically undefined for φ < 0, their algebraic sign at any given temperature is determined by the ln(T/T_S) term. This leads to three scenarios: (i) for T < T_S we have ∂m_TS-D(T)/∂T > 0 and ∂β_T(fold)(T)/∂T > 0; and (iii) for T = T_S we have ∂m_TS-D(T)/∂T = 0 and ∂β_T(fold)(T)/∂T = 0.

Because m_TS-N(T) = (m_D-N − m_TS-D(T)) for a two-state system, and β_T(unfold)(T) = m_TS-N(T)/m_D-N, we have

Eqs. (A6) and (A7) once again lead to three scenarios: (i) for T < T_S we have ∂m_TS-N(T)/∂T > 0 and ∂β_T(unfold)(T)/∂T > 0; (ii) for T > T_S we have ∂m_TS-N(T)/∂T < 0 and ∂β_T(unfold)(T)/∂T > 0; and (iii) for T = T_S we have ∂m_TS-N(T)/∂T = 0 and ∂β_T(unfold)(T)/∂T = 0.

The second derivatives of m_TS-D(T) and m_TS-N(T) with respect to temperature

Differentiating Eq. (A4) with respect to temperature gives

Simplifying Eq. (A8) yields

Similarly, we may show that

Expression for the temperature-dependence of the observed rate constant

The observed rate constant k_obs(T) for a two-state system is the sum of k_f(T) and k_u(T).¹⁰⁴ Therefore, we can write

Substituting Eqs. (5) and (6) in (A11) gives

Expressions to demonstrate why the extrema of Φ_{F(internal)(T)} and Φ_{U(internal)(T)} must occur at T_S

Differentiating Eq. (44) with respect to temperature gives where the protein at the temperature T_Ref is by definition the wild type protein. Because ΔS_N-D(T) and ΔS_TS-D(T) are both zero at T_S, irrespective of T_Ref, the derivative of Φ_{F(internal)(T)} will be zero at T_S. Similarly, we can show by differentiating Eq. (45) that

Once again, since ΔS_D-N(T) and ΔS_TS-N(T) are both zero at T_S, irrespective of T_Ref, the derivative of Φ_{U(internal)(T)} will be zero at T_S.

Footnotes

Vinkensteynstraat 128, 2562 TV, Den Haag, Netherlands, robert.sade{at}gmail.com

REFERENCES

↵
Marcus, R. A. Chemical and Electrochemical Electron-Transfer Theory. Annu. Rev. Phys. Chem. 15, 155, doi:10.1146/annurev.pc.15.100164.001103 (1964).
OpenUrl CrossRef Web of Science
Sade, R. S. Analysis of Two-State Folding Using Parabolic Approximation I: Hypothesis. bioRxiv, doi:10.1101/036491 (2016).
OpenUrl Abstract/FREE Full Text
↵
Sade, R. S. Analysis of Two-State Folding Using Parabolic Approximation II: Temperature-Dependence. bioRxiv, doi:10.1101/037341 (2016).
OpenUrl CrossRef
↵
Petrovich, M., Jonsson, A. L., Ferguson, N., Daggett, V. & Fersht, A. R. Φ-Analysis at the Experimental Limits: Mechanism of Beta-Hairpin Formation. J. Mol. Biol. 360, 865–881, doi:10.1016/j.jmb.2006.05.050 (2006).
OpenUrl CrossRef PubMed Web of Science
↵
Tanford, C. Protein Denaturation. C. Theoretical Models for the Mechanism of Denaturation. Adv. Protein Chem. 24, 1–95, doi:10.1016/S0065-3233(08)60241-7 (1970).
OpenUrl CrossRef PubMed
↵
Becktel, W. J. & Schellman, J. A. Protein Stability Curves. Biopolymers 26, 1859–1877, doi:10.1002/bip.360261104 (1987).
OpenUrl CrossRef PubMed Web of Science
↵
Privalov, P. L. Thermodynamic Problems of Protein Structure. Annu. Rev. Biophys. Biophys. Chem. 18, 47–69, doi:10.1146/annurev.bb.18.060189.000403 (1989).
OpenUrl CrossRef PubMed Web of Science
↵
Otzen, D. E. & Oliveberg, M. Correspondence between anomalous m- and ΔC_p-values in protein folding. Protein Sci. 13, 3253–3263, doi:10.1110/ps.04991004 (2004).
OpenUrl CrossRef PubMed Web of Science
↵
Dimitriadis, G. et al. Microsecond folding dynamics of the F13W G29A mutant of the B domain of staphylococcal protein A by laser-induced temperature jump. Proc. Natl. Acad. Sci. U S A 101, 3809–3814, doi:10.1073/pnas.0306433101 (2004).
OpenUrl Abstract/FREE Full Text
↵
Taskent, H., Cho, J. H. & Raleigh, D. P. Temperature-Dependent Hammond Behaviour in a Protein-Folding Reaction: Analysis of Transition-State Movement and Ground-State Effects. J. Mol. Biol. 378, 699–706, doi:10.1016/j.jmb.2008.02.024 (2008).
OpenUrl CrossRef PubMed
Cellmer, T., Henry, E. R., Hofrichter, J. & Eaton, W. A. Measuring internal friction of an ultrafast-folding protein. Proc. Natl. Acad. Sci. U S A 105, 18320–18325, doi:10.1073/pnas.0806154105 (2008).
OpenUrl Abstract/FREE Full Text
↵
Sun, L., Noel, J. K., Sulkowska, J. I., Levine, H. & Onuchic, J. N. Connecting Thermal and Mechanical Protein (Un)folding Landscapes. Biophys. J. 107, 2941–2952, doi:10.1016/j.bpj.2014.10.021 (2014).
OpenUrl CrossRef PubMed
↵
Marcus, R. A. Theoretical Relations among Rate Constants, Barriers, and Brønsted Slopes of Chemical Reactions. J. Phys. Chem. 72, 891–899, doi:10.1021/j100849a019 (1968).
OpenUrl CrossRef
↵
Cohen, A. O. & Marcus, R. A. On the Slope of Free Energy Plots in Chemical Kinetics. J. Phys. Chem. 72, 4249–4256, doi:10.1021/j100858a052 (1968).
OpenUrl CrossRef
↵
Koeppl, G. W. & Kresge, A. J. Marcus Rate Theory and the Relationship between Brønsted Exponents and Energy of Reaction. Chem. Commun., 371–373, doi:10.1039/C39730000371 (1973).
OpenUrl CrossRef
↵
Rasmussen, B. F., Stock, A. M., Ringe, D. & Petsko, G. A. Crystalline ribonuclease A loses function below the dynamical transition at 220 K. Nature 357, 423–424, doi:10.1038/357423a0 (1992).
OpenUrl CrossRef PubMed Web of Science
↵
Kauzmann, W. The Nature of the Glassy State and the Behavior of Liquids at Low Temperatures. Chem. Rev. 43, 219–256, doi:10.1021/cr60135a002 (1948).
OpenUrl CrossRef Web of Science
Adam, G. & Gibbs, J. H. On the Temperature Dependence of Cooperative Relaxation Properties in Glass Forming Liquids. J. Chem. Phys. 43, 139–146, doi:10.1063/1.1696442 (1965).
OpenUrl CrossRef Web of Science
Goldstein, M. Viscous Liquids and the Glass Transition: A Potential Energy Barrier Picture. J. Chem. Phys. 51, 3728–3739, doi:10.1063/1.1672587 (1969).
OpenUrl CrossRef Web of Science
Debenedetti, P. G. & Stillinger, F. H. Supercooled liquids and the glass transition. Nature 410, 259–267, doi:10.1038/35065704 (2001).
OpenUrl CrossRef PubMed Web of Science
Dill, K. A. & Bromberg, S. Molecular Driving Forces – Statistical Thermodynamics in Chemistry and Biology. (Garland Science, 2003).
Dyre, J. Mysteries of the Glass Transition. Physics Today 61, 15, doi:10.1063/1.2835137 (2008).
OpenUrl CrossRef
↵
Bauer, T., Lunkenheimer, P. & Loidl, A. Cooperativity and the Freezing of Molecular Motion at the Glass Transition. Phys. Rev. Lett. 111, 225702, doi:10.1103/PhysRevLett.111.225702 (2013).
OpenUrl CrossRef PubMed
↵
Kresge, A. J. The Brønsted Relation - Recent Developments. Chem. Soc. Rev. 2, 475–503, doi:10.1039/CS9730200475 (1973).
OpenUrl CrossRef
↵
Nguyen, H., Jäger, M., Moretto, A., Gruebele, M. & Kelly, J. W. Tuning the free-energy landscape of a WW domain by temperature, mutation, and truncation. Proc. Natl. Acad. Sci. U S A 100, 3948–3953, doi:10.1073/pnas.0538054100 (2003).
OpenUrl Abstract/FREE Full Text
↵
Ferguson, N. et al. Rapid amyloid fiber formation from the fast-folding WW domain FBP28. Proc. Natl. Acad. Sci. U S A 100, 9814–9819, doi:10.1073/pnas.1333907100 (2003).
OpenUrl Abstract/FREE Full Text
↵
Thomsen, J. S. Logical Relations among the Principles of Statistical Mechanics and Thermodynamics. Phys. Rev. 91, 1263–1266, doi:10.1103/PhysRev.91.1263 (1953).
OpenUrl CrossRef
↵
Bryngelson, J. D., Onuchic, J. N., Socci, N. D. & Wolynes, P. G. Funnels, Pathways and the Energy Landscape of Protein Folding: A Synthesis. Proteins: Struct. Funct. Genet. 21, 167, doi:10.1002/prot.340210302 (1995).
OpenUrl CrossRef PubMed Web of Science
Oliveberg, M., Tan, Y. J. & Fersht, A. R. Negative activation enthalpies in the kinetics of protein folding. Proc. Natl. Acad. Sci. U S A 92, 8926–8929, doi:10.1073/pnas.92.19.8926 (1995).
OpenUrl Abstract/FREE Full Text
Scalley, M. L. & Baker, D. Protein folding kinetics exhibit an Arrhenius temperature dependence when corrected for the temperature dependence of protein stability. Proc. Natl. Acad. Sci. U S A 94, 10636–10640, doi:10.1073/pnas.94.20.10636 (1997).
OpenUrl Abstract/FREE Full Text
Chan, H. S. & Dill, K. A. Protein Folding in the Landscape Perspective: Chevron Plots and Non-Arrhenius Kinetics. Proteins: Struct. Funct. Genet. 30, 2–33, doi:10.1002/(SICI)1097-0134(19980101)30:1<2::AID-PROT2>3.0.CO;2-R (1998).
OpenUrl CrossRef PubMed Web of Science
↵
Ghosh, K., Ozkan, S. B. & Dill, K. A. The Ultimate Speed Limit to Protein Folding Is Conformational Searching. J. Am. Chem. Soc. 129, 11920–11927, doi:10.1021/ja066785b (2007).
OpenUrl CrossRef PubMed Web of Science
↵
Myers, J. K., Pace, C. N. & Scholtz, J. M. Denaturant m values and heat capacity changes: Relation to changes in accessible surface areas of protein unfolding. Protein Sci. 4, 2138–2148, doi:10.1002/pro.5560041020 (1995).
OpenUrl CrossRef PubMed Web of Science
↵
Robertson, A. D. & Murphy, K. P. Protein Structure and the Energetics of Protein Stability. Chem. Rev. 97, 1251–1268, doi:10.1021/cr960383c (1997).
OpenUrl CrossRef PubMed Web of Science
↵
Schellman, J. A. Temperature, Stability, and the Hydrophobic Interaction. Biophys. J. 73, 2960–2964, doi:10.1016/S0006-3495(97)78324-3 (1997).
OpenUrl CrossRef PubMed Web of Science
↵
Baldwin, R. L. Dynamic hydration shell restores Kauzmann’s 1959 explanation of how the hydrophobic factor drives protein folding. Proc. Natl. Acad. Sci. U S A 111, 13052–13056, doi:10.1073/pnas.1414556111 (2014).
OpenUrl Abstract/FREE Full Text
↵
Privalov, P. L. & Gill, S. J. Stability of Protein Structure and Hydrophobic Interaction. Adv. Protein Chem. 39, 191–234, doi:10.1016/S0065-3233(08)60377-0 (1988).
OpenUrl CrossRef PubMed Web of Science
↵
Gómez, J., Hilser, V. J., Xie, D. & Freire, E. The Heat Capacity of Proteins. Proteins: Struct. Funct. Bioinf. 22, 404–412, doi:10.1002/prot.340220410 (1995).
OpenUrl CrossRef PubMed Web of Science
↵
Shortle, D. The Expanded Denatured State: An Ensemble of Conformations Trapped in a Locally Encoded Topological Space. Adv. Protein Chem. 62, 1–23, doi:10.1016/S0065-3233(02)62003-0 (2002).
OpenUrl CrossRef PubMed Web of Science
↵
Bowler, B. E. Residual structure in unfolded proteins. Curr. Opin. Struct. Biol. 22, 4–13, doi:10.1016/j.sbi.2011.09.002 (2012).
OpenUrl CrossRef PubMed
↵
Richards, F. M. Areas, Volumes, Packing, and Protein Structure. Annu. Rev. Biophys. Bioeng. 6, 151–176, doi:10.1146/annurev.bb.06.060177.001055 (1977).
OpenUrl CrossRef PubMed Web of Science
Bernadó, P., Blackledge, M. & Sancho, J. Sequence-Specific Solvent Accessibilities of Protein Residues in Unfolded Protein Ensembles. Biophys. J. 91, 4536–4543, doi:10.1529/biophysj.106.087528 (2006).
OpenUrl CrossRef PubMed
↵
Gong, H. & Rose, G. D. Assessing the solvent-dependent surface area of unfolded proteins using an ensemble model. Proc. Natl. Acad. Sci. U S A 105, 3321–3326, doi:10.1073/pnas.0712240105 (2008).
OpenUrl Abstract/FREE Full Text
↵
Behe, M. J., Lattman, E. E. & Rose, G. D. The protein-folding problem: The native fold determines packing, but does packing determine the native fold? Proc. Natl. Acad. Sci. U S A 88, 4195–4199, doi:10.1073/pnas.88.10.4195 (1991).
OpenUrl Abstract/FREE Full Text
↵
Chothia, C. Principles that Determine the Structure of Proteins. Annu. Rev. Biochem. 53, 537–572, doi:10.1146/annurev.bi.53.070184.002541 (1984).
OpenUrl CrossRef PubMed Web of Science
↵
Miller, S., Janin, J., Lesk, A. M. & Chothia, C. Interior and Surface of Monomeric Proteins. J. Mol. Biol. 196, 641–656, doi:10.1016/0022-2836(87)90038-6 (1987).
OpenUrl CrossRef PubMed Web of Science
↵
Tan, Y. J., Oliveberg, M. & Fersht, A. R. Titration Properties and Thermodynamics of the Transition State for Folding: Comparison of Two-state and Multi-state Folding Pathways. J. Mol. Biol. 264, 377–389, doi:10.1006/jmbi.1996.0647 (1996).
OpenUrl CrossRef PubMed Web of Science
↵
Leffler, J. Parameters for the Description of Transition States. Science 117, 340–341, doi:10.1126/science.117.3039.340 (1953).
OpenUrl FREE Full Text
↵
Sanchez, I. E. & Kiefhaber, T. Non-linear rate-equilibrium free energy relationships and Hammond behavior in protein folding. Biophys. Chem. 100, 397–407, doi:10.1016/S0301-4622(02)00294-6 (2003).
OpenUrl CrossRef PubMed Web of Science
↵
Bilsel, O. & Matthews, C. R. Barriers in Protein Folding Reactions. Adv. Protein Chem. 53, 153–207, doi:10.1016/S0065-3233(00)53004-6 (2000).
OpenUrl CrossRef PubMed Web of Science
↵
Gloss, L. M. & Matthews, C. R. The Barriers in the Bimolecular and Unimolecular Folding Reactions of the Dimeric Core Domain of Escherichia Coli Trp Repressor are Dominated by Enthalpic Contributions. Biochemistry 37, 16000–16010, doi:10.1021/bi981694f (1998).
OpenUrl CrossRef PubMed
↵
Lee, A. L. & Wand, A. J. Microscopic origins of entropy, heat capacity and the glass transition in proteins. Nature 411, 501–504, doi:10.1038/35078119 (2001).
OpenUrl CrossRef PubMed Web of Science
Chandler, D. Interfaces and the driving force of hydrophobic assembly. Nature 437, 640–647, doi:10.1038/nature04162 (2005).
OpenUrl CrossRef PubMed Web of Science
↵
Ben-Naim, A. Myths and verities in protein folding theories: From Frank and Evans iceberg-conjecture to explanation of the hydrophobic effect. J. Chem. Phys. 139, 165105, doi:10.1063/1.4827086 (2013).
OpenUrl CrossRef
↵
Sturtevant, J. M. Heat capacity and entropy changes in processes involving proteins. Proc. Natl. Acad. Sci. U S A 74, 2236–2240, doi:10.1073/pnas.74.6.2236 (1977).
OpenUrl Abstract/FREE Full Text
↵
Lazaridis, T. & Karplus, M. Heat capacity and compactness of denatured proteins. Biophys. Chem. 78, 207–217, doi:10.1016/S0301-4622(99)00022-8 (1999).
OpenUrl CrossRef
↵
Frauenfelder, H. et al. A unified model of protein dynamics. Proc. Natl. Acad. Sci. U S A 106, 5129–5134, doi:10.1073/pnas.0900336106 (2009).
OpenUrl Abstract/FREE Full Text
Lewandowski, J. R., Halse, M. E., Blackledge, M. & Emsley, L. Direct observation of hierarchical protein dynamics. Science 348, 578–581, doi:10.1126/science.aaa6111 (2015).
OpenUrl Abstract/FREE Full Text
Cooper, A. Heat capacity of hydrogen-bonded networks: an alternative view of protein folding thermodynamics. Biophys. Chem. 85, 25–39, doi:10.1016/S0301-4622(00)00136-8 (2000).
OpenUrl CrossRef PubMed Web of Science
Cooper, A. Heat capacity effects in protein folding and ligand binding: a re-evaluation of the role of water in biomolecular thermodynamics. Biophys. Chem. 115, 89–97, doi:10.1016/j.bpc.2004.12.011 (2005).
OpenUrl CrossRef PubMed Web of Science
↵
Cooper, A. Protein Heat Capacity: An Anomaly that Maybe Never Was. J. Phys. Chem. Lett. 1, 3298–3304, doi:10.1021/jz1012142 (2010).
OpenUrl CrossRef
↵
Karplus, M., Ichiye, T. & Pettitt, B. M. Configurational Entropy of Native Proteins. Biophys. J. 52, 1083–1085, doi:10.1016/S0006-3495(87)83303-9 (1987).
OpenUrl CrossRef PubMed Web of Science
↵
Finkelstein, A. V. & Ptitsyn, O. B. Protein Physics: A Course of Lectures. (Academic Press, 2002).
↵
Dill, K. A. Dominant Forces in Protein Folding. Biochemistry 29, 7133–7155, doi:10.1021/bi00483a001 (1990).
OpenUrl CrossRef PubMed
↵
Privalov, P. L. & Makhatadze, G. I. Contribution of Hydration and Non-covalent Interactions to the Heat Capacity Effect on Protein Unfolding. J. Mol. Biol. 224, 715–723, doi:10.1016/0022-2836(92)90555-X (1992).
OpenUrl CrossRef PubMed Web of Science
Privalov, P. L. & Makhatadze, G. I. Contribution of Hydration to Protein Folding Thermodynamics: II. The Entropy and Gibbs Energy of Hydration. J. Mol. Biol. 232, 660–679, doi:10.1006/jmbi.1993.1417 (1993).
OpenUrl CrossRef PubMed Web of Science
↵
Makhatadze, G. I. & Privalov, P. L. Contribution of Hydration to Protein Folding Thermodynamics I. The Enthalpy of Hydration. J. Mol. Biol. 232, 639–659, doi:10.1006/jmbi.1993.1416 (1993).
OpenUrl CrossRef PubMed Web of Science
↵
Hammond, G. S. A Correlation of Reaction Rates. J. Am. Chem. Soc. 77, 334–338, doi:10.1021/ja01607a027 (1955).
OpenUrl CrossRef
↵
Hammett, L. P. Linear free energy relationships in rate and equilibrium phenomena. Trans. Faraday Soc. 34, 156–165, doi:10.1039/TF9383400156 (1938).
OpenUrl CrossRef
↵
Wells, P. R. Linear Free Energy Relationships. Chem. Rev. 63, 171–219, doi:10.1021/cr60222a005 (1963).
OpenUrl CrossRef
Farcasiu, D. The use and misuse of the Hammond Postulate. J. Chem. Edu. 52, 76, doi:10.1021/ed052p76 (1975).
OpenUrl CrossRef
↵
Agmon, N. Quantitative Hammond postulate. J. Chem. Soc., Faraday Trans. 2 74, 388–404, doi:10.1039/f29787400388 (1978).
OpenUrl CrossRef
Albery, W. J. The Application of the Marcus Relation to Reactions in Solution. Annu. Rev. Phys. Chem. 31, 227–263, doi:10.1146/annurev.pc.31.100180.001303 (1980).
OpenUrl CrossRef Web of Science
Formosinho, S. J. On the Validity of Linear Free Energy Relationships in Chemical Kinetics. Rev. Port. Quim. 27, 521 (1985).
OpenUrl
Estell, D. A. Artifacts in the application of linear free energy analysis. Protein Eng. 1, 441–442, doi:10.1093/protein/1.6.441 (1987).
OpenUrl CrossRef PubMed
Fersht, A. R. Linear free energy relationships are valid! Protein Eng. 1, 442–445, doi:10.1093/protein/1.6.442 (1987).
OpenUrl CrossRef PubMed
↵
Sanchez, I. E. & Kiefhaber, T. Hammond behavior versus ground state effects in protein folding: evidence for narrow free energy barriers and residual structure in unfolded states. J. Mol. Biol. 327, 867–884, doi:10.1016/S0022-2836(03)00171-2 (2003).
OpenUrl CrossRef PubMed Web of Science
↵
Straub, J. E. & Karplus, M. The interpretation of site-directed mutagenesis experiments by linear free energy relations. Protein Eng. 3, 673–675, doi:10.1093/protein/3.8.673 (1990).
OpenUrl CrossRef PubMed
↵
Fersht, A. R., Matouschek, A. & Serrano, L. The Folding of an Enzyme I. Theory of Protein Engineering Analysis of Stability and Pathway of Protein Folding. J. Mol. Biol. 224, 771–782, doi:10.1016/0022-2836(92)90561-W (1992).
OpenUrl CrossRef PubMed Web of Science
↵
Fersht, A. R. & Daggett, V. Protein Folding and Unfolding at Atomic Resolution. Cell 108, 573–582, doi:10.1016/S0092-8674(02)00620-7 (2002).
OpenUrl CrossRef PubMed Web of Science
↵
Kresge, A. J. in Proton-Transfer Reactions (eds Edward Caldin & Victor Gold) Ch. 7: The Brønsted Relation: Significance of the Exponent, 179–199 (Springer US, 1975).
↵
Crane, J. C., Koepf, E. K., Kelly, J. W. & Gruebele, M. Mapping the Transition State of the WW Domain Beta-Sheet. J. Mol. Biol. 298, 283–292, doi:10.1006/jmbi.2000.3665 (2000).
OpenUrl CrossRef PubMed Web of Science
Jager, M., Nguyen, H., Crane, J. C., Kelly, J. W. & Gruebele, M. The Folding Mechanism of a Beta-Sheet: The WW Domain. J. Mol. Biol. 311, 373–393, doi:10.1006/jmbi.2001.4873 (2001).
OpenUrl CrossRef PubMed Web of Science
↵
Ervin, J. & Gruebele, M. Quantifying protein folding transition States with Φ_T. J. Biol. Phys. 28, 115–128, doi:10.1023/A:1019930203777 (2002).
OpenUrl CrossRef
↵
Chung, H. S. & Tokmakoff, A. Temperature-dependent downhill unfolding of ubiquitin. I. Nanosecond-to-millisecond resolved nonlinear infrared spectroscopy. Proteins: Struct. Funct. Bioinf. 72, 474–487, doi:10.1002/prot.22043 (2008).
OpenUrl CrossRef PubMed Web of Science
↵
Johnson, C. D. Linear Free Energy Relations and the Reactivity-Selectivity Principle. Chem. Rev. 75, 755–765, doi:10.1021/cr60298a004 (1975).
OpenUrl CrossRef
↵
Bordwell, F. G., Boyle, W. J., Hautala, J. A. & Yee, K. C. Brønsted Coefficients Larger Than 1 and less than 0 for Proton Removal from Carbon Acids. J. Am. Chem. Soc. 91, 4002–4003, doi:10.1021/ja01042a082 (1969).
OpenUrl CrossRef
Marcus, R. A. Unusual Slopes of Free Energy Plots in Kinetics. J. Am. Chem. Soc. 91, 7224–7225, doi:10.1021/ja01054a003 (1969).
OpenUrl CrossRef
↵
Kresge, A. J. The Nitroalkane Anomaly. Can. J. Chem. 52, 1897–1903, doi:10.1139/v74-270 (1974).
OpenUrl CrossRef Web of Science
Shapiro, I. O., Zharova, N. G., Ranneva, Y. I., Terekhova, M. I. & Shatenshtein, A. I. Reasons for the “Alpha” Coefficient Anomaly in the Brønsted Relation for the Ionization of CH Acids. Theor. Expt. Chem. 22, 425–430, doi:10.1007/BF00523820 (1987).
OpenUrl CrossRef
↵
Würthwein, E.-U., Lang, G., Schappele, L. H. & Mayr, H. Rate-Equilibrium Relationships in Hydride Transfer Reactions: The Role of Intrinsic Barriers. J. Am. Chem. Soc. 124, 4084–4092, doi:10.1021/ja0121540 (2002).
OpenUrl CrossRef PubMed
↵
Fersht, A. R. Relationship of Leffler (Brønsted) alpha values and protein folding Phi values to position of transition-state structures on reaction coordinates. Proc. Natl. Acad. Sci. U S A 101, 14338–14342, doi:10.1073/pnas.0406091101 (2004).
OpenUrl Abstract/FREE Full Text
↵
Gillespie, J. R. & Shortle, D. Characterization of Long-range Structure in the Denatured State of Staphylococcal Nuclease. I. Paramagnetic Relaxation Enhancement by Nitroxide Spin Labels. J. Mol. Biol. 268, 158–169, doi:10.1006/jmbi.1997.0954 (1997).
OpenUrl CrossRef PubMed Web of Science
Shortle, D. & Ackerman, M. S. Persistence of Native-Like Topology in a Denatured Protein in 8 M Urea. Science 293, 487–489, doi:10.1126/science.1060438 (2001).
OpenUrl Abstract/FREE Full Text
Klein-Seetharaman, J. et al. Long-range Interactions Within a Nonnative Protein. Science 295, 1719–1722, doi:10.1126/science.1067680 (2002).
OpenUrl Abstract/FREE Full Text
↵
Bernadó, P. et al. A structural model for unfolded proteins from residual dipolar couplings and small-angle x-ray scattering. Proc. Natl. Acad. Sci. U S A 102, 17002–17007, doi:10.1073/pnas.0506202102 (2005).
OpenUrl Abstract/FREE Full Text
↵
Goldenberg, D. P. Finding the right fold. Nat. Struct. Biol. 6, 987–990, doi:10.1038/14866 (1999).
OpenUrl CrossRef PubMed Web of Science
↵
Sanchez, I. E. & Kiefhaber, T. Origin of Unusual Phi-Values in Protein Folding: Evidence Against Specific Nucleation Sites. J. Mol. Biol. 334, 1077–1085, doi:10.1016/j.jmb.2003.10.016 (2003).
OpenUrl CrossRef PubMed Web of Science
Weikl, T. R. & Dill, K. A. Transition-States in Protein Folding Kinetics: The Structural Interpretation of Φ values. J. Mol. Biol. 365, 1578–1586, doi:10.1016/j.jmb.2006.10.082 (2007).
OpenUrl CrossRef PubMed
Naganathan, A. N. & Munoz, V. Insights into protein folding mechanisms from large scale analysis of mutational effects. Proc. Natl. Acad. Sci. U S A 107, 8611–8616, doi:10.1073/pnas.1000988107 (2010).
OpenUrl Abstract/FREE Full Text
↵
Booth, P. J. & Clarke, J. Membrane protein folding makes the transition. Proc. Natl. Acad. Sci. U S A 107, 3947–3948, doi:10.1073/pnas.0914478107 (2010).
OpenUrl FREE Full Text
↵
Fersht, A. R. & Sato, S. Phi-value analysis and the nature of protein-folding transition states. Proc. Natl. Acad. Sci. U S A 101, 7976–7981, doi:10.1073/pnas.0402684101 (2004).
OpenUrl Abstract/FREE Full Text
LiCata, V. J. & Liu, C.-C. Analysis of Free Energy Versus Temperature Curves in Protein Folding and Macromolecular Interactions. Methods Enzymol. 488, 219–238, doi:10.1016/B978-0-12-381268-1.00009-4 (2011).
OpenUrl CrossRef PubMed
↵
Fersht, A. Structure and Mechanism in Protein Science: A Guide to Enzyme Catalysis and Protein Folding. (W.H. Freeman and Company, 1999).

View the discussion thread.

Posted February 01, 2016.

Download PDF

Citation Tools

Subject Area

Biophysics

Subject Areas

All Articles

Animal Behavior and Cognition (5204)
Biochemistry (11725)
Bioengineering (8728)
Bioinformatics (29135)
Biophysics (14940)
Cancer Biology (12052)
Cell Biology (17363)
Clinical Trials (138)
Developmental Biology (9408)
Ecology (14148)
Epidemiology (2067)
Evolutionary Biology (18273)
Genetics (12223)
Genomics (16773)
Immunology (11844)
Microbiology (28027)
Molecular Biology (11564)
Neuroscience (60843)
Paleontology (451)
Pathology (1864)
Pharmacology and Toxicology (3232)
Physiology (4940)
Plant Biology (10405)
Scientific Communication and Education (1681)
Synthetic Biology (2878)
Systems Biology (7335)
Zoology (1642)

[1] ↵
Marcus, R. A. Chemical and Electrochemical Electron-Transfer Theory. Annu. Rev. Phys. Chem. 15, 155, doi:10.1146/annurev.pc.15.100164.001103 (1964).
OpenUrl CrossRef Web of Science

[2] Sade, R. S. Analysis of Two-State Folding Using Parabolic Approximation I: Hypothesis. bioRxiv, doi:10.1101/036491 (2016).
OpenUrl Abstract/FREE Full Text

[3] ↵
Sade, R. S. Analysis of Two-State Folding Using Parabolic Approximation II: Temperature-Dependence. bioRxiv, doi:10.1101/037341 (2016).
OpenUrl CrossRef

[4] ↵
Petrovich, M., Jonsson, A. L., Ferguson, N., Daggett, V. & Fersht, A. R. Φ-Analysis at the Experimental Limits: Mechanism of Beta-Hairpin Formation. J. Mol. Biol. 360, 865–881, doi:10.1016/j.jmb.2006.05.050 (2006).
OpenUrl CrossRef PubMed Web of Science

[5] ↵
Tanford, C. Protein Denaturation. C. Theoretical Models for the Mechanism of Denaturation. Adv. Protein Chem. 24, 1–95, doi:10.1016/S0065-3233(08)60241-7 (1970).
OpenUrl CrossRef PubMed

[6] ↵
Becktel, W. J. & Schellman, J. A. Protein Stability Curves. Biopolymers 26, 1859–1877, doi:10.1002/bip.360261104 (1987).
OpenUrl CrossRef PubMed Web of Science

[7] ↵
Privalov, P. L. Thermodynamic Problems of Protein Structure. Annu. Rev. Biophys. Biophys. Chem. 18, 47–69, doi:10.1146/annurev.bb.18.060189.000403 (1989).
OpenUrl CrossRef PubMed Web of Science

[8] ↵
Otzen, D. E. & Oliveberg, M. Correspondence between anomalous m- and ΔC_p-values in protein folding. Protein Sci. 13, 3253–3263, doi:10.1110/ps.04991004 (2004).
OpenUrl CrossRef PubMed Web of Science

[9] ↵
Dimitriadis, G. et al. Microsecond folding dynamics of the F13W G29A mutant of the B domain of staphylococcal protein A by laser-induced temperature jump. Proc. Natl. Acad. Sci. U S A 101, 3809–3814, doi:10.1073/pnas.0306433101 (2004).
OpenUrl Abstract/FREE Full Text

[10] ↵
Taskent, H., Cho, J. H. & Raleigh, D. P. Temperature-Dependent Hammond Behaviour in a Protein-Folding Reaction: Analysis of Transition-State Movement and Ground-State Effects. J. Mol. Biol. 378, 699–706, doi:10.1016/j.jmb.2008.02.024 (2008).
OpenUrl CrossRef PubMed

[11] Cellmer, T., Henry, E. R., Hofrichter, J. & Eaton, W. A. Measuring internal friction of an ultrafast-folding protein. Proc. Natl. Acad. Sci. U S A 105, 18320–18325, doi:10.1073/pnas.0806154105 (2008).
OpenUrl Abstract/FREE Full Text

[12] ↵
Sun, L., Noel, J. K., Sulkowska, J. I., Levine, H. & Onuchic, J. N. Connecting Thermal and Mechanical Protein (Un)folding Landscapes. Biophys. J. 107, 2941–2952, doi:10.1016/j.bpj.2014.10.021 (2014).
OpenUrl CrossRef PubMed

[13] ↵
Marcus, R. A. Theoretical Relations among Rate Constants, Barriers, and Brønsted Slopes of Chemical Reactions. J. Phys. Chem. 72, 891–899, doi:10.1021/j100849a019 (1968).
OpenUrl CrossRef

[14] ↵
Cohen, A. O. & Marcus, R. A. On the Slope of Free Energy Plots in Chemical Kinetics. J. Phys. Chem. 72, 4249–4256, doi:10.1021/j100858a052 (1968).
OpenUrl CrossRef

[15] ↵
Koeppl, G. W. & Kresge, A. J. Marcus Rate Theory and the Relationship between Brønsted Exponents and Energy of Reaction. Chem. Commun., 371–373, doi:10.1039/C39730000371 (1973).
OpenUrl CrossRef

[16] ↵
Rasmussen, B. F., Stock, A. M., Ringe, D. & Petsko, G. A. Crystalline ribonuclease A loses function below the dynamical transition at 220 K. Nature 357, 423–424, doi:10.1038/357423a0 (1992).
OpenUrl CrossRef PubMed Web of Science

[17] ↵
Kauzmann, W. The Nature of the Glassy State and the Behavior of Liquids at Low Temperatures. Chem. Rev. 43, 219–256, doi:10.1021/cr60135a002 (1948).
OpenUrl CrossRef Web of Science

[18] Adam, G. & Gibbs, J. H. On the Temperature Dependence of Cooperative Relaxation Properties in Glass Forming Liquids. J. Chem. Phys. 43, 139–146, doi:10.1063/1.1696442 (1965).
OpenUrl CrossRef Web of Science

[19] Goldstein, M. Viscous Liquids and the Glass Transition: A Potential Energy Barrier Picture. J. Chem. Phys. 51, 3728–3739, doi:10.1063/1.1672587 (1969).
OpenUrl CrossRef Web of Science

[20] Debenedetti, P. G. & Stillinger, F. H. Supercooled liquids and the glass transition. Nature 410, 259–267, doi:10.1038/35065704 (2001).
OpenUrl CrossRef PubMed Web of Science

[21] Dill, K. A. & Bromberg, S. Molecular Driving Forces – Statistical Thermodynamics in Chemistry and Biology. (Garland Science, 2003).

[22] Dyre, J. Mysteries of the Glass Transition. Physics Today 61, 15, doi:10.1063/1.2835137 (2008).
OpenUrl CrossRef

[23] ↵
Bauer, T., Lunkenheimer, P. & Loidl, A. Cooperativity and the Freezing of Molecular Motion at the Glass Transition. Phys. Rev. Lett. 111, 225702, doi:10.1103/PhysRevLett.111.225702 (2013).
OpenUrl CrossRef PubMed

[24] ↵
Kresge, A. J. The Brønsted Relation - Recent Developments. Chem. Soc. Rev. 2, 475–503, doi:10.1039/CS9730200475 (1973).
OpenUrl CrossRef

[25] ↵
Nguyen, H., Jäger, M., Moretto, A., Gruebele, M. & Kelly, J. W. Tuning the free-energy landscape of a WW domain by temperature, mutation, and truncation. Proc. Natl. Acad. Sci. U S A 100, 3948–3953, doi:10.1073/pnas.0538054100 (2003).
OpenUrl Abstract/FREE Full Text

[26] ↵
Ferguson, N. et al. Rapid amyloid fiber formation from the fast-folding WW domain FBP28. Proc. Natl. Acad. Sci. U S A 100, 9814–9819, doi:10.1073/pnas.1333907100 (2003).
OpenUrl Abstract/FREE Full Text

[27] ↵
Thomsen, J. S. Logical Relations among the Principles of Statistical Mechanics and Thermodynamics. Phys. Rev. 91, 1263–1266, doi:10.1103/PhysRev.91.1263 (1953).
OpenUrl CrossRef

[28] ↵
Bryngelson, J. D., Onuchic, J. N., Socci, N. D. & Wolynes, P. G. Funnels, Pathways and the Energy Landscape of Protein Folding: A Synthesis. Proteins: Struct. Funct. Genet. 21, 167, doi:10.1002/prot.340210302 (1995).
OpenUrl CrossRef PubMed Web of Science

[29] Oliveberg, M., Tan, Y. J. & Fersht, A. R. Negative activation enthalpies in the kinetics of protein folding. Proc. Natl. Acad. Sci. U S A 92, 8926–8929, doi:10.1073/pnas.92.19.8926 (1995).
OpenUrl Abstract/FREE Full Text

[30] Scalley, M. L. & Baker, D. Protein folding kinetics exhibit an Arrhenius temperature dependence when corrected for the temperature dependence of protein stability. Proc. Natl. Acad. Sci. U S A 94, 10636–10640, doi:10.1073/pnas.94.20.10636 (1997).
OpenUrl Abstract/FREE Full Text

[31] Chan, H. S. & Dill, K. A. Protein Folding in the Landscape Perspective: Chevron Plots and Non-Arrhenius Kinetics. Proteins: Struct. Funct. Genet. 30, 2–33, doi:10.1002/(SICI)1097-0134(19980101)30:1<2::AID-PROT2>3.0.CO;2-R (1998).
OpenUrl CrossRef PubMed Web of Science

[32] ↵
Ghosh, K., Ozkan, S. B. & Dill, K. A. The Ultimate Speed Limit to Protein Folding Is Conformational Searching. J. Am. Chem. Soc. 129, 11920–11927, doi:10.1021/ja066785b (2007).
OpenUrl CrossRef PubMed Web of Science

[33] ↵
Myers, J. K., Pace, C. N. & Scholtz, J. M. Denaturant m values and heat capacity changes: Relation to changes in accessible surface areas of protein unfolding. Protein Sci. 4, 2138–2148, doi:10.1002/pro.5560041020 (1995).
OpenUrl CrossRef PubMed Web of Science

[34] ↵
Robertson, A. D. & Murphy, K. P. Protein Structure and the Energetics of Protein Stability. Chem. Rev. 97, 1251–1268, doi:10.1021/cr960383c (1997).
OpenUrl CrossRef PubMed Web of Science

[35] ↵
Schellman, J. A. Temperature, Stability, and the Hydrophobic Interaction. Biophys. J. 73, 2960–2964, doi:10.1016/S0006-3495(97)78324-3 (1997).
OpenUrl CrossRef PubMed Web of Science

[36] ↵
Baldwin, R. L. Dynamic hydration shell restores Kauzmann’s 1959 explanation of how the hydrophobic factor drives protein folding. Proc. Natl. Acad. Sci. U S A 111, 13052–13056, doi:10.1073/pnas.1414556111 (2014).
OpenUrl Abstract/FREE Full Text

[37] ↵
Privalov, P. L. & Gill, S. J. Stability of Protein Structure and Hydrophobic Interaction. Adv. Protein Chem. 39, 191–234, doi:10.1016/S0065-3233(08)60377-0 (1988).
OpenUrl CrossRef PubMed Web of Science

[38] ↵
Gómez, J., Hilser, V. J., Xie, D. & Freire, E. The Heat Capacity of Proteins. Proteins: Struct. Funct. Bioinf. 22, 404–412, doi:10.1002/prot.340220410 (1995).
OpenUrl CrossRef PubMed Web of Science

[39] ↵
Shortle, D. The Expanded Denatured State: An Ensemble of Conformations Trapped in a Locally Encoded Topological Space. Adv. Protein Chem. 62, 1–23, doi:10.1016/S0065-3233(02)62003-0 (2002).
OpenUrl CrossRef PubMed Web of Science

[40] ↵
Bowler, B. E. Residual structure in unfolded proteins. Curr. Opin. Struct. Biol. 22, 4–13, doi:10.1016/j.sbi.2011.09.002 (2012).
OpenUrl CrossRef PubMed

[41] ↵
Richards, F. M. Areas, Volumes, Packing, and Protein Structure. Annu. Rev. Biophys. Bioeng. 6, 151–176, doi:10.1146/annurev.bb.06.060177.001055 (1977).
OpenUrl CrossRef PubMed Web of Science

[42] Bernadó, P., Blackledge, M. & Sancho, J. Sequence-Specific Solvent Accessibilities of Protein Residues in Unfolded Protein Ensembles. Biophys. J. 91, 4536–4543, doi:10.1529/biophysj.106.087528 (2006).
OpenUrl CrossRef PubMed

[43] ↵
Gong, H. & Rose, G. D. Assessing the solvent-dependent surface area of unfolded proteins using an ensemble model. Proc. Natl. Acad. Sci. U S A 105, 3321–3326, doi:10.1073/pnas.0712240105 (2008).
OpenUrl Abstract/FREE Full Text

[44] ↵
Behe, M. J., Lattman, E. E. & Rose, G. D. The protein-folding problem: The native fold determines packing, but does packing determine the native fold? Proc. Natl. Acad. Sci. U S A 88, 4195–4199, doi:10.1073/pnas.88.10.4195 (1991).
OpenUrl Abstract/FREE Full Text

[45] ↵
Chothia, C. Principles that Determine the Structure of Proteins. Annu. Rev. Biochem. 53, 537–572, doi:10.1146/annurev.bi.53.070184.002541 (1984).
OpenUrl CrossRef PubMed Web of Science

[46] ↵
Miller, S., Janin, J., Lesk, A. M. & Chothia, C. Interior and Surface of Monomeric Proteins. J. Mol. Biol. 196, 641–656, doi:10.1016/0022-2836(87)90038-6 (1987).
OpenUrl CrossRef PubMed Web of Science

[47] ↵
Tan, Y. J., Oliveberg, M. & Fersht, A. R. Titration Properties and Thermodynamics of the Transition State for Folding: Comparison of Two-state and Multi-state Folding Pathways. J. Mol. Biol. 264, 377–389, doi:10.1006/jmbi.1996.0647 (1996).
OpenUrl CrossRef PubMed Web of Science

[48] ↵
Leffler, J. Parameters for the Description of Transition States. Science 117, 340–341, doi:10.1126/science.117.3039.340 (1953).
OpenUrl FREE Full Text

[49] ↵
Sanchez, I. E. & Kiefhaber, T. Non-linear rate-equilibrium free energy relationships and Hammond behavior in protein folding. Biophys. Chem. 100, 397–407, doi:10.1016/S0301-4622(02)00294-6 (2003).
OpenUrl CrossRef PubMed Web of Science

[50] ↵
Bilsel, O. & Matthews, C. R. Barriers in Protein Folding Reactions. Adv. Protein Chem. 53, 153–207, doi:10.1016/S0065-3233(00)53004-6 (2000).
OpenUrl CrossRef PubMed Web of Science

[51] ↵
Gloss, L. M. & Matthews, C. R. The Barriers in the Bimolecular and Unimolecular Folding Reactions of the Dimeric Core Domain of Escherichia Coli Trp Repressor are Dominated by Enthalpic Contributions. Biochemistry 37, 16000–16010, doi:10.1021/bi981694f (1998).
OpenUrl CrossRef PubMed

[52] ↵
Lee, A. L. & Wand, A. J. Microscopic origins of entropy, heat capacity and the glass transition in proteins. Nature 411, 501–504, doi:10.1038/35078119 (2001).
OpenUrl CrossRef PubMed Web of Science

[53] Chandler, D. Interfaces and the driving force of hydrophobic assembly. Nature 437, 640–647, doi:10.1038/nature04162 (2005).
OpenUrl CrossRef PubMed Web of Science

[54] ↵
Ben-Naim, A. Myths and verities in protein folding theories: From Frank and Evans iceberg-conjecture to explanation of the hydrophobic effect. J. Chem. Phys. 139, 165105, doi:10.1063/1.4827086 (2013).
OpenUrl CrossRef

[55] ↵
Sturtevant, J. M. Heat capacity and entropy changes in processes involving proteins. Proc. Natl. Acad. Sci. U S A 74, 2236–2240, doi:10.1073/pnas.74.6.2236 (1977).
OpenUrl Abstract/FREE Full Text

[56] ↵
Lazaridis, T. & Karplus, M. Heat capacity and compactness of denatured proteins. Biophys. Chem. 78, 207–217, doi:10.1016/S0301-4622(99)00022-8 (1999).
OpenUrl CrossRef

[57] ↵
Frauenfelder, H. et al. A unified model of protein dynamics. Proc. Natl. Acad. Sci. U S A 106, 5129–5134, doi:10.1073/pnas.0900336106 (2009).
OpenUrl Abstract/FREE Full Text

[58] Lewandowski, J. R., Halse, M. E., Blackledge, M. & Emsley, L. Direct observation of hierarchical protein dynamics. Science 348, 578–581, doi:10.1126/science.aaa6111 (2015).
OpenUrl Abstract/FREE Full Text

[59] Cooper, A. Heat capacity of hydrogen-bonded networks: an alternative view of protein folding thermodynamics. Biophys. Chem. 85, 25–39, doi:10.1016/S0301-4622(00)00136-8 (2000).
OpenUrl CrossRef PubMed Web of Science

[60] Cooper, A. Heat capacity effects in protein folding and ligand binding: a re-evaluation of the role of water in biomolecular thermodynamics. Biophys. Chem. 115, 89–97, doi:10.1016/j.bpc.2004.12.011 (2005).
OpenUrl CrossRef PubMed Web of Science

[61] ↵
Cooper, A. Protein Heat Capacity: An Anomaly that Maybe Never Was. J. Phys. Chem. Lett. 1, 3298–3304, doi:10.1021/jz1012142 (2010).
OpenUrl CrossRef

[62] ↵
Karplus, M., Ichiye, T. & Pettitt, B. M. Configurational Entropy of Native Proteins. Biophys. J. 52, 1083–1085, doi:10.1016/S0006-3495(87)83303-9 (1987).
OpenUrl CrossRef PubMed Web of Science

[63] ↵
Finkelstein, A. V. & Ptitsyn, O. B. Protein Physics: A Course of Lectures. (Academic Press, 2002).

[64] ↵
Dill, K. A. Dominant Forces in Protein Folding. Biochemistry 29, 7133–7155, doi:10.1021/bi00483a001 (1990).
OpenUrl CrossRef PubMed

[65] ↵
Privalov, P. L. & Makhatadze, G. I. Contribution of Hydration and Non-covalent Interactions to the Heat Capacity Effect on Protein Unfolding. J. Mol. Biol. 224, 715–723, doi:10.1016/0022-2836(92)90555-X (1992).
OpenUrl CrossRef PubMed Web of Science

[66] Privalov, P. L. & Makhatadze, G. I. Contribution of Hydration to Protein Folding Thermodynamics: II. The Entropy and Gibbs Energy of Hydration. J. Mol. Biol. 232, 660–679, doi:10.1006/jmbi.1993.1417 (1993).
OpenUrl CrossRef PubMed Web of Science

[67] ↵
Makhatadze, G. I. & Privalov, P. L. Contribution of Hydration to Protein Folding Thermodynamics I. The Enthalpy of Hydration. J. Mol. Biol. 232, 639–659, doi:10.1006/jmbi.1993.1416 (1993).
OpenUrl CrossRef PubMed Web of Science

[68] ↵
Hammond, G. S. A Correlation of Reaction Rates. J. Am. Chem. Soc. 77, 334–338, doi:10.1021/ja01607a027 (1955).
OpenUrl CrossRef

[69] ↵
Hammett, L. P. Linear free energy relationships in rate and equilibrium phenomena. Trans. Faraday Soc. 34, 156–165, doi:10.1039/TF9383400156 (1938).
OpenUrl CrossRef

[70] ↵
Wells, P. R. Linear Free Energy Relationships. Chem. Rev. 63, 171–219, doi:10.1021/cr60222a005 (1963).
OpenUrl CrossRef

[71] Farcasiu, D. The use and misuse of the Hammond Postulate. J. Chem. Edu. 52, 76, doi:10.1021/ed052p76 (1975).
OpenUrl CrossRef

[72] ↵
Agmon, N. Quantitative Hammond postulate. J. Chem. Soc., Faraday Trans. 2 74, 388–404, doi:10.1039/f29787400388 (1978).
OpenUrl CrossRef

[73] Albery, W. J. The Application of the Marcus Relation to Reactions in Solution. Annu. Rev. Phys. Chem. 31, 227–263, doi:10.1146/annurev.pc.31.100180.001303 (1980).
OpenUrl CrossRef Web of Science

[74] Formosinho, S. J. On the Validity of Linear Free Energy Relationships in Chemical Kinetics. Rev. Port. Quim. 27, 521 (1985).
OpenUrl

[75] Estell, D. A. Artifacts in the application of linear free energy analysis. Protein Eng. 1, 441–442, doi:10.1093/protein/1.6.441 (1987).
OpenUrl CrossRef PubMed

[76] Fersht, A. R. Linear free energy relationships are valid! Protein Eng. 1, 442–445, doi:10.1093/protein/1.6.442 (1987).
OpenUrl CrossRef PubMed

[77] ↵
Sanchez, I. E. & Kiefhaber, T. Hammond behavior versus ground state effects in protein folding: evidence for narrow free energy barriers and residual structure in unfolded states. J. Mol. Biol. 327, 867–884, doi:10.1016/S0022-2836(03)00171-2 (2003).
OpenUrl CrossRef PubMed Web of Science

[78] ↵
Straub, J. E. & Karplus, M. The interpretation of site-directed mutagenesis experiments by linear free energy relations. Protein Eng. 3, 673–675, doi:10.1093/protein/3.8.673 (1990).
OpenUrl CrossRef PubMed

[79] ↵
Fersht, A. R., Matouschek, A. & Serrano, L. The Folding of an Enzyme I. Theory of Protein Engineering Analysis of Stability and Pathway of Protein Folding. J. Mol. Biol. 224, 771–782, doi:10.1016/0022-2836(92)90561-W (1992).
OpenUrl CrossRef PubMed Web of Science

[80] ↵
Fersht, A. R. & Daggett, V. Protein Folding and Unfolding at Atomic Resolution. Cell 108, 573–582, doi:10.1016/S0092-8674(02)00620-7 (2002).
OpenUrl CrossRef PubMed Web of Science

[81] ↵
Kresge, A. J. in Proton-Transfer Reactions (eds Edward Caldin & Victor Gold) Ch. 7: The Brønsted Relation: Significance of the Exponent, 179–199 (Springer US, 1975).

[82] ↵
Crane, J. C., Koepf, E. K., Kelly, J. W. & Gruebele, M. Mapping the Transition State of the WW Domain Beta-Sheet. J. Mol. Biol. 298, 283–292, doi:10.1006/jmbi.2000.3665 (2000).
OpenUrl CrossRef PubMed Web of Science

[83] Jager, M., Nguyen, H., Crane, J. C., Kelly, J. W. & Gruebele, M. The Folding Mechanism of a Beta-Sheet: The WW Domain. J. Mol. Biol. 311, 373–393, doi:10.1006/jmbi.2001.4873 (2001).
OpenUrl CrossRef PubMed Web of Science

[84] ↵
Ervin, J. & Gruebele, M. Quantifying protein folding transition States with Φ_T. J. Biol. Phys. 28, 115–128, doi:10.1023/A:1019930203777 (2002).
OpenUrl CrossRef

[85] ↵
Chung, H. S. & Tokmakoff, A. Temperature-dependent downhill unfolding of ubiquitin. I. Nanosecond-to-millisecond resolved nonlinear infrared spectroscopy. Proteins: Struct. Funct. Bioinf. 72, 474–487, doi:10.1002/prot.22043 (2008).
OpenUrl CrossRef PubMed Web of Science

[86] ↵
Johnson, C. D. Linear Free Energy Relations and the Reactivity-Selectivity Principle. Chem. Rev. 75, 755–765, doi:10.1021/cr60298a004 (1975).
OpenUrl CrossRef

[87] ↵
Bordwell, F. G., Boyle, W. J., Hautala, J. A. & Yee, K. C. Brønsted Coefficients Larger Than 1 and less than 0 for Proton Removal from Carbon Acids. J. Am. Chem. Soc. 91, 4002–4003, doi:10.1021/ja01042a082 (1969).
OpenUrl CrossRef

[88] Marcus, R. A. Unusual Slopes of Free Energy Plots in Kinetics. J. Am. Chem. Soc. 91, 7224–7225, doi:10.1021/ja01054a003 (1969).
OpenUrl CrossRef

[89] ↵
Kresge, A. J. The Nitroalkane Anomaly. Can. J. Chem. 52, 1897–1903, doi:10.1139/v74-270 (1974).
OpenUrl CrossRef Web of Science

[90] Shapiro, I. O., Zharova, N. G., Ranneva, Y. I., Terekhova, M. I. & Shatenshtein, A. I. Reasons for the “Alpha” Coefficient Anomaly in the Brønsted Relation for the Ionization of CH Acids. Theor. Expt. Chem. 22, 425–430, doi:10.1007/BF00523820 (1987).
OpenUrl CrossRef

[91] ↵
Würthwein, E.-U., Lang, G., Schappele, L. H. & Mayr, H. Rate-Equilibrium Relationships in Hydride Transfer Reactions: The Role of Intrinsic Barriers. J. Am. Chem. Soc. 124, 4084–4092, doi:10.1021/ja0121540 (2002).
OpenUrl CrossRef PubMed

[92] ↵
Fersht, A. R. Relationship of Leffler (Brønsted) alpha values and protein folding Phi values to position of transition-state structures on reaction coordinates. Proc. Natl. Acad. Sci. U S A 101, 14338–14342, doi:10.1073/pnas.0406091101 (2004).
OpenUrl Abstract/FREE Full Text

[93] ↵
Gillespie, J. R. & Shortle, D. Characterization of Long-range Structure in the Denatured State of Staphylococcal Nuclease. I. Paramagnetic Relaxation Enhancement by Nitroxide Spin Labels. J. Mol. Biol. 268, 158–169, doi:10.1006/jmbi.1997.0954 (1997).
OpenUrl CrossRef PubMed Web of Science

[94] Shortle, D. & Ackerman, M. S. Persistence of Native-Like Topology in a Denatured Protein in 8 M Urea. Science 293, 487–489, doi:10.1126/science.1060438 (2001).
OpenUrl Abstract/FREE Full Text

[95] Klein-Seetharaman, J. et al. Long-range Interactions Within a Nonnative Protein. Science 295, 1719–1722, doi:10.1126/science.1067680 (2002).
OpenUrl Abstract/FREE Full Text

[96] ↵
Bernadó, P. et al. A structural model for unfolded proteins from residual dipolar couplings and small-angle x-ray scattering. Proc. Natl. Acad. Sci. U S A 102, 17002–17007, doi:10.1073/pnas.0506202102 (2005).
OpenUrl Abstract/FREE Full Text

[97] ↵
Goldenberg, D. P. Finding the right fold. Nat. Struct. Biol. 6, 987–990, doi:10.1038/14866 (1999).
OpenUrl CrossRef PubMed Web of Science

[98] ↵
Sanchez, I. E. & Kiefhaber, T. Origin of Unusual Phi-Values in Protein Folding: Evidence Against Specific Nucleation Sites. J. Mol. Biol. 334, 1077–1085, doi:10.1016/j.jmb.2003.10.016 (2003).
OpenUrl CrossRef PubMed Web of Science

[99] Weikl, T. R. & Dill, K. A. Transition-States in Protein Folding Kinetics: The Structural Interpretation of Φ values. J. Mol. Biol. 365, 1578–1586, doi:10.1016/j.jmb.2006.10.082 (2007).
OpenUrl CrossRef PubMed

[100] Naganathan, A. N. & Munoz, V. Insights into protein folding mechanisms from large scale analysis of mutational effects. Proc. Natl. Acad. Sci. U S A 107, 8611–8616, doi:10.1073/pnas.1000988107 (2010).
OpenUrl Abstract/FREE Full Text

[101] ↵
Booth, P. J. & Clarke, J. Membrane protein folding makes the transition. Proc. Natl. Acad. Sci. U S A 107, 3947–3948, doi:10.1073/pnas.0914478107 (2010).
OpenUrl FREE Full Text

[102] ↵
Fersht, A. R. & Sato, S. Phi-value analysis and the nature of protein-folding transition states. Proc. Natl. Acad. Sci. U S A 101, 7976–7981, doi:10.1073/pnas.0402684101 (2004).
OpenUrl Abstract/FREE Full Text

[103] LiCata, V. J. & Liu, C.-C. Analysis of Free Energy Versus Temperature Curves in Protein Folding and Macromolecular Interactions. Methods Enzymol. 488, 219–238, doi:10.1016/B978-0-12-381268-1.00009-4 (2011).
OpenUrl CrossRef PubMed

[104] ↵
Fersht, A. Structure and Mechanism in Protein Science: A Guide to Enzyme Catalysis and Protein Folding. (W.H. Freeman and Company, 1999).