Substrate conformational dynamics drive structure-specific recognition of gapped DNA by DNA polymerase

Timothy D. Craggs; Marko Sustarsic; Anne Plochowietz; Majid Mosayebi; Hendrik Kaju; Andrew Cuthbert; Johannes Hohlbein; Laura Domicevica; Philip C. Biggin; Jonathan P. K. Doye; Achillefs N. Kapanidis

doi:10.1101/263038

Abstract

DNA-binding proteins utilise different recognition mechanisms to locate their DNA targets. Some proteins recognise specific nucleotide sequences, while many DNA repair proteins interact with specific (often bent) DNA structures. While sequence-specific DNA binding mechanisms have been studied extensively, structure-specific mechanisms remain unclear. Here, we study structure-specific DNA recognition by examining the structure and dynamics of DNA polymerase I (Pol) substrates both alone and in Pol-DNA complexes. Using a rigid-body docking approach based on a network of 73 distance restraints collected using single-molecule FRET, we determined a novel solution structure of the singlenucleotide-gapped DNA-Pol binary complex. The structure was highly consistent with previous crystal structures with regards to the downstream primer-template DNA substrate; further, our structure showed a previously unobserved sharp bend (~120°) in the DNA substrate; we also showed that this pronounced bending of the substrate is present in living bacteria. All-atom molecular dynamics simulations and single-molecule quenching assays revealed that 4-5 nt of downstream gap-proximal DNA are unwound in the binary complex. Coarsegrained simulations on free gapped substrates reproduced our experimental FRET values with remarkable accuracy (<ΔFRET> = -0.0025 across 34 independent distances) and revealed that the one-nucleotide-gapped DNA frequently adopted highly bent conformations similar to those in the Pol-bound state (ΔG < 4 kT); such conformations were much less accessible to nicked (> 7 kT) or duplex (>> 10 kT) DNA. Our results suggest a mechanism by which Pol and other structure-specific DNA-binding proteins locate their DNA targets through sensing of the conformational dynamics of DNA substrates.

Significance Statement Most genetic processes, including DNA replication, repair and transcription, rely on DNA-binding proteins locating specific sites on DNA; some sites contain a specific sequence, whereas others present a specific structure. While sequence-specific recognition has a clear physical basis, structure-specific recognition mechanisms remain obscure. Here, we use single-molecule FRET and computer simulations to show that the conformational dynamics of an important repair intermediate (1nt-gapped DNA) act as central recognition signals for structure-specific binding by DNA polymerase I (Pol). Our conclusion is strongly supported by a novel solution structure of the Pol-DNA complex wherein the gapped-DNA is significantly bent. Our iterative approach combining precise single-molecule measurements with molecular modelling is general and can elucidate the structure and dynamics for many large biomachines.

Protein machines functioning on chromosomes and plasmids utilise different mechanisms to locate their targets on DNA. Sequence-specific DNA-binding proteins, such as restriction enzymes and transcription factors, recognize a particular nucleotide sequence via a combination of direct and indirect readouts (1). Whereas direct readout involves specific interactions between the DNA bases and protein amino acid side chains, indirect readout senses sequence-dependent structural and mechanical features, such as major or minor groove width, and conformational flexibility (2). In contrast, structure-specific proteins have no sequence specificity; instead, they interact with particular DNA structures (e.g., gapped duplexes, and 5’ or 3’ overhangs). While sequence-specific DNA binding mechanisms have been studied extensively, structure-specific mechanisms remain unclear.

Many enzymes involved in DNA repair and replication are necessarily structure-specific, and have been shown to interact with bent DNA substrates; examples include Flap endonuclease 1 (3-5), DNA Polymerase β (6), XPF (7), and MutS (8, 9). Although catalytic reasons for substrate distortion have been suggested for individual systems, it is unclear whether the structure and dynamics of bent states serve as general recognition signals for binding or substrate selectivity. A key related question is whether these proteins induce DNA bending upon binding (via an “induced” fit mechanism) or they recognise a pre-bent state adopted by the DNA prior to protein binding (a “conformational selection” mechanism), or a combination of both.

Here we studied the E. coli DNA polymerase I (Klenow Fragment; Pol), a structure-specific protein responsible for Okazaki fragment processing in lagging-strand DNA replication, as well as for DNA synthesis during DNA repair. In both roles, the polymerase recognizes and binds to a gapped DNA substrate and polymerizes across the gap. After gap filling, strand-displacement synthesis may follow; this is important for Okazaki fragment processing, as the polymerase continues to synthesize DNA whilst displacing an RNA primer, which is subsequently excised.

Attempts to understand the DNA binding and recognition mechanism for Pol are complicated by the absence of crystal structures of DNA-Pol binary complexes containing downstream duplex DNA, by the heterogeneity of Pol-DNA complexes (10, 11), and by the conformational mobility of the free DNA substrate. As a result, there are many open questions regarding the mechanisms of strand-displacement DNA synthesis and substrate recognition.

We investigated the mechanism of structure-specific recognition by Pol via a combination of single-molecule Forster resonance energy transfer (smFRET) and molecular modelling. The single-molecule nature of our work addressed the issues of conformational and compositional heterogeneity, and led to a FRET-restrained solution structure of the binary complex, Pol bound to 1-nt gapped DNA. This structure revealed a substantial bend in the DNA substrate (which was also supported by complementary FRET experiments in living bacteria), and provided insight into protein and DNA structural features crucial for strand-displacement synthesis. The structure also served as the starting point for atomistic molecular dynamics simulations, which revealed the dynamic nature of the binary complex and the specific interactions between the protein and DNA. Experimental smFRET measurements and coarse-grained modelling allowed us to characterise the conformational ensemble of the free substrate and propose a mechanism for substrate recognition and binding by DNA polymerase I, which is likely to apply to many other structure-specific DNA-binding proteins.

Results

Structural analysis of multiple species in dynamic equilibrium

To analyse the structure of the binary complex of Pol with a 1-nt gapped DNA in solution, we determined numerous DNA-DNA and protein-DNA distance restraints within freely diffusing Pol-DNA complexes using single-molecule confocal fluorescence microscopy combined with alternating-laser excitation (12-14). We measured DNA-DNA distances between labelled sites in the upstream and downstream duplex regions of gapped DNA containing a 3’-dideoxy nucleotide (to prevent any chemistry occurring; Fig 1A). We also measured DNA-protein distances between a FRET donor dye attached to one of three Pol residues (K550C, L744C, C907; Fig 1B) and a FRET acceptor dye attached to one of 13 labelling sites on gapped DNA. Pol activities were not significantly affected by the dye presence (10, 15).

Figure 1 Measuring distances within single polymerase-DNA complexes in heterogeneous mixtures with dynamic species.

(a) Schematic of a 1-nt gapped DNA substrate showing the template (red lettering) and non-template strands (black). Red stars represent acceptor labelled dT bases. Split red/green stars indicate positions labelled with donor for DNA-DNA FRET, or acceptor for DNA-protein FRET.

(b) DNA Polymerase I (Klenow Fragment; Pol) structural schematic (grey - pdb 1KLN) and donor labelling positions (green stars)

(c) Apparent FRET histograms for the doubly-labelled substrate T(−12) B(+11) at increasing concentrations of Pol. The data (grey bars) were fitted with up to three Gaussians (black, red and blue dashed lines), yielding apparent FRET efficiencies, E* of 0.35, 0.55 and 0.75.

(d) Corrected ES histogram for a DNA-DNA FRET measurement (here, for T(−12)B(+11) in the presence of 3 nM Pol). Data (grey bars) were fitted by the sum of three Gaussians (solid black lines) centered on E=0.41 (black dash), E=0.63 (red dash) and E=0.90 (blue dash) respectively.

(e) Corrected ES histogram for a protein-DNA FRET measurement (here, for C907-Cy3B B6-Atto647N). Data were fitted with a single Gaussian function, centered on E=0.48.

The DNA substrate is bent by 120° in the Pol-DNA binary structure

To obtain structural models of the Pol-DNA complex, we used our 73 distance restraints to perform rigid-body docking using Pol and two shortened DNA helices representing the upstream and downstream DNA (Fig 2A). We generated 32 refined structures and ranked them according to their fit to the measured distances (see Methods). A single model (Fig 2B) emerged with a significantly better fit than the other structures (Fig S2A). In this model, the position of the upstream DNA agreed very well with the position of a DNA fragment in a crystal structure of a Pol-DNA binary complex (16), (RMSD = 2.9 Å; Fig S2B), demonstrating the accuracy of our structural model. To test the robustness of our model, we generated 100 ‘bootstrapped’ structures by randomly perturbing the 73 distance restraints in proportion to their experimental errors, repeating the docking calculations, and calculating the RMSD for each DNA backbone phosphate atom across all bootstrapped structures (average RMSD = 3.8 Å, Fig S2E).

Figure 2 Pol-DNA binary structure from rigid-body docking

(a) Pol structure showing the fingers (blue), thumb (purple) and palm (wheat) subdomains and the proof-reading exonuclease domain (grey). Example DNA-DNA and protein DNA-distances (black dashed lines) are shown between mean dye positions (green and red spheres). Example accessible volumes of a donor (pale green cloud) and an acceptor (pale red cloud) dye are also shown, along with the full sequences of the docked DNAs; the shaded region indicating the DNA not used for the docking.

(b) Results of the rigid-body docking: template DNA (red), non-template DNA (black), subdomains coloured as in (A).

(d) Clash between full-length downstream DNA and the fingers subdomain (cyan). See also Fig S2 and Table S1.

Having established the accuracy and precision of our model, we inspected it for insights into the DNA-binding and strand-displacement mechanisms. The most striking feature was the significant kink in the DNA substrate, a ~120° bend compared to straight duplex (Fig 2B). Further, the downstream DNA is positioned close to the fingers subdomain (Fig 2C), with the helical axis aligned with Y719 (Bst numbering used as default; this corresponds to residue F771 in E. coli). Substitution of this residue with alanine was previously shown to significantly impair strand displacement synthesis by Pol (17). Extending the downstream DNA to its full length by modelling the previously deleted base pairs proximal to the gap, resulted in a clash between the additional DNA and the Pol fingers (Fig 2D) indicating that the gap-proximal downstream DNA may be partially melted in the binary complex.

We also used 21 FRET restraints (Table S1) to obtain the relative orientation of the upstream and downstream DNA in the high-FRET (Pol)2-DNA ternary complex and generated a low-resolution model for this complex in which the DNA was more severely bent (by ~140°; Fig S2F; RMSD = 12 Å, Fig S2G).

All-atom MD simulations give structural insights into the strand displacement mechanism

To probe the exact position of DNA in the binary complex, its dynamics and any specific contacts with Pol, we carried out all-atom MD simulations. We generated five different starting models by combining the DNA from our FRET-restrained structure with the short DNA fragment present in the 4BDP Bst X-ray structure (18) (see Methods and Fig S3B), and performed two unconstrained 100-ns simulations from each model.

Whereas the DNA fragment present in the X-ray structure remained stably bound to Pol (RMSD 2.8 ± 0.8 A), the upstream and downstream segments flanking this DNA were much more mobile (RMSD 11.1 ± 4.4 Å and 19.2 ± 8.8 Å, respectively; Fig S3C), with the end-to-end DNA distance ranging from 24 to 144 Å (Fig 3A and Fig S3D). The first six nucleotides of the downstream, nontemplate DNA (nucleotides T(+1) to T(+6), termed the non-template flap) also displayed appreciable dynamics (RMSD 8.7 ± 3.7 Å, Fig 3B and Fig S3E), and did not dock in a particular conformation. To study the extent of DNA melting in this region, we counted the hydrogen bonds formed between the six nucleotides of the non-template flap and the template strand: for most of the simulation time, 2 or 5 hydrogen bonds were present (Fig 3B), corresponding either to a single AT or to an A-T plus a G-C pair, respectively, consistent with base-pairing of the two nucleotides at the base of the flap. Hence, in most conformations, 4 or 5 nucleotides of the flap were melted. Contacts between the flap and Pol were transient and diverse in terms of the residues involved; the most consistent interactions were sequence-unspecific, being formed between DNA phosphates and positively charged residues (mainly R729 and K730).

Figure 3 Binary complex structure and dynamics

(a) Representative snapshot of the DNA-Pol binary complex from a 100-ns MD simulation, showing the volume accessed by the DNA over the simulation (pale pink). The plot shows the DNA end-to-end distance fluctuations over the same simulation, with the ends taken as the terminal non-hydrogen atoms of the template strand. The time point corresponding to the snapshot is indicated with an arrowhead.

(b) Representative snapshot of the conformation of the 6-nt non-template flap, with its volumetric map during a 100-ns simulation (pale orange). The plot shows the frequency of the number of hydrogen bonds formed between the flap and the template strand of downstream DNA during the entire 1-μs (10× 100 ns) simulation time.

(c) Overview of Pol residues involved in strand separation or interactions with downstream DNA. See also panels (D) to (F).

(d) Involvement of Y719 in strand separation of downstream DNA. Top - a representative snapshot of the position of Y719 relative to the template DNA strand. The three DNA residues positioned closest to Y719 during the time course of the simulation are highlighted in CPK colouring. The position of the three-helix bundle is shown for reference; the rest of the protein is omitted for clarity. Lower panels, two different views of the volumetric maps of Y719 (yellow), template (red) and non-template DNA strands (black) during a 100-ns simulation.

(e) A representative snapshot of the interactions between R779 (green) and R784 (cyan) with phosphate groups (orange spheres) in the non-template strand of downstream DNA. The plot shows the minimum distance between the side-chain nitrogen atoms of R779 (green) or R784 (cyan) to any phosphorous atom in the non-template strand of downstream DNA during a 100 ns simulation. Arrowhead denotes the time point corresponding to the snapshot, and dashed lines indicate the distance corresponding to an interaction.

(f) A representative snapshot of the interaction between residue K549 (purple) with phosphate groups in the template strand of downstream DNA.

See also Fig S3, Movie M1 and PBD File P1.

Many of the protein-DNA interactions in the active site (Y714, S717, Y719 and R789, all contacting the template strand) were similar to those observed in the X-ray structure (18). Further, in our simulations, the conserved three-helix bundle (O, O1 and O2 helices in the fingers subdomain), and especially residue Y719 (F771 in E. coli) were consistently positioned between the downstream template and non-template strands (Fig 3D); Y719 was typically positioned perpendicular to bases B(+1) and B(+2) of the template strand (Fig 3D, upper panel), and occasionally stacked against them (Fig S3F). The position of Y719 is consistent with a previously suggested mechanism in which Y719 acts as a “wedge”, separating the non-template strand from its template counterpart (17). Despite the intrinsic dynamics of the non-template strand, the stable positioning of Y719 against the template strand likely prevents re-pairing during catalysis.

Finally, we observed interactions between downstream DNA and the polymerase, which consistently involved positively charged residues on the Pol surface and the negatively charged phosphate groups of the DNA backbone. These interactions occurred in two regions: the first involved R779 (S831) and R784 (R836) that contact the duplex region of downstream DNA (Fig 3E), and the second featured K549 (K601) of the thumb region interacting with the unpaired template strand (Fig 3F). Whilst any individual nitrogen-phosphate interaction was transient, each residue contacted up to 6 phosphate groups, resulting in Pol-downstream DNA interactions persisting for most of the simulation time. The dynamic nature of these interactions likely reflects the need for rapid Pol movement along its DNA substrate during DNA synthesis.

Downstream DNA is melted in the DNA-Pol binary complex. To study the melting of the downstream non-template strand predicted by both our docked binary complex model (Fig 2D) and MD simulations (Fig 3B), we used quenchable FRET (quFRET), a single-molecule assay able to detect local DNA unwinding (19-21). In quFRET, when the donor (Cy3B) and acceptor (Atto647N) are in close proximity (< 2 nm), their emission is quenched, yielding only few events with intermediate stoichiometry (0.4 < S < 0.8) (see Methods). Upon local DNA melting, the two dyes move further apart and the quenching is reduced, leading to a large increase in both the number, and proportion of events with intermediate stoichiometry (mostly occurring at high FRET efficiencies, as the inter-dye distance remains short).

We studied a 1-nt gapped DNA substrate labelled with donor and acceptor dyes at positions T(+1) and B(+4), respectively. In the absence of Pol, the dyes are in very close proximity; as a result, we detected few intermediate-S events (Fig 4), comprising only ~25% of all acceptor-containing molecules (Fig S4A). On addition of Pol, we observed a ~4.5-fold increase in the number of such events per measurement, with a peak at high FRET (E* > 0.9; Fig 4), now comprising ~75% of all acceptor-containing molecules (Fig S4B). These results demonstrate an increase in dye separation and reduced quenching, consistent with the presence of local melting at the 5’-end of the downstream non-template strand in the binary complex.

Figure 4 Downstream DNA is melted in the binary complex

Results from quenchable FRET experiments. The plot shows the number of events with mid-stoichiometry (0.4 < S < 0.8) vs the apparent FRET efficiency, for DNA substrate T(+1)B(+4) alone (grey bars) and in the presence of 3 nM Pol (red bars). Inset: Schematics of the labeling positions and DNA structures for the unbound (left; B-DNA) and bound conformations (right; snapshot from MD simulations, atomic coordinates provided as SI), and the related accessible volumes of the donor (green) and acceptor (red) dyes, quoting the percentage overlap between them (see main text).

Discussion

The combination of single-molecule FRET with both coarse-grained and all-atom molecular simulations has provided substantial mechanistic and structural insight into the recognition and binding of DNA substrates by Pol. We have characterised the structure and dynamics of multiple species present in solution: the substrate alone, the binary complex and the high-FRET ternary complex. Further, we have obtained evidence for the in vivo relevance of the bent binary complex, detecting its FRET signature in live cells.

We obtained a unique, solution-based, high-precision structure (RMSD = 3.8 Å) of Pol bound to a gapped-DNA substrate, containing upstream and downstream duplex DNA flanking a 1-nt gap (Fig 2B and 3A). Previous structural efforts lacked any downstream duplex DNA and so its position and the conformation of the substrate were unknown. Gapped DNA in the binary complex structure adopted a 120° bend (discussed further below).

The location of the upstream DNA in the docked structure agrees very well with existing co-crystal structures containing primer-template substrates. This supports our rigid-body docking approach, and the accuracy of our positioning of the downstream DNA on the fingers subdomain. This positioning conclusively rejects early propositions that the DNA might be channelled through the cleft formed by the fingers and thumb subdomains (33, 34). Our structure served as a starting point for all-atom MD simulations, which showed DNA dynamics in the binary complex, and identified transient DNA interactions with specific Pol residues. Some of these interactions involved residues implicated in previous biochemical studies, e.g. Y719 (17), providing a structural and mechanistic explanation for the experimental data; other residues (e.g. K549) revealed novel interactions that will merit further study.

Our docked structure showed that the downstream DNA was positioned very close to Y719 (Fig 3D), confirming its involvement in strand displacement. DNA Pol I shares a three-helix bundle (O, O1 and O2) structural motif with T7 RNA polymerase (35). This motif participates in DNA binding and strand separation (36), and includes conserved residues Y719, S717 and R789 in Bst (F771, S769 and R841 in E. coli), which have been shown to be important for strand-displacement by Pol (17). This role for Y719 was further supported in our simulations, which showed the three-helix bundle (and particularly Y719) to be positioned between the template and non-template strands of the downstream DNA. The exact position of Y719 close to bases B(+1) and B(+2) on the downstream-template DNA is consistent with cross-linking data (37, 38).

We also identified residues that interacted with the downstream DNA (R779 and R784; Fig 3E). These residues are highly conserved, with published sequence alignments showing 29 and 48 out of 50 bacterial polymerase sequences containing a homologous residue at positions 779 and 784, respectively (37). The two residues are likely to be functionally complementary given their proximity in the structure and the similar interactions they form with downstream DNA in our simulations. Whereas our simulations indicate that R779 is more important for contacting DNA in the Bst Pol I, R784 may be the key residue in other bacterial polymerases that lack a positively charged residue at position 779, such as E. coli Pol. Interestingly, mutation of R784 to alanine (R836A in E. coli) has been shown to increase the binding of downstream DNA to the polymerase site (37, 39), possibly due to R784 contributing to the bending and distortion of downstream DNA, or reflecting an unfavourable orientation of the side chain in the DNA-Pol binary complex.

K549 is part of a conserved motif (K)KT present in 33 out of 50 bacterial polymerase sequences analysed (37). In our simulations, interactions with K549 appear to keep the template strand away from its non-template counterpart, which may facilitate strand separation. Radioactive competition assays and cross-linking experiments have shown that Pol forms contacts with the first 4 nucleotides of the downstream template strand (37), which are beyond the reach of the active-site residues (Y714, S717, Y719 and R789), but could be accounted for by interactions with K549. The identity of the amino acid(s) cross-linking to base +4 could not be identified in these studies, likely due to the dynamics of the template strand and the transiency of interactions with K549, both features being apparent in our simulations.

The binary complex structure from rigid-body docking suggested that the downstream DNA cannot be fully base-paired proximal to the Pol fingers (Fig 2D). This idea was supported by our MD simulations, in which 4-5 nt of the downstream DNA remained single-stranded for the majority of the simulation time (Fig 3B). Our quenchable FRET assay confirmed that the downstream DNA is indeed melted when bound by Pol (Fig 4). When carrying out Okazaki fragment processing or long-patch base excision repair, Pol must perform strand-displacement DNA synthesis, replacing the RNA primer / damaged DNA with newly polymerized DNA. Our data suggest that the strand-displacement process starts before any DNA synthesis, with up to seven nucleotides being melted upon Pol binding to the substrate.

Our in vivo single-molecule experiments unequivocally show that non-extendible gapped-DNA constructs are bent in live E. coli, unlike duplex DNA. The close agreement between the FRET signatures of the bent species in cells and in vitro suggests that bending is likely mediated by the endogenous full-length Pol binding, although the effect of other DNA binding proteins cannot be excluded. For both internalized labelled DNAs, we observed a higher proportion of the lowest FRET species (corresponding to unbound DNA) than expected from our in vitro binding data and the expected cellular Pol concentration (~400 nM (40)). The high abundance of the low-FRET molecules in cells may reflect the effect of intracellular conditions (e.g., the presence of free nucleotides that can transiently occupy the 1-nt gap), the involvement of other proteins that could compete with Pol for gapped-DNA binding, or a lower affinity of Pol for gapped substrates in vivo.

Previous in vitro studies observed the presence of two molecules of Pol bound to DNA substrates (41-43). We also observed Pol₂-DNA species in our in vitro titrations (Figs S1 and S6), but not in vivo, suggesting that these complexes are unlikely to be important in the cellular context, where the presence of the 5’-nuclease domain in the full-length protein may inhibit dimer formation.

Gapped DNA in the binary complex structure exhibited a 120° bend (Fig 2B and 3A). DNA bending was also observed in the crystal structure of the mammalian gap-filling DNA polymerase β, where the ~90° bend observed was suggested to be important for the mechanisms of polymerisation and fidelity (6). Our data support the idea that bending may be a necessary mechanistic step for gap-filling polymerases, exposing more of the template base for interrogation by the incoming nucleotide. However, we propose bending may also play a role in substrate recognition and selectivity.

Our coarse-grained simulations on the free gapped DNA showed remarkable agreement with the smFRET data (Fig 5B) and have important implications for the binding mechanism of Pol. Since the breaking of the stacking interactions opposite the gap increases DNA bendability, unstacking will likely occur as a step on the path to Pol binding. In addition, the high flexibility of the unstacked DNA suggests that the substrate can adopt a close-to-final bent conformation even prior to Pol complex formation. The simulations also provide an explanation for Pol substrate specificity, specifically its increasing binding preference for gapped over nicked DNA, previously observed by gel shift assays (41) and ensemble anisotropy (44). This preference appears to arise from the increased flexibility of the gap over the nicked DNA, reflected in the different energy cost required for their bending. In this way, the substrate specificity is encoded in the structure and dynamics of the DNA substrate itself, allowing sequence-unspecific recognition of gapped DNA by Pol.

Interestingly, other forms of DNA modification can affect DNA flexibility; cytosine methylation reduces flexibility, while 5-formylcytosine (a substrate for base excision repair) was shown to increase flexibility (Ngo et al., 2016). Thus, it is likely that increased DNA flexibility may act as a general recognition signal for a variety of DNA repair processes.

Based on our results, we propose the following model for recognition and binding of a gapped DNA substrate by Pol involving conformational capture followed by an ‘on-protein’ rearrangement (Fig 7). The DNA substrate rapidly interconverts between stacked and unstacked states; the unstacked conformations are generally more bent and show increased fraying 1-2 nt around the gap. The Pol initially interacts with the upstream DNA while the substrate is in an unstacked state (conformational capture). This upstream region of the substrate resembles a primer-template structure, which is known to bind tightly to Pol (K_D < 1 nM; Turner, Grindley and Joyce, 2003) forming a sufficiently stable complex for crystallization (16, 18). This conformational selection step does not necessarily require the substrate to adopt the precise 120° bend angle seen in the binary complex; rather, the DNA conformational flexibility helps to avoid blocking binding through steric clashes. Having bound the upstream duplex, the downstream duplex is free to sample conformational space (as seen in the MD simulations on the binary complex; Fig 3A), docking to the protein, and fraying the additional 3-4 nts, resulting in the complete binding of the gapped DNA (K_D = 0.4 nM; Fig S1A). This proposed two-step binding mechanism comprises an initial conformational selection step in which the substrate is bound, followed by an ‘on-protein’ conformational search, in which the DNA and the protein both search conformational space

Other structure-specific DNA binding proteins which have been shown to interact with bent DNA (e.g. FEN1, Pol β) are also likely to exploit the conformational dynamics of their substrates for recognition and binding. Thus, the mechanism we propose of an initial conformational selection step, sensing the increased flexibility of the substrate DNA, followed by an ‘on-protein’ rearrangement, may be generally applicable to many structure-specific DNA binding enzymes. It is an attractive model for how these enzymes operate during DNA repair, where vast regions of undamaged DNA are searched rapidly to identify sites that need to be repaired to stop accumulation of toxic intermediates and mutations, and ensure normal cellular function.

Figure 7 Gapped DNA recognition: conformational capture followed by an ‘on-protein’ rearrangement.

Gapped DNA is dynamic adopting bent and frayed states (orange haze). Pol can bind to the upstream DNA when the downstream DNA conformation is not impeding the Pol (conformational capture of slightly bent states). Following binding of the upstream DNA, the downstream DNA now docks and is further melted, beginning the process of strand-displacement.

AUTHOR CONTRIBUTIONS

Conceptualization, T.D.C., J.H. and A.N.K.; Software, T.D.C. and J.H.; Investigation, T.D.C., M.S., A.P., M.M., H.K., A.C. and J.H.; Resources J.P.K.D., P.C.B. and A.N.K.; Writing - Original Draft, T.D.C., M.S., and M.M.; Writing - Review and Editing, T.D.C., M.S., A.P., M.M., J.H., J.P.K.D., P.C.B. and A.N.K.; Supervision, T.D.C., J.P.K.D., P.C.B. and A.N.K. Funding acquisition, A.N.K.

METHODS

Protein Expression, purification and labelling. Pol variants were expressed from an N-terminal-His6, D424A construct and purified as described (10). The D424A mutation inhibits the proof-reading exonuclease activity. Briefly, a plasmid carrying the gene encoding Pol was transformed into HMS174 (DE3) cells, and single colonies inoculated in 25 ml LB, supplemented with 50 μg/ml carbenicillin. The cultures were grown overnight at 220 rpm and 37°C, and were used to inoculate 1 liter of LB supplemented with carbenicillin. The culture was grown to an ÜD600 of 0.6, at which point expression was induced with 0.5 mM Isopropyl β-D-1-thiogalactopyranoside (IPTG). After 2 hours of expression, the cells were harvested by centrifugation (20 min at 3000 rpm and 4 °C; GS-6R Beckman), resuspended in cold 50 mM Tris pH 7.5, and spun down in an ultracentrifuge (15 min at 8,000 rpm at 4 °C; Sigma 3K30, rotor 12150-H). Finally, the pellet was resuspended in lysis buffer (50 mM Tris pH 7.2, 300 mM NaCl, 1 mM β-mercaptoethanol, 10 mM imidazole, 2 mg/ml lysozyme and 0.02 mM Phenylmethane sulfonyl fluoride; PMSF). The cells were stored in the lysis buffer overnight at -80°C.

The frozen cells were thawed and fresh PMSF (25 μ!) was added. Cells were lysed by sonication (6 cycles of 5-sec ON and 10-sec OFF time) and the cell debris was spun down (20 min at 15,000 rpm at 4 °C; Sigma 3K30). The supernatant containing the cell lysate was combined with Ni-NTA resin (pre-equilibrated in buffer A: 50 mM Tris pH 7.2, 300 mM NaCl, 1 mM β-mercaptoethanol and 10 mM imidazole), and the protein allowed to batch-bind (1 hour at 4°C). The resin was spun down, resuspended in buffer A, applied to a plastic column and washed with buffer A containing increasing concentrations of imidazole (10, 20, and 27 mM). The protein was eluted in buffer A containing 100 mM imidazole, and the fractions analyzed by absorbance and SDS-PAGE. The concentrated fractions were pooled, and dialyzed into 50 mM Tris pH 7.2, 1 mM dithiothreitol (DTT) overnight at 4 °C. The dialyzed samples were combined in a 1:1 ratio with 2x glycerol storage buffer (80 % glycerol, 50 mM Tris pH 7.2, 2 mM DTT) and stored at -20°C.

Pol variants containing a single cysteine (C907+, C907S / K550C and C907S / L744C) were labelled using a maleimide derivative of Cy3B (GE Healthcare) as described (10). Briefly, purified Pol samples were reduced (5 mM DTT, 1 hr at 22°C), and DTT was removed by dialysis into 50 mM Tris pH 7.1, 0.12 mM tris(2-carboxyethyl)phosphine (TCEP). A two-fold excess of the Cy3B maleimide dissolved in DMSO was added to the protein sample and the reaction allowed to proceed overnight at 4 °C with gentle rocking. The reaction was quenched with 1 mM DTT, and applied to a heparin column (preequilibrated in heparin buffer containing 20 mM Tris pH 7.4, 1 mM ethylenediaminetetraacetic acid (EDTA), 2 % glycerol and 1 mM β-mercaptoethanol), washed with heparin buffer containing 50 mM NaCl, and the protein eluted in buffer containing 400 mM NaCl. Samples were dialyzed first into 1 liter of 50 mM Tris pH 7.4, 25 mM NaCl, 1 mM DTT, for 3x 1 hour, and then into 500 ml of the same buffer containing 40 % glycerol, overnight, before storing at -20 °C. Labelling efficiencies (typically ~80 %) were determined by UV-Vis absorbance, using extinction coefficients for Pol (58,790 M⁻¹ cm⁻¹ at 280 nm) and Cy3B (130,000 M⁻¹ cm⁻¹ at 570 nm), and taking into account the Cy3B absorbance at 280 nm.

DNA labelling and annealing. DNA oligonucleotides (oligos; Table S3) were prepared using automated synthesis (IBA GmbH), and labelled with NHS-ester derivatives of Cy3B (GE Healthcare) and Atto647N (Atto-tec) via dT-C6-amino linkers at selected positions according to the manufacturers’ protocols. Labelled oligos were purified by 20% polyacrylamide gel electrophoresis. Bands were visualized by UV-shadowing, cut, and extracted from the gel using an overnight crush and soak protocol at 4°C. The sample volume was reduced (by centrifugal evaporation) and buffer exchanged into TE buffer (Microbiospin6 columns, BioRad). Gapped-DNA substrates were assembled by annealing three single-stranded oligos (one from each group - DNA1, DNA2 and DNA3; Table S3) in annealing buffer, 20 mM Tris-HCl pH 8.0, 100 mM NaCl, and 1 mM EDTA. Samples were heated to 94°C and subsequently cooled to 4°C, in steps of 10°C over 45 min. Annealed substrates were stored at -20°C. For DNAs prepared for electroporation, the oligonucleotides were annealed in a low-salt annealing buffer (20 mM Tris-HCl (pH 8.0), 10 mM NaCl, 1 mM EDTA).

Single-molecule FRET measurements. Single-molecule FRET measurements were performed at room temperature using a home-built confocal microscope with 20 kHz alternating-laser excitation between a 532-nm (Samba, Cobolt, operated at 240 μW) and a 638-nm laser (Cube, Coherent, operated at 60 μW), coupled to a 60x, 1.35 numerical aperture (NA), UPLSAPO 60X0 objective (Olympus). For DNA-DNA measurements, labelled DNA was present at < 100 pM and unlabelled Pol (when present) at 3 nM concentration. For Pol-DNA measurements, both Pol and DNA were present at 100 pM concentration. Measurements were taken in ‘Pol buffer’, consisting of 40 mM 4-(2-hydroxyethyl)-1-piperazineethanesulfonic acid (HEPES)-NaOH, pH 7.3, 10 mM MgCl₂, 1 mM DTT, 100μgml⁻¹ bovine serum albumin, 5% (vol/vol) glycerol, 1 mM mercaptoethylamine. We recorded 3-6 datasets of 10 min for each distance measurement and combined for analysis. Photon streams in DD, DA and AA channels were recorded and processed using custom-written software (LabVIEW). Bursts were filtered for the correct labelling stoichiometry (45), and accurate FRET was calculated as described below. FRET histograms were fitted to single, double or triple Gaussian functions.

Derivation of the multistate equilibrium model. A three-species model, in which the observed low-, mid-, and high-FRET states correspond to DNA alone, DNA-Pol binary complex, and a dimer DNA-Pol₂ respectively, could not account for the persistence of the mid-FRET signal at high Pol concentrations (Fig S1A). The simplest model that could account for all the data involved a second dimer species with a FRET efficiency indistinguishable from the DNA-Pol binary complex (Fig S1B). In this model, binding of Pol to the DNA, forming the binary complex (Pol:DNA*), is described by the association constant, K₁=[Pol:DNA*]/[DNA]. Binding of a second Pol to the binary complex yields a mid-FRET dimer species (Pol₂:DNA*) governed by the association constant K₂=[ Pol₂:DNA*]/[ Pol:DNA*]. This mid-FRET dimer can isomerize to the high-FRET dimer (Pol₂:DNA**), a process described by the equilibrium constant K₃=[ Pol₂:DNA**]/[ Pol₂:DNA*]. The corresponding dissociation constants are defined as K_D1=1/K₁ for the formation of the binary complex and K_D2=1/(K₁K₂) for the formation of the mid-FRET dimer species.

The total DNA substrate concentration is:

The fractions of each species as a function of Pol concentration are:

The fractional populations of the low-mid and high-FRET states are:

We used a global fitting approach to fit the variation in the fractional populations of the three FRET states simultaneously as a function of Pol concentration, and to determine the equilibrium constants K₁, K₂, and K₃. From the equilibrium constants and their standard errors, we calculated the dissociation constants.

Accurate FRET corrections. The apparent FRET efficiency, E* was calculated from the DA and DD photon streams:

Similarly, the apparent stoichiometry, S* was calculated using:

To obtain the accurate FRET efficiency, the raw photon streams were sequentially corrected for background counts, cross talk, and gamma / beta factors (which take into account the different detection efficiencies, quantum yields and excitation cross sections of the two dyes), as described (12, 13).

First, the three photon streams were corrected for background, which arises from impurities, Raman scattering from the solvent and dark counts in the detectors. For each burst, the corrected counts were calculated from the raw counts by subtracting the background count rate, multiplied by the length of the burst. Typical background count rates were 1-3 photons per ms.

After the background correction, the leakage fraction of the donor emission into the acceptor detection channel and the direct excitation of the acceptor by the donor-excitation laser were obtained. The correction factor for leakage (lk) was determined from the FRET efficiency of the donor-only population, E_don-only:

The correction factor for direct excitation (dir) was determined from the apparent stoichiometry value of the acceptor-only population, S_acc-only:

The DA intensities and the FRET efficiency and stoichiometry were then corrected as follows:

Finally, the gamma and beta parameters were obtained from a linear fit to a plot of 1/S_PR vs E_PR:

The fully-corrected accurate FRET efficiencies and stoichiometries, E and S are given by:

Accurate determination and application of all correction parameters was checked visually on the ES histograms, as all FRET populations should be located at S~0.5. Gamma and beta factors were determined separately for DNA-DNA and DNA-Pol measurements. This was necessary because of the significant difference in the quantum yield of the donor when attached to DNA or protein (see below and Table S4).

Conversion of accurate FRET to distance. Accurate FRET efficiency E, was converted to distance R, according to the equation: using experimentally determined values for the Forster radius, R₀, which were calculated according to the equation: where Q_D is the quantum yield of the donor (which must be measured; see below), N_A is Avogadro’s number, and n is the refractive index of the medium. The term κ² describes the relative orientation of the transition dipoles of the donor and acceptor. Its value lies in the range of 0-4, and it is often assumed to be equal to 2/3, which is the case when both fluorophores have unrestricted rotational freedom (46).The overlap integral J is a measure of the degree of overlap between the donor emission and acceptor excitation spectra (47), and can be calculated according to: where F_D is the corrected donor fluorescence intensity at a particular wavelength λ, with the total intensity normalized to unity, and ε_Α is the extinction coefficient of the acceptor at the same wavelength.

Quantum yields were measured according to established methods (48, 49) for the following donor samples: free Cy3b-maleimide dye, Cy3b attached to gapped DNA (in the presence and absence of unlabelled Pol in a 1:1 molar ratio), and Cy3b attached to different positions of Pol (K550, L744, C907). Each sample was diluted from a glycerol stock to 5 μΜ final concentration in Pol buffer. Free Cy3b-maleimide dye was reduced with 10 mM DTT for 10 min prior to dilution. Absorbance at 490 nm was recorded for each sample, using a UV-visible spectrophotometer (Cary 50 Bio, Varian). An emission scan was taken of the same sample using a steady-state fluorimeter (PTI), exciting at 490 nm and recording at 510-700 nm. Samples were diluted and recordings repeated 5 times, to populate absorbance in the 0 to 0.1 region, where absorbance and emission are linearly related. The same procedure was applied to the reference dye, rhodamine 6G, dissolved in ethanol. The quantum yield of the donor dye was then calculated according to equation: where Q is quantum yield, E is integrated emission across the whole spectrum, A is absorption at 490 nm, n is the refractive index of the medium, and D and R refer to the donor and the reference dyes, respectively. Established values were taken for the quantum yield of rhodamine 6G in ethanol (0.95; Magde, Wong and Seybold, 2002) and for the refractive index of ethanol (1.361). To calculate the overlap integrals, absorption spectra of the following acceptor samples were also measured: Atto647 free dye, Atto647N attached to DNA (in the presence and absence of Pol in a 1:1 ratio), and Atto647N attached to Pol. Samples were diluted in Pol buffer to 2 μΜ concentration, and absorption recorded at 400-710 nm. Both the absorption spectra of the acceptor, and the emission spectra of the donor (see above) were corrected for background, normalized, and the overlap integral calculated as in equation 3.3. The extinction coefficient of Atto647N at A_max was taken as 150,000 M⁻¹cm² (from manufacturer’s website; http://www.atto-tec.com). This allowed isotropic R0 values to be calculated (Equation S22), assuming orientational averaging (κ² = 2/3) and the refractive index of water (n=1.333).

To test if orientational averaging is justified, steady-state anisotropies were measured (48). Samples were diluted to 100 nM in Pol buffer and excited with vertically polarized light at 532 nm (donor) or 638 nm (acceptor samples) in a steady-state fluorimeter (PTI). Fluorescence was measured through horizontally and perpendicularly oriented emission filters at 570 nm (donor) or 669 nm (acceptor samples) over 1 minute. Anisotropy values (Table S4) were calculated from the difference of vertically and horizontally polarized emission intensities, corrected for background and for the different sensitivities of the emission channel for vertically and horizontally polarized light.

Distance calculations and accessible volume modelling of dye positions. Accurate FRET efficiencies were converted to their corresponding distances using a FRET error of +/− 0.025 (determined from the standard error of the mean in three independent FRET measurements of the same sample), and using experimentally determined R₀ values (Table S4). The R₀ values used were 64.5 Å for DNA-DNA and for 59.0 Å for DNA-Pol distances, and the error in R0 was assumed to be the error that was propagated from the uncertainty in quantum yield determination of +/* 0.10. The experimentally determined distances correspond to FRET-averaged distances, <R_DA>_E in the accessible volume model established by the Siedel laboratory (28). In this model, dye rotation occurs faster than the FRET process, but the position of the dye is fixed on the timescale of FRET. Other dye modelling methods based on Bayesian statistics can also be used (Muschielok et al., 2008).

The FRET-averaged distance is calculated by averaging the distances between individual dye positions in the donor and acceptor accessible volumes. For rigid body docking, these FRET-averaged distances were converted to distances between mean dye positions R_mp, using a third-order polynomial function: that was established by calculating R_mp and <R_DA>_E values for pairs of dyes at different positions along a double-stranded DNA using, as described (29). The radii, linker lengths and linker widths of Cy3B and Atto647N dyes were estimated from their structures in silico using ChemDraw (Perkin Elmer) and are given here (in Å):

View this table:

We used the accessible volume (AV) algorithm of the FPS software (29) to model the mean positions of the dyes for each Pol and DNA attachment site. The attachment points were taken to be the S atoms of Cys residues, and the C7 atoms of dTTP residues. For quFRET (see below) we calculated the percentage of the donor accessible volume that overlapped with the acceptor accessible volume using custom written code in MATLAB (Mathworks). Accessible volume elements were counted as overlapping if the distance between them was smaller than the lattice spacing used to calculate the initial AV.

Rigid body docking. The polymerase structure was obtained from the Bst X-ray crystal structure (PDB code 1L3U; Johnson et al., 2003). DNA was removed and Cys substitutions were introduced at positions K498, V692 and A855 (corresponding to E. coli residues K550, L744 and C907) using the PyMOL Molecular Graphics System, Version 1.8 Schrödinger, LLC. B-DNA models of the upstream and downstream DNA were made using 3D-DART modelling server (51) and were truncated at the gap-proximal ends by 3 base-pairs each for the purposes of rigid-body docking. Three-body rigid-body docking with Pol, upstream and downstream DNA structures was performed in the FPS software (29) using the calculated R_mp distances. Docking was repeated 1000 times from different starting configurations of the binding partners, using a clash tolerance of 6 Å. This treatment generated several clusters of structures, which were distinguished by the different RMSD values relative to each other, and the different goodness of fit to experimental data assessed by the reduced chi-squared parameter (). Structures with ; values above 6 were rejected, and one structure from each of the remaining clusters was further refined using a clash tolerance of 2 Å, and then again using a tolerance of 1 Å, during which steps the AV clouds were recalculated. The structure with the lowest ; was taken, and the R_mp distances from the model back-converted to <R_da>_e and FRET efficiency values, to compare with the experimental FRET data. For precision estimation, 100 bootstrapped structures were generated from the best model, using a clash tolerance of 1 Å. The coordinates of each P atom were extracted, and the RMSD of each P atom calculated across the 100 bootstrapped structures.

To compare the position of the upstream DNA in the docked structure with the crystal structure, the protein components of the FRET-restrained and crystal structures were aligned in PyMol. The RMSD of the upstream DNA fragment between the two structures was calculated as for bootstrapped structures, but across all P atoms. The DNA structure in complex with the Pol dimer was obtained using the same procedure as the Pol-DNA structure, but with no polymerase present in the docking. In the case of the DNA structure in the absence of Pol, the DNA fragments were at their full length, and with an additional distance restraint of 5 +/− 2.5 Å imposed between the C atoms in the template strand opposite the gap, to account for the covalent link between the two.

All-atom molecular dynamics simulation - Model preparation. The protein atoms and the catalytic magnesium ion were extracted from the Bst X-ray structure PDB file (code 4BDP; Kiefer et al. 1998). The online server ‘WHAT IF’ was used to check for errors in the PDB, and build the missing side chains into the structure (52, 53). PyMol was used to align the FRET-restrained structure with the X-ray structure, based on the protein component only. The downstream DNA in the docked structure was extended to its full length, and its template strand linked to the template strand of a 5-nucleotide fragment of upstream DNA from the X-ray structure (which also includes the templating nucleotide and one nucleotide downstream). Basic molecular sculpting was performed such that the conformation of the DNA backbone was not significantly disturbed, and no steric clashes occurred with the polymerase, which resulted in 6 base-pairs of downstream DNA being unpaired. The upstream DNA fragment was then extended using the sequence of upstream DNA from the docked structure. This step was justified by the excellent agreement in the position of the upstream DNA between the X-ray and docked structures (Fig S2B). For DNA-only simulations, DNA models were generated using the 3D-DART server (51). DNA atoms were extracted from the PDBs and terminal phosphate groups removed using PyMol. In the case of gapped DNA, the central nucleotide was removed and a 5’ phosphate group generated instead.

Force fields and parameters. All complex simulations and high-temperature DNA simulations were run using Amber ff99sb force field (54) with modified nucleic acid parameters (parmbsc0; (55, 56). The crystal structure control and the DNA-only simulations, were run using Amber ff99sb-ILDN with Amber94 nucleic acid parameters (57). No parameters were available in either force field for the 5’ phosphate groups of DNA, as these are usually missing in crystal structures due to their high flexibility. The phosphate group had to be modelled at the 5’ end of the gap in our DNA substrate, as it is both physiologically relevant and present in our single-molecule experiments. Therefore, the force fields were modified by assuming that the parameters of the β-phosphate of free ADP available online (http://research.bmh.manchester.ac.uk/bryce/amber/), are a reasonable approximation for the α-phosphate of the gap-proximal dDTP.

Simulation conditions. All simulations were carried out using Gromacs 4.6 (58). The X-ray structure control simulations and complex simulations were done using explicit solvent (TIP3P) in a triclinic box, with a minimum 10-Å solvent edge, in the presence of 10 mM MgCl₂. The system was neutralized with addition of magnesium ions, and energy-minimized using steepest descent minimization. In order to stabilize the temperature of the system, equilibration was performed in the NVT ensemble for 100 ps, with the temperature of 298 K maintained using a Berendsen thermostat (59). Next, the pressure of the system was stabilized by equilibration in the NPT ensemble for 1 ns, with the temperature of 298 K and the pressure of 1 bar retained using a V-rescale thermostat (60) and a Berendsen barostat (59), respectively. During equilibration, DNA, protein heavy atoms and the catalytic magnesium ion were position-restrained with a force constant of 1,000 kJmol⁻¹nm⁻². DNA was equilibrated for an additional 10 ns with protein heavy atoms restrained, under the NPT conditions. Atom velocities were preserved between the equilibration steps, and between equilibration and production steps. Unrestrained production was finally allowed to run for 100 ns, with the temperature of 298 K and the pressure of 1 bar maintained by the V-rescale thermostat and a Parrinello-Rahman barostat (61). Periodic boundary conditions and the Verlet cutoff scheme were used, and long-range electrostatic interactions were accounted for by the Particle-Mesh Ewald method (62). All bonds were treated as constraints with the LINCS algorithm, resulting in a time step of 2 fs. Coordinates were saved to an output trajectory every 5 ps. Repeat simulations were carried out using different randomly numbered seeds, generating different initial atom velocities each time.

In the case of full-length DNA-only simulations, the conditions were the same except that a square box was used, with dimensions equal to the length of the DNA plus a 10-Â solvent edge. The NVT and NPT equilibration steps were performed, and the production times were 20 ns. In the case of high-temperature DNA simulations carried out as part of model preparation, the conditions were the same as for the complex simulations except that the temperature during the equilibration and production runs was 400 K, and the production times were 2 ns. All DNA heavy atoms were position-restrained during the production runs, except for the 6 base pairs in the protein-proximal, downstream part of the DNA, which were unpaired in the starting configuration.

Analysis. All analysis was carried out using Gromacs 4.6 or 5.0, and VMD (63). Trajectories were repaired for periodic boundary conditions, and processed to include only every 10th frame, corresponding to 50-ps steps. Maps of occupancy of DNA and of polymerase residues during the simulation were created with VMD’s volmap density function, using an isovalue of 0.001. RMSD and end-to-end distance measurements were done using standard functions in Gromacs. The flap-to-tem plate H-bonds were quantified by measuring the number of bonds at any one time in the simulation, using a distance cut-off of 0.33 nm, and an angle cut-off of 30°. The position of residue Y719 relative to the DNA was calculated by measuring the distance between the centers of mass of Y719 side chain and individual DNA residue base moieties. Pol-DNA interactions were detected by measuring the minimum distance between any nitrogen atom of a specific Pol residue and a specific phosphorous atom in DNA, during the entire simulation. Distances below 0.4 nm were taken as indicating an interaction.

Coarse-grained molecular dynamics simulation of DNA substrates using oxDNA. DNA substrate systems were simulated using oxDNA, a nucleotide-level coarse-grained model of DNA in which each nucleotide is modelled as a rigid body. The oxDNA model has been described in detail (24, 64) and is implemented in a simulation package which is available for download (http://dna.physics.ox.ac.uk/). It was designed to reproduce the thermodynamic and mechanical properties of both single- and double-stranded DNA (24, 25), and has proven powerful in predicting the kinetics of the basic dynamical processes in DNA systems (27, 65, 66). Therefore, it is particularly suited for probing the structure and dynamics of the DNA substrates in this study. The oxDNA interaction potential consists of terms representing the backbone connectivity (modelled as a finitely-extensible nonlinear elastic spring), excluded volume, hydrogen bonding between Watson-Crick (WC) complementary base pairs, stacking between adjacent bases along the chain, coaxial stacking between non-adjacent bases, and cross stacking (Fig S5E). Aside from backbone connectivity and excluded volume, all interactions are anisotropic, depending on the relative orientation of the nucleotides. Orientational modulations of the stacking potential favors the bases to form coplanar stacks, and hydrogen bonding can occur between complementary WC base pairs when they are anti-aligned, leading to the formation of double-helical structures for which the helical twist arises from the different length scales of the backbone separation and the optimal stacking separation. Within oxDNA, the bases in the single-stranded DNA can stack/unstack and the strengths of hydrogen bonding and stacking interactions depend on the identities of the interacting bases (64). It has been parameterized for a NaCl concentration of 0.5 M, similar to the experimental buffer conditions in this study.

We performed 100 simulations of 10⁸ steps each for each of the gapped, nicked and duplex DNAs, with interaction energies and configurations sampled every 10³ steps. The time step was 0.005 simulation units, where one simulation unit implies a time of 3.03 ×10⁻¹² s. The temperature was set to 295 K, and an Andersen-like thermostat was used (67). Particle velocities were refreshed every 10³ steps from the Maxwell distribution corresponding to the simulation temperature, with fixed probabilities of 0.02 and 0.0067 for the linear and angular velocities, respectively.

The bend angle was calculated from the vectors placed along the midlines of the two helical segments, as described previously (68). The relative free energies were calculated from the MD trajectories, as follows: where A(|θ|) is the free energy, k_B is the Boltzmann constant, p(|θ|) is the observed probability density for the DNA adopting a bend angle |θ|, and | θ₀| is the reference bend angle, for which A(|θ₀|) = 0.

FRET efficiencies were calculated from the molecular dynamics trajectories, by adapting the accessible volume (AV) model for dye positions detailed above. Briefly, a grid of points was produced around the DNA base attached to the dye, with the spacing between grid points set to half the smallest dye dimension (see table above). Points were excluded if their distance to a base or backbone site, was smaller than the sum of the dye radius and the excluded volume radius of the base or backbone, respectively. This overlap check was repeated with the three different dye radii and the resulting AV clouds were combined (Fig S5G). A position that could accommodate all three dye radii was therefore weighted three times more than a position that could only accommodate one. The FRET efficiencies were averaged over all dye distances for each configuration and then again over all configurations in our molecular dynamics trajectories (of length ~15μs).

Single-molecule FRET measurements of DNAs in living bacteria. Gapped and duplex DNAs were internalized into electro-competent DH5α E. coli cells (Invitrogen) using electroporation (30). Cells were diluted 1:1 with sterile milli-Q water and stored at -80°C. For each electroporation experiment, 20 μL of electrocompetent cells were used. DNAs were stored in 2μΜ stocks in low-salt annealing buffer at -20 °C. For each experiment 0.25 μL of DNA and 0.2 μL of 50 mM EDTA were added to 20 μL electrocompetent cells and incubated on ice. The mixture of electro-competent cells and labeled DNAs was transferred into a pre-chilled electroporation cuvette (0.1 cm gap cuvette, Bio-Rad) and placed into an electroporator (MicroPulser, Bio-Rad). An electric field of 1.4 kV/cm was applied for electroporation. About 500 μL of super optimal broth with catabolite repression (SOC) was added immediately after electroporation. Cells were recovered for 3 min at 37°C. After recovery, cells were harvested by centrifugation at 3300 g for 1 min at 4°C and washed 5 times with 500 μL phosphate buffered saline (PBS). Cells were resuspended in 150 μL PBS and placed on 1% agarose pads before imaging. The agarose pads were made from ~300 μL of M9 medium containing 1% (v:w) BioRad Certified Molecular Biology Agarose on a coverslip. About 3 μL of cells were pipetted onto the agarose pad, and another coverslip was added on top. The slide/agar/slide sandwich was inverted and placed on the microscope with the side containing the cells closest to the objective.

Live-cell imaging was performed on a customized inverted Olympus IX-71 microscope equipped with a 532 nm DPSS laser (MGL_III-532-100mW, CNI). Laser light was collected into a single-mode optical fiber (Thorlabs, Newton, NJ, USA) and collimated before focusing on the objective. Cells were imaged using highly inclined thin illumination (HILO, Tokunaga et al., 2008) by adjusting the position of the focused excitation light on the back focal plane of the objective. Cellular fluorescence was collected through the same objective, filtered to remove excitation light through a long-pass filter (HQ545LP, Chroma) and a notch filter (NF02-633S, Semrock), and spectrally separated by a dichroic mirror (630DRLP, Omega). Donor and FRET channels were imaged onto separate halves of the chip of an electron-multiplying charge-coupled device camera (iXon+, BI-887, Andor). The illumination for brightfield images comprised a white-light lamp (IX2-ILL100, Olympus) and condenser (IX2-LWUCD, Olympus) attached to the microscope. Movies and images were recorded using manufacturer’s software. Measurements were performed in green continuous-wave mode using an excitation power density of 38 W/cm² and 20 ms exposure time.

Custom-written MATLAB software was used to analyze single-molecule tracking and diffusion in live E. coli as previously described (30, 40). Briefly, the PSFs in donor and FRET channels in each movie frame were fitted by a 2D elliptical Gaussian (free fit parameters: x/y position, x/y width, elliptical rotation angle, amplitude, background) using initial position guesses from applying a fixed localization-intensity threshold on the bandpass-filtered fluorescence image (69). Tracking was performed in the FRET channel by adapting the MATLAB script based on a published algorithm (70). Localized PSFs were linked to a trajectory if they appeared in consecutive frames within a window of 7 pixels (~ 0.69 μm). This window size ensures 98% of steps are correctly linked for an apparent diffusion coefficient of 1.0 μm²/s at 20 ms exposure time. To account for transient disappearance of the PSF within a trajectory due to blinking or missed localization, we used a memory parameter of 1 frame. To eliminate noise, only molecules appearing in 5 consecutive frames were included in the analysis. The donor channel was mapped onto the FRET channel using a transformation matrix. FRET values, E*, were obtained from co-localized PSFs by calculating the ratio of photon counts in the FRET channel over the sum of photon counts in both channels for each single-molecule (71); E* = pc_FRET/( pc_FRET +pc_Donor), pc_FRET/Donor: photon counts in FRET and donor channel, respectively.

Fluorescence overlay images were obtained by overlaying the donor and FRET fluorescence channels colored green and red, respectively. The green fluorescence channel was transformed onto the red fluorescence channel. All transformation matrices were based on a calibration matrix generated each day where fluorescent beads were mapped from the green onto the red fluorescence channel.

Quenchable FRET (quFRET) experiments. quFRET experiments were performed as per our standard smFRET confocal experiments (see above). DD, DA and AA photon streams were recorded and used to calculate the uncorrected FRET efficiencies (E* - equation S9) and stoichiometries (S* - equation S10) of filtered bursts, which correspond to individual molecules. These values were plotted as a two-dimensional histogram, and the number of bursts in the mid-S (0.4 < S* < 0.8) and low-S (S* < 0.4) regimes counted. One dimensional histograms of E* were produced from projections of the mid-S data onto the E* axis. The quFRET assay offers two related readouts for DNA melting: an increase in the absolute number of mid-S bursts, and an increase in the relative proportion of mid-S bursts compared with low-S bursts; the latter is a more robust measure, being independent of sample concentration and measurement time.

ACKNOWLEDGEMENTS

We thank Maria Musgaard for assistance with all-atom MD simulations, and are grateful for funding from: Lindemann Trust Fellowship (T.D.C.); Wellcome Trust (M.S.); German National Academic Foundation (Studienstiftung) and Phizackerley Senior Scholarship in Medical Sciences by Balliol College (A.P.); Marie Curie Career Integration Grant [#630992] (J.H.), UK EPSRC (A.P., M.M and J.P.K.D.); European Research Council (261227) and the UK BBSRC (BB/J00054X/1) (A.N.K.).

Footnotes

↵1 Co-first authors
Present addresses: TDC: Department of Chemistry, University of Sheffield, S3 7HF, United Kingdom, MM: School of Mathematics, University of Bristol, University Walk, Bristol BS8 1TW, United Kingdom. AP: Palo Alto Research Center, 3333 Coyote Hill Rd, Palo Alto, CA 94304, U. S. A.

REFERENCES

1.↵
von Hippel PH (1994) Protein-DNA Recognition: New Perspectives and Underlying Themes. Science 263(5148):769–770.
OpenUrl FREE Full Text
2.↵
Rohs R, et al. (2010) Origins of specificity in protein-DNA recognition. Annu Rev Biochem 79:233–69.
OpenUrl CrossRef PubMed Web of Science
3.↵
Tsutakawa SE, et al. (2011) Human flap endonuclease structures, DNA double-base flipping, and a unified understanding of the FEN1 superfamily. Cell 145(2):198–211.
OpenUrl CrossRef PubMed Web of Science
4.
Craggs TD, Hutton RD, Brenlla A, White MF, Penedo JC (2014) Single-molecule characterization of Fen1 and Fen1/PCNA complexes acting on flap substrates. Nucleic Acids Res 42(3):1857–1872.
OpenUrl CrossRef PubMed Web of Science
5.↵
Algasaier SI, et al. (2016) DNA and protein requirements for substrate conformational changes necessary for human flap endonuclease-1-catalyzed reaction. J Biol Chem 291(15):8258–8268.
OpenUrl Abstract/FREE Full Text
6.↵
Sawaya MR, Prasad R, Wilson SH, Kraut J, Pelletier H (1997) Crystal structures of human DNA polymerase beta complexed with gapped and nicked DNA: evidence for an induced fit mechanism. Biochemistry 36(37):11205–15.
OpenUrl CrossRef PubMed Web of Science
7.↵
Hutton RD, Craggs TD, White MF, Penedo JC (2010) PCNA and XPF cooperate to distort DNA substrates. Nucleic Acids Res 38(5):1664–75.
OpenUrl CrossRef PubMed Web of Science
8.↵
Sass LE, Lanyi C, Weninger K, Erie DA (2010) Single-molecule FRET TACKLE reveals highly dynamic mismatched DNA-MutS complexes. Biochemistry 49(14):3174–3190.
OpenUrl CrossRef PubMed Web of Science
9.↵
Cristóváo M, et al. (2012) Single-molecule multiparameter fluorescence spectroscopy reveals directional MutS binding to mismatched bases in DNA. Nucleic Acids Res:1–17.
10.↵
Santoso Y, et al. (2010) Conformational transitions in DNA polymerase I revealed by single-molecule FRET. Proc Natl Acad Sci U S A 107(2):715–20.
OpenUrl Abstract/FREE Full Text
11.↵
Hohlbein J, et al. (2013) Conformational landscapes of DNA polymerase I and mutator derivatives establish fidelity checkpoints for nucleotide insertion. Nat Commun 4:2131.
OpenUrl PubMed
12.↵
Lee NK, et al. (2005) Accurate FRET measurements within single diffusing biomolecules using alternating-laser excitation. Biophys J 88(4):2939–53.
OpenUrl CrossRef PubMed Web of Science
13.↵
Hohlbein J, Craggs TD, Cordes T (2014) Alternating-laser excitation: singlemolecule FRET and beyond. Chem Soc Rev 43(4):1156–71.
OpenUrl CrossRef PubMed
14.↵
Santoso Y, Hwang LC, Le Reste L, Kapanidis AN (2008) Red light, green light: probing single molecules using alternating-laser excitation. Biochem Soc Trans 36(Pt 4):738–44.
OpenUrl Abstract/FREE Full Text
15.↵
Markiewicz RP, Vrtis KB, Rueda D, Romano LJ (2012) Single-molecule microscopy reveals new insights into nucleotide selection by DNA polymerase I. Nucleic Acids Res: 1–10.
16.↵
Johnson SJ, Taylor JS, Beese LS (2003) Processive DNA synthesis observed in a polymerase crystal suggests a mechanism for the prevention of frameshift mutations. Proc Natl Acad Sci U S A 100(7):3895–900.
OpenUrl Abstract/FREE Full Text
17.↵
Singh K, Srivastava A, Patel SS, Modak MJ (2007) Participation of the fingers subdomain of Escherichia coli DNA polymerase I in the strand displacement synthesis of DNA. J Biol Chem 282(14):10594–604.
OpenUrl Abstract/FREE Full Text
18.↵
Kiefer JR, Mao C, Braman JC, Beese LS (1998) Visualizing DNA replication in a catalytically active Bacillus DNA polymerase crystal. Nature 391(6664):304–7.
OpenUrl CrossRef PubMed
19.↵
Cordes T, et al. (2010) Sensing DNA opening in transcription using quenchable Forster resonance energy transfer. Biochemistry 49(43):9171–80.
OpenUrl CrossRef PubMed Web of Science
20.
Robb NC, et al. (2013) The transcription bubble of the RNA polymerase-promoter open complex exhibits conformational heterogeneity and millisecond-scale dynamics: Implications for transcription start-site selection. J Mol Biol 425(5): 875–885.
OpenUrl CrossRef PubMed
21.↵
Robb NC, et al. (2016) Single-molecule FRET reveals the pre-initiation and initiation conformations of influenza virus promoter RNA. Nucleic Acids Res:gkw884.
22.↵
Lin S, Horning DP, Szostak JW, Chaput JC (2009) Conformational Analysis of DNA Repair Intermediates by Time-Resolved Fluorescence Spectroscopy. J Phys Chem A 113(35):9585–9587.
OpenUrl CrossRef PubMed
23.↵
Mills JB, Cooper JP, Hagerman PJ (1994) Electrophoretic evidence that singlestranded regions of one or more nucleotides dramatically increase the flexibility of DNA. Biochemistry 33(7):1797–803.
OpenUrl CrossRef PubMed Web of Science
24.↵
Ouldridge TE, Louis AA, Doye JPK (2011) Structural, mechanical, and thermodynamic properties of a coarse-grained DNA model. J Chem Phys 134(8):0–22.
OpenUrl
25.↵
Romano F, Chakraborty D, Doye JPK, Ouldridge TE, Louis AA (2013) Coarsegrained simulations of DNA overstretching. J Chem Phys 138(8):0–10.
OpenUrl
26.
Doye JPK, et al. (2013) Coarse-graining DNA for simulations of DNA nanotechnology. Phys Chem Chem Phys 15(47):20395–414.
OpenUrl CrossRef PubMed
27.↵
Srinivas N, et al. (2013) On the biophysics and kinetics of toehold-mediated DNA strand displacement. Nucleic Acids Res 41(22):10641–10658.
OpenUrl CrossRef PubMed Web of Science
28.↵
Sindbert S, et al. (2011) Accurate distance determination of nucleic acids via Forster resonance energy transfer: implications of dye linker length and rigidity. J Am Chem Soc 133(8):2463–80.
OpenUrl CrossRef PubMed Web of Science
29.↵
Kalinin S, et al. (2012) A toolkit and benchmark study for FRET-restrained high-precision structural modeling. Nat Methods 9(12):1218–25.
OpenUrl CrossRef PubMed
30.↵
Crawford R, et al. (2013) Long-lived intracellular single-molecule fluorescence using electroporated molecules. Biophys J 105(11):2439–2450.
OpenUrl CrossRef PubMed
31.
Plochowietz a, Crawford R, Kapanidis a N (2014) Characterization of organic fluorophores for in vivo FRET studies based on electroporated molecules. Phys Chem Chem Phys:1–3.
32.↵
Plochowietz A, Farrell I, Smilansky Z, Cooperman BS, Kapanidis AN (2016) In vivo single-RNA tracking shows that most tRNA diffuses freely in live bacteria. Nucleic Acids Res:gkw787.
33.↵
Ollis DL, Brick P, Hamlin R, Xuong NG, Steitz TA (1985) Structure of large fragment of Escherichia coli DNA polymerase I complexed with dTMP. Nature 313:762–766.
OpenUrl CrossRef PubMed Web of Science
34.↵
Beese LS, Derbyshire V, Steitz TA (1993) Structure of DNA polymerase I Klenow fragment bound to duplex DNA. Science (80-) 260(5106):352–355.
OpenUrl Abstract/FREE Full Text
35.↵
Yuan YC, Whitson RH, Liu Q, Itakura K, Chen Y (1998) A novel DNA-binding motif shares structural homology to DNA replication and repair nucleases and polymerases. Nat Struct Biol 5(11):959–64.
OpenUrl CrossRef PubMed Web of Science
36.↵
Yin YW, Steitz TA (2004) The Structural Mechanism of Translocation and Helicase Activity in T7 RNA Polymerase. Cell 116(3):393–404.
OpenUrl CrossRef PubMed Web of Science
37.↵
Turner RM, Grindley NDF, Joyce CM (2003) Interaction of DNA polymerase I (Klenow fragment) with the single-stranded template beyond the site of synthesis. Biochemistry 42(8):2373–85.
OpenUrl CrossRef PubMed
38.↵
Srivastava A, Singh K, Modak MJ (2003) Phe 771 of Escherichia coli DNA polymerase I (Klenow fragment) is the major site for the interaction with the template overhang and the stabilization of the pre-polymerase ternary complex. Biochemistry 42(13): 3645–3654.
OpenUrl CrossRef PubMed
39.↵
Thompson EHZ, Bailey MF, Van der Schans EJC, Joyce CM, Millar DP (2002) Determinants of DNA mismatch recognition within the polymerase domain of the Klenow fragment. Biochemistry 41(3):713–722.
OpenUrl CrossRef PubMed Web of Science
40.↵
Uphoff S, Reyes-Lamothe R, Garza de Leon F, Sherratt DJ, Kapanidis AN (2013) Single-molecule DNA repair in live bacteria. Proc Natl Acad Sci U S A 110(20):8063–8068.
OpenUrl Abstract/FREE Full Text
41.↵
Xu Y, Grindley ND, Joyce CM (2000) Coordination between the polymerase and 5’-nuclease components of DNA polymerase I of Escherichia coli. J Biol Chem 275(27):20949–55.
OpenUrl Abstract/FREE Full Text
42.
Bailey MF, Van Der Schans EJC, Millar DP (2007) Dimerization of the Klenow fragment of Escherichia coli DNA polymerase I is linked to its mode of DNA binding. Biochemistry 46(27):8085–8099.
OpenUrl CrossRef PubMed Web of Science
43.↵
Evans GW, Hohlbein J, Craggs T, Aigrain L, Kapanidis AN (2015) Real-time single-molecule studies of the motions of DNA polymerase fingers illuminate DNA synthesis mechanisms. Nucleic Acids Res 43(12):5998–6008.
OpenUrl CrossRef PubMed
44.↵
Yang Y, LiCata VJ (2011) Interactions of replication versus repair DNA substrates with the Pol I DNA polymerases from Escherichia coli and Thermus aquaticus. Biophys Chem 159(1):188–93.
OpenUrl PubMed
45.↵
Kapanidis AN, et al. (2004) Fluorescence-aided molecule sorting: analysis of structure and interactions by alternating-laser excitation of single molecules. Proc Natl Acad Sci U S A 101(24):8936–41.
OpenUrl Abstract/FREE Full Text
46.↵
Stryer L (1978) Fluorescence Energy Transfer as a Spectroscopic Ruler. Ann Rev Biochem 47(2):819–846.
OpenUrl CrossRef PubMed Web of Science
47.↵
Clegg RM (1992) Fluorescence resonance energy transfer and nucleic acids. Methods Enzymol 211:353–388.
OpenUrl CrossRef PubMed Web of Science
48.↵
1. ed
2. Lakowicz JR
Lakowicz JR (2006) Principles of Fluorescence Spectroscopy ed Lakowicz JR (Springer US, Boston, MA) doi:10.1007/978-0-387-46312-4.
OpenUrl CrossRef
49.↵
Würth C, Grabolle M, Pauli J, Spieles M, Resch-genger U (2013) Relative and absolute determination of fluorescence quantum yields of transparent samples. Nat Protoc 8(8):1535–50.
OpenUrl CrossRef PubMed
50.
Magde D, Wong R, Seybold PG (2002) Fluorescence Quantum Yields and Their Relation to Lifetimes of Rhodamine 6G and Fluorescein in Nine Solvents: Improved Absolute Standards for Quantum Yields. Photochem Photobiol 75(4):327–334.
OpenUrl CrossRef PubMed Web of Science
51.↵
van Dijk M, Bonvin AMJJ (2009) 3D-DART: A DNA structure modelling server. Nucleic Acids Res 37(SUPPL. 2). doi:10.1093/nar/gkp287.
OpenUrl CrossRef PubMed Web of Science
52.↵
Chinea G, Padron G, Hooft RWW, Sander C, Vriend G (1995) The Use of Position-Specific Rotamers in Model-Building by Homology. Proteins-Structure Funct Genet 23(3):415–421.
OpenUrl
53.↵
Vriend G (1990) WHAT IF: A molecular modeling and drug design program. J Mol Graph 8(1):52–56.
OpenUrl CrossRef PubMed Web of Science
54.↵
Hornak V, et al. (2006) Comparison of multiple amber force fields and development of improved protein backbone parameters. Proteins Struct Funct Genet 65(3):712–725.
OpenUrl CrossRef PubMed Web of Science
55.↵
Pérez A, et al. (2007) Refinement of the AMBER force field for nucleic acids: improving the description of alpha/gamma conformers. Biophys J 92(11):3817–29.
OpenUrl CrossRef PubMed Web of Science
56.↵
Guy AT, Piggot TJ, Khalid S (2012) Single-stranded DNA within nanopores: Conformational dynamics and implications for sequencing; A molecular dynamics simulation study. Biophys J 103(5):1028–1036.
OpenUrl CrossRef PubMed
57.↵
Lindorff-Larsen K, et al. (2010) Improved side-chain torsion potentials for the Amber ff99SB protein force field. Proteins Struct Funct Bioinforma 78(8):1950–1958.
OpenUrl
58.↵
Hess B, Kutzner C, Van Der Spoel D, Lindahl E (2008) GRGMACS 4: Algorithms for highly efficient, load-balanced, and scalable molecular simulation. J Chem Theory Comput 4(3):435–447.
OpenUrl CrossRef PubMed Web of Science
59.↵
Berendsen HJC, Postma JPM, van Gunsteren WF, DiNola a, Haak JR (1984) Molecular dynamics with coupling to an external bath. J Chem Phys 81:3684–3690.
OpenUrl CrossRef Web of Science
60.↵
Bussi G, Donadio D, Parrinello M (2007) Canonical sampling through velocity rescaling. J Chem Phys 126(1):14101.
OpenUrl CrossRef
61.↵
Parrinello M (1981) Polymorphic transitions in single crystals: A new molecular dynamics method. J Appl Phys 52(12):7182.
OpenUrl CrossRef PubMed Web of Science
62.↵
Darden T, York D, Pedersen L (1993) Particle mesh Ewald: An N-log(N) method for Ewald sums in large systems. J Chem Phys 98(12):10089.
OpenUrl CrossRef PubMed Web of Science
63.↵
Humphrey W, Dalke A, Schulten K (1996) VMD: Visual molecular dynamics. J Mol Graph 14(1):33–38.
OpenUrl CrossRef PubMed Web of Science
64.↵
Sulc P, et al. (2012) Sequence-dependent thermodynamics of a coarse-grained DNA model. J Chem Phys 137(13). doi:10.1063/1.4754132.
OpenUrl CrossRef PubMed
65.↵
Mosayebi M, Romano F, Ouldridge TE, Louis AA, Doye JPK (2014) The role of loop stacking in the dynamics of DNA hairpin formation. J Phys Chem B 118(49):14326–14335.
OpenUrl
66.↵
Ouldridge TE, Sulc P, Romano F, Doye JPK, Louis AA (2013) DNA hybridization kinetics: Zippering, internal displacement and sequence dependence. Nucleic Acids Res 41(19):8886–8895.
OpenUrl CrossRef PubMed Web of Science
67.↵
Russo J, Tartaglia P, Sciortino F (2009) Reversible gels of patchy particles: Role of the valence. J Chem Phys 131(1). doi:10.1063/1.3153843.
OpenUrl CrossRef PubMed
68.↵
Schreck JS, Ouldridge TE, Romano F, Louis AA, Doye JPK (2015) Characterizing the bending and flexibility induced by bulges in DNA duplexes. J Chem Phys 142(16). doi:10.1063/1.4917199.
OpenUrl CrossRef
69.↵
Holden SJ, et al. (2010) Defining the limits of single-molecule FRET resolution in TIRF microscopy. Biophys J 99(9):3102–11.
OpenUrl CrossRef PubMed Web of Science
70.↵
Crocker J, Grier D (1996) Methods of Digital Video Microscopy for Colloidal Studies. J Colloid Interface Sci 179(1):298–310.
OpenUrl CrossRef Web of Science
71.↵
Plochowietz A, El-Sagheer AH, Brown T, Kapanidis A. N (2016) Stable end-sealed DNA as robust nano-rulers for in vivo single-molecule fluorescence. Chem Sci 7:4418–4422.
OpenUrl CrossRef

View the discussion thread.

Posted February 10, 2018.

Download PDF

Supplementary Material

Citation Tools

Subject Area

Biophysics

Subject Areas

All Articles

Animal Behavior and Cognition (5214)
Biochemistry (11745)
Bioengineering (8751)
Bioinformatics (29195)
Biophysics (14971)
Cancer Biology (12095)
Cell Biology (17411)
Clinical Trials (138)
Developmental Biology (9421)
Ecology (14178)
Epidemiology (2067)
Evolutionary Biology (18306)
Genetics (12245)
Genomics (16801)
Immunology (11867)
Microbiology (28083)
Molecular Biology (11592)
Neuroscience (60965)
Paleontology (451)
Pathology (1870)
Pharmacology and Toxicology (3238)
Physiology (4959)
Plant Biology (10427)
Scientific Communication and Education (1683)
Synthetic Biology (2885)
Systems Biology (7339)
Zoology (1651)

[1] 1.↵
von Hippel PH (1994) Protein-DNA Recognition: New Perspectives and Underlying Themes. Science 263(5148):769–770.
OpenUrl FREE Full Text

[2] 2.↵
Rohs R, et al. (2010) Origins of specificity in protein-DNA recognition. Annu Rev Biochem 79:233–69.
OpenUrl CrossRef PubMed Web of Science

[3] 3.↵
Tsutakawa SE, et al. (2011) Human flap endonuclease structures, DNA double-base flipping, and a unified understanding of the FEN1 superfamily. Cell 145(2):198–211.
OpenUrl CrossRef PubMed Web of Science

[4] 4.
Craggs TD, Hutton RD, Brenlla A, White MF, Penedo JC (2014) Single-molecule characterization of Fen1 and Fen1/PCNA complexes acting on flap substrates. Nucleic Acids Res 42(3):1857–1872.
OpenUrl CrossRef PubMed Web of Science

[5] 5.↵
Algasaier SI, et al. (2016) DNA and protein requirements for substrate conformational changes necessary for human flap endonuclease-1-catalyzed reaction. J Biol Chem 291(15):8258–8268.
OpenUrl Abstract/FREE Full Text

[6] 6.↵
Sawaya MR, Prasad R, Wilson SH, Kraut J, Pelletier H (1997) Crystal structures of human DNA polymerase beta complexed with gapped and nicked DNA: evidence for an induced fit mechanism. Biochemistry 36(37):11205–15.
OpenUrl CrossRef PubMed Web of Science

[7] 7.↵
Hutton RD, Craggs TD, White MF, Penedo JC (2010) PCNA and XPF cooperate to distort DNA substrates. Nucleic Acids Res 38(5):1664–75.
OpenUrl CrossRef PubMed Web of Science

[8] 8.↵
Sass LE, Lanyi C, Weninger K, Erie DA (2010) Single-molecule FRET TACKLE reveals highly dynamic mismatched DNA-MutS complexes. Biochemistry 49(14):3174–3190.
OpenUrl CrossRef PubMed Web of Science

[9] 9.↵
Cristóváo M, et al. (2012) Single-molecule multiparameter fluorescence spectroscopy reveals directional MutS binding to mismatched bases in DNA. Nucleic Acids Res:1–17.

[10] 10.↵
Santoso Y, et al. (2010) Conformational transitions in DNA polymerase I revealed by single-molecule FRET. Proc Natl Acad Sci U S A 107(2):715–20.
OpenUrl Abstract/FREE Full Text

[11] 11.↵
Hohlbein J, et al. (2013) Conformational landscapes of DNA polymerase I and mutator derivatives establish fidelity checkpoints for nucleotide insertion. Nat Commun 4:2131.
OpenUrl PubMed

[12] 12.↵
Lee NK, et al. (2005) Accurate FRET measurements within single diffusing biomolecules using alternating-laser excitation. Biophys J 88(4):2939–53.
OpenUrl CrossRef PubMed Web of Science

[13] 13.↵
Hohlbein J, Craggs TD, Cordes T (2014) Alternating-laser excitation: singlemolecule FRET and beyond. Chem Soc Rev 43(4):1156–71.
OpenUrl CrossRef PubMed

[14] 14.↵
Santoso Y, Hwang LC, Le Reste L, Kapanidis AN (2008) Red light, green light: probing single molecules using alternating-laser excitation. Biochem Soc Trans 36(Pt 4):738–44.
OpenUrl Abstract/FREE Full Text

[15] 15.↵
Markiewicz RP, Vrtis KB, Rueda D, Romano LJ (2012) Single-molecule microscopy reveals new insights into nucleotide selection by DNA polymerase I. Nucleic Acids Res: 1–10.

[16] 16.↵
Johnson SJ, Taylor JS, Beese LS (2003) Processive DNA synthesis observed in a polymerase crystal suggests a mechanism for the prevention of frameshift mutations. Proc Natl Acad Sci U S A 100(7):3895–900.
OpenUrl Abstract/FREE Full Text

[17] 17.↵
Singh K, Srivastava A, Patel SS, Modak MJ (2007) Participation of the fingers subdomain of Escherichia coli DNA polymerase I in the strand displacement synthesis of DNA. J Biol Chem 282(14):10594–604.
OpenUrl Abstract/FREE Full Text

[18] 18.↵
Kiefer JR, Mao C, Braman JC, Beese LS (1998) Visualizing DNA replication in a catalytically active Bacillus DNA polymerase crystal. Nature 391(6664):304–7.
OpenUrl CrossRef PubMed

[19] 19.↵
Cordes T, et al. (2010) Sensing DNA opening in transcription using quenchable Forster resonance energy transfer. Biochemistry 49(43):9171–80.
OpenUrl CrossRef PubMed Web of Science

[20] 20.
Robb NC, et al. (2013) The transcription bubble of the RNA polymerase-promoter open complex exhibits conformational heterogeneity and millisecond-scale dynamics: Implications for transcription start-site selection. J Mol Biol 425(5): 875–885.
OpenUrl CrossRef PubMed

[21] 21.↵
Robb NC, et al. (2016) Single-molecule FRET reveals the pre-initiation and initiation conformations of influenza virus promoter RNA. Nucleic Acids Res:gkw884.

[22] 22.↵
Lin S, Horning DP, Szostak JW, Chaput JC (2009) Conformational Analysis of DNA Repair Intermediates by Time-Resolved Fluorescence Spectroscopy. J Phys Chem A 113(35):9585–9587.
OpenUrl CrossRef PubMed

[23] 23.↵
Mills JB, Cooper JP, Hagerman PJ (1994) Electrophoretic evidence that singlestranded regions of one or more nucleotides dramatically increase the flexibility of DNA. Biochemistry 33(7):1797–803.
OpenUrl CrossRef PubMed Web of Science

[24] 24.↵
Ouldridge TE, Louis AA, Doye JPK (2011) Structural, mechanical, and thermodynamic properties of a coarse-grained DNA model. J Chem Phys 134(8):0–22.
OpenUrl

[25] 25.↵
Romano F, Chakraborty D, Doye JPK, Ouldridge TE, Louis AA (2013) Coarsegrained simulations of DNA overstretching. J Chem Phys 138(8):0–10.
OpenUrl

[26] 26.
Doye JPK, et al. (2013) Coarse-graining DNA for simulations of DNA nanotechnology. Phys Chem Chem Phys 15(47):20395–414.
OpenUrl CrossRef PubMed

[27] 27.↵
Srinivas N, et al. (2013) On the biophysics and kinetics of toehold-mediated DNA strand displacement. Nucleic Acids Res 41(22):10641–10658.
OpenUrl CrossRef PubMed Web of Science

[28] 28.↵
Sindbert S, et al. (2011) Accurate distance determination of nucleic acids via Forster resonance energy transfer: implications of dye linker length and rigidity. J Am Chem Soc 133(8):2463–80.
OpenUrl CrossRef PubMed Web of Science

[29] 29.↵
Kalinin S, et al. (2012) A toolkit and benchmark study for FRET-restrained high-precision structural modeling. Nat Methods 9(12):1218–25.
OpenUrl CrossRef PubMed

[30] 30.↵
Crawford R, et al. (2013) Long-lived intracellular single-molecule fluorescence using electroporated molecules. Biophys J 105(11):2439–2450.
OpenUrl CrossRef PubMed

[31] 31.
Plochowietz a, Crawford R, Kapanidis a N (2014) Characterization of organic fluorophores for in vivo FRET studies based on electroporated molecules. Phys Chem Chem Phys:1–3.

[32] 32.↵
Plochowietz A, Farrell I, Smilansky Z, Cooperman BS, Kapanidis AN (2016) In vivo single-RNA tracking shows that most tRNA diffuses freely in live bacteria. Nucleic Acids Res:gkw787.

[33] 33.↵
Ollis DL, Brick P, Hamlin R, Xuong NG, Steitz TA (1985) Structure of large fragment of Escherichia coli DNA polymerase I complexed with dTMP. Nature 313:762–766.
OpenUrl CrossRef PubMed Web of Science

[34] 34.↵
Beese LS, Derbyshire V, Steitz TA (1993) Structure of DNA polymerase I Klenow fragment bound to duplex DNA. Science (80-) 260(5106):352–355.
OpenUrl Abstract/FREE Full Text

[35] 35.↵
Yuan YC, Whitson RH, Liu Q, Itakura K, Chen Y (1998) A novel DNA-binding motif shares structural homology to DNA replication and repair nucleases and polymerases. Nat Struct Biol 5(11):959–64.
OpenUrl CrossRef PubMed Web of Science

[36] 36.↵
Yin YW, Steitz TA (2004) The Structural Mechanism of Translocation and Helicase Activity in T7 RNA Polymerase. Cell 116(3):393–404.
OpenUrl CrossRef PubMed Web of Science

[37] 37.↵
Turner RM, Grindley NDF, Joyce CM (2003) Interaction of DNA polymerase I (Klenow fragment) with the single-stranded template beyond the site of synthesis. Biochemistry 42(8):2373–85.
OpenUrl CrossRef PubMed

[38] 38.↵
Srivastava A, Singh K, Modak MJ (2003) Phe 771 of Escherichia coli DNA polymerase I (Klenow fragment) is the major site for the interaction with the template overhang and the stabilization of the pre-polymerase ternary complex. Biochemistry 42(13): 3645–3654.
OpenUrl CrossRef PubMed

[39] 39.↵
Thompson EHZ, Bailey MF, Van der Schans EJC, Joyce CM, Millar DP (2002) Determinants of DNA mismatch recognition within the polymerase domain of the Klenow fragment. Biochemistry 41(3):713–722.
OpenUrl CrossRef PubMed Web of Science

[40] 40.↵
Uphoff S, Reyes-Lamothe R, Garza de Leon F, Sherratt DJ, Kapanidis AN (2013) Single-molecule DNA repair in live bacteria. Proc Natl Acad Sci U S A 110(20):8063–8068.
OpenUrl Abstract/FREE Full Text

[41] 41.↵
Xu Y, Grindley ND, Joyce CM (2000) Coordination between the polymerase and 5’-nuclease components of DNA polymerase I of Escherichia coli. J Biol Chem 275(27):20949–55.
OpenUrl Abstract/FREE Full Text

[42] 42.
Bailey MF, Van Der Schans EJC, Millar DP (2007) Dimerization of the Klenow fragment of Escherichia coli DNA polymerase I is linked to its mode of DNA binding. Biochemistry 46(27):8085–8099.
OpenUrl CrossRef PubMed Web of Science

[43] 43.↵
Evans GW, Hohlbein J, Craggs T, Aigrain L, Kapanidis AN (2015) Real-time single-molecule studies of the motions of DNA polymerase fingers illuminate DNA synthesis mechanisms. Nucleic Acids Res 43(12):5998–6008.
OpenUrl CrossRef PubMed

[44] 44.↵
Yang Y, LiCata VJ (2011) Interactions of replication versus repair DNA substrates with the Pol I DNA polymerases from Escherichia coli and Thermus aquaticus. Biophys Chem 159(1):188–93.
OpenUrl PubMed

[45] 45.↵
Kapanidis AN, et al. (2004) Fluorescence-aided molecule sorting: analysis of structure and interactions by alternating-laser excitation of single molecules. Proc Natl Acad Sci U S A 101(24):8936–41.
OpenUrl Abstract/FREE Full Text

[46] 46.↵
Stryer L (1978) Fluorescence Energy Transfer as a Spectroscopic Ruler. Ann Rev Biochem 47(2):819–846.
OpenUrl CrossRef PubMed Web of Science

[47] 47.↵
Clegg RM (1992) Fluorescence resonance energy transfer and nucleic acids. Methods Enzymol 211:353–388.
OpenUrl CrossRef PubMed Web of Science

[48] 48.↵
ed
Lakowicz JR
Lakowicz JR (2006) Principles of Fluorescence Spectroscopy ed Lakowicz JR (Springer US, Boston, MA) doi:10.1007/978-0-387-46312-4.
OpenUrl CrossRef

[49] ed

[50] Lakowicz JR

[51] 49.↵
Würth C, Grabolle M, Pauli J, Spieles M, Resch-genger U (2013) Relative and absolute determination of fluorescence quantum yields of transparent samples. Nat Protoc 8(8):1535–50.
OpenUrl CrossRef PubMed

[52] 50.
Magde D, Wong R, Seybold PG (2002) Fluorescence Quantum Yields and Their Relation to Lifetimes of Rhodamine 6G and Fluorescein in Nine Solvents: Improved Absolute Standards for Quantum Yields. Photochem Photobiol 75(4):327–334.
OpenUrl CrossRef PubMed Web of Science

[53] 51.↵
van Dijk M, Bonvin AMJJ (2009) 3D-DART: A DNA structure modelling server. Nucleic Acids Res 37(SUPPL. 2). doi:10.1093/nar/gkp287.
OpenUrl CrossRef PubMed Web of Science

[54] 52.↵
Chinea G, Padron G, Hooft RWW, Sander C, Vriend G (1995) The Use of Position-Specific Rotamers in Model-Building by Homology. Proteins-Structure Funct Genet 23(3):415–421.
OpenUrl

[55] 53.↵
Vriend G (1990) WHAT IF: A molecular modeling and drug design program. J Mol Graph 8(1):52–56.
OpenUrl CrossRef PubMed Web of Science

[56] 54.↵
Hornak V, et al. (2006) Comparison of multiple amber force fields and development of improved protein backbone parameters. Proteins Struct Funct Genet 65(3):712–725.
OpenUrl CrossRef PubMed Web of Science

[57] 55.↵
Pérez A, et al. (2007) Refinement of the AMBER force field for nucleic acids: improving the description of alpha/gamma conformers. Biophys J 92(11):3817–29.
OpenUrl CrossRef PubMed Web of Science

[58] 56.↵
Guy AT, Piggot TJ, Khalid S (2012) Single-stranded DNA within nanopores: Conformational dynamics and implications for sequencing; A molecular dynamics simulation study. Biophys J 103(5):1028–1036.
OpenUrl CrossRef PubMed

[59] 57.↵
Lindorff-Larsen K, et al. (2010) Improved side-chain torsion potentials for the Amber ff99SB protein force field. Proteins Struct Funct Bioinforma 78(8):1950–1958.
OpenUrl

[60] 58.↵
Hess B, Kutzner C, Van Der Spoel D, Lindahl E (2008) GRGMACS 4: Algorithms for highly efficient, load-balanced, and scalable molecular simulation. J Chem Theory Comput 4(3):435–447.
OpenUrl CrossRef PubMed Web of Science

[61] 59.↵
Berendsen HJC, Postma JPM, van Gunsteren WF, DiNola a, Haak JR (1984) Molecular dynamics with coupling to an external bath. J Chem Phys 81:3684–3690.
OpenUrl CrossRef Web of Science

[62] 60.↵
Bussi G, Donadio D, Parrinello M (2007) Canonical sampling through velocity rescaling. J Chem Phys 126(1):14101.
OpenUrl CrossRef

[63] 61.↵
Parrinello M (1981) Polymorphic transitions in single crystals: A new molecular dynamics method. J Appl Phys 52(12):7182.
OpenUrl CrossRef PubMed Web of Science

[64] 62.↵
Darden T, York D, Pedersen L (1993) Particle mesh Ewald: An N-log(N) method for Ewald sums in large systems. J Chem Phys 98(12):10089.
OpenUrl CrossRef PubMed Web of Science

[65] 63.↵
Humphrey W, Dalke A, Schulten K (1996) VMD: Visual molecular dynamics. J Mol Graph 14(1):33–38.
OpenUrl CrossRef PubMed Web of Science

[66] 64.↵
Sulc P, et al. (2012) Sequence-dependent thermodynamics of a coarse-grained DNA model. J Chem Phys 137(13). doi:10.1063/1.4754132.
OpenUrl CrossRef PubMed

[67] 65.↵
Mosayebi M, Romano F, Ouldridge TE, Louis AA, Doye JPK (2014) The role of loop stacking in the dynamics of DNA hairpin formation. J Phys Chem B 118(49):14326–14335.
OpenUrl

[68] 66.↵
Ouldridge TE, Sulc P, Romano F, Doye JPK, Louis AA (2013) DNA hybridization kinetics: Zippering, internal displacement and sequence dependence. Nucleic Acids Res 41(19):8886–8895.
OpenUrl CrossRef PubMed Web of Science

[69] 67.↵
Russo J, Tartaglia P, Sciortino F (2009) Reversible gels of patchy particles: Role of the valence. J Chem Phys 131(1). doi:10.1063/1.3153843.
OpenUrl CrossRef PubMed

[70] 68.↵
Schreck JS, Ouldridge TE, Romano F, Louis AA, Doye JPK (2015) Characterizing the bending and flexibility induced by bulges in DNA duplexes. J Chem Phys 142(16). doi:10.1063/1.4917199.
OpenUrl CrossRef

[71] 69.↵
Holden SJ, et al. (2010) Defining the limits of single-molecule FRET resolution in TIRF microscopy. Biophys J 99(9):3102–11.
OpenUrl CrossRef PubMed Web of Science

[72] 70.↵
Crocker J, Grier D (1996) Methods of Digital Video Microscopy for Colloidal Studies. J Colloid Interface Sci 179(1):298–310.
OpenUrl CrossRef Web of Science

[73] 71.↵
Plochowietz A, El-Sagheer AH, Brown T, Kapanidis A. N (2016) Stable end-sealed DNA as robust nano-rulers for in vivo single-molecule fluorescence. Chem Sci 7:4418–4422.
OpenUrl CrossRef