Content area

Abstract

Recent single molecule experiments have determined the probability of loop formation in DNA as a function of the DNA contour length for different types of looping proteins. The optimal contour length for loop formation as well as the probability density functions have been found to be strongly dependent on the type of looping protein used. We show, using Monte Carlo simulations and analytical calculations, that these observations can be replicated using the wormlike-chain model for double-stranded DNA if we account for the nonzero size of the looping protein. The simulations have been performed in two dimensions so that bending is the only mode of deformation available to the DNA while the geometry of the looping protein enters through a single variable which is representative of its size. We observe two important effects that seem to directly depend on the size of the enzyme: 1), the overall propensity of loop formation at any given value of the DNA contour length increases with the size of the enzyme; and 2), the contour length corresponding to the first peak as well as the first well in the probability density functions increases with the size of the enzyme. Additionally, the eigenmodes of the fluctuating shape of the looped DNA calculated from simulations and theory are in excellent agreement, and reveal that most of the fluctuations in the DNA occur in regions of low curvature. [PUBLICATION ABSTRACT]

Full text

Turn on search term navigation
 
Headnote

ABSTRACT

Recent single molecule experiments have determined the probability of loop formation in DNA as a function of the DNA contour length for different types of looping proteins. The optimal contour length for loop formation as well as the probability density functions have been found to be strongly dependent on the type of looping protein used. We show, using Monte Carlo simulations and analytical calculations, that these observations can be replicated using the wormlike-chain model for double-stranded DNA if we account for the nonzero size of the looping protein. The simulations have been performed in two dimensions so that bending is the only mode of deformation available to the DNA while the geometry of the looping protein enters through a single variable which is representative of its size. We observe two important effects that seem to directly depend on the size of the enzyme: 1), the overall propensity of loop formation at any given value of the DNA contour length increases with the size of the enzyme; and 2), the contour length corresponding to the first peak as well as the first well in the probability density functions increases with the size of the enzyme. Additionally, the eigenmodes of the fluctuating shape of the looped DNA calculated from simulations and theory are in excellent agreement, and reveal that most of the fluctuations in the DNA occur in regions of low curvature.

INTRODUCTION

Since its discovery in the 1980s, enzyme-mediated DNA looping has been implicated as the key to many important biological processes. For example, the activity of the lac, gal, and λ-operons in E. coli is known to be regulated by the formation of DNA loops mediated by their respective repressor proteins (1). Similarly, the functioning of many restriction enzymes is known to be controlled by the formation of loops in DNA (2). A subclass of these enzymes called two-site restriction endonucleases efficiently cleave double-stranded DNA only if they interact with the DNA at two distant sites. In fact, a majority of reactions on DNA that include transcription, replication and repair, site-specific recombination etc., are mediated by multimeric proteins that interact with DNA at multiple sites (2). As a result, the biochemistry and biophysics of these reactions have been the subject of many experimental, computational, and theoretical investigations. A key question in this context is, "What molecular machinery or mechanism governs the rate at which two distant sites on the DNA are brought close to each other?"

The quest to address this question has produced several studies (3), through which a reasonably clear picture has emerged for the related process of DNA cyclization in which two sticky ends (short regions of single-stranded DNA with complementary basepairs) of a piece of linear double-stranded DNA are juxtaposed to produce a circular DNA loop in the absence of any mediating protein. The equilibrium constant for the cyclization reaction is governed by the length of the DNA involved (4). For DNA lengths longer than 300 basepairs (bp), this has been proved by the remarkable agreement of bulk biochemical experiments (5), Monte Carlo (MC) simulations (6), and theories based on the wormlikechain (WLC) models of DNA (4,7). There is still some debate (5,6) about the cyclization propensity of short (~100 bp) DNA fragments-the data from some bulk biochemical experiments have been explained on the basis of nonlinear models that require the formation of flexible hinges (or kinks) in the DNA (7,8) while those from another set of bulk experiments seem to agree quite well with the traditional WLC model of DNA, without any need for nonlinearities such as kinks or hinges (6).

On the other hand, enzyme-mediated DNA loops have been studied primarily by single molecule techniques that burst onto the scene approximately two decades ago. The majority of experiments involving DNA looping are carried out using the tethered particle assay in which one end of the DNA is immobilized by attaching it to a coverslip or to an optically trapped bead while the Brownian motion of the other end, also attached to a bead, reports on the formation/ breakage of enzyme-mediated loops (9). The bead at the other end can be trapped optically or magnetically (10), allowing for the possibility of exerting forces and moments on the DNA that can attenuate the rate of the looping reaction. This technique has been used to study the kinetics of formation/breakage of loops formed by the lac, gal, and λ-repressors (9-11) as well those by the restriction enzymes Nael and NarI (12). The constant formation/breakage of the loops (over timescales of ~10 s for Nael ( 12), for instance) in these experiments, which typically span several minutes or hours, ensures that this process is well described by equilibrium binding statistics. Once again, an important question that arises in this context concerns the effect of the length of the DNA loop on the rates of the forward/backward reaction or equivalently, on the equilibrium constant of looping. This question of length dependence was addressed in a recent single molecule experiment in which the probability of loop formation was measured as a function of DNA length for several two-site restriction enzymes (13). The key results of this experiment were that, 1), the probability of forming short DNA loops (~100 bp or less) is much higher than predicted by a theory based on the WLC theory of DNA mechanics alone; 2), the data agree better with theories of DNA with kinks and hinges; and 3), the probabilily density as well as the optimal loop length is highly dependent on the looping prolein. In this set of experiments, large forces were required to accelerate the rate of the loop breaking reaction for some proteins, implying that the results report on the probability of loop formation alone and not on the equilibrium constant of the loop formation/breakage reaction.

It is our goal in this article to explore a possible explanation for these observations by accounting for the geometry of the looping protein. We do not invoke nonlinear theories of DNA involving kinks or hinges. We also assume that the protein acts as a coupler and has no elasticity of its own. The calculations presented here have been carried out in two dimensions so that the only mode of deformation available to the DNA is bending in a plane. As a result, other sources of nonlinearities such as coupling between twisting and bending modes (14,15) are not considered in this model. In contrast to the work of Merlitz et al. (16). we also do not account for the electrostatic interaction and the stretching energy of the DNA. These calculations are a precursor to more comprehensive three-dimensional calculations where the DNA can bend and twist (15). An advantage of two-dimensional calculations is that the analytical theory remains tractable while not sacrificing the important concept of the competition between elasticity and entropy that governs the physics of DNA cyclization and looping reactions at equilibrium. For example, the peak in the Jacobson-Stockmayer factor (17) for DNA cyclization can be seen both in two- as well as threedimensional MC simulations although it is shifted to longer DNA lengths in the two-dimensional setting since entropie forces are relatively weaker in this case ( 18). We show in this article that the mere introduction of the span of the protein complex (denoted by the length scale a throughout this article) together with the competition of elastic and entropie forces results in probability density functions (probability of loop formation as function of length) that can vary significantly with protein geometry. A battery of MC methods have been employed to arrive at the probability density functions presented in this article. The details are explained in Simulation Methods. In some cases, we have also verified our MC calculations by comparison with analytical calculations based on the treatment of DNA as a fluctuating elastic rod.

We observe two important effects that seem to directly depend on the size of the protein complex: 1), the overall propensity of loop formation at any given value of the DNA contour length increases with the size of the protein complex; and 2), the contour length corresponding to the first peak as well as the first well in the probability density functions increases with the size of the protein complex. Another interesting outcome of the MC simulations of DNA loops presented in this article is the visualization of the fluctuating shape. For loop lengths which are small multiples of the DNA persistence length, we find that the shape fluctuates close to an equilibrium shape that can be calculated from the Kirchhoff theory of rods. The fluctuations around the equilibrium shape contribute to the configurational entropy. If the fluctuations are small enough, we can expand the elastic energy functional up to quadratic order in the fluctuations around equilibrium and obtain a fluctuation operator. The eigenmodes of this operator show us the collective motions of the DNA molecule. We have analytically calculated the slowest eigenmode of this fluctuation operator and compared our expressions with the results of a numerical eigenfunction analysis of the MC data. Remarkably, we find good agreement between the two methods. To our knowledge, this is the first time the shape fluctuations have been computed using analytical techniques for this problem. We note that a similar computation of eigenfunctions for boundary conditions involving a given force and zero moments at the ends was performed by Kulic et al. (19). Such shape fluctuations in macromolecules are now known to play a key role in determining the free energy change associated with binding two species (20).

View Image -

View Image -

View Image -

View Image - FIGURE 1 Schematic of prote in-mediated two-dimensional DNA loop; a is the size of the protein holding the loop.

FIGURE 1 Schematic of prote in-mediated two-dimensional DNA loop; a is the size of the protein holding the loop.

View Image -

View Image -

Figs. 2 and 3 report the equilibrium probability of loop formation P(L;a) for different values of L and a while Fig. 4 reports the equilibrium value of average opening angle (defined as π - 2θ^sub a^) over all conformations recorded as nils as a function of L/a.

View Image - FIGURE 2 Probability of loop formation P(L;a) plotted as a function of nondimensionalized length L/ξ^sub p^ for various values of the end-to-end distance a. The probability is peaked at L/ξ^sub p^ [asymptotically =] 5. There is also a second peak at much smaller values of L/ξ^sub p^, which is depicted in Fig. 3. A peak at L/ξ^sub p^ [asymptotically =] 5 is expected from the classical WLC model of DNA, which does not account for the presence of the prolein. The location of this peak shows only a weak dependence on a. Link length = 2.5 nm; tolerance in a 0.5 nm. Coefficient of variation of P(L;a) (not shown in the figure) is <1%.

FIGURE 2 Probability of loop formation P(L;a) plotted as a function of nondimensionalized length L/ξ^sub p^ for various values of the end-to-end distance a. The probability is peaked at L/ξ^sub p^ [asymptotically =] 5. There is also a second peak at much smaller values of L/ξ^sub p^, which is depicted in Fig. 3. A peak at L/ξ^sub p^ [asymptotically =] 5 is expected from the classical WLC model of DNA, which does not account for the presence of the prolein. The location of this peak shows only a weak dependence on a. Link length = 2.5 nm; tolerance in a 0.5 nm. Coefficient of variation of P(L;a) (not shown in the figure) is <1%.

Eigenmode calculation

Eigenmodes of the DNA thermal fluctuations can be extracted based upon the knowledge of various loop configurations. In our model, we sample DNA loop configurations from a constant length-constant separation-constant temperature ensemble. New loop conformations are generated from the existing one by crankshaft rotation (26). A subchain containing a random number of links is flipped about an axis joining the end points of this segment. This new conformation is selected with a probability of acceptance min[1, exp(-(E^sub new^ -E^sub old^)//k^sub B^T)] to satisfy the Metropolis criterion (27), where E^sub new^ and E^sub old^ are the energies of the new and old conformations, respectively, and the min function selects the minimum of the two terms in parenthesis. In our model, overlap of DNA segments is not allowed and therefore, trial moves generating loop-segment overlap (E^sub new^ = ∞) are automatically discarded by the acceptance criteria. The eigenmode calculations can be performed by either imposing fixed end-angles or variable end-angles in the simulation. However, the theoretical calculation of the first eigenmode (see Appendix) is performed for the case when the end-angles are fixed. Therefore, to make the explicil comparison with the theoretical result, we impose that the end-angles are fixed in our Metropolis MC simulations. Rigid body translation and rotation are removed by holding the end-points of the DNA loop fixed. Each MC run is carried out one billion times to ensure that the system reaches equilibrium and the properties (average energy) converge.

View Image -

To calculate the eigenmodes of DNA loop fluctuations from the MC data, a covariance matrix C^sub ij^ = [left angle bracket](r^sub i^ - [left angle bracket]r^sub i^[right angle bracket])(r^sub j^ - [left angle bracket]r^sub j^[right angle bracket])[right angle bracket] is constructed (29), where r^sub i^ is the position vector of each link, and [left angle bracket].[right angle bracket] represents average over conformations sampled from the MC run. Eigenvectors of this matrix represent the principal modes of loop fluctuations, while each eigenvalue indicates the squared amplitude of the fluctuations along each eigenmode. Because the eigenvectors are orthogonal, they represent independent modes (basis functions) for describing the collective DNA loop fluctuations in the equilibrium ensemble of the con formal ions.

View Image - FIGURE 3 Prob1ability of loop formation P(L;a) plotted as a function of nondimensionalized length L/ξ^sub p^ for various values of the end-to-end distance a. The presence of a new length-scale a imposed by the protein results in a second peak at small values of L. The WLC theory for cyclization does not predict this peak. The wells in the probability distributions correspond 10 lengths at which the elastic energy required to bend a short fragment of DNA to satisfy the constraint on end-to-end distance is a local maximum. The inset on the top shows that there is good correlation between the locations of the well, determined from the MC simulations versus the locations of maximum bending energy. The disagreement between these two calculations increases with increasing length due to the increasing effects of fluctuations. The inset in the bottom depicts the shape of a DNA loop when L [approximate] a. Link length = 1.0 nm; tolerance in a = 0.5 nm. Coefficient of variation of P(L;a) (not shown in the figure) is <1%.FIGURE 4 Mosf probable angle θ^sub a^ plotted as function of L/a. Error bars represent standard error in the reported values. As a [arrow right] 0, we see that θ^sub a^ [arrow right] 49.5°. which corresponds to a loop opening angle of 81° predicted by Shimada and Yamakawa (4). The most probable angle was obtained from the probability distribution of the end angles of the loops generated by the MC simulations. The line is the result of a calculation based on a minimization of elastic bending energy which predicts thai the optimal loop is the one whose curvatures are zero at the ends. This condition corresponds to a situation in which the protein exerts no moments on the DNA. The inset shows the energy of an elastic rod plotted as a function of θ^sub a^ for L = 5ξ^sub p^ and two different values of L/a. In both the panels we also plot -log(P(θ^sub a^:L/a)) + C. where C is an arbitrary constant using data from MC simulations and find good agreement. We note that the energy wells in both the panels are shallow (which implies that we should expect a large variance), which explains why the MC data for most probable θ^sub a^ for large values of L/a does not agree too well with the curve.

FIGURE 3 Prob1ability of loop formation P(L;a) plotted as a function of nondimensionalized length L/ξ^sub p^ for various values of the end-to-end distance a. The presence of a new length-scale a imposed by the protein results in a second peak at small values of L. The WLC theory for cyclization does not predict this peak. The wells in the probability distributions correspond 10 lengths at which the elastic energy required to bend a short fragment of DNA to satisfy the constraint on end-to-end distance is a local maximum. The inset on the top shows that there is good correlation between the locations of the well, determined from the MC simulations versus the locations of maximum bending energy. The disagreement between these two calculations increases with increasing length due to the increasing effects of fluctuations. The inset in the bottom depicts the shape of a DNA loop when L [approximate] a. Link length = 1.0 nm; tolerance in a = 0.5 nm. Coefficient of variation of P(L;a) (not shown in the figure) is <1%.FIGURE 4 Mosf probable angle θ^sub a^ plotted as function of L/a. Error bars represent standard error in the reported values. As a [arrow right] 0, we see that θ^sub a^ [arrow right] 49.5°. which corresponds to a loop opening angle of 81° predicted by Shimada and Yamakawa (4). The most probable angle was obtained from the probability distribution of the end angles of the loops generated by the MC simulations. The line is the result of a calculation based on a minimization of elastic bending energy which predicts thai the optimal loop is the one whose curvatures are zero at the ends. This condition corresponds to a situation in which the protein exerts no moments on the DNA. The inset shows the energy of an elastic rod plotted as a function of θ^sub a^ for L = 5ξ^sub p^ and two different values of L/a. In both the panels we also plot -log(P(θ^sub a^:L/a)) + C. where C is an arbitrary constant using data from MC simulations and find good agreement. We note that the energy wells in both the panels are shallow (which implies that we should expect a large variance), which explains why the MC data for most probable θ^sub a^ for large values of L/a does not agree too well with the curve.

Fig. 5 reports the calculated shape of the first (slowest) eigenmode resulting from the covariance analysis (see above).

Validation of the quasiharmonic assumption

To calculate the eigenfunctions of the fluctuation operator. T (see Eigenmode Calculation), we expanded the potential energy functional to quadratic order in δθ, thus treating the DNA loop as a quasihamionic system. In this section, we describe a method to validate this assumption by comparing the configuralional density of stales (DOS) of the DNA loop against that of n-independent harmonic oscillators. To this end, we use the DOSMC method, developed by Wang and Landau (30), to calculate DOS of the DNA loop. DOSMC is an enhancement over conventional MC techniques since it directly produces the DOS, g(E) instead of the canonical distribution g(E)e^sup -(E/kBT)^ generated by conventional techniques. DOSMC achieves this task by performing a random walk in energy space instead of random walk in the conformational space. Starting from g(E) = 1 and energy histogram, h(E) = 0, random walks in the energy space are performed by generating new loop conformations by crankshaft rotation (see Validation of the Quasiharmonic Assumplion). The new conformation is accepted with a probability min[(g(E^sub old^)/g(E^sub new^)), 1]. Each time an energy state is visited, the corresponding DOS and energy histogram are updated according to g(E) = g(E) × f and H(E) = h(E) + 1, where f is a modification factor > 1 (in our simulations, we take f = e^sup 1^). The random walk in energy space is continued until the accumulated energy histogram is flat within a predefined tolerance (we define a histogram to be flat when h(E) is within ±5% of average h(E)). To increase the accuracy of g(E) (which is proportional to ln f), f is reduced according to the rule f^sub new^ = [radical]f^sub old^. and the histogram is reset to zero, i.e., h(E) = 0. These steps are performed until the desired accuracy in g(E) is obtained. In this work, simulations are performed until/reduces to 10^sup -7^. To speed up the simulations, the energy space is divided into overlapping energy windows. Any walk outside the corresponding energy window is rejected. To satisfy the boundary condition imposed by Eq. 3, the energy cost to change the terminal angle that the last/first link makes with the positive ? axis is set to zero. At the end. resultant pieces of g(E) m the respective windows are merged together so as to minimize the error between g(E) in the overlapping regions. The obtained g(E') is an accurate estimate of the configurational DOS of the system up to a constant multiplicative factor.

View Image -

View Image -

View Image - FIGURE 5 The first eigenmode of the fluctuating loop obtained from MC simulations. The solid line represents the mean configuration and the dashed line represents the deformation due to the fluctuations along the tirst eigenniode. The end-to-end distance of the loop is fixed and so are the angles made by the cangenis (to the x axis) at the ends. The insel shows the corresponding change in lhe tangenl angle off as a function of the arc-length s calculated using theory (solid tine plotted using Eq. 23) and using MC simulations (dotted line) calculated as described in Eigenmode Calculation.FIGURE 6 DOS for the 200-nm fluctuating DNA loop plotted as a function of the energy. The inset shows the DOS exponent as a function of the nondimensionalized length L/ξ^sub p^. The excellent agreement heiween the slope predicted from quasihurmonic theory of independent oscillators with that from DOSMC simulations shows that expanding the energy up to quadratic order in the fluctuations in θ (s) is a good approximation for the lengths of the DNA considered in this article.

FIGURE 5 The first eigenmode of the fluctuating loop obtained from MC simulations. The solid line represents the mean configuration and the dashed line represents the deformation due to the fluctuations along the tirst eigenniode. The end-to-end distance of the loop is fixed and so are the angles made by the cangenis (to the x axis) at the ends. The insel shows the corresponding change in lhe tangenl angle off as a function of the arc-length s calculated using theory (solid tine plotted using Eq. 23) and using MC simulations (dotted line) calculated as described in Eigenmode Calculation.FIGURE 6 DOS for the 200-nm fluctuating DNA loop plotted as a function of the energy. The inset shows the DOS exponent as a function of the nondimensionalized length L/ξ^sub p^. The excellent agreement heiween the slope predicted from quasihurmonic theory of independent oscillators with that from DOSMC simulations shows that expanding the energy up to quadratic order in the fluctuations in θ (s) is a good approximation for the lengths of the DNA considered in this article.

RESULTS AND DISCUSSION

The main message of this article is that the probability of loop formation in DNA is affected by the geometry of the looping protein. This result is manifest in Figs. 2-4. Fig. 2 shows the probability of loop formation P(L;a) as a function of the length L of the loop and the size of the protein complex a. As expected from the classical WLC model (33) of DNA there is a peak in the probability of loop formation for L/ξ^sub p^ [approximate] 5. This is a result of the competition between elastic bending and entropy. The probability is not much affected by the protein size a at these lengths, since a [much less than] L. Similar conclusions were reported also by Merlitz et al. (16), who showed (using a Brownian dynamic simulation) that the effect of the finite size of the looping protein is most dramatic for contour lengths <300 bp and small for lengths >500 bp. This does not imply, however, that the size of the protein complex is irrelevant for these loop lengths. This can be better appreciated from Fig. 4. which summarizes the effect of protein size on the value of the loop opening angle. For example, the optimal opening angle of a DNA loop is known to be 81° when a [arrow right] 0 (4), but for a = 10 nm at L [approximate] 250 nm we find an optimal opening angle of 75°. Fig. 4 also suggests that the most probable shape of the loop corresponds to the case in which the curvatures at the ends are zero. Evidence for this assertion comes from the strong correlation between the continuous line obtained from an argument resting on the minimization of elastic energy of the loop and the data obtained from MC simulations, and the fact that an opening angle of 81 ° for a = 0 calculated by Shimada and Yamakawa (4) does actually correspond to the zero end-curvature condition. This observation implies that the most probable loop shape is one in which the protein exerts no moments on the DNA at their points of contact. The agreement between the curve obtained from the elastic calculation and the data obtained from MC simulations seems to get poorer as L [arrow right] ∞. The reason for this can be understood by looking at the insets of Fig. 4. The continuous lines in the inset were obtained by calculating (following (14)) the elastic energy of the loop as a function of the end-angle θ^sub a^ for L = 5ξ^sub p^ and two different values of a. The open circles are data from MC simulations for the same values of L and a. The probabilities were converted into energies (up to an additive constant) through the Boltzmann law. It is remarkable that the data from the MC simulations agree so well with the elasticity calculation. This suggests that the shapes of the loop corresponding to different values of the fluctuating variable θ^sub a^ are such that the corresponding energies are not too different from the equilibrium shape for those boundary conditions. We also see that for large values of L/a the probability of having an end-angle θ^sub a^ is peaked at the value of θ^sub a^ corresponding to zero end moments. However, the energy well is shallow, implying that the variance is large. This is the reason behind the relatively poorer agreement between the two methods used for determining the most probable value of the end angles. One has to do an impractically large MC calculation to obtain better agreement.

The most significant effects of the size of the protein complex are felt at small values of the length L. The probability of loop formation is peaked at values of L that are comparable to a as seen from Fig. 3. This peak is significantly higher than the peak observed at L/ξ^sub p^ [asymptotically =] 5 and has not been predicted by the classical WLC model of DNA. Some researchers have suggested that looping probabilities will necessarily be high when the DNA contour length is comparable to the span of the protein complex, but a quantitative prediction is still lacking (34). In fact, most studies which predict high probability of loop formation at short DNA lengths do so only after the introduction of defects, such as, kinks or hinges in the DNA, thus deviating from the WLC model (7,34-36). A notable exception is a study by Merlitz et al. (16) which shows, through Brownian Dynamics simulations based on the classical WLC model of DNA, that the probability of loop formation is enhanced > 10-fold at L [asymptotically =] 40 nm when we go from a = 0 to a = 10 nm. They also analyzed the effects of nonlinearities such as permanent bends in the DNA, and showed how these defects can greatly enhance looping probabilities and rate constants for contour lengths L in the interval 40 nm < L < 100 nm for various values of the span a. Merlitz et al. do not report results for lengths shorter than 40 nm, but it would not be unreasonable to expect that to obtain high looping probabilities in this regime would require introduction of non linearities in the DNA. However, this is exactly the regime where we have obtained a second peak and valley in the looping probabilities. In the light of this observation the significance of the results summarized in Fig. 3 is that high looping probabilities for short DNA contour lengths (L < 40 nm) can be explained with the classical WLC model of DNA (without nonlinearities such as kinks or permanent bends) if we account for the geometry of the looping protein. At these short contour lengths, shape fluctuations make only a small contribution to the free energy so that the peak in probability is simply a result of the low elastic bending energy required to satisfy the constraint on the end-to-end distance placed by the looping protein. In fact, the location of the well in the probability distribution between the two peaks (at L [asymptotically =] a and L [asymptotically =] 5ξ^sub p^) is strongly correlated with the length at which the elastic bending energy has a local maximum (see Fig. 3, inset).

The results summarized in Figs. 2 and 3 could also provide an alternative interpretation for the experimental results of Smith et al. (13). In this experiment, the probability of loop formation was measured as a function of the length of the loop for several enzymes which interact with DNA at two separate sites (13). The main results of these experiments were that the probability distribution was different for different proteins and that looping at short contour lengths was far more probable than predicted by the WLC theory alone. The authors had also found two peaks in the probability distribution for looping by some proteins. Qualitatively similar observations in bulk experiments were made by Reuter et al. (37), who found that the propensity of cutting by certain two-site restriction enzymes (EcoRII) was peaked at two different contour lengths with the highest propensity occurring at the peak at short lengths. They had suggested that at short contour lengths the DNA is slightly bent to meet the constraints placed by the enzyme while at longer lengths it was looped. All of these observations are replicated in our model which accounts for the effects of protein size. A direct comparison of our results with those of Smith et al. (13) is not possible, since our calculations have been carried out only in two dimensions, whereas the experiments are fully three-dimensional. Also, despite our results which rely solely on an elastic rod model of DNA, the possibility of kink or hinge formation at high curvatures still remains open.

An important by-product of our MC simulations is that we have decomposed the fluctuating shapes of the loop into eigenmodes. Such a decomposition is possible when the fluctuations around equilibrium are small so that the energy of an arbitrary shape can be expressed as the sum of the energy of the equilibrium shape and a term that is quadratic in the small fluctuations. For the case of the DNA loop, the shape can be written in terms of the angle θ (s), which is the angle made by the tangent to the loop to the positive x axis. Fig. 5 shows the deviations in the shape of the loop and the angle δθ (s) as a function of the arc-length s. The first eigenmode (corresponding to the largest eigenvalue of covariance matrix) is shown together with comparison to an analytical result. The analytical calculation is performed in a slightly different context in which the force at the ends (as opposed to the end-to-end distance) as well as the angles made by the tangents at the ends are held fixed. Despite this difference in the boundary condition, the theory and simulations yield similar variation for the change in the tangent angle along the arc length of the DNA (see Fig. 5, inset). Movies showing the projection of the MC data on the two slowest eigenmodes are available as Supplementary Material data. Both the results show that the shape fluctuations are large in the regions of the loop which are nearly straight (low curvature) and small in the highly curved regions. This would imply that the entropic contributions to the free energy of the loop have their origin in the low curvature regions. A similar conclusion was also reached by Fain et al. (38) in their analysis of plectonemes in DNA where it was determined that most of the free energy of the plectonemes was elastic bending and twisting energy while the entropic part was always negligible. To the best of the authors' knowledge, this is the first report on the fluctuating modes of a DNA loop subjected to clamped boundary conditions. Calculations such as these could be important building blocks for determining the free energies of binding/unbinding reactions of biological entities which have only recently been shown to depend strongly on configurational entropy.

Finally, from our DOSMC simulations, we have confirmed that expanding the potential energy of the DNA loop to quadratic order in fluctuations is a good approximation (see Fig. 6). The assumption of quasiharmonicity simplifies a variety of thermodynamic property calculations, the most prominent example being the entropy. Based on the conformational sampling of metropolis MC and its subsequent eigenvector decomposition, we can calculate the quasiharmonic configurational entropy of the DNA loop (29). Furthermore, the DOS can be directly used to compute the free energy and entropy, quantities which are not directly available in conventional MC methods.

CONCLUSIONS

In this article, we have summarized the effects of the size of the mediating protein on the propensity of loop formation in DNA. Many of the qualilative features observed in recent single molecule experiments on enzyme-mediated DNA looping are reproduced by the WLC theory if we take into account the nonzero size of the looping enzyme. Two important effects that seem to directly depend on the size of the enzyme complex are that, 1), the overall propensity of loop formation at any given value of the DNA contour length increases with the size of the enzyme complex; and 2), the contour length corresponding to the first peak as well as the first well in the probability density functions increases with the size of the enzyme complex. These qualitative features of the results can be readily tested by performing the looping experiments with looping proteins of known sizes. Also, of special interest are the eigenmodes of DNA fluctuations. Our theoretical calculations and MC simulations have shown that the fluctuations in the DNA are large where the curvature is small. Perhaps this observation can also be verified from experiments where realtime motions of DNA are recorded (39).

SUPPLEMENTARY MATERIAL

To view all of the supplemental files associated with this article, visit www.biophysj.org.

We thank Philip Nelson for some insightful discussions and Douglas E. Smith for explaining some details about the experiments on DNA looping by two-site restriction enzymes.

R.R. acknowledges funding from National Science Foundation grant No. CBET-0730955 and supercompuiing resource allocution grant No. MCB06W06 from NPACI.

References

REFERENCES

1. Schleif, R. 1992. DNA looping. Annu. Rev. Biochem. 61:199-223.

2. Halford, S. E., A. J. Welsh, and M. D. Szczelkun. 2004. Enzyme-mediated DNA looping. Annu. Rev. Biophys. Biomol. Struct. 33:1-24.

3. Garcia, H. G., P. Grayson, L. Han, M. Inamdar. J. Kondev. P. C. Nelson. R. Phillips. J. Widom, and P. A. Wiggins. 2007. Biological consequences of tightly bent DNA: the other life of a macromolecular celebrity. Biopolymers. 85:115-130.

4. Shimada, J., and H. Yamakawa. 1984. Ring-closure probabilities for twisted wormlike chains-application to DNA. Macromolecules. 17:689-698.

5. Cloutier, T. E., and J. Widom. 2004. Spontaneous sharp bending of double-stranded DNA. Mol. Cell. 14:355-362.

6. Du, Q., C. Smith, N. Shiffeldrim, M. Vologodskaia, and A. Vologodskii. 2005. Cyclization of short DNA fragments and bending fluctuations of the double helix. Proc. Natl. Acad. Sci. USA. 102:5397-5402.

7. Wiggins, P. A., R. Phillips, and P. C. Nelson. 2005. Exact theory of kinkable elastic polymers. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 71:021909.

8. Lankas, F., R. Lavery, and J. H. Maddocks. 2006. Kinking occurs during molecular dynamics simulations of small DNA minicircles. Structure. 14:1527-1534.

9. Finzi, L., and J. Gelles. 1995. Measurement of lactose repressor-mediated loop formation and breakdown in single DNA molecules. Science. 267:378-380.

10. Lia, G., D. Bensimon, V. Croquette, J.-F. Allemand, D. Dunlap, D. E. A. Lewis, S. Adhya, and L. Finzi. 2003. Supercoiling and denaturation in Gal repressor/heat unstable nucleoid protein (HU)-mediated DNA looping. Proc. Natl. Acad. Sci. USA. 100:11373-11377.

11. Zurla, C., A. Franzini, G. Galli. D. D. Dunlap. D. E. A. Lewis. S. Adhya. and L. Finzi. 2006. Novel tethered particle motion analysis of Cl protein-mediated DNA looping in the regulation of bacteriophage-λ. J. Phys. Condens. Matter. 18:8225-8234.

12. van den Broek, B., F. Vanzi, D. Normanno, F. S. Pavone, and G. J. L. Wuite. 2006. Real-time observation of DNA looping dynamics of type HE restriction enzymes NaeI and NarI. Nucleic Acids Res. 34:167-174.

13. Gemmen, G. J., R. Millin, and D. E. Smith. 2006. DNA looping by two-site restriction endonucleases: heterogeneous probability distributions for loop size and unbinding force. Nucleic Acids Res. 34:2864-2877.

14. Purohit, P. K., and P. C. Nelson. 2006. Effect of supercoiling on formation of protein-mediated DNA loops. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 74:061906.

15. Czapla, L., D. Swigon, and W. K. Olson. 2006. Sequence-dependent effects in the cyclization of short DNA. J. Chem. Theory Comput. 2:685-695.

16. Merlitz, H., K. Rippe, K. V. Klenin, and J. Langowski. 1998. Looping dynamics of linear DNA molecules and the effect of DNA curvature: a study by Brownian dynamics simulation. Biophys. J. 74:773-779.

17. Jacobson, H., and W. H. Stockmayer. 1950. Intramolecular reaction in polycondensations. 1. The theory of linear systems. J. Chem. Phys. 18:1600-1606.

18. Kindt, J. T. 2002. Pivot-coupled grand canonical Monte Carlo method for ring simulations. J. Chem. Phys. 116:6817-6825.

19. Kulic, I. M., H. Mohrbach, V. Lobaskin, R. Thaokar, and H. Schiesset. 2005. Apparent persistence length renormalization of bent DNA. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 72:041905.

20. Frederick, K. K., M. S. Marlow, K. G. Valentine, and A. J. Wand. 2007. Conformational entropy in molecular recognition by proteins. Nature. 448:325-330.

21. Marko, J. F., and E. D. Siggia. 1995. Stretching DNA. Macromolecules. 28:8759-8770.

22. Friedman, A. M., T. O. Fischmann, and T. A. Steitz. 1995. Crystalstructure of Lac repressor core tetramer and its implications for DNA looping. Science. 268:1721-1727.

23. Mehta, R. A., and J. D. Kahn. 1999. Designed hyperstable Lac repressor center dot DNA loop topologies suggest alternative loop geometries. J. Mol. Biol. 294:67-77.

24. Harmer, T., M. Wu. and R. Schleif. 2001. The role of rigidity in DNA looping-unlooping by AraC. Proc. Natl. Acad. Sci. USA. 98:427-431.

25. Klenin, K., H. Merlitz, and J. Langowski. 1998. A Brownian dynamics program for the simulation of linear and circular DNA and other wormlike chain polyelectrolytes. Biophys. J. 74:780-788.

26. Vologodskii, A. V., S. D. Levene, K. V. Klenin, M. Frank-Kamenetskii, and N. R. Cozzarelli. 1992. Conformational and mermodynamic properties of supercoiled DNA. J. Mol. Biol. 227:1224-1243.

27. Allen, M. P., and D. J. Tildesley. 1987. Computer Simulation of Liquids. Oxford Science, Oxford.

28. Hoffman, J. D. 1992. Numerical Methods for Engineers and Scientists. McGraw-Hill, New York.

29. Andricioaei, I., and M. Karplus. 2001. On the calculation of entropy from covariance matrices of the atomic fluctuations. J Chem. Phys. 115:6289-6292.

30. Wang, F. G., and D. P. Landau. 2001. Efficient, multiple-range random walk algorithm to calculate the density of states. Phys. Rev. Lett. 86:2050-2053.

31. Reif, F. 1965. Fundamentals of Statistical and Thermal Physics, Ch. 2. McGraw-Hill, Singapore.

32. Pathria, R. K. 1996. Statistical Mechanics. Butterworth Heinemann, Oxford.

33. Yamakawa, H. 1971. Modern Theory of Polymer Solutions. Harper and Row, New York.

34. Douarche, N., and S. Cocco. 2005. Protein-mediated DNA loops: effects of protein bridge size and kinks. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 72:061902.

35. Sankararaman, S., and J. F. Marko. 2005. Formation of loops in DNA under tension. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 71:021911.

36. Rippe, K. 2001. Making contacts on a nucleic acid polymer. Trends Biochem. Sci. 26:733-740.

37. Reuter, M., D. Kupper, A. Meisel, C. Schroeder, and D. H. Kruger. 1998. Cooperative binding properties of restriction endonuclease EcoRII with DNA recognition sites. J. Biol. Chem. 273:8294-8300.

38. Fain, B., J. Rudnick, and S. Ostlund. 1997. Conformations of linear DNA. Phys. Rev. E Stat. Phys. Plasmas Fluids Relat. Interdiscip. Topics. 55:7364-7368.

39. Perkins, T. T., S. R. Quake, D. E. Smith, and S. Chu. 1994. Relaxation of a single DNA molecule observed by optical microscopy. Science. 264:822-826.

40. Sutherland, B. 1973. Some exact results for one-dimensional models of solids. Phys. Rev. A. 8:2514-2516.

AuthorAffiliation

Neeraj J. Agrawal,* Ravi Radhakrishnan,[dagger] and Prashant K. Purohit[double dagger]

* Chemical and Biomolecular Engineering, [dagger] Bioengineering, and [double dagger] Mechanical Engineering and Applied Mechanics, University of Pennsylvania, Philadelphia, Pennsylvania

AuthorAffiliation

Submitted September 28, 2007, and accepted for publication November 30, 2007.

Address reprint requests to Prashant K. Purohit, E-mail: purohit@seas. upenn.edu.

Editor: Taekjip Ha.

View Image - APPENDIX: FLUCTUATION OPERATOR

APPENDIX: FLUCTUATION OPERATOR

View Image - APPENDIX: FLUCTUATION OPERATOR

APPENDIX: FLUCTUATION OPERATOR

View Image - APPENDIX: FLUCTUATION OPERATOR

APPENDIX: FLUCTUATION OPERATOR

Copyright Biophysical Society Apr 15, 2008