1. Introduction
Entropy is often described as a measure of disorder. Its importance is that the free energy of a system involves a trade-off between the entropy and the energy. The simplest example of this is the Helmholtz free energy , which applies to a system with a fixed number of particles and a fixed volume at constant temperature. Because of the complexity of biological systems, there are many instances in which the term of the free energy either dominates the energy term, or at least plays an important role in the free energy minimization that determines equilibrium and drives the dynamics of the system.
As a consequence, the determination of the optimal structure of many biologically important systems requires minimization of the free energy, and hence this competition between the internal energy and entropy. Often in biological systems the changes in entropy are greater than the changes in internal energy. For instance, in the hydrophobic effect, an unfolded protein lowers the entropy by ordering the water molecules, and so the protein prefers to be in the folded state[
1,
2]. In the chloroplast stroma, it has been shown that there is an entropy-driven attraction that determines chloroplast ultrastructure through spontaneous Mg
-induced stacking of membranes[
3].
In sickle hemoglobin, the aggregation of monomers into polymers[
4] is also entropy driven, with the internal energy and entropy in a delicate balance. In fact, Cao and Ferrone [
5] measured the chemical potential
, which is the "glue" that holds the polymer together, and found that the entropic contribution to this chemical potential is around
kcal/mol, which is more than 100% of the chemical potential itself. Another example is F-actin, whose fibrils stack in cross-linked rafts when positive alkaline earth ions are in the solution[
6]. In F-actin counter-ions form one-dimensional charge density waves that have a periodicity equal to twice the actin monomer spacing, coupling to twist distortions of the oppositely-charged actin filaments[
7]. These phenomena and others owe their existence to Coulomb interactions between the constituent parts, and the geometry of the underlying structure can also play a vital role.
Thermodynamically, the distribution of charge in the solution is governed by a competition between energy and entropy. The bound state in which the ions are condensed on the surface of the macroion/polyelectrolyte is energetically-favored, and the continuum state, in which the ions are free to drift in any direction, is entropically-favored. The balance between these two factors in minimizing the free energy has been shown to vary significantly based on the geometry of the macroion [
8]. This interplay between electrostatics and geometry is particularly important for polyelectrolytes in solution, and they have shown to produce a wide array of intriguing phenomena in many different systems. One example is DNA, which in the presence of a physiological salt solution (a 0.1 molar solution of NaCl) is usually negatively charged, with a double helix DNA molecule strongly repelling another. However, if DNA is in a dilute salt solution in which a positively charged polyelectrolyte, such as spermine or sperimidine, has been added to the solution, it has been shown to roll up into a tightly-packed torus[
8,
9,
10,
11]. In fact, DNA is usually in a very compact state in cells and viruses.
Historically, the most common description of charged solutions has been Poisson-Boltzmann theory, but continued research has indicated that charged species in solution can have far more complex and counterintuitive effects than simple charge screening [
8]. In 1969 Manning proposed [
12,
13] that a portion of the ambient ions condense onto (i.e., attach to) the surface of a charged macro-ion, partially neutralizing the bare charge. This occurs up to the point at which the energy required to condense another ion equals the available thermal energy
at temperature
T, where
is the Boltzmann constant. The proposed effect, later termed “Manning condensation," marked a significant departure from linearized Poisson-Boltzmann theory that predicted only exponential screening. In Manning’s treatment of polyelectrolytes, which are long chains of charged subunits, the ion condensation was addressed [
8] separately from the ions that remain in solution, which were treated using linearized Poisson-Boltzmann theory [
12,
13].
Further refinements in the treatment of these ambient-ion solutions have been motivated in part by surprising effects observed in DNA that cannot be accounted for using Poisson-Boltzmann theory. By using multivalent cations, modifying the salts in the solutions, and even using alcohol solvents, it is possible to cause the DNA to undergo a radical structural transition into a variety of new geometries, including rod-like bundles and toroids [
14]. The toroidal structures, in particular, have received considerable attention in the biological community because such toroidal packing motifs are employed by spermidine and other molecules to contain their own DNA in small volumes [
15]. A variety of techniques, including cryo-transmission electron microscopy [
15] and UV spectroscopy [
16], have enabled direct observation of the formation of these toroidal condensates, and similar studies on other biological polyelectrolytes like F-actin [
6] indicate that condensation is a general phenomenon. If the electrical interactions between the polyelectrolyte, such as DNA, and the ambient-ion solution are purely those predicted by the Poisson-Boltzmann description, then two like-charged polyelectrolytes always repel one another, and such condensates will not form. For these condensates to be stable, the net electrical force between like-charged polyelectrolytes must be
, which is incompatible with the Poisson-Boltzmann theory of ions in solution.
Many of the approaches to understanding the role of polyelectrolytes in biological systems have adopted models of continuum electrostatics, and some of these have focused on solving the Poisson-Boltzmann equation for a cylindrical model of DNA in the presence of multivalent ions[
8,
17]. However, it has been shown that including charge correlations is essential to understanding these systems[
18,
19]. Recognizing the need for including fluctuations beyond the mean field, Ha and Liu used a field theoretic approach that produces a systematic expansion taking these effects into account[
20,
21,
22,
23]. They employed a simplified model of a DNA particle as a charged cylindrical rod composed of cylindrical charged segments and considered these rodlike particles in the presence of polyvalent counterions. Their first model consisted of two rodlike DNA molecules[
20], and they found attraction between the rods. When Podgornik and Parsegian suggested that the fluctuation forces between such rodlike polylectrolytes might not be pairwise additive[
24], Ha and Liu extended their calculation to include bundles of rodlike particles and included a one-dimensional form factor depending on the ionic size in order to incorporate short-ranged correlations along the rod length approximately[
21,
22]. They found that the breakdown of pairwise additivity can lead to qualitatively new behavior, although they still found that that the rods attracted one another under some conditions. Shklovskii then pointed out that the previous results seemed to be independent of radius, and that, in a highly correlated system of negatively-charged cylindrical rods in the presence of positive counterion charges, those positive charges can form a Wigner crystal, which will be three-dimensional and closely packed for small rod radii and form two-dimensional Wigner crystals on the surfaces of rods for large radii[
25]. Ha and Liu saw merit in the arguments for the large radius limit but disagreed with the the two-dimensional crystal lattice limit, which they said was due to Shklovskii incorrectly assuming pairwise additive interactions between various surfaces, which would make that limit invalid[
22].
To show how the radius of a DNA chain could affect multivalent-counterion-induced attraction between negatively-charged DNA chains, Ha and Liu used a modified model of DNA[
23]. They assumed, as before, that a DNA rod is a stack of short cylinders that have a finite radius and length, but to treat two-dimensional charge fluctuations on the surface of the rod, they divided the counterions into two classes, free and condensed, with the condensed counterions modifying the local charge at the surface of the cylinder, giving rise to charge fluctuations on the two-dimensional surface of each cylinder. With this model, they found that the competition between attractive and repulsive interactions tended to balance one another, resulting in no attraction at all for thick rods. They also found that that for valence of the polyvalent cations greater that around 3, the spacing of chains in a bundle and the size of bundles appears nearly independent of the nature of the bundling agent. This is because the increased valence leads to a stronger screening as well as a stronger attraction.
Recognizing that geometry was playing a crucial role in the attraction, Nguyen and Shklovskii [
26,
27] proposed a simple theory of charge inversion that considers the structure of the polyelectrolyte together with its electrostatic interaction with the substrate. The idea was that there is fractionalization of charge in which a polyelectrolyte can either neutralize the charge or reverse it depending on how it attaches. In that model, a single double-helix strand of DNA is represented by a rigid cylinder with two one-dimensional lattices of negative charges
in helices around the surface to represent DNA’s double helix of negatively-charged phosphates. In order to model the polyelectrolyte solution in which the DNA is immersed, they take the positively-charged species to be freely-jointed chains. These adsorbing species have multiple charges that may partially attach to the surface, with excess charge protruding into the solution, as shown in
Figure 1. They assumed that all the negative charges on the DNA are neutralized by the polyelectrolytes, so that there will be no vacancies, and because there is excess charge dangling into the solution, the charge on the surface can be not only neutralized but reversed. They also assume that the only role of the background salt solution is to provide screening for the interaction between the polyelectrolyte chains. Since they assumed no vacancies on the one-dimensional chains, they could easily count the possible configurations. In calculating the electrical potential, they took the negative charges of the DNA to be spread uniformly on the surface of the cylindrical DNA surface and used simple electrostatic arguments with screened potentials to calculate the energies involved. For polyelectrolytes in a 0.1M NaCl solution with DNA, Nguyen and Shklovskii[
26] give in their Eq. (6) an estimate of the net charge inversion, the charge density on DNA divided by its bare charge density. The largest value of that estimate is about 0.07.
Since the simple calculation of Nguyen and Shklovskii[
26] did not include the discrete nature of the phosphate sites in calculating the electrostatics and used a simple version of the entropy, Bishop and McMullen[
28] decided to use a more rigorous formalism to calculate the charge reversal in DNA. Starting with the simplifying assumption that the polyelectrolytes could be considered as dimers (two units in length) on DNA, Bishop and McMullen allowed for vacancies and used a model of the discrete location of phosphates on the DNA surface, with interactions between charges located on the various sites. In that model, doubly-charged polyelectrolyte molecules attach to the DNA either parallel or perpendicular to the DNA surface and confirmed that under the right conditions, the charge on the surface could be inverted. The use of the lattice model allowed them to accomdodate vacancies, which is not possible if the charge is treated in a continuum manner. Inspired by the work of Ha and Liu[
20,
21,
22,
23], they used a field-theoretic approach that employs a loop expansion, where the mean field contains polyelectolyte ions adsorbed on the surface of the DNA, similar to their approach. While the general formalism for the one-loop expansion was presented in that paper, the numerical calculations stopped at the mean field level, and the current work extends that to include fluctuations at the one-loop method. The emphasis here is on the entropy and the role it plays in the configuration of charged species attached to the lattice. The details of this model will be explained in detail in the next section.
The results of these theories indicate that the correlation effects in the solution are also strongly-dependent on the system geometry. To understand this, consider the electric potential outside spherical, cylindrical, and planar positively-charged surfaces in vacuum. For spheres, the potential decays as the inverse of the distance; for the cylinder, the potential decreases logarithmically with the distance; and for the plane, the potential decreases linearly with the distance. Gelbart et al. [
8] assert that, in an ambient-ion solution, the entropy of the point ions varies logarithmically with their concentration, and therefore with the distance from the cylinder. Thus they conclude that, in a crude comparison, for spherical geometry, the
potential is dominated by the logarithmic entropy; for planar geometry, the entropy is dominated by the linear potential; and for cylindrical geometry, both the energy and the entropy vary logarithmically.[
29] In this paper, we will incorporate the impact of the ions in solution on the free energy through the chemical potential of these ions, as detailed in
Appendix B.
The helical geometry of DNA, then, sits precisely balanced on the fulcrum between energy- and entropy-driven processes under physiological conditions. The geometry of the biomolecule plays an even more crucial role when the highly-charged ions in solution are not point charges but have geometries of their own. Such is the case for DNA immersed in a solution of polyelectrolytes. These polyelectrolytes can be proteins or other fragments of biomolecules which are routinely found in the nucleus [
30], so that, again, the central biological processes occur in precisely the most difficult regimes to model.
For charge inversion on DNA with polyelectrolytes, “physiological conditions" require incorporating the combined effects of charge correlation, thermodynamic fluctuations, crowding, and geometrical considerations all at once. As we have discussed in this introduction, there has been considerable work in addressing all of these issues. Some approaches treat only the geometry, with no interactions[
31]. Others include both geometry and interactions, but use a continuum model for electrostatics that neglects discreteness effects[
32]. The lattice gas dimer model is unique in its simultaneous consideration of all these effects, and the thermodynamic and geometrical idealizations it makes can be systematically improved.
While the formalism of Bishop and McMullen[
28] included both a mean field theory and corrections due to fluctuations or charge correlations, the computed results were obtained only at the mean field level. In this paper, this simple model is extended to calculate the fluctuation corrections and the entropy of the system. The purpose is to determine the importance of the fluctuation terms for inverting the charge and to see whether these terms have a significant impact on the entropy of the system. A preliminary version of this work was in Sievert’s master’s thesis[
33]. In
Section 2 of this paper, we outline the model of the charged binding sites on DNA and present the computation of the entropy due only to the hard-core repulsion that prevents multiple binding on the same site. In
Section 3 we explain the geometry of the double helix and show how it can be represented as a one-dimensional lattice and in
Section 4 derive the form of the potential and determine the orthogonal transformation that diagonalizes it. In
Section 5, we use functional integral techniques to derive the partition function, which uses a Gaussian integral identity to perform the sum over configurations exactly, at the expense of an integral over a new auxiliary field. In Sec
Section 6, we show how the grand canonical potential can be computed order by order, in which the first two terms are the mean field and the one-loop correction to the mean field. In
Section 7, we examine the saddle point equation that defines the mean field, and this is used to calculate the entropy, the number densities of all species, and the charge density. In
Section 8, we find the explicit form for the inverse-Hartree-field-fluctuation propagator. In
Section 9, we include the one-loop order terms in the calculations to reveal the effects of fluctuations. Finally, in
Section 10, we compare the results of the various approximations. Details concerning the Gaussian integral identity and the chemical potential of dimers in solution are given in
Appendix A and
Appendix B.
2. The Noninteracting DNA Lattice Gas
Our model system starts with a simplified version of the DNA molecule itself, as shown in
Figure 2, with double helical chains of phosphate ions connected by base pairs. The phosphate ions are represented by red balls and blue balls, with the base pairs represented by yellow lines. The two different chains are labeled "up chain" and "down chain", which correspond to the direction of the carbon atoms in the sugar backbone. We will assume that each phosphate site has charge
, where
e is magnitude of the charge of an electron, and that there are no charges either between the phosphate sites along the helical chain or in the vicinity of the base pairs. In solution are polyelectrolyte ions of charge
, which we call dimers. These dimers are assumed to be the length of the spacing between two phosphate ions along a helical chain, which is
nm
Å, which is longer than a diatomic molecule. Possible candidates for these species are the diamines, 1,3-diaminopropane (DAP
), putrescine (Put
), and cadaverine (Cad
). An extension of this model could include more highly charged spermidine(Spd
) or spermine (Spn
). These polyamines are shown in
Figure 3, with the lattice spacing
shown as the dashed line for comparison. Although we will only consider the doubly charged species in our model, the model and analysis could be extended to these more highly-charged species in the future.
Deng and Bloomfield have shown using Raman difference spectra, for the systems they studied, that in the presence of spermidine or spermidine these polyvalent cations bind electrostatically near the DNA phosphates[
34]. van Dam et al. agree with this conclusion and suggest that the A form of DNA is stabilized by polyamines fitting perfectly between the phosphate ions[
35]. They conclude that DAP, which has charge
probably has the best fit to the phosphate lattice of all the polyamines, and that longer species, like spermine and sperimidine are longer and probably also interact with base pairs. In our model, we are assuming that there is no interaction with the base pairs.
The choice of Bishop and McMullen[
28] to model dimers was motivated by the wealth of studies (refs. [
36] and [
37], among others) in the literature about dimer models in other branches of physics, as well as for geometrical simplicity. We will continue to use this same model. Dimers, the shortest polyelectrolytes, have only two possible orientations when adsorbed on DNA, assuming that the length of the dimer is comparable to the helical spacing between charged sites on DNA, as shown in
Figure 4. As we stated earlier, this should not be confused with a diatomic molecule, which would not be long enough to span the space distance between two phosphate ions on one of the helical chains of DNA.
In our model, a dimer lying on the cylindrical surface of the molecule must lie parallel to the helical strands, neutralizing the charge on two adjacent sites. Otherwise, the dimer must adsorb perpendicular to the cylindrical surface, extending one end radially out from the central axis of the cylinder and inverting the charge on a single site. Charge inversion by dimers, then, is quite similar to charge inversion due to two species of point ions: one monovalent species representing parallel-adsorbed dimers and one divalent species representing perpendicular-adsorbed dimers. In the previous work, Bishop and McMullen[
28] modeled the adsorption of dimers in a lattice-gas model as a two-component solution of point ions, allowing the possibility of vacancies, and used field-theoretic methods to describe the thermodynamics of the system. They carried their calculations to a mean-field level of approximation of the inverted charge on DNA, but did not calculate the entropy. Their work confirmed the possibility of charge inversion within this model.
Those computations yielded the charge per site on the DNA helix as a function of the chemical potential, or equivalently of the concentration of the polyelectrolyte in solution. While the Nguyen-Shklovskii[
26] calculation assumed complete filling of the lattice, the Bishop-McMullen approach allowed for vacancies, represented as negatively-charged sites, in addition to the neutral or positively-charged sites arising from dimer adsorption. However, the lattice-gas model, which assumes all sites are equivalent, does not take into account that the parallel dimer occupies two sites. In this model, the binding energy of the parallel dimer is taken to be
and for the perpendicular dimer
, and these energies can be adjusted to account for the difference in occupancy. In practice, the occupation of two sites is mimicked by making the binding energy of the parallel dimer twice as large as that of the perpendicular dimer, and we will use the same approach here in the numerical calculations. The assignment of binding energies to these species is a simple way of including the complicated electrostatics that allows the polyelectrolyte ions to bind to the surface of the DNA. This is analogous to the "fermion" model of Ha and Liu[
22], which assumes that each site can either be empty or occupied by one counterion. Here, as in the earlier Bishop and McMullen work, we have three species, parallel dimers, perpendicular dimers, and vacancies.
This approach allows us to describe adsorption of either species independently for each site. Any configuration of the DNA-dimer complex in this model can then be described by identifying the type of dimer (parallel, perpendicular, or none) adsorbed on each site on the DNA molecule. Such a model also resembles the lattice gas model of condensed matter physics [
38], which treats the ways of distributing particles of different types onto a regular lattice of sites. In this section, we assume that the three different species on the lattice do not interact with one another, and in this case, the structure of the lattice does not matter. We call this the noninteracting model. This does not actually mean that there are no interactions. The interaction of the polyelectrolytes with the DNA can actually be quite strong, depending on the values of
and
, and their attachment to the lattice is analogous to what Ha and Liu call condensed counterions[
22]. Also, we will assume that there is only one species per site, which corresponds a hard-core repulsion between sites.
In developing the formalism, it is convenient to consider a vacancy as a third species of particle, because then we can impose the constraint that each site is singly occupied, either by a parallel dimer , a perpendicular dimer (), or a vacancy (). For each our three species of “particles" that can reside on our lattice sites, we define a relative charge in units of the magnitude e of the charge of an electron. These relative charges are then for vacancies, for parallel dimers, and for perpendicular dimers. We will assume that the binding energy of species to the lattice depends only on the type of species and not the location. We will be specifying each lattice point by its location ℓ along helix b, with the pair specifying that lattice position. Then the quantity will be the number of particles of species on the lattice site at ℓ on chain b. For each chain, ℓ extends from to , such that the total number of sites is , and we employ periodic boundary conditions. There will also be free polyelectrolyte ions in solution, and their influence on the stability of the adsorbed dimers will be through their chemical potential.
We begin by considering the Hamiltonian
for this noninteracting system (that is, with no interactions between different sites aside from the hard-core repulsion that blocks double-occupancy), which is
Because we have a system that exchanges particles with its surroundings, specifically the ions in the solution surrounding the DNA, which attach to the surface, we work in the grand canonical ensemble. If the system contains
particles of type
in equilibrium with its surroundings, with an average internal energy
E, the grand canonical potential
, which is a function of the temperature
T, volume
V, and chemical potentials
for each species
, is written as[
38,
39],
where the number of particles of type
is given by
Then the entropy can be written in terms this potential as
where
and
is Boltzmann’s constant. In this partial derivative, the volume and all the chemical potentials
of all species
are held constant. It is convenient to define the grand canonical partition function
in terms of the grand canonical potential
as
where the sum over configurations includes all the accessible states of the system. The grand canonical potential can alternatively be written in terms of the partition function as
and this allows us to write an expression for the entropy in terms of
as:
For the noninteracting lattice, the average internal energy is represented here by the Hamiltonian (
1),
, and the noninteracting grand canonical partition function
becomes
where we have suppressed the
G subscript for simplicity.
Since a vacant site does not really correspond to a particle, we recognize that energy and chemical potential of the vacancy must be related such that
. In addition, because the dimers adsorbed on the surface and those in solution are in equilibrium,
is the chemical potential of dimers in solution at the appropriate concentration. We thus need an estimate for
. Approximating the dimer as a uniformly-charged cylinder, this value is shown in
Appendix B to be
at the physiological temperature of
K.
The average occupancy
for species
per site is found by taking the derivative of this partition function with respect to
This can be verified by taking the derivative of Eq. (
8).
which is by definition the average number per site of species
.
Continuing with the evaluation of the partition function, we note that the exponential of a sum can be written as the product of exponentials, enabling us to rewrite the partition function (
8) as
In order to simplify and to appreciate the physical meaning of this expression, it is useful to define the relative activity [
40] of species
in the noninteracting lattice-gas model, given by
Note that this is independent of the lattice site
ℓ or chain
b. The partition function can then be written in the simple form
Because we have assumed that there can be only one species per site, parallel dimer, perpendicular dimer, or vacancy, the sum over configurations can now be done over each site separately, where there are three possible configurations
This is the same as saying that
for one and only one of
, or
v and is zero otherwise. This means that
and so the grand partition function is
Since every term in the product is the same, the expression in parentheses is simply raised to the power
, giving
At this point, we can easily see that we can obtain the average number per site using our derivative form from Eq. (
9) as
Taking the derivative and substituting the expression for
in the denominator, we have
where the derivative of the activity is
Using this in the expression for the mean site occupancy in Eq. (
19) and cancelling
factors of the sum over
, the mean occupancy for species
is given by
In
Figure 5, we show the mean occupancies for the three species versus
, with
. Negative
corresponds to an attraction of dimers to the lattice, and we see that parallel adsorption of dimers dominates because of the stronger binding, followed by perpendicular adsorption, with vacancies becoming nonexistent. Positive
corresponds to a repulsion of dimers from the lattice, so that at large
, the ordering is reversed, for the same reasons. At small positive
, the dimers are repelled from the lattice, but this competes with the chemical potential, which tries to put dimers back onto the lattice. In this low-coverage regime, perpendicular adsorption dominates, while vacancies dominate for large
because the dimers would prefer to stay in solution. For
, the parallel and perpendicular mean occupancies become the same. Similarly, where
,
because, as mentioned earlier,
is always zero. Also,
where
because that is where
. It is at this point where
becomes a maximum.
The average charge per site can now be determined by multiplying
, the charge of species
, by
the occupation of species
on site
ℓ of chain
b,
Since we assume that every site has either a parallel dimer, a perpendicular dimer, or a vacancy, with single occupancy, then for a given site and species,
is either 0 or 1. The average charge density is then the sum the averages of the individual terms,
This charge density is plotted as the solid blue curve in
Figure 6. Because a parallel dimer has no charge,
, a perpendicular dimer has a positive unit charge,
, and a vacancy has a negative unit charge,
, the total mean field charge per site
in
Figure 6 is the difference between the curves
and
in
Figure 5 and goes to zero where those two curves cross.
A measure of the magnitude of the charge fluctuations is given by the average of the square of the charge on a site minus the square of its average, which is the charge variance
, given by
where
and
The product
describes a double occupancy of site
by species
and
. Since we have required single occupancy, then we must have
. Also, since
can only take the values 0 or 1,
Then the charge density per site reduces to
Taking the thermal average of both sides, we have
In
Figure 6, we have plotted the standard deviation in the charge density,
, which is the square root of the charge variance, for the noninteracting model. As we saw in
Figure 5, there are three distinct regions,
,
and
near
. These three regions are also reflected in
Figure 6. At large negative
, parallel adsorption dominates, which leads to
. At large positive
, vacancies dominate, leading to
, and for
near
, there is a small window where perpendicular adsorption of dimers dominates, leading to positive values of
. Correspondingly, the fluctuations, represented by the standard deviation
, become small when
becomes large, and the fluctuations are largest at small
, when there are comparable numbers of all species. The phenomenon of charge inversion is demonstrated in
Figure 6 because the average charge is positive, indicating that sufficiently many dimers adsorb in a perpendicular configuration to invert the charge on the molecule from negative to positive. The magnitude of charge inversion increases in the weak-binding limit
. The maximum of this inversion in
Figure 6 is about 0.25, which is much larger than the estimate of 0.07 from the work of Nguyen and Shklovskii[
26] , as discussed in the Introduction. That work treated the charge on DNA as a continuum, while we have used the discrete lattice of phosphates in a lattice-gas model. We have not included interactions between sites yet, although we have assumed that there can only be one species per site, either a parallel dimer, a perpendicular dimer, or a vacancy. We will see later in this paper that interactions reduce this value somewhat, but it will still be much larger than 0.07.
The noninteracting entropy
can be obtained from the partition function using Eq. (
7) as
Because the derivative of the activity with respect to
can be written in terms of its logarithm as
the entropy can be written in the simple form
Substituting the mean occupancy for the noninteracting model from Eq. (
21) allows us to write the dimensionless entropy per site for the noninteracting model in a simplified form as
which agrees with the standard result for the entropy of mixing of an ideal solution with species
[
40].
In
Figure 7, we show the entropy
of the noninteracting model as a function of the binding energy
, assuming
. We also show the individual contributions of Eq. (
33) to the entropy. The entropy is a maximum when disorder is greatest, and this occurs when the numbers of each of the species are as close to equal as possible, which occurs at
, the maximum of
in
Figure 5.
3. Geometry of the Charged Double Helix of DNA
While stored in the nucleus of a cell, DNA is wrapped compactly both around histone protein complexes and around itself, but on sufficently small scales (≈ 150 base pairs [
41], or 15 turns [
30], or 50 nm [
8]), DNA’s dominant geometrical structure is the familiar double-helix structure shown in
Figure 2. Each strand of the DNA takes the shape of a helix with a characteristic radius
nm and pitch angle
, as shown in
Figure 8. Here the pitch angle
denotes the angle with respect to the
-plane that gives the appropriate altitude per unit circumferential winding; in cylindrical coordinates,
, where
is the distance along the
z axis. Each strand of DNA has a “direction", identified by a particular carbon on the backbone structure, corresponding to the chirality of the helix. In the DNA double-helix, the two strands are antiparallel and therefore have opposite chiralities. As a consequence, the azimuthal angle between the two helices is always a constant,
. Because this phase shift is not exactly
, the chains have unequal separation in the clockwise and counterclockwise azimuthal directions. The larger gap is referred to as the
major groove, and the smaller as the
minor groove.
When the hydrogen atoms dissociate under physiological conditions, the resulting negative charges (
, where
e is the magnitude of the charge of the electron), can be regarded as located at the mean position of the oxygen atoms on the backbone, as shown in
Figure 2. This figure was constructed using geometrical data taken from Bishop and McMullen [
28], which is based on experimental x-ray diffraction data [
42,
43] as input to the SYBYL molecular modeling program[
44]. The oxygen atoms occur at regular intervals along each strand, separated by a helix segment of arc length
nm. It is these sites located at regular intervals along the helix to which dimers will adsorb. These negatively-charged sites do not occur at the same altitudes on both strands, however. Rather, there is a vertical separation
nm between corresponding sites on the two strands. With the relative phase of the strands and the vertical separation between sites on those strands taken together, corresponding sites on the two strands may be viewed as connected by a helical segment of arc length
nm at a pitch angle of
. This geometry is shown in
Figure 8.
When positively-charged dimers approach the DNA molecule, they will be attracted to the negative charges at the sites on the double-helix. We consider dimers with a length comparable to the spacing
between sites on a strand and having positive charges
at either end. The dimers can then adsorb onto the surface in two possible orientations, either parallel to the strand or perpendicular to the helix axis. (See
Figure 4.) If the dimer adsorbs parallel to the strand, the positive charges from the dimer lie directly over the negative charges on the strand, neutralizing the charge on two adjacent sites. If the dimer adsorbs perpendicular to the surface of the bounding cylinder, one end of the dimer sits atop the site, while the other extends radially outward. This perpendicular adsorption effectively inverts the charge on the site from
to
. In order to use a lattice-gas model, this geometric constraint is loosened by having the parallel dimer block only a single site. This deficiency can be somewhat compensated by making binding energy of the parallel dimer twice as large,
.
Note that because the length of the dimer (equal to the same-chain site spacing nm) is much smaller than both the cross-chain site spacing nm and the vertical separation nm between turns of the helix, other orientations of the dimer are not possible.
The problem thus described is a complex one, but the similarities with the lattice gas models of condensed matter physics provide guidelines for how to proceed. These prescriptions, however, are aimed at the treatment of a periodic crystalline lattice, and, although the DNA sites exhibit helical symmetry, they do not constitute a periodic lattice in the strict sense of the term. However, an appropriate choice of coordinates can take advantage of the helical symmetry, so that, in these new coordinates, the positions of the sites will fall on a regular, one-dimensional lattice.
We will define these coordinates on a cylinder of radius
, as shown in
Figure 8 and
Figure 9. The first coordinate
traces out a path with pitch angle
along a single helical strand, and the other coordinate
traces out a path with pitch angle
that connects corresponding sites on the two strands. Geometrically, a cylinder can be regarded as flat in the sense that it has no curvature. In
Figure 9, we show the way that the cylinder can be cut with scissors and unwrapped so that this lattice can be mapped on a flat surface as shown in
Figure 10. If we define the origin of coordinates
and
to be halfway along the helical path between the partners on the two chains, as shown in
Figure 9 and
Figure 10, then the positions of the sites on both strands form a one-dimensional lattice in the coordinates
.
These coordinates can be written simply in terms of cylindrical coordinates
in matrix form as
Inverting this gives definitions of the two coordinates as
and
With these definitions, the difference in coordinates between adjacent sites on the same strand is
, and the difference in coordinates between corresponding sites on the two strands is
. That is,
is the distance along the helical path of a single chain from one phosphate ion to the next, and
is the distance along a helical path from a phosphate ion on one chain to its partner phosphate ion on the other chain.
Next we define a lattice index
ℓ, which specifies the cell (altitude on the double-helix) and chain index
b, which specifies the basis site, where
for the “down"(↓) chain and
for the “up"(↑) chain, as shown in
Figure 11. Using these variables, the coordinates can be written as
where
Thus, although the sites on the DNA molecule do not constitute a periodic lattice in real space, they do constitute a lattice in an appropriately-defined coordinate space (see
Figure 11). As we will see, however, this choice of coordinates will make the form of the interaction potential more complicated as a result.
The use of
instead of the cylindrical coordinates
indicates a more fundamental shift in our description of the DNA double-helix. The two-dimensional surface on which the helices lie is a cylinder of radius
, and the helices inherit the cylinder’s geometric properties. The geometry of the cylinder, however, is locally indistinguishable from the geometry of the flat plane. One common consequence of this is that it is possible to smoothly wrap a flat sheet of paper around a cylinder. In contrast, it is not possible to smoothly wrap a sheet of paper around a sphere; this problem is well-known because of the geometrical distortions that occur in flat maps of a spherical Earth. Maps of a cylindrical surface, however, have no such distortions. This geometrical difference is quantified by the Riemannian curvature tensor, which vanishes for both the cylinder and the plane, but not for the sphere[
45]. This means that the local geometry of the cylinder behaves in exactly the same way as the local geometry of the plane, so that a helix on a cylinder is geometrically equivalent to a line on a plane.
Our choice of coordinates is simply a map of the cylindrical surface that reduces the double-helix to two parallel lines, as shown in
Figure 11. The result of this map is that we have a linear lattice with two phosphate sites per cell, with the cells labeled from
to
. All
cells are identical, and the lattice can be imagined to satisfy periodic boundary conditions.
The analogy describing DNA as a ladder wrapped around a cylinder of radius then has a true mathematical basis, because the local structure of the double-helix is equivalent to the structure of a “ladder" — a one-dimensional chain with a unit cell containing one site from each strand. If interactions are ignored, then the problem is described simply by this one-dimensional lattice.
4. DNA Interaction Potential
The geometrical structure of the double helix is important in determining the electrostatic energy
of the sites (with and without dimers) interacting with one another. The full Hamiltonian must contain these contributions and can be written as
where the interaction energy
is the total interaction energy between all the charges on all the sites. If
is the screened Coulomb energy between a charge
at site
on chain
with another charge
at site
on chain
, then
can be written as
where
is the number of particles of species
on the lattice site at
and
is the charge of that species, in units of the magnitude
e of the charge of an electron. The factor of
is to ensure that we are not double counting when we sum over all lattice sites, and we implicitly exclude same-site interactions
. The total charge on a given site is given by summing all the charges on that site, so that the total charge on the site
is
The interaction potential can then be written in terms the the total charge on each site as
where
is the electrostatic interaction energy between the sites. This screened electrostatic energy between charges
and
at positions
and
, (in SI units) is
where
is the electric permittivity of the medium between the two charges, and
is the straight-line distance between the two charges. Distances along the chain are invariant under translations by a lattice spacing, and so
depends only on the difference between the lattice site indices
. As in the noninteracting lattice-gas model, we have three species of “particles" on the lattice sites, vacancies of charge
, parallel (‖) dimers of charge
, and perpendicular (⊥) dimers of charge
.
Under physiological conditions, the presence of monovalent salt ions such as Na
leads to screening of the bare charges. Traditionally, screening is treated by modeling the ions as a continuous density, resulting in the nonlinear Poisson-Boltzmann equation [
19,
46]. The Poisson-Boltzmann equation is not analytically solvable, so it is often further approximated by linearization. The resulting Thomas-Fermi model is analytically solvable and gives the screened Coulomb (or Yukawa) potential between two charges given in Eq. (
43) where
is the permittivity of water (with
the permittivity of free space) and
is the magnitude of the screening wave vector [
47]. Various names are ascribed to both the nonlinear and linearized equations, including Debye-Hückel, Thomas-Fermi, and Poisson-Boltzmann, but we will always refer to the Poisson-Boltzmann equation when we mean the nonlinear form and the (linearized) Thomas-Fermi equation when we mean the linearized form.
One must be cautious in using the screened Coulomb potential for screened interactions. The continuous density approximation from which the nonlinear Poisson-Boltzmann equation was derived constitutes a form of mean-field theory[
19], which fails to describe any of the effects due to correlations between the ions such as charge inversion and condensation. Thus we cannot model screening of the DNA molecule by the dimers using the Thomas-Fermi model, or even the Poisson-Boltzmann equation. Instead, we must treat the dimers as individual particles and compute their interactions so that we do not ignore correlation effects.
For the screening due to the monovalent salt, however, the valence involved is small enough and the concentration is low enough that a mean-field treatment is generally believed to be a good approximation [
19,
48]. Thus, we will treat the interaction energy
in Eq. (
42) using the screened Coulomb potential for the dimer-dimer interactions, with the screening vector
given in the Thomas-Fermi model as [
47]
where
is the inverse temperature,
n is the concentration of ions, and
is the Bjerrum length [
32]. The Bjerrum length is a characteristic length scale equal to the distance at which two proton charges interact with energy
. It should be noted that these expressions differ slightly in the literature because of various authors’ choices of units for the electrostatics. Here, we use SI units.
By choosing a coordinate system based on the arc lengths around the surface of the cylinder, we have managed to align the sites into a regular lattice, but the potential
depends on the
straight-line distance between the sites rather than the surface arc-length connecting them. We must then express Eq. (
43) in terms of the straight-line distance function
between the sites at
and
. The straight-line distance in cylindrical coordinates between two points on the surface of a cylinder of radius
is
and applying our change of variables (
35,
36) gives the straight-line distance between sites as
where
and
. Because of the translational invariance of the lattice, the distance in Eq. (
46) depends only on the differences between the lattice and basis indices, and not their values individually. From this relation, we see the symmetry property of the distance,
Since
and
are either
or
,
takes the three values
for interactions of one site on a single chain with all other sites on the same chain,
for interactions between a site on the “up" chain with with all the sites on the “down" chain, and
for interactions of one site on the “down" chain with all the sites on the “up" chain. We will write an up arrow ↑ will indicate
, and a down arrow ↓ will indicate
. Plots of
and
are shown in
Figure 12a. There are wiggles in the curve because, as the chain wraps around the cylinder (see
Figure 4), the distances between lattice sites can become larger or smaller than they would be on a linear chain.
In order to understand the consequences of the screened Coulomb potential (
43) depending on the straight-line distances
d between sites, we define a new notation for the potential, which is expressed in terms of the difference in lattice site indices
,
This potential decays exponentially as a function of the straight-line distance
d (inset of
Figure 12(b)). However, as a function of the lattice index
ℓ as in
Figure 12(b), there are corresponding “wiggles" in the decay. This occurs because the distance between the sites decreases slightly as the helix completes a turn. These wiggles are present for every turn the helix makes, but the effect on the potential becomes negligibly small after about the first turn. Thus, by using the indices
ℓ and
b to describe the relationships between sites, we have reduced the DNA double-helix structure to a regular one-dimensional lattice with two sites per cell at the cost of an irregular potential due to the geometrical structure of the DNA molecule. The potentials for other combinations of
and
can be related to those in
Figure 12(b) through the symmetry relations
Fourier transforming in the site index
ℓ block diagonalizes this matrix into
blocks, corresponding to the chain indices
and
. Explicitly, the Fourier transform is
where the elements of
are independent of
and
and are given by
and the dagger denotes the adjoint (complex conjugate transpose). We use periodic boundary conditions, so that the wave vector
k appearing here, which is dimensionless, takes the
values
where
If
is large, the range of
k is essentially continuous from
to
.
The matrix elements of this Fourier transform are
Substituting for
from Eq. (
48), we have
Since we regard the chain as having periodic boundary conditions, changing summation indices to
and
gives
where the sum over
becomes
We can now write the Fourier transform of
as
Substituting this back into the matrix, we have
Thus, the Fourier transform
of
is diagonal in the
k indices and contains
blocks
The symmetries (49) of the screened Coulomb potential on the DNA lattice are reflected in the Fourier transforms (
58) as well:
Also, because
is an even function of
ℓ,
is real. The Fourier transforms (
58) are plotted in
Figure 13.
As can be seen in
Figure 13, the Fourier transforms
have regions of
k that are negative, which is a result of transforming with respect to our helical coordinate system. When the screened Coulomb potential (
43) is Fourier transformed with respect to the straight-line distance
d, the function is positive definite. We have instead Fourier transformed the potential a function of the lattice index
ℓ, resulting in the “wiggles" in
Figure 12(b) caused by the turns of the helix. These deviations from the exponential decay of
result in the deviations here from the positive-definite Fourier transform.
Using the symmetry operations in Eqs. (
61a) and (61b) obeyed by the Fourier transforms allows us to simplify the
blocks
in Eq. (
60) to
This matrix is easily diagonalized, yielding the real eigenvalues
which are plotted in
Figure 14. The transformation that diagonalizes
is a unitary matrix
whose columns are the eigenvectors of
. Choosing the
eigenvector to be first, this transformation matrix is given by
This diagonalizes the
matrix
according to
We can write the combined process of Fourier transforming
into block-diagonal form and then diagonalizing the
blocks
as a single unitary transformation
given by
which completely diagonalizes
such that
Here
is a diagonal matrix, with each element along the diagonal given by:
where
is either + or −.
Although we have diagonalized the potential with a unitary transformation, the fact that the transformation matrix is complex means that the charge density in the diagonal basis could also be complex, which is physically undesirable. However, the potential matrix in the position basis is real and symmetric, which means that it should be diagonalizable by a real orthogonal transformation matrix
. In fact, the transformation matrix is not unique, since there is a degeneracy in the eigenvalues, since
. This means that we can choose eigenvectors that are real by taking two orthogonal linear combinations of the eigenvectors of the unitary transformation
for
k and
. Those two new eigenvectors, which represent the columns
for
and
of the new transformation matrix
, are given by
and
Since
, these can be written in terms of real and imaginary parts as
and
When
, the orthogonal transformation is the same as the unitary transformation, that is
where
Here the columns are labeled by the eigenvalue
, and the rows are labeled by the chain
for the first row and
for the second row.
This transformation can be written in a more compact form by introducing the unitary transformation matrix
, whose elements are defined by
Then the orthogonal transformation is
In deriving the thermodynamic quantities for the interacting lattice gas model, it will be convenient to be able to use this orthogonal transformation to diagonalize the potential.
5. Partition Function for Interacting DNA Chains
For the interacting lattice gas, the partition function is similar to the noninteracting one in Eqs. (
5) and (
8). However, here the full Hamiltonian
from Eq. (
39) is used, and then the partition function is given by
The full Hamiltonian, including the interaction term from Eq. (
42) written in matrix form, is
In
Section 4, we showed that the orthogonal transformation matrix
diagonalizes
. In order to streamline the calculation, it will be useful to insert the identity matrix in the form
, to the left and right of of
in the interaction term of the Hamiltonian, which gives
where we have grouped factors to show the transformed quantities.
The Hamiltonian, with the interaction term written in the diagonal basis, can then be written as
where
is the transformed charge density. We have not transformed the first term in the Hamiltonian to the diagonal basis because it is more convenient later in the calculation to have it in the position basis. Substituting this form into the partition function and writing the resulting expression as two separate exponentials, we have
We can write the exponential of the sum in the interaction term as a product of exponentials as
From this expression, we can see that the mean site occupancy for the interacting lattice-gas model can be obtained in the same way as in the noninteracting case (
9),
by differentiating the partition function with respect to the chemical potential.
In the noninteracting case, since all the information about the configuration was contained in , we were able to decouple the sum over configurations from the sum over lattice sites because the activity was independent of position, and only appeared in its exponent. This is because only linear factors of were in the exponent in the partition function. While it is still true that contains the configuration dependence, it now appears quadratically in the exponent of the partition function, since contains linear factors of the , and the exponent in the interaction term contains . This makes it impossible to decouple the sum over configurations from the sum over lattice sites. However, we can use an integral identity, known as the Hubbard-Stratonovich transformation, to replace the term quadratic in in the exponential with a term linear in , meaning that there will only be linear terms in . This requires introducing the integral over an auxiliary field . Then the sum over configurations is possible to do exactly, although at the cost of having to integrate over the auxiliary fields.
The integral identity, which is a complicated way of writing unity, as is shown in
Appendix A, is given by
where the integration path is over the real axis from
to
∞ when
and over the imaginary axis from
to
when
. In the previous work by Bishop and McMullen[
28], the problem was done in the position basis, and this integral identity, known as the Hubbard-Stratonovich transformation, was written with the matrix version of the potential in the exponent, as given in Negele and Orland[
48]. This form creates a dilemma when there are negative and possibly zero eigenvalues of the potential, and it is difficult to determine the path of integration, since it changes depending on the sign of the eigenvalues. Zero eigenvalues would make the determinant of the matrix in that formula zero, and one then finds that there is a division by zero in the formula. By transforming to the diagonal basis, all these difficulties are avoided, since there is a separate integral for each eigenvalue, and if a particular eigenvalue is zero, the identity is not used at all.
To use the identity in Eq. (
84), we make the identifications that
,
, and
. Substituting this “1" into the partition function, we have
Now we see that the two exponentials containing
cancel, as we planned. Of course, we have gained this convenience by introducing the auxiliary field
, and we will have to do the integral over this variable at a later stage. However, since
does not depend on the configuration of the system, we can bring the sum over configurations and the leading exponential factors in the partition function inside the integral, and the expression for the partition function becomes
It will be useful to define an action
such that the partition function can be written in the form
where
and
Note that the factors of
are included here in the grand differential
, rather than incorporating them in the action
, as was done in Bishop and McMullen[
28] and in Sievert[
33]. When these factors appear in the action, they introduce a term of the form
. This adds a large constant value to the mean field results, which is subtracted out when the fluctuation terms are included. It actually has no real physical meaning and is part of the normalization of the integral[
49].
We see that by diagonalizing first, our auxiliary fields are in the diagonal basis of the potential. If we transform the terms containing
and
to this basis, they will no longer be diagonal. Therefore, we will leave those terms in the position basis. It will also be convenient to have the term containing
in the position basis also. Therefore, the expression for the transformed charge density is
in terms of the quantities in real space.
Substituting this into the action, we have
It is convenient to identify the last term in this expression as a self energy of species
,
By transforming
to the position basis as
this self-energy can also be written in the position basis as
This will be a useful form to use when we discuss the saddle point, or mean-field, approximation.
Keeping the first term in the position basis and the second term in the diagonal basis, we now write the action in a compact form, combining the self energy term with the first term in the action to obtain
Analogous to the noninteracting case, we define the activity as
With this definition, the partition function becomes
The trace over configurations is now done over each site separately exactly as we did for the noninteracting case, where
for one and only one of
, or
v, and zero otherwise. Thus, performing the trace gives
and the grand partition function becomes
where the effective action is
This is the general result, which is in principle exact. To understand what the Hubbard-Stratonovich transformation has accomplished for us, recall that the interaction energy
posed two difficulties: the additional configuration dependence and the coupling between the sites. By introducing auxiliary fields through the Hubbard-Stratonovich transformation (
84), we managed to separate these two complications so that one factor
contains interactions between field fluctuations but no configuration dependence, and another factor
that is configuration-dependent (through
) but decoupled. This allowed us to define a modified activity (
95) that incorporated all the configuration dependence, making it possible to evaluate the sum over configurations directly, as in the noninteracting case.
What we have done in using the Hubbard-Stratonovich transformation is replace the interaction part of the partition function with its functional integral representation. From the form of Eq. (
96), we see that the largest contribution to the partition function comes from the values of
for which the effective action
is stationary,
i.e., at a saddle point. Thus a Taylor series expansion of the effective action (
99) about the saddle point will yield an order-by-order approximation to the exact partition function (
98).
6. Expansion in Powers of the Auxiliary Field
Since it is not possible to evaluate the integral in the partition function of Eq. (
98) exactly, we will approximate it by expanding the action in Eq. (
99) about a mean field
, and include fluctuations to lowest order about this mean field. This mean field is determined by the saddle point of the integrand in Eq. (
98), which is the largest contribution to the integral.
We can write the effective action then as an expansion about this uniform saddle point. As an aid to keeping track of the order of the expansions, we add a multiplicative factor
m in front of the effective action. as
Here the expansion parameter
is scaled by
by writing
. The factors of
m cancel in the quadratic term, so that it is of order
.
In evaluating the integral in the partition function, the largest contribution comes from the region near a saddle point, and the terms beyond the saddle point give corrections to the integral. Physically, the saddle point solution corresponds to a mean field approximation, and we begin with that. The saddle point is found by setting the first derivative in the expansion to zero, that is
The second derivatives evaluated at the saddle point are proportional to the components of a matrix
, which is the inverse propagator for the fluctuations of the auxiliary Hartree field
that we introduced to decouple the interparticle interactions. Because we want to absorb the normalization factor in the grand differential
, we have defined each of the elements of
by the second derivative of
divided by
as
We will show in
Section 8 that, for a spatially uniform saddle point, this matrix is diagonal, that is,
is diagonal in the
’s, that is,
Although the Gaussian integral can still be done if
is not diagonal, the argument becomes much easier to follow if we assume at this point that
is diagonal because then the Gaussian integral can be done straightforwardly.
With this assumption that
is diagonal, the expansion of the effective action reduces to
The grand canonical partition function can then be written as
This can be written as a product of Gaussian integrals as
and each Gaussian integral can be evaluated using the same change of variable and path of integration as was used in the Hubbard-Stratonovich transformation in Eq. (
84) that originally was used to define the auxiliary fields, and as is discussed in
Appendix A. Each Gaussian integral produces a result of the same form, regardless of whether
is positive or negative. This quantity can never be zero, since we would not have defined an auxiliary field for that case, which is the same as the case in which an eigenvalue of the potential is zero. Therefore, the integral can always be evaluated, and the partition function can be written as
Note that all the factors of
have cancelled out due to our choices of the form of the effective action
and the inverse Hartree field fluctuation propagator
. Choosing these quantities carefully allows for the correct normalization of the integral[
49]. The product can now be brought inside the square root, and since
is diagonal, this gives the determinant of the matrix inside the square root, and so the partition function assumes the form
and the partition function can be written as a single exponential,
The grand canonical potential is then
If we formally write the expansion in terms of orders in
m, we have
The leading term in the expansion is therefore given by the saddle point value
and what is known as the “one-loop" correction term is
7. The Saddle-Point or Mean Field Approximation
The first task in this section is to find an equation that determines the value of the auxiliary field at the saddle-point or mean-field value of the the effective action. In order to find this saddle point, we must set the first derivative of the effective action in Eq. (
99) with respect to the auxiliary field
to zero. That derivative is given by
The activity
for species
is given by Eq. (
95), and its derivative is
where from Eq. (
91), the derivative of the self-energy is
Substituting this into the derivative of the action, we have
Setting this first derivative to zero, we obtain an equation whose solution gives the auxiliary field in the saddle point approximation,
We see that on the right-hand side of the equation is the transformation to the diagonal basis of the potential, as given in Eq. (
92). We could therefore write the auxiliary field in the position basis as
where
is the activity with the self energy in the diagonal basis
, which by Eq. (
93) has been replaced by the self energy in the position basis,
. Either of these two equations, Eq. (
117) or (
118), may be considered the equation giving the saddle-point, or mean-field, value of the auxiliary field, and we will refer to this as the mean field equation. In this expression
is a function of all the
elements, and so, for
, which we have used in our calculations, there are
coupled nonlinear equations. Therefore, for simplicity, we make the assumption that the mean field is spatially uniform, which means that
where
is constant. This is still a nonlinear equation in
, and so this must be solved numerically with an iterative approach.
Before solving the mean field equation, it is useful to have a physical interpretation of the mean auxiliary field
. The first step in this direction is the second task of this section, which is to find the mean site occupancies of species
at the mean field level. This is done as in the noninteracting case, by taking the derivative of the partition function with respect to
. At this mean field level, the
dependence is contained in
in Eq. (
108), neglecting the term containing the inverse propagator
. The expression for the mean occupancy can be gotten using Eq. (
83) as
Taking the derivative of the exponential, we can cancel out the partition function in the denominator and we are left with the derivative of the effective action as
Substituting the effective action from Eq. (
99), we have
The derivative of the activity is
and so the mean occupancy becomes
The third task of this section is to relate the mean charge density per site to the mean occupancy. The charge density has the same form as in Eq. (
22) for the noninteracting model, so that at the mean field level we obtain
which, like the mean site occupancy, is independent of the site index because of our choice of spatially uniform mean field. This is equivalent to the equation for the saddle-point value of the auxiliary field because the right-hand side of the equation shows that
on comparison with Eq. (
118)
In the mean field or saddle point equation for the spatially uniform saddle point, the auxiliary field is the same for all lattice sites, and the self energy becomes spatially uniform also and can be written as
where
is a lattice sum that is independent of the choice of lattice site
for a long DNA double helix, and is given by
For the parameters used in the figures,
.
Consequently, at the mean field level, the self-energy simply represents the electrostatic energy of charge
interacting with the rest of the DNA lattice. This term plays the role of the self-energy
in many-body quantum mechanics [
28,
48,
49]. Since the self-energy is independent of lattice site, so is the activity, which can be written as
The simplifications made possible by the assumption of a spatially uniform saddle point allow us to rewrite the mean field equation given by Eq. (
118) in an explicit form as
Again, recall that
, identically,
, and
. Therefore, when
, we can see from Eq. (
130) that
, and since
, and
, it follows that
. Therefore, this is the point at which the charge density
goes to zero. When
, the charge density goes to zero at
, which is the same place it went to zero for the noninteracting model, as shown in
Figure 6.
Returning to the mean site occupancy in Eq. (
124), the quantity inside the sum over
ℓ and
b is constant and the sum yields
, and so the average occupancy per species becomes
Using the mean-field value
of the charge density, calculated from Eq. (
130), we have calculated the average occupation numbers
via Eq. (
131) using Eq. (
129) for the activities. In
Figure 15, we have plotted
, which are the differences in the mean-field occupancies from the occupancies in noninteracting case shown in
Figure 5. The corresponding change in the charge density
is shown as the solid blue curve in
Figure 16.
The shifts in occupancy plotted in
Figure 15 can be primarily understood as a consequence of the electromagnetic “self energy”
of each species
interacting with the total charge
of the lattice. Immediately, this implies that the parallel dimers are hardly modified at all, since they are electrically neutral and not directly affected by the total charge
of the lattice. For the vacancies and perpendicular dimers, which are charged, the mean field corrections plotted in
Figure 15 and
Figure 16 act to reduce the overall charge of the lattice. Below
, the total charge of the lattice is positive, so the mean field corrections penalize the positively-charged perpendicular dimers and enhance the negatively-charged vacancies. Above
, the total charge of the lattice is negative, and this effect is reversed. Thus, the mean-field corrections tend to reduce (but not eliminate) the charge inversion at weak binding seen in
Figure 5. Note that the biggest effect of this electrostatic self energy is to heavily penalize the “naked lattice” of negatively-charged vacancies which was the favored ground state at large positive
for the non-interacting Hamiltonian.
The fourth task of this section is to determine variance of the charge density from the mean field, similar to Eq. (
24) for the noninteracting model, which gives a measure of the importance of spatial fluctuations in the charge density, can be written in the form
where the average
of the square of the charge density, similar to Eq. (
29) for the noninteracting model, is
The variance
will be useful in knowing the size of the charge fluctuations about the mean field, and this quantity will appear in the next level of approximation. The difference in the standard deviation
is plotted as the red dashed curve in
Figure 16, where
for the noninteracting model was plotted as the red dashed curve in
Figure 6.
Using a spatially uniform saddle point, which yields a charge density
that is equal at the mean field level to the auxiliary field
in real space, it is interesting to write down the auxiliary field in the diagonal basis, given by
From Eqs. (
71) and (
72), we see that when
,
has the
ℓ dependence of the form
, which by Eq. (
57) sums to zero. The only surviving term is for
which when summed over
ℓ gives
, and we have from Eqs. (
73) and (
134)
There are two different results corresponding to the two different eigenvalue labels
and
in Eq. (
63). Summing over
b, these are
where
. The interpretation of Eq. (
136) can be understood from Eq. (
74). The sum over
b in Eq. (
135) is a sum over the elements rows in Eq. (
74), which for
sums the mean charge densities on the two chains and for
subtracts the charge densities on the two chains.
The fifth task of this section is to calculate the entropy for the spatially uniform saddle point, which is our mean field value of the entropy. This requires the effective action at this saddle point, which becomes
The grand canonical potential is then
We can now compute the entropy from the derivative of the grand thermodynamic potential with respect to the inverse temperature
, as in
Section 2. This derivative becomes much more complicated with the introduction of
because the screened Coulomb potential (
43) depends on
through the screening vector
(
44), where
This introduces a
-dependence into both terms of Eq. (
137), so that the mean field entropy is given by the more complex expression
where
is the mixing entropy of the three species using the mean field occupation numbers and
arises directly from the self energy
. The occupation numbers
used in this expression are those of the mean-field level, given by Eq. (
131) and shown in
Figure 15. The mean-field entropy
, given in Eq. (
140), is plotted in
Figure 17, together with
for each species. Also shown are the two constituents
and
of the total entropy. This plot shows that, as with
in
Figure 7, the mean-field entropy is greatest for
near
and decreases outside this region. The difference
between this mean-field entropy for the interacting model and the entropy for the noninteracting model is plotted in
Figure 18. Note that this curve has roughly the same shape as that of
in
Figure 16, with a large increase in disorder at large positive
and small changes elsewhere.
By employing field-theoretic techniques borrowed from quantum mechanics, we have included the interaction energy
in the partition function and found a mean-field approximation to the exact partition function. However, these mean-field level calculations assume the same average charge
for all DNA molecules in the same solution. Thus, mean-field calculations would predict a repulsive interaction between two DNA molecules, which would not lead to DNA condensation. Indeed, this repulsive interaction is all that this zeroth-order calculation can predict. It is the fluctuations of the charge from its mean-field value that enable the net attraction observed experimentally; to understand the attractive forces between DNA molecules in solution, it is necessary to extend this treatment by at least one additional order. The next order in the expansion (
100) will yield Gaussian-type integrals, which are analytically solvable for the lowest-order fluctuations in the average charge and other parameters. Furthermore, once lowest-order fluctuations have been included, it will be possible to calculate thermodynamic properties of the system from the partition function to meaningful levels, such as the entropy changes due to small fluctuations in the local charge.
9. Inclusion of Lowest-Order Fluctuations
Going beyond mean field to one-loop order requires use of the full partition function given by Eqs. (
106) and (
108). This has consequences for the site occupancies, the average charge per site, the charge variance, and the entropy. The expression for site occupancy has the same form in terms of the partition function as for the noninteracting and mean field cases in Eqs. (
9) and (
83), and can be written explicitly for the one-loop order of approximation as
which is obtained from Eq. (
111) with
m set equal to one. In that expression,
m was the parameter that was introduced in
Section 6 as an aid to identifying the various orders in the expansion. This can be written as the sum of two terms,
where
is the occupancy of species
at the mean field level, given by Eq. (
131) ,
and
is the correction to that mean site occupancy due to fluctuations, which is given by
with the inverse propagator
from Eq. (
148) written in matrix form as
where
I is the identity matrix and
is the (diagonal) matrix of eigenvalues of the potential. Since
is diagonal, its determinant is given by the product of its diagonal elements as
The logarithm of this determinant then gives the sum over logarithms of diagonal elements as
The only
dependence is contained in
, and its derivative is given by
so that the derivative of
is given by the coefficient
Then the mean fluctuation in occupancy
becomes
These changes in the mean field occupancies at this one loop order are plotted in
Figure 21. These changes are not large, and the fluctuation corrections amount to no more than about 10% of the total. The most dramatic effects are seen in the region of large positive
, with the corrections to the vacancies and perpendicular dimers going in the opposite direction to the corrections due to the mean field in
Figure 15. This can be interpreted as a relaxation of the rigid penalties imposed by the electrostatic self energies at the mean field level. An enhancement of parallel dimers is also seen across the whole range of
that peaks around
.
The mean charge density at this one-loop-correction level of approximation is given by the same form as Eqs. (
22) and (
125) for the noninteracting and mean field approximations and is given by
where now
is taken from Eqs. (
150) and (
158). and the charge density becomes
where
The change in the charge density
is shown as the solid blue curve in
Figure 22. The most notable features are a negative contribution to the total charge above
, reflecting the addition of more negatively charged vacancies due to the fluctuations, and a positive contribution to the total charge around
. Interestingly, this positive contribution is not associated with the increase of parallel dimer occupancy, since parallel dimers are electrically neutral. Rather, it is due to the fact that, although both the negatively charged vacancies and positively charged perpendicular dimers are both reduced around
, the vacancies are decreased more.
The variance in the charge is given by
The average of the square of the charge density is given by
and is written explicitly as
where
is given by Eq. (
133), and
works out to be
The charge variance can then be written as
where the fluctuation correction is
The standard deviation can then be written as
The change
in the standard deviation of the charge densities due to fluctuations are given in
Figure 22, where the fluctuations account for changes typically of the order of 10% of the mean field value.
Having obtained results for the effects of fluctuations on the mean occupancies and charge densities, we now consider the corresponding corrections to the entropy. For that, we need the grand thermodynamic potential, which was expanded in
Section 6, as Eqs. (
110)-(
112), with the mean field and lowest order fluctuation terms given by
The corresponding expression for the interacting entropy is
where the mean field entropy
was derived and the results presented in
Section 7 in Eqs. (
140)-(
142). The fluctuation contribution to the entropy
is given by
which requires performing another derivative.
To evaluate this contribution to the entropy
in Eq. (
171), we return to Eq. (
155) and substitute into the fluctuation contribution to the entropy from Eq. (
171), which after performing derivatives gives
where
and
The derivative of the eigenvalue
of the potential is given by
where
is the Fourier transform of the interaction potential of a chain with itself and
is the Fourier transform of the interaction potential between the two chains. Both of these potentials depend on the screening vector
, and the only
dependence of the potential is contained in
, which is proportional to
. The derivative we need is then given by Eq. (
139). As a consequence, the derivative of the Fourier transform of the potential is
where the derivative of the potential in the position basis is given by
Substituting and using the symmetry relations in Eqs. (47) and (49), the derivative of the Fourier transform of the potential is
Using these relations, the derivative of the eigenvalues becomes
where
and
are the real and imaginary parts of
.
Eq. (
172) also requires the derivative of the variance
, where
is given by Eq. (
132). This contains the average charge density from Eq. (
125) and the average of the square of the charge density from Eq. (
133). This derivative is
where
and
Here
is the weighted average of the specified quantity over the distribution given by the mean-field occupancies
of the three species.
It is interesting to note that the combinations
and
can be written in form similar to that of the zeroth-order entropy. These forms are
and
The fluctuation entropy contribution
is easy to calculate, given the eigenvalues
. Eqs. (
174)-(
185) simply collect the results that allow us to calculate
. The results for the total fluctuation entropy
and the two contributions to it are shown in
Figure 23 for
. Like the mean field entropy, the fluctuation entropy becomes larger in magnitude as the binding becomes weak. However, the fluctuation contribution is actually negative in this region, suggesting that the fluctuations are leading to increasing order. These corrections to the entropy are typically an order of magnitude larger than the difference between the mean field and noninteracting values of the entropy.
10. Discussion
Entropy forms an important contribution to the free energies of many biological systems. Here we summarize the results for the entropies and particle densities for a lattice gas model representing the adsorption of molecules on double-stranded DNA. The entropies and particle densities are presented at three successive levels of approximation, first, for the noninteracting system then at the mean-field level, and finally with the inclusion of fluctuations to one-loop order. Those fluctuation corrections result from correlations induced by the electrostatic forces among the dimer molecules themselves and with the electrically charged DNA substrate.
Perhaps the most transparent descriptors of the DNA system’s behavior are the average occupation numbers
,
, and
representing the thermal average of the numbers of parallel-adsorbed dimers, perpendicular-adsorbed dimers, and vacancies, respectively, on each site. These particle densities are proportional to the probability that a given site would be found to be occupied by a particular species. These site-occupation numbers are shown in
Figure 24 for the three levels of approximation. The short-dashed curves are for the noninteracting level of approximation
, the long-dashed curves for the mean field level
, and the solid curves for the inclusion of fluctuations
. The corresponding charge densities and standard deviations of the charge density are shown in
Figure 25.
The site occupancies exhibit similar qualitative behavior in all three levels of approximation. Due to the energetics of binding , parallel dimers occupy the greatest fraction of sites for all negative binding energies , followed the perpendicular dimers and then the vacancies, at least for the parameters used in these calculations. At weak binding () in the noninteracting system, the difference between these energies becomes negligible, and both and approach the same value of . When the binding energy or increases and reaches the chemical potential of the dimers in solution, they are repelled from the surface of the DNA and go into solution, leaving a “naked” lattice of negatively-charged vacancies, with the parallel dimers being more repelled than perpendicular ones.
In contrast to the case of noninteracting sites, the inclusion of interactions between sites gives a pronounced gap of about
by which parallel adsorption exceeds perpendicular adsorption in the weak binding limit. This enhancement of parallel adsorption in the mean-field approximation results from the inclusion of the charge on the perpendicular dimers in an average or mean field sense, which penalizes the addition of positively-charged perpendicular dimers when the total charge on the DNA lattice is positive. Fluctuations reduce this effect slightly. Recall that the perpendicular dimers are positively charged because one end is not attached to a negative binding site, and this means that it is energetically unfavorable to have a second one nearby. Parallel binding, on the other hand, neutralizes the negative charge on the sites, so that a DNA double strand completely covered in parallel-adsorbed dimers would be electrically neutral. The decrease in perpendicular-dimer occupancy caused by this electrostatic repulsion of like charges leads to an increase in sites occupied by vacancies. Vacancies are negatively charged and are energetically favored when the perpendicular dimers provide a positively-charged region. There may, in fact, be correlations resulting from the lower electrostatic energy of a configuration in which sites alternate between perpendicular dimers and vacancies, i.e., between positively and negatively charged sites, in a Wigner-lattice-like state similar to those described in the literature [
8,
19,
26].
The average charge
per site depends only on the perpendicular dimers and vacancies since the parallel dimers neutralize sites, so that the charge density is written simply as
and this connection can be seen in the plots of charge density
in
Figure 25, when compared with
Figure 24. For all approximations, the average charge is positive at negative and slightly positive binding energies, indicating charge inversion. This is a result of the occupation numbers shown in
Figure 24, because parallel binding, which neutralizes the charge, increasingly dominates in the strong binding limit (
). Similarly, the decrease in the magnitude of the charge inversion from the noninteracting to the mean-field approximations can be explained by the enhancement of parallel adsorption and reduction of perpendicular adsorption due to like-charge repulsion. The fluctuation corrections to the site occupancies are not large when compared with the mean-field values. They only seem significant at positive binding energies where the site occupancies are shifted in the direction of the noninteracting system. Of course, the site occupancies are single-particle properties and do not measure spatial correlations.
The behavior of the entropy is shown in
Figure 26, in which the entropy per site is plotted versus
for all three levels of approximation. The three curves are practically indistinguishable for strong binding (negative values of
) until the binding energy becomes quite weak (approaching zero), indicating little effect of correlations induced by electrostatic repulsion on the entropy in that region. This is because the dimers are closely packed, leaving little freedom for disorder. The pronounced differences are seen in the region of
and more positive binding energies (
), because the system becomes more disordered. There, the dimers tend to leave the surface and enter the solution, and the fluctuation corrections to the entropy are greater in that region and act to reduce the entropy. This shows that the electrostatic correlations increase the order and reduce the entropy. Surprisingly, this decrease in entropy at large positive
due to fluctuations is so significant that it
overcompensates for the entropy generated at the mean field level, leading to a total entropy
that is even lower than in the noninteracting case
.
The form of the noninteracting entropy comes from the lattice gas model, in which parallel adsorption is treated as “local," only occupying one site. When the geometrical blocking effect is included, in which parallel-adsorbed dimers occupy two sites, we would expect the entropy to differ significantly from the result even at the noninteracting level. It is unclear how large a contribution to the full entropy the blocking effect provides, but it is certainly lower for dimers than for longer polymers, for which the nonlocality is greater. Because of this, it will be important in future work to develop a way of incorporating the blocking effect into these types of field-theoretic models.
Dimers are certainly the simplest example of polymers, although biological polymers are typically considerably longer. There is extensive work done on the dimer model, beginning with the work of Fisher [
50,
51]. Some of this work has been motivated by the analogy between dimer models and quantum spin systems [
36]. In this context, Fisher has derived expressions for the contributions of hard-core crowding effects to the partition function, including the geometrical blocking effect. Fisher’s results include an expression of the noninteracting partition function for dimers on a 1D linear lattice. In a future publication, we plan to adapt Fisher’s approach to study the statistical consequences of these blocking effects[
52].
In addition, it has been suggested that methods can be adapted from analogous problems in particle physics[
37] and condensed matter physics[
53]. This involves using numerical codes like those of Adams and Chandrasekharan[
37] that are used for simulations in lattice quantum chromodynamics.
Modeling the behavior of DNA and other biologically active polymeric molecules under physiological conditions is an area in which quantitative calculation is particularly difficult. This is because it is necessary to accommodate simultaneously the influences of geometry, electrostatic forces, and charge correlations. All of these play roles of varying significance in determining the properties of these molecules. Despite these challenges, considerable progress has been made in extending our physical understanding of these systems, and some surprising new physics, such as charge inversion and condensation, have been discovered in the process.
The work presented here focuses on electrostatic effects and especially on the role they play in the entropy, with the overall goal of determining the entropy as a contribution to the free energy and as a driving force in biochemical reactions. For the DNA-based model that we consider, we find that, particularly in the low-coverage regime, a uniform mean field theory gives changes from the usual noninteracting entropy term. We also find, though, that the fluctuation contributions are both considerably larger and in the opposite direction. This establishes that the fluctuations are significant, and the opposite sign of their contribution suggests that they lead to additional order in the system.
Several aspects of the formalism and calculations presented here have been done in such a way as to be accessible to students and others wishing to extend these techniques to more realistic models of polyelectrolytes condensing on the surfaces of DNA. We have shown in
Section 3 why the DNA helices can be treated as a pair of one-dimensional lattices (
Figure 9), albeit with unusual interactions due to the three-dimensional nature of screened Coulomb potential, as shown in
Figure 12. We show that by writing the interaction term in the diagonal basis, as in Eq. (
68) and
Figure 14, the problem can be greatly simplified. In addition, we have chosen a basis in which the eigenvalues and eigenvectors are real, in Eqs. (
71) and (
72), which makes the physical interpretation much more transparent. Then we show in detail in Eq. (
84) and
Appendix A how to perform a Hubbard-Stratanovich transformation of the partition function to decouple the sum over configurations from the sum over lattice sites, which allows one to perform the sum over all configurations of the system exactly. This introduces an integral over an auxiliary field
. Often such an auxiliary field emerges as a complicated complex matrix, but in our diagonal representation it is real and represents the charge density in the diagonal basis, which was shown explicitly for the mean field by comparing Eqs. (
118) and (
125). Also in this diagonal basis, it becomes clear how to choose a proper integration path to have a converging integral. One must then expand the effective action in a power series in the fluctuations of auxiliary field
, as in Eq. (
100), which physically is in powers of the fluctuations of the charge density about the mean field. Setting the first derivative term in this expansion to zero yields the mean field, which itself includes terms of the interaction to all orders, as can be seen in Eq. (
117) or (
118). Whether the expansion is useful depends on the size of the charge fluctuations. The series is an asymptotic one, which means that the infinite series does not necessarily converge. However, as is true for all asymptotic expansions, it is useful as long as the last term kept in the expansion is smaller than the term before it, which is true in our case. This indicates that the mean field, which is the saddle point, is a reasonable first approximation of the system. The fluctuations about this mean field then give insight into how the actual solution differs from the mean field.
In conclusion, we have shown with this simple model the importance of including fluctuation contributions beyond the mean field. The method we have used has allowed us to include vacancies, and we have found that the inclusion of the fluctuation term decreases the entropy by ∼50% in the weak binding regime. This is because, due to the presence of vacancies, the bound dimer concentration is low because of the competition between dimers being repelled from the DNA molecule and the the chemical potential driving them from the solution onto the DNA surface. It is surprising that this decrease in entropy due to correlations is so significant that it overcompensates for the entropy increase at the mean field level, so that the total entropy is even lower than in the absence of interactions between lattice sites. This effect of fluctuations could be important for other biological systems as well, and this approach can be useful where correlations are significant.
Figure 1.
Charge fractionalization occurs when only a fraction of the positive charges of the polyelectrolyte attach to the negatively-charged surface. The charges that are not attached then cause the surface to become positively charged. This shows a freely jointed polyelectrolyte chain of charge
is adapted from
Figure 1b of Nguyen and Shklovskii [
26].
Figure 1.
Charge fractionalization occurs when only a fraction of the positive charges of the polyelectrolyte attach to the negatively-charged surface. The charges that are not attached then cause the surface to become positively charged. This shows a freely jointed polyelectrolyte chain of charge
is adapted from
Figure 1b of Nguyen and Shklovskii [
26].
Figure 2.
Model of DNA as two helical chains of phosphates, shown as red and blue balls, with the yellow lines representing the base pairs. These have been labeled “up" and “down" chains, corresponding to the direction of the carbon atoms in the sugar backbone.
Figure 2.
Model of DNA as two helical chains of phosphates, shown as red and blue balls, with the yellow lines representing the base pairs. These have been labeled “up" and “down" chains, corresponding to the direction of the carbon atoms in the sugar backbone.
Figure 3.
Polyamines as candidates for attachment to DNA. The diamines, including 1,3-diaminopropane (DAP), putrescine (Put), and cadaverine (Cad) have charge , with sufficient length to attach to adjacent phosphates along the DNA chain, with DAP the best fit. These are the most likely polyelectrolytes that the dimers in our model represent. Higher-charged polyamines spermidine(Spd) and spermine (Spn), shown for comparison, are considerably longer. The lattice spacing between phosphates along a helix is shown as a dashed line for comparison.
Figure 3.
Polyamines as candidates for attachment to DNA. The diamines, including 1,3-diaminopropane (DAP), putrescine (Put), and cadaverine (Cad) have charge , with sufficient length to attach to adjacent phosphates along the DNA chain, with DAP the best fit. These are the most likely polyelectrolytes that the dimers in our model represent. Higher-charged polyamines spermidine(Spd) and spermine (Spn), shown for comparison, are considerably longer. The lattice spacing between phosphates along a helix is shown as a dashed line for comparison.
Figure 4.
Double helix of negatively-charged phosphate chains of red and blue balls are wrapped around a cylinder. A parallel dimer attaching to one of the chains would neutralize two sites, while a perpendicular dimer would have one excess charge dangling into the solution, causing that site to have a net positive charge.
Figure 4.
Double helix of negatively-charged phosphate chains of red and blue balls are wrapped around a cylinder. A parallel dimer attaching to one of the chains would neutralize two sites, while a perpendicular dimer would have one excess charge dangling into the solution, causing that site to have a net positive charge.
Figure 5.
Plots of the mean occupation numbers in the noninteracting lattice gas model. Curves are shown for (blue,solid), ⊥ (red,long-dashed), and v (green,short-dashed). The parameters used in the calculation are and . For negative , dimers are attracted to the lattice, while for positive values of this quantity, they are repelled.
Figure 5.
Plots of the mean occupation numbers in the noninteracting lattice gas model. Curves are shown for (blue,solid), ⊥ (red,long-dashed), and v (green,short-dashed). The parameters used in the calculation are and . For negative , dimers are attracted to the lattice, while for positive values of this quantity, they are repelled.
Figure 6.
Plots of the charge densities for the noninteracting lattice gas model. The charge density
is shown as the solid blue curve, and the standard deviation of the charge density,
, is shown as the dashed red curve. The parameters used in the calculation are
and
. As in
Figure 5, dimers are attracted to the lattice when
, and dimers are repelled from the lattice when
.
Figure 6.
Plots of the charge densities for the noninteracting lattice gas model. The charge density
is shown as the solid blue curve, and the standard deviation of the charge density,
, is shown as the dashed red curve. The parameters used in the calculation are
and
. As in
Figure 5, dimers are attracted to the lattice when
, and dimers are repelled from the lattice when
.
Figure 7.
The entropy
per site in the noninteracting lattice gas model. The parameters used in the calculation are
and
. The dashed curves are the individual contributions in Eq. (
33).
Figure 7.
The entropy
per site in the noninteracting lattice gas model. The parameters used in the calculation are
and
. The dashed curves are the individual contributions in Eq. (
33).
Figure 8.
A segment of a single DNA strand. The helix winds along the cylindrical surface with a pitch angle , and a helix of pitch angle connects a site with its neighbor on the other chain. The helix has been stretched in the z direction so that the angle , which is only , can be clearly labeled on the diagram.
Figure 8.
A segment of a single DNA strand. The helix winds along the cylindrical surface with a pitch angle , and a helix of pitch angle connects a site with its neighbor on the other chain. The helix has been stretched in the z direction so that the angle , which is only , can be clearly labeled on the diagram.
Figure 9.
DNA double helix wrapped around a cylindrical tube. If the scissors cut along the solid green line, which is the center of the minor groove, then the paper can be put on a flat surface, with the double helix forming a one-dimensional chain. The coordinate lies along the dashed brown line in the major groove, midway between the up and down chains, and lies along a path on the surface of the cylinder that connects the two phosphates on the two sub-chains that are connected by base pairs.
Figure 9.
DNA double helix wrapped around a cylindrical tube. If the scissors cut along the solid green line, which is the center of the minor groove, then the paper can be put on a flat surface, with the double helix forming a one-dimensional chain. The coordinate lies along the dashed brown line in the major groove, midway between the up and down chains, and lies along a path on the surface of the cylinder that connects the two phosphates on the two sub-chains that are connected by base pairs.
Figure 10.
Diagram of the new coordinates and in terms of the cylindrical coordinates .
Figure 10.
Diagram of the new coordinates and in terms of the cylindrical coordinates .
Figure 11.
In the new coordinates and , the positions of the sites constitute a one-dimensional lattice with two sites per unit cell. Here for the “down"(↓) chain and for the “up"(↑) chain.
Figure 11.
In the new coordinates and , the positions of the sites constitute a one-dimensional lattice with two sites per unit cell. Here for the “down"(↓) chain and for the “up"(↑) chain.
Figure 12.
(a) Distance between sites and (b) screened Coulomb potential as a function of lattice index ℓ, within the same chain () and between the two chains (). The red curve (), representing same-chain repulsion, is an even function, and the blue curve (), representing opposite-chain repulsion, is neither even nor odd. The point for the same chain repulsion is omitted because it would correspond to a site interacting with itself. The inset in (b) is the screened Coulomb interaction as a function of the straight-line distance d between charges. At the physiological temperature of C (300.15 K), 1/(27 meV).
Figure 12.
(a) Distance between sites and (b) screened Coulomb potential as a function of lattice index ℓ, within the same chain () and between the two chains (). The red curve (), representing same-chain repulsion, is an even function, and the blue curve (), representing opposite-chain repulsion, is neither even nor odd. The point for the same chain repulsion is omitted because it would correspond to a site interacting with itself. The inset in (b) is the screened Coulomb interaction as a function of the straight-line distance d between charges. At the physiological temperature of C (300.15 K), 1/(27 meV).
Figure 13.
Plots of the Fourier transforms (solid red curve) and against the dimensionless wavevector k. The symbols and correspond to the real (blue dashed curve) and imaginary parts (green dot-dashed curve) of the complex Fourier transform, respectively.
Figure 13.
Plots of the Fourier transforms (solid red curve) and against the dimensionless wavevector k. The symbols and correspond to the real (blue dashed curve) and imaginary parts (green dot-dashed curve) of the complex Fourier transform, respectively.
Figure 14.
Plots of the eigenvalues of the blocks of .
Figure 14.
Plots of the eigenvalues of the blocks of .
Figure 15.
A plot of the differences
in mean-field average occupation numbers from their noninteracting values for ‖ (blue, solid),
(red, long-dashed), and
v (green, short-dashed). The parameters used in the calculation are
and
. The noninteracting results were shown in
Figure 5.
Figure 15.
A plot of the differences
in mean-field average occupation numbers from their noninteracting values for ‖ (blue, solid),
(red, long-dashed), and
v (green, short-dashed). The parameters used in the calculation are
and
. The noninteracting results were shown in
Figure 5.
Figure 16.
Plots of the differences in the charge densities and standard deviations of the charge density for the mean field of the interacting lattice gas model and the noninteracting model.
is shown as the solid blue curve, and
is shown as the dashed red curve. The parameters used in the calculation are
and
. The noninteracting results were shown in
Figure 6.
Figure 16.
Plots of the differences in the charge densities and standard deviations of the charge density for the mean field of the interacting lattice gas model and the noninteracting model.
is shown as the solid blue curve, and
is shown as the dashed red curve. The parameters used in the calculation are
and
. The noninteracting results were shown in
Figure 6.
Figure 17.
The functional form of mean-field entropy for the interacting lattice gas model at the mean field level. The constituent parts of the entropy are shown as well.
Figure 17.
The functional form of mean-field entropy for the interacting lattice gas model at the mean field level. The constituent parts of the entropy are shown as well.
Figure 18.
The difference between the mean-field entropy for the interacting lattice gas model and the entropy for the noninteracting model.
Figure 18.
The difference between the mean-field entropy for the interacting lattice gas model and the entropy for the noninteracting model.
Figure 19.
The inverse propagator for the fluctuations of the auxiliary Hartree field .
Figure 19.
The inverse propagator for the fluctuations of the auxiliary Hartree field .
Figure 20.
The propagator for the fluctuations of the auxiliary Hartree field .
Figure 20.
The propagator for the fluctuations of the auxiliary Hartree field .
Figure 21.
The changes in the occupancies
due to fluctuations. These should be compared with the changes between occupancies of the mean-field and noninteracting systems shown in
Figure 15.
Figure 21.
The changes in the occupancies
due to fluctuations. These should be compared with the changes between occupancies of the mean-field and noninteracting systems shown in
Figure 15.
Figure 22.
The changes in the charge density
and in the standard deviation
of the charge density due to fluctuations. These are further corrections to the mean-field values of
Figure 16.
Figure 22.
The changes in the charge density
and in the standard deviation
of the charge density due to fluctuations. These are further corrections to the mean-field values of
Figure 16.
Figure 23.
The fluctuation contribution to the entropy per site. The two contributions, and , to the total entropy are also shown, and they are defined similarly.
Figure 23.
The fluctuation contribution to the entropy per site. The two contributions, and , to the total entropy are also shown, and they are defined similarly.
Figure 24.
The average occupation numbers in the noninteracting approximation (), shown as dotted curves, the mean-field approximation (), shown as dashed curves, and the model including the lowest order corrections to mean field, shown as solid curves, plotted for (blue), ⊥ (red), and v(green). Parameters used in the plot are and .
Figure 24.
The average occupation numbers in the noninteracting approximation (), shown as dotted curves, the mean-field approximation (), shown as dashed curves, and the model including the lowest order corrections to mean field, shown as solid curves, plotted for (blue), ⊥ (red), and v(green). Parameters used in the plot are and .
Figure 25.
The average charge per site in the noninteracting (green, short dashes), mean-field (blue, long dashes) and inclusion of fluctuations (red, solid). This plot assumes and .
Figure 25.
The average charge per site in the noninteracting (green, short dashes), mean-field (blue, long dashes) and inclusion of fluctuations (red, solid). This plot assumes and .
Figure 26.
The entropy S per site, in units of the Boltzmann constant , in the noninteracting (green, short dashes), mean-field (blue, long dashes), and and inclusion of fluctuations (red, solid) approximations. These curves assume and .
Figure 26.
The entropy S per site, in units of the Boltzmann constant , in the noninteracting (green, short dashes), mean-field (blue, long dashes), and and inclusion of fluctuations (red, solid) approximations. These curves assume and .