Preprint

Article

Entropy of Charge Inversion in DNA Including One-Loop Fluctuations

Altmetrics

Downloads

114

Views

Comments

A peer-reviewed article of this preprint also exists.

Matthew D. Sievert^*,

Marilyn F. Bishop^*

,Tom McMullen

Matthew D. Sievert^*,

Marilyn F. Bishop^*

,Tom McMullen

This version is not peer-reviewed

Submitted:

13 July 2023

Posted:

14 July 2023

You are already at the latest version

Alerts

Abstract

The entropy and charge distributions have been calculated for a simple model of polyelectrolytes attached to the surface of DNA using a field-theoretic method that includes fluctuations to the lowest one-loop order beyond mean-field theory. Experiments have revealed correlation-driven behavior of DNA in charged solutions, including charge inversion and condensation. In our model, the condensed polyelectrolytes are taken to be doubly charged dimers of length comparable to the distance between sites along the phosphate chains. Within this lattice-gas model, each adsorption site is assumed to have either a vacancy or a positively-charged dimer attached with the dimer oriented either parallel or perpendicular to the double helix DNA chain. We find that the inclusion of the fluctuation terms decreases the entropy by ∼50% in the weak binding regime. There, the bound dimer concentration is low because the dimers are repelled from the DNA molecule, which competes with the chemical potential driving them from the solution to the DNA surface. Surprisingly, this decrease in entropy due to correlations is so significant that it overcompensates for the entropy increase at the mean field level, so that the total entropy is even lower than in the absence of interactions between lattice sites. As a bonus, we present a transparent exposition of the methods used that could be useful to students and others wishing to use this formulation to extend this calculation to more realistic models.

Keywords:

Subject: Physical Sciences - Biophysics

1. Introduction

Entropy is often described as a measure of disorder. Its importance is that the free energy of a system involves a trade-off between the entropy and the energy. The simplest example of this is the Helmholtz free energy

F = E - T S

, which applies to a system with a fixed number of particles and a fixed volume at constant temperature. Because of the complexity of biological systems, there are many instances in which the

T S

term of the free energy either dominates the energy term, or at least plays an important role in the free energy minimization that determines equilibrium and drives the dynamics of the system.

As a consequence, the determination of the optimal structure of many biologically important systems requires minimization of the free energy, and hence this competition between the internal energy and entropy. Often in biological systems the changes in entropy are greater than the changes in internal energy. For instance, in the hydrophobic effect, an unfolded protein lowers the entropy by ordering the water molecules, and so the protein prefers to be in the folded state[1,2]. In the chloroplast stroma, it has been shown that there is an entropy-driven attraction that determines chloroplast ultrastructure through spontaneous Mg

^{2 +}

-induced stacking of membranes[3].

In sickle hemoglobin, the aggregation of monomers into polymers[4] is also entropy driven, with the internal energy and entropy in a delicate balance. In fact, Cao and Ferrone [5] measured the chemical potential

μ_{P C}

, which is the "glue" that holds the polymer together, and found that the entropic contribution to this chemical potential is around

- 11

kcal/mol, which is more than 100% of the chemical potential itself. Another example is F-actin, whose fibrils stack in cross-linked rafts when positive alkaline earth ions are in the solution[6]. In F-actin counter-ions form one-dimensional charge density waves that have a periodicity equal to twice the actin monomer spacing, coupling to twist distortions of the oppositely-charged actin filaments[7]. These phenomena and others owe their existence to Coulomb interactions between the constituent parts, and the geometry of the underlying structure can also play a vital role.

Thermodynamically, the distribution of charge in the solution is governed by a competition between energy and entropy. The bound state in which the ions are condensed on the surface of the macroion/polyelectrolyte is energetically-favored, and the continuum state, in which the ions are free to drift in any direction, is entropically-favored. The balance between these two factors in minimizing the free energy has been shown to vary significantly based on the geometry of the macroion [8]. This interplay between electrostatics and geometry is particularly important for polyelectrolytes in solution, and they have shown to produce a wide array of intriguing phenomena in many different systems. One example is DNA, which in the presence of a physiological salt solution (a 0.1 molar solution of NaCl) is usually negatively charged, with a double helix DNA molecule strongly repelling another. However, if DNA is in a dilute salt solution in which a positively charged polyelectrolyte, such as spermine or sperimidine, has been added to the solution, it has been shown to roll up into a tightly-packed torus[8,9,10,11]. In fact, DNA is usually in a very compact state in cells and viruses.

Historically, the most common description of charged solutions has been Poisson-Boltzmann theory, but continued research has indicated that charged species in solution can have far more complex and counterintuitive effects than simple charge screening [8]. In 1969 Manning proposed [12,13] that a portion of the ambient ions condense onto (i.e., attach to) the surface of a charged macro-ion, partially neutralizing the bare charge. This occurs up to the point at which the energy required to condense another ion equals the available thermal energy

k_{B} T

at temperature T, where

k_{B}

is the Boltzmann constant. The proposed effect, later termed “Manning condensation," marked a significant departure from linearized Poisson-Boltzmann theory that predicted only exponential screening. In Manning’s treatment of polyelectrolytes, which are long chains of charged subunits, the ion condensation was addressed [8] separately from the ions that remain in solution, which were treated using linearized Poisson-Boltzmann theory [12,13].

Further refinements in the treatment of these ambient-ion solutions have been motivated in part by surprising effects observed in DNA that cannot be accounted for using Poisson-Boltzmann theory. By using multivalent cations, modifying the salts in the solutions, and even using alcohol solvents, it is possible to cause the DNA to undergo a radical structural transition into a variety of new geometries, including rod-like bundles and toroids [14]. The toroidal structures, in particular, have received considerable attention in the biological community because such toroidal packing motifs are employed by spermidine and other molecules to contain their own DNA in small volumes [15]. A variety of techniques, including cryo-transmission electron microscopy [15] and UV spectroscopy [16], have enabled direct observation of the formation of these toroidal condensates, and similar studies on other biological polyelectrolytes like F-actin [6] indicate that condensation is a general phenomenon. If the electrical interactions between the polyelectrolyte, such as DNA, and the ambient-ion solution are purely those predicted by the Poisson-Boltzmann description, then two like-charged polyelectrolytes always repel one another, and such condensates will not form. For these condensates to be stable, the net electrical force between like-charged polyelectrolytes must be

attractive

, which is incompatible with the Poisson-Boltzmann theory of ions in solution.

Many of the approaches to understanding the role of polyelectrolytes in biological systems have adopted models of continuum electrostatics, and some of these have focused on solving the Poisson-Boltzmann equation for a cylindrical model of DNA in the presence of multivalent ions[8,17]. However, it has been shown that including charge correlations is essential to understanding these systems[18,19]. Recognizing the need for including fluctuations beyond the mean field, Ha and Liu used a field theoretic approach that produces a systematic expansion taking these effects into account[20,21,22,23]. They employed a simplified model of a DNA particle as a charged cylindrical rod composed of cylindrical charged segments and considered these rodlike particles in the presence of polyvalent counterions. Their first model consisted of two rodlike DNA molecules[20], and they found attraction between the rods. When Podgornik and Parsegian suggested that the fluctuation forces between such rodlike polylectrolytes might not be pairwise additive[24], Ha and Liu extended their calculation to include bundles of rodlike particles and included a one-dimensional form factor depending on the ionic size in order to incorporate short-ranged correlations along the rod length approximately[21,22]. They found that the breakdown of pairwise additivity can lead to qualitatively new behavior, although they still found that that the rods attracted one another under some conditions. Shklovskii then pointed out that the previous results seemed to be independent of radius, and that, in a highly correlated system of negatively-charged cylindrical rods in the presence of positive counterion charges, those positive charges can form a Wigner crystal, which will be three-dimensional and closely packed for small rod radii and form two-dimensional Wigner crystals on the surfaces of rods for large radii[25]. Ha and Liu saw merit in the arguments for the large radius limit but disagreed with the the two-dimensional crystal lattice limit, which they said was due to Shklovskii incorrectly assuming pairwise additive interactions between various surfaces, which would make that limit invalid[22].

To show how the radius of a DNA chain could affect multivalent-counterion-induced attraction between negatively-charged DNA chains, Ha and Liu used a modified model of DNA[23]. They assumed, as before, that a DNA rod is a stack of short cylinders that have a finite radius and length, but to treat two-dimensional charge fluctuations on the surface of the rod, they divided the counterions into two classes, free and condensed, with the condensed counterions modifying the local charge at the surface of the cylinder, giving rise to charge fluctuations on the two-dimensional surface of each cylinder. With this model, they found that the competition between attractive and repulsive interactions tended to balance one another, resulting in no attraction at all for thick rods. They also found that that for valence of the polyvalent cations greater that around 3, the spacing of chains in a bundle and the size of bundles appears nearly independent of the nature of the bundling agent. This is because the increased valence leads to a stronger screening as well as a stronger attraction.

Recognizing that geometry was playing a crucial role in the attraction, Nguyen and Shklovskii [26,27] proposed a simple theory of charge inversion that considers the structure of the polyelectrolyte together with its electrostatic interaction with the substrate. The idea was that there is fractionalization of charge in which a polyelectrolyte can either neutralize the charge or reverse it depending on how it attaches. In that model, a single double-helix strand of DNA is represented by a rigid cylinder with two one-dimensional lattices of negative charges

- e

in helices around the surface to represent DNA’s double helix of negatively-charged phosphates. In order to model the polyelectrolyte solution in which the DNA is immersed, they take the positively-charged species to be freely-jointed chains. These adsorbing species have multiple charges that may partially attach to the surface, with excess charge protruding into the solution, as shown in Figure 1. They assumed that all the negative charges on the DNA are neutralized by the polyelectrolytes, so that there will be no vacancies, and because there is excess charge dangling into the solution, the charge on the surface can be not only neutralized but reversed. They also assume that the only role of the background salt solution is to provide screening for the interaction between the polyelectrolyte chains. Since they assumed no vacancies on the one-dimensional chains, they could easily count the possible configurations. In calculating the electrical potential, they took the negative charges of the DNA to be spread uniformly on the surface of the cylindrical DNA surface and used simple electrostatic arguments with screened potentials to calculate the energies involved. For polyelectrolytes in a 0.1M NaCl solution with DNA, Nguyen and Shklovskii[26] give in their Eq. (6) an estimate of the net charge inversion, the charge density on DNA divided by its bare charge density. The largest value of that estimate is about 0.07.

Since the simple calculation of Nguyen and Shklovskii[26] did not include the discrete nature of the phosphate sites in calculating the electrostatics and used a simple version of the entropy, Bishop and McMullen[28] decided to use a more rigorous formalism to calculate the charge reversal in DNA. Starting with the simplifying assumption that the polyelectrolytes could be considered as dimers (two units in length) on DNA, Bishop and McMullen allowed for vacancies and used a model of the discrete location of phosphates on the DNA surface, with interactions between charges located on the various sites. In that model, doubly-charged polyelectrolyte molecules attach to the DNA either parallel or perpendicular to the DNA surface and confirmed that under the right conditions, the charge on the surface could be inverted. The use of the lattice model allowed them to accomdodate vacancies, which is not possible if the charge is treated in a continuum manner. Inspired by the work of Ha and Liu[20,21,22,23], they used a field-theoretic approach that employs a loop expansion, where the mean field contains polyelectolyte ions adsorbed on the surface of the DNA, similar to their approach. While the general formalism for the one-loop expansion was presented in that paper, the numerical calculations stopped at the mean field level, and the current work extends that to include fluctuations at the one-loop method. The emphasis here is on the entropy and the role it plays in the configuration of charged species attached to the lattice. The details of this model will be explained in detail in the next section.

The results of these theories indicate that the correlation effects in the solution are also strongly-dependent on the system geometry. To understand this, consider the electric potential outside spherical, cylindrical, and planar positively-charged surfaces in vacuum. For spheres, the potential decays as the inverse of the distance; for the cylinder, the potential decreases logarithmically with the distance; and for the plane, the potential decreases linearly with the distance. Gelbart et al. [8] assert that, in an ambient-ion solution, the entropy of the point ions varies logarithmically with their concentration, and therefore with the distance from the cylinder. Thus they conclude that, in a crude comparison, for spherical geometry, the

r^{- 1}

potential is dominated by the logarithmic entropy; for planar geometry, the entropy is dominated by the linear potential; and for cylindrical geometry, both the energy and the entropy vary logarithmically.[29] In this paper, we will incorporate the impact of the ions in solution on the free energy through the chemical potential of these ions, as detailed in Appendix B.

The helical geometry of DNA, then, sits precisely balanced on the fulcrum between energy- and entropy-driven processes under physiological conditions. The geometry of the biomolecule plays an even more crucial role when the highly-charged ions in solution are not point charges but have geometries of their own. Such is the case for DNA immersed in a solution of polyelectrolytes. These polyelectrolytes can be proteins or other fragments of biomolecules which are routinely found in the nucleus [30], so that, again, the central biological processes occur in precisely the most difficult regimes to model.

For charge inversion on DNA with polyelectrolytes, “physiological conditions" require incorporating the combined effects of charge correlation, thermodynamic fluctuations, crowding, and geometrical considerations all at once. As we have discussed in this introduction, there has been considerable work in addressing all of these issues. Some approaches treat only the geometry, with no interactions[31]. Others include both geometry and interactions, but use a continuum model for electrostatics that neglects discreteness effects[32]. The lattice gas dimer model is unique in its simultaneous consideration of all these effects, and the thermodynamic and geometrical idealizations it makes can be systematically improved.

While the formalism of Bishop and McMullen[28] included both a mean field theory and corrections due to fluctuations or charge correlations, the computed results were obtained only at the mean field level. In this paper, this simple model is extended to calculate the fluctuation corrections and the entropy of the system. The purpose is to determine the importance of the fluctuation terms for inverting the charge and to see whether these terms have a significant impact on the entropy of the system. A preliminary version of this work was in Sievert’s master’s thesis[33]. In Section 2 of this paper, we outline the model of the charged binding sites on DNA and present the computation of the entropy due only to the hard-core repulsion that prevents multiple binding on the same site. In Section 3 we explain the geometry of the double helix and show how it can be represented as a one-dimensional lattice and in Section 4 derive the form of the potential and determine the orthogonal transformation that diagonalizes it. In Section 5, we use functional integral techniques to derive the partition function, which uses a Gaussian integral identity to perform the sum over configurations exactly, at the expense of an integral over a new auxiliary field. In Sec Section 6, we show how the grand canonical potential can be computed order by order, in which the first two terms are the mean field and the one-loop correction to the mean field. In Section 7, we examine the saddle point equation that defines the mean field, and this is used to calculate the entropy, the number densities of all species, and the charge density. In Section 8, we find the explicit form for the inverse-Hartree-field-fluctuation propagator. In Section 9, we include the one-loop order terms in the calculations to reveal the effects of fluctuations. Finally, in Section 10, we compare the results of the various approximations. Details concerning the Gaussian integral identity and the chemical potential of dimers in solution are given in Appendix A and Appendix B.

2. The Noninteracting DNA Lattice Gas

Our model system starts with a simplified version of the DNA molecule itself, as shown in Figure 2, with double helical chains of phosphate ions connected by base pairs. The phosphate ions are represented by red balls and blue balls, with the base pairs represented by yellow lines. The two different chains are labeled "up chain" and "down chain", which correspond to the direction of the carbon atoms in the sugar backbone. We will assume that each phosphate site has charge

- e

, where e is magnitude of the charge of an electron, and that there are no charges either between the phosphate sites along the helical chain or in the vicinity of the base pairs. In solution are polyelectrolyte ions of charge

+ 2 e

, which we call dimers. These dimers are assumed to be the length of the spacing between two phosphate ions along a helical chain, which is

a^{(1)} = 0.684

= 6.84

Å, which is longer than a diatomic molecule. Possible candidates for these species are the diamines, 1,3-diaminopropane (DAP

^{2 +}

), putrescine (Put

^{2 +}

), and cadaverine (Cad

^{2 +}

). An extension of this model could include more highly charged spermidine(Spd

^{3 +}

) or spermine (Spn

^{4 +}

). These polyamines are shown in Figure 3, with the lattice spacing

a^{(1)}

shown as the dashed line for comparison. Although we will only consider the doubly charged species in our model, the model and analysis could be extended to these more highly-charged species in the future.

Deng and Bloomfield have shown using Raman difference spectra, for the systems they studied, that in the presence of spermidine or spermidine these polyvalent cations bind electrostatically near the DNA phosphates[34]. van Dam et al. agree with this conclusion and suggest that the A form of DNA is stabilized by polyamines fitting perfectly between the phosphate ions[35]. They conclude that DAP, which has charge

+ 2 e

probably has the best fit to the phosphate lattice of all the polyamines, and that longer species, like spermine and sperimidine are longer and probably also interact with base pairs. In our model, we are assuming that there is no interaction with the base pairs.

The choice of Bishop and McMullen[28] to model dimers was motivated by the wealth of studies (refs. [36] and [37], among others) in the literature about dimer models in other branches of physics, as well as for geometrical simplicity. We will continue to use this same model. Dimers, the shortest polyelectrolytes, have only two possible orientations when adsorbed on DNA, assuming that the length of the dimer is comparable to the helical spacing between charged sites on DNA, as shown in Figure 4. As we stated earlier, this should not be confused with a diatomic molecule, which would not be long enough to span the space distance between two phosphate ions on one of the helical chains of DNA.

In our model, a dimer lying on the cylindrical surface of the molecule must lie parallel to the helical strands, neutralizing the charge on two adjacent sites. Otherwise, the dimer must adsorb perpendicular to the cylindrical surface, extending one end radially out from the central axis of the cylinder and inverting the charge on a single site. Charge inversion by dimers, then, is quite similar to charge inversion due to two species of point ions: one monovalent species representing parallel-adsorbed dimers and one divalent species representing perpendicular-adsorbed dimers. In the previous work, Bishop and McMullen[28] modeled the adsorption of dimers in a lattice-gas model as a two-component solution of point ions, allowing the possibility of vacancies, and used field-theoretic methods to describe the thermodynamics of the system. They carried their calculations to a mean-field level of approximation of the inverted charge on DNA, but did not calculate the entropy. Their work confirmed the possibility of charge inversion within this model.

Those computations yielded the charge per site on the DNA helix as a function of the chemical potential, or equivalently of the concentration of the polyelectrolyte in solution. While the Nguyen-Shklovskii[26] calculation assumed complete filling of the lattice, the Bishop-McMullen approach allowed for vacancies, represented as negatively-charged sites, in addition to the neutral or positively-charged sites arising from dimer adsorption. However, the lattice-gas model, which assumes all sites are equivalent, does not take into account that the parallel dimer occupies two sites. In this model, the binding energy of the parallel dimer is taken to be

ε^{(‖)}

and for the perpendicular dimer

ε^{(⊥)}

, and these energies can be adjusted to account for the difference in occupancy. In practice, the occupation of two sites is mimicked by making the binding energy of the parallel dimer twice as large as that of the perpendicular dimer, and we will use the same approach here in the numerical calculations. The assignment of binding energies to these species is a simple way of including the complicated electrostatics that allows the polyelectrolyte ions to bind to the surface of the DNA. This is analogous to the "fermion" model of Ha and Liu[22], which assumes that each site can either be empty or occupied by one counterion. Here, as in the earlier Bishop and McMullen work, we have three species, parallel dimers, perpendicular dimers, and vacancies.

This approach allows us to describe adsorption of either species independently for each site. Any configuration of the DNA-dimer complex in this model can then be described by identifying the type of dimer (parallel, perpendicular, or none) adsorbed on each site on the DNA molecule. Such a model also resembles the lattice gas model of condensed matter physics [38], which treats the ways of distributing particles of different types onto a regular lattice of sites. In this section, we assume that the three different species on the lattice do not interact with one another, and in this case, the structure of the lattice does not matter. We call this the noninteracting model. This does not actually mean that there are no interactions. The interaction of the polyelectrolytes with the DNA can actually be quite strong, depending on the values of

ε^{(‖)}

and

ε^{(⊥)}

, and their attachment to the lattice is analogous to what Ha and Liu call condensed counterions[22]. Also, we will assume that there is only one species per site, which corresponds a hard-core repulsion between sites.

In developing the formalism, it is convenient to consider a vacancy as a third species

γ

of particle, because then we can impose the constraint that each site is singly occupied, either by a parallel dimer

(γ = ‖)

, a perpendicular dimer (

γ = ⊥

), or a vacancy (

γ = v

). For each our three species of “particles" that can reside on our lattice sites, we define a relative charge

q_{γ}

in units of the magnitude e of the charge of an electron. These relative charges are then

q_{v} = - 1

for vacancies,

q_{‖} = 0

for parallel dimers, and

q_{⊥} = + 1

for perpendicular dimers. We will assume that the binding energy

ε_{γ}

of species

γ

to the lattice depends only on the type of species and not the location. We will be specifying each lattice point by its location ℓ along helix b, with the pair

(ℓ, b)

specifying that lattice position. Then the quantity

n_{ℓ, b}^{γ}

will be the number of particles of species

γ

on the lattice site at ℓ on chain b. For each chain, ℓ extends from

- N

N

, such that the total number of sites is

N_{sites} = 2 (2 N + 1)

, and we employ periodic boundary conditions. There will also be free polyelectrolyte ions in solution, and their influence on the stability of the adsorbed dimers will be through their chemical potential.

We begin by considering the Hamiltonian

H_{NI}

for this noninteracting system (that is, with no interactions between different sites aside from the hard-core repulsion that blocks double-occupancy), which is

H_{NI} = \sum_{ℓ, b} \sum_{γ} ε_{γ} n_{ℓ, b}^{(γ)} .

(1)

Because we have a system that exchanges particles with its surroundings, specifically the ions in the solution surrounding the DNA, which attach to the surface, we work in the grand canonical ensemble. If the system contains

N_{γ}

particles of type

γ

in equilibrium with its surroundings, with an average internal energy E, the grand canonical potential

Ω_{G}

, which is a function of the temperature T, volume V, and chemical potentials

μ_{γ}

for each species

γ

, is written as[38,39],

Ω_{G} (T, V, μ) = E - T S - \sum_{γ} μ_{γ} N_{γ},

(2)

where the number of particles of type

γ

is given by

N_{γ} = \sum_{ℓ, b} n_{ℓ, b}^{(γ)} .

(3)

Then the entropy can be written in terms this potential as

S = - {(\frac{\partial Ω_{G}}{\partial T})}_{V, {μ_{γ}}} = k_{B} β^{2} {(\frac{\partial Ω_{G}}{\partial β})}_{V, {μ_{γ}}},

(4)

where

β = \frac{1}{k_{B} T}

and

k_{B}

is Boltzmann’s constant. In this partial derivative, the volume and all the chemical potentials

μ_{γ}

of all species

γ

are held constant. It is convenient to define the grand canonical partition function

Z_{G}

in terms of the grand canonical potential

Ω_{G}

Z_{G} = \sum_{configurations} e^{- β (E - \sum_{γ} μ_{γ} N_{γ})} = e^{- β Ω_{G}},

(5)

where the sum over configurations includes all the accessible states of the system. The grand canonical potential can alternatively be written in terms of the partition function as

Ω_{G} = - \frac{ln Z_{G}}{β},

(6)

and this allows us to write an expression for the entropy in terms of

Z_{G}

as:

S = k_{B} ln Z_{G} - \frac{k_{B} β}{Z_{G}} {(\frac{\partial Z_{G}}{\partial β})}_{V, {μ_{γ}}} .

(7)

For the noninteracting lattice, the average internal energy is represented here by the Hamiltonian (1),

E = H_{N I}

, and the noninteracting grand canonical partition function

Z_{NI}

becomes

Z_{NI} = \sum_{configurations} e^{- β \sum_{ℓ, b} \sum_{γ} (ε_{γ} - μ_{γ}) n_{ℓ, b}^{(γ)}},

(8)

where we have suppressed the G subscript for simplicity.

Since a vacant site does not really correspond to a particle, we recognize that energy and chemical potential of the vacancy must be related such that

ε^{(v)} - μ_{v} = 0

. In addition, because the dimers adsorbed on the surface and those in solution are in equilibrium,

μ_{‖} = μ_{⊥} = μ_{dimer}

is the chemical potential of dimers in solution at the appropriate concentration. We thus need an estimate for

μ_{dimer}

. Approximating the dimer as a uniformly-charged cylinder, this value is shown in Appendix B to be

β μ_{dimer} ≃ 0.79

at the physiological temperature of

T ≃ 310

The average occupancy

{〈 n^{(γ)} 〉}_{NI}

for species

γ

per site is found by taking the derivative of this partition function with respect to

μ_{γ},

{〈 n^{(γ)} 〉}_{NI} = \frac{1}{N_{sites} β Z_{NI}} \frac{\partial Z_{NI}}{\partial μ_{γ}} .

(9)

This can be verified by taking the derivative of Eq. (8).

{〈 n^{(γ)} 〉}_{NI} = \frac{1}{N_{sites} Z_{NI}} \sum_{configurations} \sum_{ℓ, b} n_{ℓ, b}^{(γ)} e^{- β \sum_{ℓ, b} \sum_{γ} (ε_{γ} - μ_{γ}) n_{ℓ, b}^{(γ)}},

(10)

which is by definition the average number per site of species

γ

Continuing with the evaluation of the partition function, we note that the exponential of a sum can be written as the product of exponentials, enabling us to rewrite the partition function (8) as

Z_{NI} = \sum_{configurations} \prod_{ℓ, b} \prod_{γ} {[e^{- β (ε_{γ} - μ_{γ})}]}^{n_{ℓ, b}^{(γ)}} .

(11)

In order to simplify and to appreciate the physical meaning of this expression, it is useful to define the relative activity [40] of species

γ

in the noninteracting lattice-gas model, given by

a_{NI}^{(γ)} = e^{- β (ε_{γ} - μ_{γ})} .

(12)

Note that this is independent of the lattice site ℓ or chain b. The partition function can then be written in the simple form

Z_{NI} = \sum_{configurations} \prod_{γ} \prod_{ℓ, b} {[a_{NI}^{(γ)}]}^{n_{ℓ, b}^{(γ)}} .

(13)

Because we have assumed that there can be only one species per site, parallel dimer, perpendicular dimer, or vacancy, the sum over configurations can now be done over each site separately, where there are three possible configurations

{n_{ℓ, b}^{(⊥)}, n_{ℓ, b}^{(‖)}, n_{ℓ, b}^{(v)}} = {1, 0, 0}, {0, 1, 0}, o r {0, 0, 1} .

(14)

This is the same as saying that

n_{ℓ, b}^{(γ)} = 1

for one and only one of

γ = ⊥, ‖

, or v and is zero otherwise. This means that

\sum_{\begin{matrix} \sin gle - site \\ configurations \end{matrix}} \prod_{γ = ⊥, ‖, v} {[a_{NI}^{(γ)}]}^{n_{ℓ, b}^{(γ)}} = \sum_{γ} a_{NI}^{(γ)},

(15)

and so the grand partition function is

Z_{NI} = \prod_{ℓ, b} (\sum_{γ} a_{NI}^{(γ)}) .

(16)

Since every term in the product is the same, the expression in parentheses is simply raised to the power

N_{sites}

, giving

Z_{NI} = {(\sum_{γ} a_{NI}^{(γ)})}^{N_{sites}} .

(17)

At this point, we can easily see that we can obtain the average number per site using our derivative form from Eq. (9) as

{〈 n^{(γ)} 〉}_{NI} = \frac{1}{N_{sites} β Z_{NI}} \frac{\partial}{\partial μ_{γ}} {(\sum_{γ^{'}} a_{NI}^{(γ^{'})})}^{N_{sites}} .

(18)

Taking the derivative and substituting the expression for

Z_{NI}

in the denominator, we have

{〈 n^{(γ)} 〉}_{NI} = \frac{N_{sites}}{N_{sites} β (\sum_{γ^{''}} a_{NI}^{(γ^{''})}) N_{sites}} {(\sum_{γ^{'}} a_{NI}^{(γ^{'})})}^{N_{sites} - 1} \frac{\partial}{\partial μ_{γ}} a_{NI}^{(γ)},

(19)

where the derivative of the activity is

\frac{\partial}{\partial μ_{γ}} a_{NI}^{(γ)} = \frac{\partial}{\partial μ_{γ}} e^{- β (ε_{γ} - μ_{γ})} = β a_{NI}^{(γ)} .

(20)

Using this in the expression for the mean site occupancy in Eq. (19) and cancelling

N_{sites} - 1

factors of the sum over

γ

, the mean occupancy for species

γ

is given by

{〈 n^{(γ)} 〉}_{NI} = \frac{a_{NI}^{(γ)}}{\sum_{γ^{'}} a_{NI}^{(γ^{'})}} .

(21)

In Figure 5, we show the mean occupancies for the three species versus

ε_{‖}

, with

ε_{‖} = 2 ε_{⊥}

. Negative

ε_{‖}

corresponds to an attraction of dimers to the lattice, and we see that parallel adsorption of dimers dominates because of the stronger binding, followed by perpendicular adsorption, with vacancies becoming nonexistent. Positive

ε_{‖}

corresponds to a repulsion of dimers from the lattice, so that at large

ε_{‖}

, the ordering is reversed, for the same reasons. At small positive

ε_{‖}

, the dimers are repelled from the lattice, but this competes with the chemical potential, which tries to put dimers back onto the lattice. In this low-coverage regime, perpendicular adsorption dominates, while vacancies dominate for large

β ε_{‖}

because the dimers would prefer to stay in solution. For

ε_{‖}, ε_{⊥} \to 0

, the parallel and perpendicular mean occupancies become the same. Similarly, where

ε_{‖} - μ_{dimer} = 0

{〈 n^{(‖)} 〉}_{NI} = {〈 n^{(v)} 〉}_{NI}

because, as mentioned earlier,

ε_{v} - μ_{dimer}

is always zero. Also,

{〈 n^{(⊥)} 〉}_{NI} = {〈 n^{(v)} 〉}_{NI}

where

ε_{‖} = 2 μ_{dimer}

because that is where

ε_{⊥} = μ_{dimer}

. It is at this point where

{〈 n^{(‖)} 〉}_{NI}

becomes a maximum.

The average charge per site can now be determined by multiplying

q_{γ}

, the charge of species

γ

, by

n_{ℓ, b}^{(γ)}

the occupation of species

γ

on site ℓ of chain b,

ρ_{ℓ b} = \sum_{γ} q_{γ} n_{ℓ b}^{(γ)} .

(22)

Since we assume that every site has either a parallel dimer, a perpendicular dimer, or a vacancy, with single occupancy, then for a given site and species,

n_{ℓ b}^{(γ)}

is either 0 or 1. The average charge density is then the sum the averages of the individual terms,

ρ_{NI} \equiv {〈 ρ_{ℓ b} 〉}_{NI} = \sum_{γ} q_{γ} {〈 n^{(γ)} 〉}_{NI} .

(23)

This charge density is plotted as the solid blue curve in Figure 6. Because a parallel dimer has no charge,

q_{‖} = 0

, a perpendicular dimer has a positive unit charge,

q_{⊥} = + 1

, and a vacancy has a negative unit charge,

q_{v} = - 1

, the total mean field charge per site

ρ_{c}

in Figure 6 is the difference between the curves

{〈 n^{(⊥)} 〉}_{NI}

and

{〈 n^{v} 〉}_{NI}

in Figure 5 and goes to zero where those two curves cross.

A measure of the magnitude of the charge fluctuations is given by the average of the square of the charge on a site minus the square of its average, which is the charge variance

σ_{N I}^{2}

, given by

σ_{NI}^{2} = {〈 ρ^{2} 〉}_{NI} - ρ_{NI}^{2},

(24)

where

{〈 ρ^{2} 〉}_{NI} = {〈 ρ_{ℓ b}^{2} 〉}_{NI},

(25)

and

ρ_{ℓ b}^{2} = {[\sum_{γ} q_{γ} n_{ℓ b}^{(γ)}]}^{2} = \sum_{γ, γ^{'}} q_{γ} q_{γ^{'}} n_{ℓ b}^{(γ)} n_{ℓ b}^{(γ^{'})} .

(26)

The product

n_{ℓ b}^{(γ)} n_{ℓ b}^{(γ^{'})}

describes a double occupancy of site

(ℓ, b)

by species

γ

and

γ^{'}

. Since we have required single occupancy, then we must have

γ = γ^{'}

. Also, since

n_{ℓ b}^{(γ)}

can only take the values 0 or 1,

{[n_{ℓ b}^{(γ)}]}^{2} = n_{ℓ b}^{(γ)} .

(27)

Then the charge density per site reduces to

ρ_{ℓ b, NI}^{2} = \sum_{γ} q_{γ}^{2} n_{ℓ b, NI}^{(γ)} .

(28)

Taking the thermal average of both sides, we have

{〈 ρ^{2} 〉}_{NI} \equiv {〈 ρ_{ℓ b}^{2} 〉}_{NI} = \sum_{γ} q_{γ}^{2} {〈 n^{(γ)} 〉}_{NI} .

(29)

In Figure 6, we have plotted the standard deviation in the charge density,

σ_{NI}

, which is the square root of the charge variance, for the noninteracting model. As we saw in Figure 5, there are three distinct regions,

ε_{‖} ≪ 0

ε_{‖} ≫ 0

and

ε_{‖}

near

μ_{dimer}

. These three regions are also reflected in Figure 6. At large negative

ε_{‖}

, parallel adsorption dominates, which leads to

ρ \approx 0

. At large positive

ε_{‖}

, vacancies dominate, leading to

ρ < 0

, and for

ε_{‖}

near

μ_{dimer}

, there is a small window where perpendicular adsorption of dimers dominates, leading to positive values of

ρ

. Correspondingly, the fluctuations, represented by the standard deviation

σ_{N I}

, become small when

| ε_{‖} |

becomes large, and the fluctuations are largest at small

| ε_{‖} |

, when there are comparable numbers of all species. The phenomenon of charge inversion is demonstrated in Figure 6 because the average charge is positive, indicating that sufficiently many dimers adsorb in a perpendicular configuration to invert the charge on the molecule from negative to positive. The magnitude of charge inversion increases in the weak-binding limit

ε_{⊥}, ε_{‖} \to 0

. The maximum of this inversion in Figure 6 is about 0.25, which is much larger than the estimate of 0.07 from the work of Nguyen and Shklovskii[26] , as discussed in the Introduction. That work treated the charge on DNA as a continuum, while we have used the discrete lattice of phosphates in a lattice-gas model. We have not included interactions between sites yet, although we have assumed that there can only be one species per site, either a parallel dimer, a perpendicular dimer, or a vacancy. We will see later in this paper that interactions reduce this value somewhat, but it will still be much larger than 0.07.

The noninteracting entropy

S_{NI}

can be obtained from the partition function using Eq. (7) as

S_{NI} = k_{B} ln [{(\sum_{γ} a_{NI}^{(γ)})}^{N_{sites}}] - \frac{k_{B} β}{{(\sum_{γ^{'}} a_{NI}^{(γ^{'})})}^{N_{sites}}} \frac{\partial}{\partial β} {(\sum_{γ} a_{NI}^{(γ)})}^{N_{sites}} .

(30)

Because the derivative of the activity with respect to

β

can be written in terms of its logarithm as

β \frac{\partial}{\partial β} a_{NI}^{(γ)} = a_{NI}^{(γ)} ln a_{NI}^{(γ)},

(31)

the entropy can be written in the simple form

\frac{S_{NI}}{k_{B} N_{sites}} = - \sum_{γ} \frac{a_{NI}^{(γ)}}{\sum_{γ^{''}} a_{NI}^{(γ^{''})}} ln (\frac{a_{NI}^{(γ)}}{\sum_{γ^{'}} a_{NI}^{(γ^{'})}}) .

(32)

Substituting the mean occupancy for the noninteracting model from Eq. (21) allows us to write the dimensionless entropy per site for the noninteracting model in a simplified form as

\frac{S_{NI}}{k_{B} N_{sites}} = - \sum_{γ} {〈 n^{(γ)} 〉}_{NI} ln {〈 n^{(γ)} 〉}_{NI},

(33)

which agrees with the standard result for the entropy of mixing of an ideal solution with species

γ = (‖, ⊥, v)

[40].

In Figure 7, we show the entropy

S_{N I}

of the noninteracting model as a function of the binding energy

β ε_{‖}

, assuming

ε_{‖} = 2 ε_{⊥}

. We also show the individual contributions of Eq. (33) to the entropy. The entropy is a maximum when disorder is greatest, and this occurs when the numbers of each of the species are as close to equal as possible, which occurs at

ε_{‖} = μ_{dimer}

, the maximum of

〈 n^{(⊥)} 〉

in Figure 5.

3. Geometry of the Charged Double Helix of DNA

While stored in the nucleus of a cell, DNA is wrapped compactly both around histone protein complexes and around itself, but on sufficently small scales (≈ 150 base pairs [41], or 15 turns [30], or 50 nm [8]), DNA’s dominant geometrical structure is the familiar double-helix structure shown in Figure 2. Each strand of the DNA takes the shape of a helix with a characteristic radius

R_{DNA} = 0.946

nm and pitch angle

ψ = 29 . 6^{\circ}

, as shown in Figure 8. Here the pitch angle

ψ

denotes the angle with respect to the

x y

-plane that gives the appropriate altitude per unit circumferential winding; in cylindrical coordinates,

tan ψ = Δ z / R_{DNA} Δ ϕ

, where

Δ z

is the distance along the z axis. Each strand of DNA has a “direction", identified by a particular carbon on the backbone structure, corresponding to the chirality of the helix. In the DNA double-helix, the two strands are antiparallel and therefore have opposite chiralities. As a consequence, the azimuthal angle between the two helices is always a constant,

Δ ϕ = 160^{\circ}

. Because this phase shift is not exactly

180^{\circ}

, the chains have unequal separation in the clockwise and counterclockwise azimuthal directions. The larger gap is referred to as the major groove, and the smaller as the minor groove.

When the hydrogen atoms dissociate under physiological conditions, the resulting negative charges (

- e

, where e is the magnitude of the charge of the electron), can be regarded as located at the mean position of the oxygen atoms on the backbone, as shown in Figure 2. This figure was constructed using geometrical data taken from Bishop and McMullen [28], which is based on experimental x-ray diffraction data [42,43] as input to the SYBYL molecular modeling program[44]. The oxygen atoms occur at regular intervals along each strand, separated by a helix segment of arc length

a^{(1)} = 0.684

nm. It is these sites located at regular intervals along the helix to which dimers will adsorb. These negatively-charged sites do not occur at the same altitudes on both strands, however. Rather, there is a vertical separation

Δ z = 0.023

nm between corresponding sites on the two strands. With the relative phase of the strands and the vertical separation between sites on those strands taken together, corresponding sites on the two strands may be viewed as connected by a helical segment of arc length

a^{(2)} = 3.34

nm at a pitch angle of

α = 0 . 394^{\circ}

. This geometry is shown in Figure 8.

When positively-charged dimers approach the DNA molecule, they will be attracted to the negative charges at the sites on the double-helix. We consider dimers with a length comparable to the spacing

a^{(1)}

between sites on a strand and having positive charges

+ e

at either end. The dimers can then adsorb onto the surface in two possible orientations, either parallel to the strand or perpendicular to the helix axis. (See Figure 4.) If the dimer adsorbs parallel to the strand, the positive charges from the dimer lie directly over the negative charges on the strand, neutralizing the charge on two adjacent sites. If the dimer adsorbs perpendicular to the surface of the bounding cylinder, one end of the dimer sits atop the site, while the other extends radially outward. This perpendicular adsorption effectively inverts the charge on the site from

- e

+ e

. In order to use a lattice-gas model, this geometric constraint is loosened by having the parallel dimer block only a single site. This deficiency can be somewhat compensated by making binding energy of the parallel dimer twice as large,

ε_{‖} \approx 2 ε_{⊥}

Note that because the length of the dimer (equal to the same-chain site spacing

a^{(1)} = 0.684

nm) is much smaller than both the cross-chain site spacing

a^{(2)} = 3.34

nm and the vertical separation

Δ z = 3.4

nm between turns of the helix, other orientations of the dimer are not possible.

The problem thus described is a complex one, but the similarities with the lattice gas models of condensed matter physics provide guidelines for how to proceed. These prescriptions, however, are aimed at the treatment of a periodic crystalline lattice, and, although the DNA sites exhibit helical symmetry, they do not constitute a periodic lattice in the strict sense of the term. However, an appropriate choice of coordinates can take advantage of the helical symmetry, so that, in these new coordinates, the positions of the sites will fall on a regular, one-dimensional lattice.

We will define these coordinates on a cylinder of radius

R_{D N A}

, as shown in Figure 8 and Figure 9. The first coordinate

x^{(1)}

traces out a path with pitch angle

ψ

along a single helical strand, and the other coordinate

x^{(2)}

traces out a path with pitch angle

α

that connects corresponding sites on the two strands. Geometrically, a cylinder can be regarded as flat in the sense that it has no curvature. In Figure 9, we show the way that the cylinder can be cut with scissors and unwrapped so that this lattice can be mapped on a flat surface as shown in Figure 10. If we define the origin of coordinates

x^{(1)}

and

x^{(2)}

to be halfway along the helical path between the partners on the two chains, as shown in Figure 9 and Figure 10, then the positions of the sites on both strands form a one-dimensional lattice in the coordinates

(x^{(1)}, x^{(2)})

These coordinates can be written simply in terms of cylindrical coordinates

(ϕ, z)

in matrix form as

[\begin{matrix} R_{DNA} ϕ \\ z \end{matrix}] = [\begin{matrix} cos ψ & cos α \\ sin ψ & sin α \end{matrix}] [\begin{matrix} x^{(1)} \\ x^{(2)} \end{matrix}] .

(34)

Inverting this gives definitions of the two coordinates as

x^{(1)} = (\frac{sin α}{sin (α - ψ)}) R_{DNA} ϕ - (\frac{cos α}{sin (α - ψ)}) z

(35)

and

x^{(2)} = - (\frac{sin ψ}{sin (α - ψ)}) R_{DNA} ϕ + (\frac{cos ψ}{sin (α - ψ)}) z .

(36)

With these definitions, the difference in coordinates between adjacent sites on the same strand is

Δ x^{(1)} = a^{(1)}

, and the difference in coordinates between corresponding sites on the two strands is

Δ x^{(2)} = a^{(2)}

. That is,

a^{(1)}

is the distance along the helical path of a single chain from one phosphate ion to the next, and

a^{(2)}

is the distance along a helical path from a phosphate ion on one chain to its partner phosphate ion on the other chain.

Next we define a lattice index ℓ, which specifies the cell (altitude on the double-helix) and chain index b, which specifies the basis site, where

b = - \frac{1}{2}

for the “down"(↓) chain and

b = \frac{1}{2}

for the “up"(↑) chain, as shown in Figure 11. Using these variables, the coordinates can be written as

(x^{(1)}, x^{(2)}) = (ℓ a^{(1)}, b a^{(2)}),

(37)

where

ℓ = 0, \pm 1, \pm 2, \dots; b = \pm \frac{1}{2} .

(38)

Thus, although the sites on the DNA molecule do not constitute a periodic lattice in real space, they do constitute a lattice in an appropriately-defined coordinate space (see Figure 11). As we will see, however, this choice of coordinates will make the form of the interaction potential more complicated as a result.

The use of

(x^{(1)}, x^{(2)})

instead of the cylindrical coordinates

(ϕ, z)

indicates a more fundamental shift in our description of the DNA double-helix. The two-dimensional surface on which the helices lie is a cylinder of radius

R_{D N A}

, and the helices inherit the cylinder’s geometric properties. The geometry of the cylinder, however, is locally indistinguishable from the geometry of the flat plane. One common consequence of this is that it is possible to smoothly wrap a flat sheet of paper around a cylinder. In contrast, it is not possible to smoothly wrap a sheet of paper around a sphere; this problem is well-known because of the geometrical distortions that occur in flat maps of a spherical Earth. Maps of a cylindrical surface, however, have no such distortions. This geometrical difference is quantified by the Riemannian curvature tensor, which vanishes for both the cylinder and the plane, but not for the sphere[45]. This means that the local geometry of the cylinder behaves in exactly the same way as the local geometry of the plane, so that a helix on a cylinder is geometrically equivalent to a line on a plane.

Our choice of coordinates is simply a map of the cylindrical surface that reduces the double-helix to two parallel lines, as shown in Figure 11. The result of this map is that we have a linear lattice with two phosphate sites per cell, with the cells labeled from

ℓ = - N

N

. All

2 N + 1

cells are identical, and the lattice can be imagined to satisfy periodic boundary conditions.

The analogy describing DNA as a ladder wrapped around a cylinder of radius

R_{DNA}

then has a true mathematical basis, because the local structure of the double-helix is equivalent to the structure of a “ladder" — a one-dimensional chain with a unit cell containing one site from each strand. If interactions are ignored, then the problem is described simply by this one-dimensional lattice.

4. DNA Interaction Potential

The geometrical structure of the double helix is important in determining the electrostatic energy

U_{int}

of the sites (with and without dimers) interacting with one another. The full Hamiltonian must contain these contributions and can be written as

H = \sum_{ℓ, b} \sum_{γ} ε_{γ} n_{ℓ, b}^{(γ)} + U_{int},

(39)

where the interaction energy

U_{int}

is the total interaction energy between all the charges on all the sites. If

V_{(ℓ_{1}, b_{1}), (ℓ_{2}, b_{2})}

is the screened Coulomb energy between a charge

q_{γ_{1}}

at site

ℓ_{1}

on chain

b_{1}

with another charge

q_{γ_{2}}

at site

ℓ_{2}

on chain

b_{2}

, then

U_{int}

can be written as

U_{int} = \frac{1}{2} \sum_{γ_{1}, γ_{2}} \sum_{(ℓ_{1}, b_{1}), (ℓ_{2}, b_{2})} q_{γ_{1}} n_{ℓ_{1}, b_{1}}^{(γ_{1})} V_{(ℓ_{1}, b_{1}), (ℓ_{2}, b_{2})} n_{ℓ_{2}, b_{2}}^{(γ_{2})} q_{γ_{2}},

(40)

where

n_{ℓ, b}^{(γ)}

is the number of particles of species

γ

on the lattice site at

(ℓ, b)

and

q_{γ}

is the charge of that species, in units of the magnitude e of the charge of an electron. The factor of

1 / 2

is to ensure that we are not double counting when we sum over all lattice sites, and we implicitly exclude same-site interactions

(ℓ_{1}, b_{1}) \neq (ℓ_{2}, b_{2})

. The total charge on a given site is given by summing all the charges on that site, so that the total charge on the site

(ℓ, b)

ρ_{ℓ, b} = \sum_{γ} q_{γ} n_{ℓ, b}^{(γ)} .

(41)

The interaction potential can then be written in terms the the total charge on each site as

U_{int} = \frac{1}{2} \sum_{(ℓ_{1}, b_{1}), (ℓ_{2}, b_{2})} ρ_{(ℓ_{1}, b_{1})} V_{(ℓ_{1}, b_{1}), (ℓ_{2}, b_{2})} ρ_{(ℓ_{2}, b_{2})},

(42)

where

V

is the electrostatic interaction energy between the sites. This screened electrostatic energy between charges

q_{γ_{1}}

and

q_{γ_{2}}

at positions

(ℓ_{1}, b_{1})

and

(ℓ_{2}, b_{2})

, (in SI units) is

V_{(ℓ_{1}, b_{1}), (ℓ_{2}, b_{2})} = \frac{e^{2}}{4 π ϵ} \frac{e^{- q_{s} d_{b_{1} b_{2}} (ℓ_{1} - ℓ_{2})}}{d_{b_{1} b_{2}} (ℓ_{1} - ℓ_{2})},

(43)

where

ϵ

is the electric permittivity of the medium between the two charges, and

d_{b_{1} b_{2}} (ℓ_{1} - ℓ_{2})

is the straight-line distance between the two charges. Distances along the chain are invariant under translations by a lattice spacing, and so

d_{b_{1} b_{2}} (ℓ_{1} - ℓ_{2})

depends only on the difference between the lattice site indices

ℓ_{1} - ℓ_{2}

. As in the noninteracting lattice-gas model, we have three species of “particles" on the lattice sites, vacancies of charge

q_{v} = - 1

, parallel (‖) dimers of charge

q_{‖} = 0

, and perpendicular (⊥) dimers of charge

q_{⊥} = + 1

Under physiological conditions, the presence of monovalent salt ions such as Na

^{+}

leads to screening of the bare charges. Traditionally, screening is treated by modeling the ions as a continuous density, resulting in the nonlinear Poisson-Boltzmann equation [19,46]. The Poisson-Boltzmann equation is not analytically solvable, so it is often further approximated by linearization. The resulting Thomas-Fermi model is analytically solvable and gives the screened Coulomb (or Yukawa) potential between two charges given in Eq. (43) where

ϵ = 78.5 ϵ_{0}

is the permittivity of water (with

ϵ_{0}

the permittivity of free space) and

q_{s}

is the magnitude of the screening wave vector [47]. Various names are ascribed to both the nonlinear and linearized equations, including Debye-Hückel, Thomas-Fermi, and Poisson-Boltzmann, but we will always refer to the Poisson-Boltzmann equation when we mean the nonlinear form and the (linearized) Thomas-Fermi equation when we mean the linearized form.

One must be cautious in using the screened Coulomb potential for screened interactions. The continuous density approximation from which the nonlinear Poisson-Boltzmann equation was derived constitutes a form of mean-field theory[19], which fails to describe any of the effects due to correlations between the ions such as charge inversion and condensation. Thus we cannot model screening of the DNA molecule by the dimers using the Thomas-Fermi model, or even the Poisson-Boltzmann equation. Instead, we must treat the dimers as individual particles and compute their interactions so that we do not ignore correlation effects.

For the screening due to the monovalent salt, however, the valence involved is small enough and the concentration is low enough that a mean-field treatment is generally believed to be a good approximation [19,48]. Thus, we will treat the interaction energy

U_{int}

in Eq. (42) using the screened Coulomb potential for the dimer-dimer interactions, with the screening vector

q_{s}

given in the Thomas-Fermi model as [47]

q_{s} = \sqrt{2 n \frac{β e^{2}}{ϵ}} = \sqrt{8 π n ℓ_{B}},

(44)

where

β = 1 / k_{B} T

is the inverse temperature, n is the concentration of ions, and

ℓ_{B} = β e^{2} / (4 π ϵ)

is the Bjerrum length [32]. The Bjerrum length is a characteristic length scale equal to the distance at which two proton charges interact with energy

k_{B} T

. It should be noted that these expressions differ slightly in the literature because of various authors’ choices of units for the electrostatics. Here, we use SI units.

By choosing a coordinate system based on the arc lengths around the surface of the cylinder, we have managed to align the sites into a regular lattice, but the potential

V_{(ℓ_{1}, b_{1}), (ℓ_{2}, b_{2})}

depends on the straight-line distance between the sites rather than the surface arc-length connecting them. We must then express Eq. (43) in terms of the straight-line distance function

d_{b_{1} b_{2}} (ℓ_{1} - ℓ_{2})

between the sites at

(ℓ_{1} a^{(1)}, b_{1} a^{(2)})

and

(ℓ_{2} a^{(1)}, b_{2} a^{(2)})

. The straight-line distance in cylindrical coordinates between two points on the surface of a cylinder of radius

R_{DNA}

d i s t a n c e (ϕ_{1}, z_{1}, ϕ_{2}, z_{2}) = {\{2 R_{DNA}^{2} [1 - cos (ϕ_{1} - ϕ_{2})] + {(z_{1} - z_{2})}^{2}\}}^{\frac{1}{2}},

(45)

and applying our change of variables (35,36) gives the straight-line distance between sites as

\begin{matrix} d_{b_{1} b_{2}} (ℓ) & = & R_{DNA} \{2 [1 - cos (\frac{a^{(1)}}{R_{DNA}} ℓ cos ψ + \frac{a^{(2)}}{R_{DNA}} b cos α)] \\ + {[\frac{a^{(1)}}{R_{DNA}} ℓ sin ψ + \frac{a^{(2)}}{R_{DNA}} b sin α]}^{2}\}^{\frac{1}{2}}, \end{matrix}

(46)

where

ℓ = ℓ_{1} - ℓ_{2}

and

b = b_{1} - b_{2}

. Because of the translational invariance of the lattice, the distance in Eq. (46) depends only on the differences between the lattice and basis indices, and not their values individually. From this relation, we see the symmetry property of the distance,

\begin{matrix} d_{b_{1} b_{2}} (ℓ) & = d_{b_{2} b_{1}} (- ℓ) \end{matrix}

(47a)

\begin{matrix} d_{↑ ↑} (ℓ) & = d_{↓ ↓} (- ℓ) . \end{matrix}

(47b)

Since

b_{1}

and

b_{2}

are either

+ \frac{1}{2}

- \frac{1}{2}

b = b_{1} - b_{2}

takes the three values

b = 0

for interactions of one site on a single chain with all other sites on the same chain,

b = 1

for interactions between a site on the “up" chain with with all the sites on the “down" chain, and

b = - 1

for interactions of one site on the “down" chain with all the sites on the “up" chain. We will write an up arrow ↑ will indicate

b_{1, 2} = \frac{1}{2}

, and a down arrow ↓ will indicate

b_{1, 2} = - \frac{1}{2}

. Plots of

d_{↑ ↑} (ℓ) / R_{DNA}

and

d_{↑ ↓} (ℓ) / R_{DNA}

are shown in Figure 12a. There are wiggles in the curve because, as the chain wraps around the cylinder (see Figure 4), the distances between lattice sites can become larger or smaller than they would be on a linear chain.

In order to understand the consequences of the screened Coulomb potential (43) depending on the straight-line distances d between sites, we define a new notation for the potential, which is expressed in terms of the difference in lattice site indices

ℓ_{1} - ℓ_{2}

V_{b_{1} b_{2}} (ℓ_{1} - ℓ_{2}) \equiv V_{(ℓ_{1}, b_{1}), (ℓ_{2}, b_{2})} .

(48)

This potential decays exponentially as a function of the straight-line distance d (inset of Figure 12(b)). However, as a function of the lattice index ℓ as in Figure 12(b), there are corresponding “wiggles" in the decay. This occurs because the distance between the sites decreases slightly as the helix completes a turn. These wiggles are present for every turn the helix makes, but the effect on the potential becomes negligibly small after about the first turn. Thus, by using the indices ℓ and b to describe the relationships between sites, we have reduced the DNA double-helix structure to a regular one-dimensional lattice with two sites per cell at the cost of an irregular potential due to the geometrical structure of the DNA molecule. The potentials for other combinations of

b_{1}

and

b_{2}

can be related to those in Figure 12(b) through the symmetry relations

\begin{matrix} V_{b_{1} b_{2}} (ℓ) & = V_{b_{2} b_{1}} (- ℓ) \end{matrix}

(49a)

\begin{matrix} V_{↑ ↑} (ℓ) & = V_{↓ ↓} (- ℓ) . \end{matrix}

(49b)

Fourier transforming in the site index ℓ block diagonalizes this matrix into

2 \times 2

blocks, corresponding to the chain indices

b_{1}

and

b_{2}

. Explicitly, the Fourier transform is

\tilde{V} = F^{†} V F,

(50)

where the elements of

F

are independent of

b_{1}

and

b_{2}

and are given by

F_{(ℓ, b_{1}), (k, b_{2})} = \frac{1}{\sqrt{2 N + 1}} e^{i k ℓ},

(51)

and the dagger denotes the adjoint (complex conjugate transpose). We use periodic boundary conditions, so that the wave vector k appearing here, which is dimensionless, takes the

2 N + 1

values

k = - N Δ k \dots N Δ k,

(52)

where

Δ k = (\frac{2 π}{2 N + 1}) .

(53)

N

is large, the range of k is essentially continuous from

- π

π

The matrix elements of this Fourier transform are

{\tilde{V}}_{(k_{1}, b_{1}), (k_{2}, b_{2})} = \frac{1}{2 N + 1} \sum_{ℓ_{1}, ℓ_{2} = - N}^{N} e^{- i k_{1} ℓ_{1}} V_{(ℓ_{1} b_{1}), (ℓ_{2}, b_{2})} e^{i k_{2} ℓ_{2}} .

(54)

Substituting for

V

from Eq. (48), we have

{\tilde{V}}_{(k_{1}, b_{1}), (k_{2}, b_{2})} = \frac{1}{2 N + 1} \sum_{ℓ_{1}, ℓ_{2} = - N}^{N} e^{- i (k_{1} ℓ_{1} - k_{2} ℓ_{2})} V_{b_{1} b_{2}} (ℓ_{1} - ℓ_{2}) .

(55)

Since we regard the chain as having periodic boundary conditions, changing summation indices to

ℓ = ℓ_{1} - ℓ_{2}

and

ℓ_{2}

gives

{\tilde{V}}_{(k_{1}, b_{1}), (k_{2}, b_{2})} = \frac{1}{2 N + 1} \sum_{ℓ = - N}^{N} e^{- i k_{1} ℓ} V_{b_{1} b_{2}} (ℓ) \sum_{ℓ_{2} = - N}^{N} e^{- i (k_{1} - k_{2}) ℓ_{2}},

(56)

where the sum over

ℓ_{2}

becomes

\sum_{ℓ_{2} = - N}^{N} e^{- i (k_{1} - k_{2}) ℓ_{2}} = (2 N + 1) δ_{k_{1}, k_{2}} .

(57)

We can now write the Fourier transform of

V_{b_{1} b_{2}} (ℓ)

{\tilde{V}}_{b_{1} b_{2}} (k) = \sum_{ℓ = - N}^{N} e^{- i k ℓ} V_{b_{1} b_{2}} (ℓ) .

(58)

Substituting this back into the matrix, we have

{\tilde{V}}_{(k_{1}, b_{1}), (k_{2}, b_{2})} = δ_{k_{1}, k_{2}} {\tilde{V}}_{b_{1} b_{2}} (k_{1}) .

(59)

Thus, the Fourier transform

\tilde{V}

V

is diagonal in the k indices and contains

2 \times 2

blocks

\tilde{V} (k) = [\begin{matrix} {\tilde{V}}_{↓ ↓} (k) & {\tilde{V}}_{↓ ↑} (k) \\ {\tilde{V}}_{↑ ↓} (k) & {\tilde{V}}_{↑ ↑} (k) \end{matrix}] .

(60)

The symmetries (49) of the screened Coulomb potential on the DNA lattice are reflected in the Fourier transforms (58) as well:

\begin{matrix} {\tilde{V}}_{b_{1} b_{2}} (k) & = {\tilde{V}}_{b_{2} b_{1}} {(k)}^{*} = {\tilde{V}}_{b_{2} b_{1}} (- k) \end{matrix}

(61a)

\begin{matrix} {\tilde{V}}_{↑ ↑} (k) & = {\tilde{V}}_{↓ ↓} (k) . \end{matrix}

(61b)

Also, because

V_{↑ ↑} (ℓ)

is an even function of ℓ,

{\tilde{V}}_{↑ ↑} (k)

is real. The Fourier transforms (58) are plotted in Figure 13.

As can be seen in Figure 13, the Fourier transforms

\tilde{V} (k)

have regions of k that are negative, which is a result of transforming with respect to our helical coordinate system. When the screened Coulomb potential (43) is Fourier transformed with respect to the straight-line distance d, the function is positive definite. We have instead Fourier transformed the potential a function of the lattice index ℓ, resulting in the “wiggles" in Figure 12(b) caused by the turns of the helix. These deviations from the exponential decay of

V (d)

result in the deviations here from the positive-definite Fourier transform.

Using the symmetry operations in Eqs. (61a) and (61b) obeyed by the Fourier transforms allows us to simplify the

2 \times 2

blocks

\tilde{V} (k)

in Eq. (60) to

\tilde{V} (k) = [\begin{matrix} {\tilde{V}}_{↑ ↑} (k) & {\tilde{V}}_{↑ ↓}^{*} (k) \\ {\tilde{V}}_{↑ ↓} (k) & {\tilde{V}}_{↑ ↑} (k) \end{matrix}] .

(62)

This matrix is easily diagonalized, yielding the real eigenvalues

λ_{k, \pm} = {\tilde{V}}_{↑ ↑} (k) \pm | {\tilde{V}}_{↑ ↓} (k) |,

(63)

which are plotted in Figure 14. The transformation that diagonalizes

\tilde{V} (k)

is a unitary matrix

ξ (k)

whose columns are the eigenvectors of

\tilde{V} (k)

. Choosing the

λ_{k +}

eigenvector to be first, this transformation matrix is given by

ξ (k) = \frac{1}{\sqrt{2}} [\begin{matrix} 1 & 1 \\ \frac{{\tilde{V}}_{↑ ↓} (k)}{| {\tilde{V}}_{↑ ↓} (k) |} & - \frac{{\tilde{V}}_{↑ ↓} (k)}{| {\tilde{V}}_{↑ ↓} (k) |} \end{matrix}] .

(64)

This diagonalizes the

2 \times 2

matrix

\tilde{V} (k)

according to

[\begin{matrix} λ_{+} (k) & 0 \\ 0 & λ_{-} (k) \end{matrix}] = ξ^{†} (k) \tilde{V} (k) ξ (k) .

(65)

We can write the combined process of Fourier transforming

V

into block-diagonal form and then diagonalizing the

2 \times 2

blocks

\tilde{V} (k)

as a single unitary transformation

M

given by

M_{(ℓ, b), (k, σ)} = \frac{ξ_{b, σ} (k)}{\sqrt{2 N + 1}} e^{i k ℓ},

(66)

which completely diagonalizes

V

such that

Λ = M^{†} V M .

(67)

Here

Λ

is a diagonal matrix, with each element along the diagonal given by:

Λ_{(k_{1}, σ_{1}), (k_{2}, σ_{2})} = δ_{k_{1}, k_{2}} δ_{σ_{1}, σ_{2}} λ_{k_{1} σ_{1}},

(68)

where

σ

is either + or −.

Although we have diagonalized the potential with a unitary transformation, the fact that the transformation matrix is complex means that the charge density in the diagonal basis could also be complex, which is physically undesirable. However, the potential matrix in the position basis is real and symmetric, which means that it should be diagonalizable by a real orthogonal transformation matrix

W

. In fact, the transformation matrix is not unique, since there is a degeneracy in the eigenvalues, since

λ_{k, σ} = λ_{- k, σ}

. This means that we can choose eigenvectors that are real by taking two orthogonal linear combinations of the eigenvectors of the unitary transformation

M

for k and

- k

. Those two new eigenvectors, which represent the columns

(k, σ)

for

k > 0

and

k < 0

of the new transformation matrix

W

, are given by

W_{(ℓ, b), (k, σ)} = \frac{1}{\sqrt{2}} \{M_{(ℓ, b), (k, σ)} + M_{(ℓ, b), (- k, σ)}\}, k > 0,

(69)

and

W_{(ℓ, b), (k, σ)} = i \frac{1}{\sqrt{2}} \{M_{(ℓ, b), (k, σ)} - M_{(ℓ, b), (- k, σ)}\}, k < 0 .

(70)

Since

ξ_{b σ} (- k) = ξ_{b σ}^{*} (k)

, these can be written in terms of real and imaginary parts as

W_{(ℓ, b), (k, σ)} = \frac{\sqrt{2}}{\sqrt{(2 N + 1)}} Re \{e^{i k ℓ} ξ_{b σ} (k)\}, k > 0 .

(71)

and

W_{(ℓ, b), (k, σ)} = - \frac{\sqrt{2}}{\sqrt{(2 N + 1)}} Im \{e^{i k ℓ} ξ_{b σ} (k)\}, k < 0 .

(72)

When

k = 0

, the orthogonal transformation is the same as the unitary transformation, that is

W_{(ℓ, b), (0, σ)} = M_{(ℓ, b), (0, σ)} = \frac{ξ_{b, σ} (0)}{\sqrt{2 N + 1}},

(73)

where

ξ (0) = \frac{1}{\sqrt{2}} [\begin{matrix} 1 & 1 \\ 1 & - 1 \end{matrix}] .

(74)

Here the columns are labeled by the eigenvalue

σ = \pm

, and the rows are labeled by the chain

b = - 1 / 2

for the first row and

b = 1 / 2

for the second row.

This transformation can be written in a more compact form by introducing the unitary transformation matrix

Z

, whose elements are defined by

Z_{(k σ), (k^{'} σ^{'})} = δ_{σ, σ^{'}} \{\begin{matrix} \frac{i}{\sqrt{2}} (δ_{k, k^{'}} - δ_{k, - k^{'}}) & k < 0 \\ δ_{k, k^{'}} & k = 0 \\ \frac{1}{\sqrt{2}} (δ_{k, k^{'}} + δ_{k, - k^{'}}) & k > 0 \end{matrix} .

(75)

Then the orthogonal transformation is

W = M Z .

(76)

In deriving the thermodynamic quantities for the interacting lattice gas model, it will be convenient to be able to use this orthogonal transformation to diagonalize the potential.

5. Partition Function for Interacting DNA Chains

For the interacting lattice gas, the partition function is similar to the noninteracting one in Eqs. (5) and (8). However, here the full Hamiltonian

H

from Eq. (39) is used, and then the partition function is given by

Z_{G} = \sum_{configurations} e^{- β (H - \sum_{ℓ, b} \sum_{γ} μ_{γ} n_{ℓ, b}^{(γ)})} .

(77)

The full Hamiltonian, including the interaction term from Eq. (42) written in matrix form, is

H = \sum_{ℓ, b} \sum_{γ} ε_{γ} n_{ℓ, b}^{(γ)} + \frac{1}{2} ρ^{T} V ρ .

(78)

In Section 4, we showed that the orthogonal transformation matrix

W

diagonalizes

V

. In order to streamline the calculation, it will be useful to insert the identity matrix in the form

W W^{T}

, to the left and right of of

V

in the interaction term of the Hamiltonian, which gives

H = \sum_{ℓ, b} \sum_{γ} ε_{γ} n_{ℓ, b}^{(γ)} + \frac{1}{2} (ρ^{T} W) (W^{T} V W) (W^{T} ρ),

(79)

where we have grouped factors to show the transformed quantities.

The Hamiltonian, with the interaction term written in the diagonal basis, can then be written as

\begin{matrix} H & = & \sum_{ℓ, b} \sum_{γ} ε_{γ} n_{ℓ, b}^{(γ)} + \frac{1}{2} {\tilde{ρ}}^{2} Λ \\ = & \sum_{ℓ, b} \sum_{γ} ε_{γ} n_{ℓ, b}^{(γ)} + \frac{1}{2} \sum_{k, σ} {({\tilde{ρ}}_{k, σ})}^{2} λ_{k, σ}, \end{matrix}

(80)

where

\tilde{ρ} = W^{T} ρ

is the transformed charge density. We have not transformed the first term in the Hamiltonian to the diagonal basis because it is more convenient later in the calculation to have it in the position basis. Substituting this form into the partition function and writing the resulting expression as two separate exponentials, we have

Z_{G} = \sum_{configurations} e^{- β \sum_{ℓ, b} \sum_{γ} (ε_{γ} - μ_{γ}) n_{ℓ, b}^{(γ)}} e^{- \frac{β}{2} \sum_{k, σ} {\tilde{ρ}}_{k, σ}^{2} λ_{k, σ}} .

(81)

We can write the exponential of the sum in the interaction term as a product of exponentials as

Z_{G} = \sum_{configurations} e^{- β \sum_{ℓ, b} \sum_{γ} (ε_{γ} - μ_{γ}) n_{ℓ, b}^{(γ)}} \prod_{k, σ} e^{- \frac{β}{2} {\tilde{ρ}}_{k, σ}^{2} λ_{k, σ}} .

(82)

From this expression, we can see that the mean site occupancy for the interacting lattice-gas model can be obtained in the same way as in the noninteracting case (9),

〈 n^{(γ)} 〉 = \frac{1}{N_{sites} β Z_{G}} \frac{\partial Z_{G}}{\partial μ_{γ}},

(83)

by differentiating the partition function with respect to the chemical potential.

In the noninteracting case, since all the information about the configuration was contained in

n_{ℓ, b}^{(γ)}

, we were able to decouple the sum over configurations from the sum over lattice sites because the activity was independent of position, and

n_{ℓ, b}^{(γ)}

only appeared in its exponent. This is because only linear factors of

n_{ℓ, b}^{(γ)}

were in the exponent in the partition function. While it is still true that

n_{ℓ, b}^{(γ)}

contains the configuration dependence, it now appears quadratically in the exponent of the partition function, since

{\tilde{ρ}}_{k, σ}

contains linear factors of the

n_{ℓ, b}^{(γ)}

, and the exponent in the interaction term contains

{\tilde{ρ}}_{k, σ}^{2}

. This makes it impossible to decouple the sum over configurations from the sum over lattice sites. However, we can use an integral identity, known as the Hubbard-Stratonovich transformation, to replace the term quadratic in

{\tilde{ρ}}_{k, σ}

in the exponential with a term linear in

{\tilde{ρ}}_{k, σ}

, meaning that there will only be linear terms in

n_{ℓ, b}^{(γ)}

. This requires introducing the integral over an auxiliary field

{\tilde{Δ}}_{k, σ}

. Then the sum over configurations is possible to do exactly, although at the cost of having to integrate over the auxiliary fields.

The integral identity, which is a complicated way of writing unity, as is shown in Appendix A, is given by

\int \sqrt{\frac{β λ}{- 2 π}} d \tilde{Δ} e^{\frac{β}{2} λ {\tilde{Δ}}^{2} - β λ \tilde{Δ} \tilde{ρ}} e^{\frac{β}{2} λ {\tilde{ρ}}^{2}} = 1,

(84)

where the integration path is over the real axis from

- \infty

to ∞ when

λ < 0

and over the imaginary axis from

- i \infty

i \infty

when

λ > 0

. In the previous work by Bishop and McMullen[28], the problem was done in the position basis, and this integral identity, known as the Hubbard-Stratonovich transformation, was written with the matrix version of the potential in the exponent, as given in Negele and Orland[48]. This form creates a dilemma when there are negative and possibly zero eigenvalues of the potential, and it is difficult to determine the path of integration, since it changes depending on the sign of the eigenvalues. Zero eigenvalues would make the determinant of the matrix in that formula zero, and one then finds that there is a division by zero in the formula. By transforming to the diagonal basis, all these difficulties are avoided, since there is a separate integral for each eigenvalue, and if a particular eigenvalue is zero, the identity is not used at all.

To use the identity in Eq. (84), we make the identifications that

λ = λ_{k, σ}

\tilde{ρ} = {\tilde{ρ}}_{k, σ}

, and

\tilde{Δ} = {\tilde{Δ}}_{k, σ}

. Substituting this “1" into the partition function, we have

\begin{matrix} Z_{G} & = & \sum_{configurations} \prod_{k, σ} \int \sqrt{\frac{β λ_{k, σ}}{- 2 π}} d {\tilde{Δ}}_{k, σ} e^{\frac{β}{2} λ_{k, σ} {\tilde{Δ}}_{k, σ}^{2} - β λ_{k, σ} {\tilde{Δ}}_{k, σ} {\tilde{ρ}}_{k, σ}} e^{\frac{β}{2} λ_{k, σ} {\tilde{ρ}}_{k, σ}^{2}} \times \\ \times e^{- β \sum_{ℓ, b} \sum_{γ} (ε_{γ} - μ_{γ}) n_{ℓ, b}^{(γ)}} e^{- \frac{β}{2} {\tilde{ρ}}_{k, σ}^{2} λ_{k, σ}} . \end{matrix}

(85)

Now we see that the two exponentials containing

{\tilde{ρ}}_{k, σ}^{2}

cancel, as we planned. Of course, we have gained this convenience by introducing the auxiliary field

{\tilde{Δ}}_{k, σ}

, and we will have to do the integral over this variable at a later stage. However, since

{\tilde{Δ}}_{k, σ}

does not depend on the configuration of the system, we can bring the sum over configurations and the leading exponential factors in the partition function inside the integral, and the expression for the partition function becomes

\begin{matrix} Z_{G} & = & \int (\prod_{k, σ} \sqrt{\frac{β λ_{k, σ}}{- 2 π}} d {\tilde{Δ}}_{k, σ}) \sum_{configurations} exp \{- β \sum_{ℓ, b} \sum_{γ} (ε_{γ} - μ_{γ}) n_{ℓ, b}^{(γ)} + \\ + \sum_{k, σ} [\frac{β}{2} λ_{k, σ} {\tilde{Δ}}_{k, σ}^{2} - β λ_{k, σ} {\tilde{ρ}}_{k, σ} {\tilde{Δ}}_{k, σ}]\} . \end{matrix}

(86)

It will be useful to define an action

S

such that the partition function can be written in the form

Z_{G} = \int D [\tilde{Δ}] \sum_{configurations} e^{- S [\tilde{Δ}]},

(87)

where

S [\tilde{Δ}] = β \{\sum_{ℓ, b} \sum_{γ} (ε_{γ} - μ_{γ}) n_{ℓ, b}^{(γ)} + \sum_{k_{,} σ} [- \frac{β}{2} λ_{k, σ} {\tilde{Δ}}_{k, σ}^{2} + β λ_{k, σ} {\tilde{ρ}}_{k, σ} {\tilde{Δ}}_{k, σ}]\}

and

D [\tilde{Δ}] = (\prod_{k, σ} \sqrt{\frac{β λ_{k, σ}}{- 2 π}} d {\tilde{Δ}}_{k, σ}) .

(88)

Note that the factors of

β λ_{k, σ}

are included here in the grand differential

D [\tilde{Δ}]

, rather than incorporating them in the action

S [\tilde{Δ}]

, as was done in Bishop and McMullen[28] and in Sievert[33]. When these factors appear in the action, they introduce a term of the form

ln \sqrt{det (β v)}

. This adds a large constant value to the mean field results, which is subtracted out when the fluctuation terms are included. It actually has no real physical meaning and is part of the normalization of the integral[49].

We see that by diagonalizing first, our auxiliary fields are in the diagonal basis of the potential. If we transform the terms containing

ε_{γ}

and

μ_{γ}

to this basis, they will no longer be diagonal. Therefore, we will leave those terms in the position basis. It will also be convenient to have the term containing

\sum_{k_{,} σ} λ_{k, σ} {\tilde{ρ}}_{k, σ} {\tilde{Δ}}_{k, σ}

in the position basis also. Therefore, the expression for the transformed charge density is

\begin{matrix} {\tilde{ρ}}_{k, σ} & = & \sum_{ℓ, b} W_{(k, σ), (ℓ, b)}^{T} \sum_{γ} ρ_{ℓ, b}^{(γ)} \\ = & \sum_{ℓ, b} W_{(k, σ), (ℓ, b)}^{T} \sum_{γ} q_{γ} n_{(ℓ, b)}^{(γ)}, \end{matrix}

(89)

in terms of the quantities in real space.

Substituting this into the action, we have

S [\tilde{Δ}] = β \{\sum_{ℓ, b} \sum_{γ} (ε_{γ} - μ_{γ}) n_{(ℓ, b)}^{(γ)} - \sum_{k_{,} σ} [\frac{1}{2} λ_{k, σ} {\tilde{Δ}}_{k, σ}^{2} + {\tilde{Δ}}_{k, σ} λ_{k, σ} \sum_{ℓ, b} W_{(k, σ), (ℓ, b)}^{T} \sum_{γ} q_{γ} n_{(ℓ, b)}^{(γ)}]\} .

(90)

It is convenient to identify the last term in this expression as a self energy of species

γ

{\tilde{Σ}}_{ℓ, b}^{(γ)} [\tilde{Δ}] = (\sum_{k, σ} {\tilde{Δ}}_{k, σ} λ_{k, σ} W_{(k, σ), (ℓ, b)}^{T}) q_{γ} .

(91)

By transforming

{\tilde{Δ}}_{k, σ}

to the position basis as

{\tilde{Δ}}_{k, σ} = \sum_{ℓ^{'}, b^{'}} Δ_{ℓ^{'}, b^{'}} W_{(ℓ^{'}, b^{'}), (k, σ)},

(92)

this self-energy can also be written in the position basis as

\begin{matrix} Σ_{ℓ, b}^{(γ)} [Δ] & = & {\tilde{Σ}}_{ℓ, b}^{(γ)} [\tilde{Δ}] \\ = & q_{γ} \sum_{ℓ^{'}, b^{'}} Δ_{ℓ^{'}, b^{'}} (\sum_{k, σ} W_{(ℓ^{'}, b^{'}), (k, σ)} λ_{k, σ} W_{(k, σ), (ℓ, b)}^{T}) \\ = & q_{γ} \sum_{ℓ^{'}, b^{'}} Δ_{ℓ^{'}, b^{'}} V_{(ℓ^{'}, b^{'}), (ℓ, b)} . \end{matrix}

(93)

This will be a useful form to use when we discuss the saddle point, or mean-field, approximation.

Keeping the first term in the position basis and the second term in the diagonal basis, we now write the action in a compact form, combining the self energy term with the first term in the action to obtain

S [\tilde{Δ}] = β \{\sum_{ℓ, b} \sum_{γ} (ε_{γ} + Σ_{ℓ, b}^{(γ)} [Δ] - μ_{γ}) n_{(ℓ, b)}^{(γ)} - \sum_{k_{,} σ} [\frac{1}{2} λ_{k, σ} {\tilde{Δ}}_{k, σ}^{2}]\} .

(94)

Analogous to the noninteracting case, we define the activity as

{\tilde{a}}_{ℓ, b}^{(γ)} [\tilde{Δ}] = a_{ℓ, b}^{(γ)} [Δ] = e^{- β (ε_{γ} + Σ_{ℓ, b}^{(γ)} [Δ] - μ_{γ})} .

(95)

With this definition, the partition function becomes

Z_{G} = \int D [\tilde{Δ}] \sum_{configurations} (\prod_{ℓ, b} \prod_{γ} {a_{ℓ, b}^{(γ)} [Δ]}^{n_{(ℓ, b)}^{(γ)}}) e^{β \sum_{k_{,} σ} [\frac{1}{2} λ_{k, σ} {\tilde{Δ}}_{k, σ}^{2}]} .

(96)

The trace over configurations is now done over each site separately exactly as we did for the noninteracting case, where

n_{(ℓ, b)}^{(γ)} = 1

for one and only one of

γ = ‖, ⊥

, or v, and zero otherwise. Thus, performing the trace gives

\sum_{\begin{matrix} \sin gle - site \\ configurations \end{matrix}} \prod_{γ} {{\tilde{a}}_{ℓ, b}^{(γ)} [\tilde{Δ}]}^{n_{(ℓ, b)}^{(γ)}} = \sum_{γ} {\tilde{a}}_{ℓ, b}^{(γ)} [\tilde{Δ}],

(97)

and the grand partition function becomes

Z_{G} = \int D [\tilde{Δ}] e^{- S_{eff} [\tilde{Δ}]},

(98)

where the effective action is

S_{eff} [\tilde{Δ}] = - \frac{β}{2} \sum_{k_{,} σ} λ_{k, σ} {\tilde{Δ}}_{k, σ}^{2} - \sum_{ℓ, b} ln (\sum_{γ} {\tilde{a}}_{ℓ, b}^{(γ)} [\tilde{Δ}]) .

(99)

This is the general result, which is in principle exact. To understand what the Hubbard-Stratonovich transformation has accomplished for us, recall that the interaction energy

U_{int}

posed two difficulties: the additional configuration dependence and the coupling between the sites. By introducing auxiliary fields through the Hubbard-Stratonovich transformation (84), we managed to separate these two complications so that one factor

exp [- \frac{β}{2} \sum_{k_{,} σ} λ_{k, σ} {\tilde{Δ}}_{k, σ}^{2}]

contains interactions between field fluctuations but no configuration dependence, and another factor

exp [β \sum_{k, σ} λ_{k, σ} {\tilde{ρ}}_{k, σ} {\tilde{Δ}}_{k, σ}]

that is configuration-dependent (through

{\tilde{ρ}}_{k, σ}

) but decoupled. This allowed us to define a modified activity (95) that incorporated all the configuration dependence, making it possible to evaluate the sum over configurations directly, as in the noninteracting case.

What we have done in using the Hubbard-Stratonovich transformation is replace the interaction part of the partition function with its functional integral representation. From the form of Eq. (96), we see that the largest contribution to the partition function comes from the values of

{\tilde{Δ}}_{k, σ}^{2}

for which the effective action

S_{eff}

is stationary, i.e., at a saddle point. Thus a Taylor series expansion of the effective action (99) about the saddle point will yield an order-by-order approximation to the exact partition function (98).

6. Expansion in Powers of the Auxiliary Field

Since it is not possible to evaluate the integral in the partition function of Eq. (98) exactly, we will approximate it by expanding the action in Eq. (99) about a mean field

Δ_{c}

, and include fluctuations to lowest order about this mean field. This mean field is determined by the saddle point of the integrand in Eq. (98), which is the largest contribution to the integral.

We can write the effective action then as an expansion about this uniform saddle point. As an aid to keeping track of the order of the expansions, we add a multiplicative factor m in front of the effective action. as

\begin{matrix} m S_{eff} (\tilde{Δ} = {\tilde{Δ}}_{c} + δ \tilde{Δ} / \sqrt{m}) & = & m S_{eff} [{\tilde{Δ}}_{c}] + {\sum_{k, σ} \frac{δ {\tilde{Δ}}_{k, σ}}{\sqrt{m}} m \frac{\partial S_{eff} [\tilde{Δ}]}{\partial {\tilde{Δ}}_{k, σ}}|}_{{\tilde{Δ}}_{c}} + \\ + & {\frac{1}{2} \sum_{k_{1}, σ_{1}, k_{2}, σ_{2}} \frac{δ {\tilde{Δ}}_{k_{1}, σ_{1}}}{\sqrt{m}} m \frac{\partial^{2} S_{eff} [\tilde{Δ}]}{\partial {\tilde{Δ}}_{k_{1}, σ_{1}} \partial {\tilde{Δ}}_{k_{2}, σ_{2}}}|}_{{\tilde{Δ}}_{c}} \frac{δ {\tilde{Δ}}_{k_{2}, σ_{2}}}{\sqrt{m}} + \dots . \end{matrix}

(100)

Here the expansion parameter

δ \tilde{Δ}

is scaled by

m^{- 1 / 2}

by writing

δ {\tilde{Δ}}_{k, σ} \equiv \sqrt{m} [{\tilde{Δ}}_{k, σ} - {\tilde{Δ}}_{k, σ, c}]

. The factors of m cancel in the quadratic term, so that it is of order

m^{0}

In evaluating the integral in the partition function, the largest contribution comes from the region near a saddle point, and the terms beyond the saddle point give corrections to the integral. Physically, the saddle point solution corresponds to a mean field approximation, and we begin with that. The saddle point is found by setting the first derivative in the expansion to zero, that is

{\frac{\partial S_{eff} [\tilde{Δ}]}{\partial {\tilde{Δ}}_{k, σ}}|}_{{\tilde{Δ}}_{c}} = 0 .

(101)

The second derivatives evaluated at the saddle point are proportional to the components of a matrix

{\tilde{D}}^{- 1}

, which is the inverse propagator for the fluctuations of the auxiliary Hartree field

Δ

that we introduced to decouple the interparticle interactions. Because we want to absorb the normalization factor in the grand differential

D [\tilde{Δ}]

, we have defined each of the elements of

{\tilde{D}}^{- 1}

by the second derivative of

S_{eff} [\tilde{Δ}]

divided by

- β λ_{k, σ}

{({\tilde{D}}^{- 1})}_{(k_{1}, σ_{1}), (k_{2}, σ_{2})} = - (\frac{1}{β λ_{k, σ}}) {\frac{\partial^{2} S_{eff} [\tilde{Δ}]}{\partial {\tilde{Δ}}_{k_{1}, σ_{1}} \partial {\tilde{Δ}}_{k_{2}, σ_{2}}}|}_{{\tilde{Δ}}_{c}} .

(102)

We will show in Section 8 that, for a spatially uniform saddle point, this matrix is diagonal, that is,

S_{eff} [\tilde{Δ}]

is diagonal in the

δ {\tilde{Δ}}_{k_{1}, σ_{1}}

’s, that is,

{({\tilde{D}}^{- 1})}_{(k_{1}, σ_{1}), (k_{2}, σ_{2})} = {({\tilde{D}}^{- 1})}_{(k_{1}, σ_{1}), (k_{1}, σ_{1})} δ_{k_{1}, k_{2}} δ_{σ_{1}, σ_{2}} .

(103)

Although the Gaussian integral can still be done if

{\tilde{D}}^{- 1}

is not diagonal, the argument becomes much easier to follow if we assume at this point that

{\tilde{D}}^{- 1}

is diagonal because then the Gaussian integral can be done straightforwardly.

With this assumption that

{\tilde{D}}^{- 1}

is diagonal, the expansion of the effective action reduces to

m S_{eff} (\tilde{Δ} = {\tilde{Δ}}_{c} + δ \tilde{Δ} / \sqrt{m}) ≃ m S_{eff} [{\tilde{Δ}}_{c}] - \frac{1}{2} \sum_{k, σ} β λ_{k, σ} {({\tilde{D}}^{- 1})}_{(k, σ), (k, σ)} {(δ {\tilde{Δ}}_{k, σ})}^{2} .

(104)

The grand canonical partition function can then be written as

Z_{G} ≃ e^{- m S_{eff} [{\tilde{Δ}}_{c}]} \int (\prod_{k, σ} \sqrt{\frac{β λ_{k, σ}}{- 2 π}} d (δ {\tilde{Δ}}_{k, σ})) e^{\frac{1}{2} β \sum_{k, σ} λ_{k, σ} {({\tilde{D}}^{- 1})}_{(k, σ), (k, σ)} {(δ {\tilde{Δ}}_{k, σ})}^{2}} .

(105)

This can be written as a product of Gaussian integrals as

Z_{G} ≃ e^{- m S_{eff} [{\tilde{Δ}}_{c}]} \prod_{k, σ} \int (\sqrt{\frac{β λ_{k, σ}}{- 2 π}} d (δ {\tilde{Δ}}_{k, σ})) e^{\frac{1}{2} β λ_{k, σ} {({\tilde{D}}^{- 1})}_{(k, σ), (k, σ)} {(δ {\tilde{Δ}}_{k, σ})}^{2}},

and each Gaussian integral can be evaluated using the same change of variable and path of integration as was used in the Hubbard-Stratonovich transformation in Eq. (84) that originally was used to define the auxiliary fields, and as is discussed in Appendix A. Each Gaussian integral produces a result of the same form, regardless of whether

{({\tilde{D}}^{- 1})}_{(k, σ), (k, σ)}

is positive or negative. This quantity can never be zero, since we would not have defined an auxiliary field for that case, which is the same as the case in which an eigenvalue of the potential is zero. Therefore, the integral can always be evaluated, and the partition function can be written as

\begin{matrix} Z_{G} & ≃ & e^{- m S_{eff} [{\tilde{Δ}}_{c}]} \prod_{k, σ} \frac{1}{\sqrt{{({\tilde{D}}^{- 1})}_{(k, σ), (k, σ)}}} . \end{matrix}

(106)

Note that all the factors of

β λ_{k, σ}

have cancelled out due to our choices of the form of the effective action

S_{eff}

and the inverse Hartree field fluctuation propagator

{\tilde{D}}^{- 1}

. Choosing these quantities carefully allows for the correct normalization of the integral[49]. The product can now be brought inside the square root, and since

{\tilde{D}}^{- 1}

is diagonal, this gives the determinant of the matrix inside the square root, and so the partition function assumes the form

\begin{matrix} Z_{G} & ≃ & e^{- m S_{eff} [{\tilde{Δ}}_{c}]} \frac{1}{\sqrt{det ({\tilde{D}}^{- 1})}}, \end{matrix}

(107)

and the partition function can be written as a single exponential,

\begin{matrix} Z_{G} & ≃ & e^{- {m S_{eff} [{\tilde{Δ}}_{c}] + \frac{1}{2} ln det ({\tilde{D}}^{- 1})}} . \end{matrix}

(108)

The grand canonical potential is then

Ω_{G} = - \frac{1}{β} ln Z_{G} ≃ \frac{1}{β} {m S_{eff} [{\tilde{Δ}}_{c}] + \frac{1}{2} ln det ({\tilde{D}}^{- 1})} .

(109)

If we formally write the expansion in terms of orders in m, we have

Ω_{G} ≃ m Ω_{m = 1} + Ω_{m = 0} + \frac{1}{\sqrt{m}} Ω_{m = - \frac{1}{2}} + \frac{1}{m} Ω_{m = - 1} + \dots .

(110)

The leading term in the expansion is therefore given by the saddle point value

m Ω_{m = 1} = m Ω_{c} = \frac{m}{β} S_{eff} [{\tilde{Δ}}_{c}],

(111)

and what is known as the “one-loop" correction term is

Ω_{m = 0} = \frac{1}{2} ln det ({\tilde{D}}^{- 1}) = \frac{1}{2} Tr ln {\tilde{D}}^{- 1} .

(112)

7. The Saddle-Point or Mean Field Approximation

The first task in this section is to find an equation that determines the value of the auxiliary field at the saddle-point or mean-field value of the the effective action. In order to find this saddle point, we must set the first derivative of the effective action in Eq. (99) with respect to the auxiliary field

{\tilde{Δ}}_{k, σ}

to zero. That derivative is given by

\frac{\partial S_{eff} [\tilde{Δ}]}{\partial {\tilde{Δ}}_{k, σ}} = - β λ_{k, σ} {\tilde{Δ}}_{k, σ} - β \sum_{ℓ, b} \frac{1}{(\sum_{γ^{'}} {\tilde{a}}_{ℓ, b}^{(γ^{'})} [\tilde{Δ}])} \sum_{γ} \frac{\partial {\tilde{a}}_{ℓ, b}^{(γ)} [\tilde{Δ}]}{\partial {\tilde{Δ}}_{k, σ}} .

(113)

The activity

a_{ℓ, b}^{(γ)} [\tilde{Δ}]

for species

γ

is given by Eq. (95), and its derivative is

\frac{\partial {\tilde{a}}_{ℓ, b}^{(γ)} [\tilde{Δ}]}{\partial {\tilde{Δ}}_{k, σ}} = - β {\tilde{a}}_{ℓ, b}^{(γ)} [\tilde{Δ}] \frac{\partial {\tilde{Σ}}_{ℓ, b}^{(γ)} [\tilde{Δ}]}{\partial {\tilde{Δ}}_{k, σ}},

(114)

where from Eq. (91), the derivative of the self-energy is

\frac{\partial {\tilde{Σ}}_{ℓ, b}^{(γ)} [\tilde{Δ}]}{\partial {\tilde{Δ}}_{k, σ}} = q_{γ} λ_{k, σ} W_{(k, σ), (ℓ, b)}^{T} .

(115)

Substituting this into the derivative of the action, we have

\frac{\partial S_{eff} [\tilde{Δ}]}{\partial {\tilde{Δ}}_{k, σ}} = - β λ_{k, σ} {\tilde{Δ}}_{k, σ} + β λ_{k, σ} \sum_{ℓ, b} W_{(k, σ), (ℓ, b)}^{T} \frac{\sum_{γ} q_{γ} {\tilde{a}}_{ℓ, b}^{(γ)} [\tilde{Δ}]}{\sum_{γ^{'}} {\tilde{a}}_{ℓ, b}^{(γ^{'})} [\tilde{Δ}]} .

(116)

Setting this first derivative to zero, we obtain an equation whose solution gives the auxiliary field in the saddle point approximation,

{\tilde{Δ}}_{(k, σ)} = \sum_{ℓ, b} W_{(k, σ), (ℓ, b)}^{T} (\frac{\sum_{γ} q_{γ} {\tilde{a}}_{ℓ, b}^{(γ)} [\tilde{Δ}]}{\sum_{γ^{'}} {\tilde{a}}_{ℓ, b}^{(γ^{'})} [\tilde{Δ}]}) .

(117)

We see that on the right-hand side of the equation is the transformation to the diagonal basis of the potential, as given in Eq. (92). We could therefore write the auxiliary field in the position basis as

Δ_{(ℓ, b)} = \frac{\sum_{γ} q_{γ} a_{ℓ, b}^{(γ)} [Δ]}{\sum_{γ^{'}} a_{ℓ, b}^{(γ^{'})} [Δ]},

(118)

where

a_{ℓ, b}^{(γ)} [Δ]

is the activity with the self energy in the diagonal basis

{\tilde{Σ}}_{ℓ, b}^{(γ)} [\tilde{Δ}]

, which by Eq. (93) has been replaced by the self energy in the position basis,

Σ_{ℓ, b}^{(γ)} [Δ]

. Either of these two equations, Eq. (117) or (118), may be considered the equation giving the saddle-point, or mean-field, value of the auxiliary field, and we will refer to this as the mean field equation. In this expression

a_{ℓ, b}^{(γ)} [Δ_{c}]

is a function of all the

Δ_{(ℓ, b), c}

elements, and so, for

N = 30

, which we have used in our calculations, there are

N_{sites} = 2 (2 N + 1) = 122

coupled nonlinear equations. Therefore, for simplicity, we make the assumption that the mean field is spatially uniform, which means that

Δ_{(ℓ, b), c} = Δ_{c},

(119)

where

Δ_{c}

is constant. This is still a nonlinear equation in

Δ_{c}

, and so this must be solved numerically with an iterative approach.

Before solving the mean field equation, it is useful to have a physical interpretation of the mean auxiliary field

Δ_{c}

. The first step in this direction is the second task of this section, which is to find the mean site occupancies of species

γ

at the mean field level. This is done as in the noninteracting case, by taking the derivative of the partition function with respect to

μ_{γ}

. At this mean field level, the

μ_{γ}

dependence is contained in

S_{eff} [{\tilde{Δ}}_{c}]

in Eq. (108), neglecting the term containing the inverse propagator

D^{- 1}

. The expression for the mean occupancy can be gotten using Eq. (83) as

{〈 n^{(γ)} 〉}_{c} = \frac{1}{N_{sites} β} \frac{\partial}{\partial μ_{γ}} ln [e^{- S_{eff} [{\tilde{Δ}}_{c}]}] .

(120)

Taking the derivative of the exponential, we can cancel out the partition function in the denominator and we are left with the derivative of the effective action as

{〈 n^{(γ)} 〉}_{c} = - \frac{1}{N_{sites} β} \frac{\partial}{\partial μ_{γ}} S_{eff} [{\tilde{Δ}}_{c}] .

(121)

Substituting the effective action from Eq. (99), we have

{〈 n^{(γ)} 〉}_{c} = \frac{1}{N_{sites} β} \sum_{ℓ, b} \frac{1}{(\sum_{γ^{'}} a_{ℓ, b}^{(γ^{'})} [Δ_{c}])} \frac{\partial a_{ℓ, b}^{(γ)} [Δ_{c}]}{\partial μ_{γ}} .

(122)

The derivative of the activity is

\frac{\partial a_{ℓ, b}^{(γ)} [Δ_{c}]}{\partial μ_{γ}} = β a_{ℓ, b}^{(γ)} [Δ_{c}],

(123)

and so the mean occupancy becomes

{〈 n^{(γ)} 〉}_{c} = \frac{1}{N_{sites}} \sum_{ℓ, b} \frac{a_{ℓ, b}^{(γ)} [Δ_{c}]}{(\sum_{γ^{'}} a_{ℓ, b}^{(γ^{'})} [Δ_{c}])} .

(124)

The third task of this section is to relate the mean charge density per site to the mean occupancy. The charge density has the same form as in Eq. (22) for the noninteracting model, so that at the mean field level we obtain

ρ_{c} = {〈 ρ 〉}_{c} = \sum_{γ} q_{γ} {〈 n^{(γ)} 〉}_{c} = \frac{\sum_{γ} q_{γ} a_{ℓ, b}^{(γ)} [Δ_{c}]}{\sum_{γ^{'}} a_{ℓ, b}^{(γ^{'})} [Δ_{c}]},

(125)

which, like the mean site occupancy, is independent of the site index because of our choice of spatially uniform mean field. This is equivalent to the equation for the saddle-point value of the auxiliary field because the right-hand side of the equation shows that

ρ_{c} = Δ_{c} .

(126)

on comparison with Eq. (118)

In the mean field or saddle point equation for the spatially uniform saddle point, the auxiliary field is the same for all lattice sites, and the self energy becomes spatially uniform also and can be written as

Σ_{ℓ, b}^{(γ)} [Δ_{c}] \equiv Σ_{c}^{(γ)} = ρ_{c} q_{γ} S_{lattice} .

(127)

where

S_{lattice}

is a lattice sum that is independent of the choice of lattice site

(ℓ, b)

for a long DNA double helix, and is given by

S_{lattice} = \sum_{ℓ^{'}, b^{'}} V_{(ℓ^{'}, b^{'}), (ℓ, b)} .

(128)

For the parameters used in the figures,

β S_{lattice} \approx 2.71

Consequently, at the mean field level, the self-energy simply represents the electrostatic energy of charge

q_{γ}

interacting with the rest of the DNA lattice. This term plays the role of the self-energy

Σ

in many-body quantum mechanics [28,48,49]. Since the self-energy is independent of lattice site, so is the activity, which can be written as

a_{c}^{(γ)} = a_{ℓ, b}^{(γ)} [Δ_{c}] = e^{- β (ε_{γ} - μ_{γ} + Σ_{c}^{(γ)})} .

(129)

The simplifications made possible by the assumption of a spatially uniform saddle point allow us to rewrite the mean field equation given by Eq. (118) in an explicit form as

ρ_{c} = \frac{\sum_{γ} q_{γ} e^{- β (ε_{γ} - μ_{γ} + q_{γ} ρ_{c} S_{lattice})}}{\sum_{γ} e^{- β (ε_{γ} - μ_{γ} + q_{γ} ρ_{c} S_{lattice})}} .

(130)

Again, recall that

ε_{v} - μ_{v} = 0

, identically,

q_{‖} = 0

, and

μ_{‖} = μ_{⊥} = μ_{dimer}

. Therefore, when

ρ_{c} = 0

, we can see from Eq. (130) that

q_{⊥} e^{- β (ε_{⊥} - μ_{dimer})} + q_{v} = 0

, and since

q_{⊥} = + 1

, and

q_{v} = - 1

, it follows that

ε_{⊥} = μ_{dimer}

. Therefore, this is the point at which the charge density

ρ_{c}

goes to zero. When

ε_{‖} = 2 ε_{⊥}

, the charge density goes to zero at

ε_{‖} = 2 μ_{dimer}

, which is the same place it went to zero for the noninteracting model, as shown in Figure 6.

Returning to the mean site occupancy in Eq. (124), the quantity inside the sum over ℓ and b is constant and the sum yields

N_{sites}

, and so the average occupancy per species becomes

{〈 n^{(γ)} 〉}_{c} = \frac{a_{c}^{(γ)}}{(\sum_{γ^{'}} a_{c}^{(γ^{'})})} .

(131)

Using the mean-field value

ρ_{c} = Δ_{c}

of the charge density, calculated from Eq. (130), we have calculated the average occupation numbers

〈 n^{(γ)} 〉

via Eq. (131) using Eq. (129) for the activities. In Figure 15, we have plotted

δ {〈 n^{(γ)} 〉}_{c} = {〈 n^{(γ)} 〉}_{c} - {〈 n^{(γ)} 〉}_{NI}

, which are the differences in the mean-field occupancies from the occupancies in noninteracting case shown in Figure 5. The corresponding change in the charge density

δ ρ_{c} = ρ_{c} - ρ_{NI}

is shown as the solid blue curve in Figure 16.

The shifts in occupancy plotted in Figure 15 can be primarily understood as a consequence of the electromagnetic “self energy”

Σ_{c}^{(γ)} = ρ_{c} q_{γ} S_{lattice}

of each species

γ

interacting with the total charge

ρ_{c}

of the lattice. Immediately, this implies that the parallel dimers are hardly modified at all, since they are electrically neutral and not directly affected by the total charge

ρ_{c}

of the lattice. For the vacancies and perpendicular dimers, which are charged, the mean field corrections plotted in Figure 15 and Figure 16 act to reduce the overall charge of the lattice. Below

β ε_{‖} \approx 2

, the total charge of the lattice is positive, so the mean field corrections penalize the positively-charged perpendicular dimers and enhance the negatively-charged vacancies. Above

β ε_{‖} \approx 2

, the total charge of the lattice is negative, and this effect is reversed. Thus, the mean-field corrections tend to reduce (but not eliminate) the charge inversion at weak binding seen in Figure 5. Note that the biggest effect of this electrostatic self energy is to heavily penalize the “naked lattice” of negatively-charged vacancies which was the favored ground state at large positive

ε_{‖}

for the non-interacting Hamiltonian.

The fourth task of this section is to determine variance of the charge density from the mean field, similar to Eq. (24) for the noninteracting model, which gives a measure of the importance of spatial fluctuations in the charge density, can be written in the form

σ_{c}^{2} = {〈 ρ^{2} 〉}_{c} - ρ_{c}^{2},

(132)

where the average

{〈 ρ^{2} 〉}_{c}

of the square of the charge density, similar to Eq. (29) for the noninteracting model, is

{〈 ρ^{2} 〉}_{c} = \sum_{γ} q_{γ}^{2} {〈 n^{(γ)} 〉}_{c} = \frac{\sum_{γ} q_{γ}^{2} a_{c}^{(γ)}}{\sum_{γ^{'}} a_{c}^{(γ^{'})}} .

(133)

The variance

σ_{c}^{2}

will be useful in knowing the size of the charge fluctuations about the mean field, and this quantity will appear in the next level of approximation. The difference in the standard deviation

δ σ_{c} = σ_{c} - σ_{NI}

is plotted as the red dashed curve in Figure 16, where

σ_{NI}

for the noninteracting model was plotted as the red dashed curve in Figure 6.

Using a spatially uniform saddle point, which yields a charge density

ρ_{c}

that is equal at the mean field level to the auxiliary field

Δ_{c}

in real space, it is interesting to write down the auxiliary field in the diagonal basis, given by

{\tilde{Δ}}_{(k, σ), c} = \sum_{ℓ, b} W_{(k, σ), (ℓ, b)}^{T} {〈 ρ_{ℓ, b} 〉|}_{c} = ρ_{c} \sum_{ℓ, b} W_{(k, σ), (ℓ, b)}^{T} .

(134)

From Eqs. (71) and (72), we see that when

k \neq 0

W_{(k, σ), (ℓ, b)}^{T}

has the ℓ dependence of the form

e^{\pm i k ℓ}

, which by Eq. (57) sums to zero. The only surviving term is for

k = 0

which when summed over ℓ gives

2 N + 1

, and we have from Eqs. (73) and (134)

{\tilde{Δ}}_{(k, σ), c} = {\tilde{ρ}}_{(k, σ), c} = δ_{k, 0} ρ_{c} \sqrt{2 N + 1} \sum_{b} ξ_{b, σ} (0) .

(135)

There are two different results corresponding to the two different eigenvalue labels

σ = +

and

σ = -

in Eq. (63). Summing over b, these are

\begin{matrix} {\tilde{Δ}}_{(0, +), c} & = & {\tilde{ρ}}_{(0, +), c} = ρ_{c} \sqrt{N_{sites}} \\ {\tilde{Δ}}_{(0, -), c} & = & {\tilde{ρ}}_{(0, -), c} = 0, \end{matrix}

(136)

where

N_{sites} = 2 (2 N + 1)

. The interpretation of Eq. (136) can be understood from Eq. (74). The sum over b in Eq. (135) is a sum over the elements rows in Eq. (74), which for

σ = +

sums the mean charge densities on the two chains and for

σ = -

subtracts the charge densities on the two chains.

The fifth task of this section is to calculate the entropy for the spatially uniform saddle point, which is our mean field value of the entropy. This requires the effective action at this saddle point, which becomes

S_{eff, c} = - N_{sites} ln (\sum_{γ} a_{c}^{(γ)}) - \frac{ρ_{c}^{2}}{2} N_{sites} β S_{lattice} .

(137)

The grand canonical potential is then

Ω_{m = 1} = Ω_{c} = \frac{1}{β} S_{eff, c} .

(138)

We can now compute the entropy from the derivative of the grand thermodynamic potential with respect to the inverse temperature

β

, as in Section 2. This derivative becomes much more complicated with the introduction of

U_{int}

because the screened Coulomb potential (43) depends on

β

through the screening vector

q_{s}

(44), where

\frac{\partial q_{s}}{\partial β} = \frac{q_{s}}{2 β} .

(139)

This introduces a

β

-dependence into both terms of Eq. (137), so that the mean field entropy is given by the more complex expression

{\bar{S}}_{c} = \frac{S_{c}}{k_{B} N_{sites}} = \frac{S_{c}^{(A)} + S_{c}^{(B)}}{k_{B} N_{sites}},

(140)

where

{\bar{S}}_{c}^{(A)} = \frac{S_{c}^{(A)}}{k_{B} N_{sites}} = - \sum_{γ} {〈 n^{(γ)} 〉}_{c} ln {〈 n^{(γ)} 〉}_{c}

(141)

is the mixing entropy of the three species using the mean field occupation numbers and

{\bar{S}}_{c}^{(B)} = \frac{S_{c}^{(B)}}{k_{B} N_{sites}} = \frac{ρ_{c}^{2}}{2} β^{2} \frac{\partial S_{lattice}}{\partial β}

(142)

arises directly from the self energy

Σ \propto S_{lattice}

. The occupation numbers

{〈 n^{(γ)} 〉}_{c}

used in this expression are those of the mean-field level, given by Eq. (131) and shown in Figure 15. The mean-field entropy

{\bar{S}}^{(0)}

, given in Eq. (140), is plotted in Figure 17, together with

- {〈 n^{(γ)} 〉}_{c} ln {〈 n^{(γ)} 〉}_{c}

for each species. Also shown are the two constituents

{\bar{S}}_{c}^{(A)}

and

{\bar{S}}_{c}^{(B)}

of the total entropy. This plot shows that, as with

S_{N I}

in Figure 7, the mean-field entropy is greatest for

ε_{‖}

near

μ_{dimer}

and decreases outside this region. The difference

δ {\bar{S}}_{c} = \frac{S_{c} - S_{NI}}{k_{B} T}

between this mean-field entropy for the interacting model and the entropy for the noninteracting model is plotted in Figure 18. Note that this curve has roughly the same shape as that of

δ σ_{c}

in Figure 16, with a large increase in disorder at large positive

ε_{‖}

and small changes elsewhere.

By employing field-theoretic techniques borrowed from quantum mechanics, we have included the interaction energy

U_{int}

in the partition function and found a mean-field approximation to the exact partition function. However, these mean-field level calculations assume the same average charge

ρ_{c}

for all DNA molecules in the same solution. Thus, mean-field calculations would predict a repulsive interaction between two DNA molecules, which would not lead to DNA condensation. Indeed, this repulsive interaction is all that this zeroth-order calculation can predict. It is the fluctuations of the charge from its mean-field value that enable the net attraction observed experimentally; to understand the attractive forces between DNA molecules in solution, it is necessary to extend this treatment by at least one additional order. The next order in the expansion (100) will yield Gaussian-type integrals, which are analytically solvable for the lowest-order fluctuations in the average charge and other parameters. Furthermore, once lowest-order fluctuations have been included, it will be possible to calculate thermodynamic properties of the system from the partition function to meaningful levels, such as the entropy changes due to small fluctuations in the local charge.

8. Inverse-Hartree-field-fluctuation propagator

In order to calculate the inverse-Hartree-field-fluctuation propagator

{\tilde{D}}^{- 1}

, we need to find the second derivatives of the effective action, evaluated at the saddle point. Since

{\tilde{D}}^{- 1}

is diagonal, from Eq. (102), we can write

{({\tilde{D}}^{- 1})}_{(k_{1}, σ_{1}), (k_{2}, σ_{2})} = (- \frac{1}{β λ_{k_{1}, σ_{1}}}) {\frac{\partial^{2} S_{eff} [\tilde{Δ}]}{\partial {\tilde{Δ}}_{k_{1}, σ_{1}} \partial {\tilde{Δ}}_{k_{2}, σ_{2}}}|}_{{\tilde{Δ}}_{c}} .

(143)

Since we already calculated the first derivative in Section 7, we need one more derivative, now with respect to

{\tilde{Δ}}_{k_{2}, σ_{2}}

. Substituting from Eq. (116) yields

\begin{matrix} {({\tilde{D}}^{- 1})}_{(k_{1}, σ_{1}), (k_{2}, σ_{2})} & = & (- \frac{1}{β λ_{k_{1}, σ_{1}}}) \frac{\partial}{\partial {\tilde{Δ}}_{k_{2}, σ_{2}}} \{- β λ_{k_{1}, σ_{1}} {\tilde{Δ}}_{k_{1}, σ_{1}} + \\ {+ β λ_{k_{1}, σ_{1}} \sum_{ℓ, b} W_{(k_{1}, σ_{1}), (ℓ, b)}^{T} \frac{\sum_{γ} q_{γ} {\tilde{a}}_{ℓ, b}^{(γ)} [\tilde{Δ}]}{\sum_{γ^{'}} {\tilde{a}}_{ℓ, b}^{(γ^{'})} [\tilde{Δ}]}\}|}_{{\tilde{Δ}}_{c}} . \end{matrix}

(144)

Performing the derivative, we have

\begin{matrix} {({\tilde{D}}^{- 1})}_{(k_{1}, σ_{1}), (k_{2}, σ_{2})} & = & \{1 - \sum_{ℓ, b} W_{(k_{1}, σ_{1}), (ℓ, b)}^{T} \frac{\sum_{γ} q_{γ} \frac{\partial {\tilde{a}}_{ℓ, b}^{(γ)} [\tilde{Δ}]}{\partial {\tilde{Δ}}_{k_{2}, σ_{2}}}}{\sum_{γ^{'}} {\tilde{a}}_{ℓ, b}^{(γ^{'})} [\tilde{Δ}]} + \\ {+ \sum_{ℓ, b} W_{(k_{1}, σ_{1}), (ℓ, b)}^{T} \frac{\sum_{γ} q_{γ} {\tilde{a}}_{ℓ, b}^{(γ)} [\tilde{Δ}]}{{(\sum_{γ^{'}} {\tilde{a}}_{ℓ, b}^{(γ^{'})} [\tilde{Δ}])}^{2}} \sum_{γ^{''}} \frac{\partial {\tilde{a}}_{ℓ, b}^{(γ^{''})} [\tilde{Δ}]}{\partial {\tilde{Δ}}_{k_{2}, σ_{2}}}\}|}_{{\tilde{Δ}}_{c}} . \end{matrix}

(145)

Substituting the derivative of the activity from Eqs. (114) and (115), we have

\begin{matrix} {({\tilde{D}}^{- 1})}_{(k_{1}, σ_{1}), (k_{2}, σ_{2})} & = & \{1 + β λ_{k_{1}, σ_{1}} \sum_{ℓ, b} W_{(k_{1}, σ_{1}), (ℓ, b)}^{T} W_{(k_{2}, σ_{2}), (ℓ, b)}^{T} \times \\ {\times [\frac{\sum_{γ} q_{γ}^{2} {\tilde{a}}_{ℓ, b}^{(γ)}}{\sum_{γ^{'}} {\tilde{a}}_{ℓ, b}^{(γ^{'})} [\tilde{Δ}]} - {(\frac{\sum_{γ} q_{γ} {\tilde{a}}_{ℓ, b}^{(γ)} [\tilde{Δ}]}{\sum_{γ^{'}} {\tilde{a}}_{ℓ, b}^{(γ^{'})} [\tilde{Δ}]})}^{2}]\}|}_{{\tilde{Δ}}_{c}} . \end{matrix}

(146)

The quantity in square brackets is just the constant

σ_{c}^{2}

, the variance of the charge density, given in Eq. (132). Since

W

from Eqs. (71), (72), and (73), is an orthogonal matrix,

\sum_{ℓ, b} W_{(k_{1}, σ_{1}), (ℓ, b)}^{T} W_{(k_{2}, σ_{2}), (ℓ, b)}^{T} = δ_{k_{1} k_{2}} δ_{σ_{1} σ_{2}},

(147)

and this means that the matrix is diagonal. With these substitutions, the inverse-Hartree-field-fluctuation propagator becomes

{({\tilde{D}}^{- 1})}_{(k_{1}, σ_{1}), (k_{2}, σ_{2})} = δ_{k_{1} k_{2}} δ_{σ_{1} σ_{2}} (1 + σ_{c}^{2} β λ_{k_{1}, σ_{1}}) .

(148)

This is the inverse propagator for the fluctuations of the auxiliary Hartree field that we introduced to decouple the interparticle interactions, expressed in the diagonal basis.

The inverse propagator

{({\tilde{D}}^{- 1})}_{(k_{1}, σ_{1}), (k_{2}, σ_{2})}

in the diagonal basis is plotted in Figure 19 for the two eigenvalues

λ_{k \pm}

, shown in Figure 14. Notice that the shapes are those of the eigenvalues, but shifted upward. They both have minima at

k = π

, with the one corresponding to

λ_{k +}

having its maximum at

k = 0

and the one corresponding to

λ_{k -}

having maxima near

k = 0

. The corresponding plots of the propagator

{\tilde{D}}_{(k_{1}, σ_{1}), (k_{2}, σ_{2})}

are shown in Figure 20, also in the diagonal basis. The propagator increases with

β ε_{‖}

near

k = \pm π

, and this can be attributed, at least in part, to the increase in

σ_{c}

, shown in Figure 16, because of the dependence shown in Eq. (146). Although the auxiliary field propagator normally describes the behavior of fluctuations of a physical field like the charge, its definition given by Eq. (106) shows that the field here have been rescaled by the square root of the eigenvalue. This was discussed earlier in the context of entropy in the paragraph after Eq. (88), and so this form is convenient for the entropy calculation, our focus here, but its interpretation is not quite so direct.

9. Inclusion of Lowest-Order Fluctuations

Going beyond mean field to one-loop order requires use of the full partition function given by Eqs. (106) and (108). This has consequences for the site occupancies, the average charge per site, the charge variance, and the entropy. The expression for site occupancy has the same form in terms of the partition function as for the noninteracting and mean field cases in Eqs. (9) and (83), and can be written explicitly for the one-loop order of approximation as

{〈 n^{(γ)} 〉}_{tot} = \frac{1}{N_{sites} β Z_{G}} \frac{\partial}{\partial μ_{γ}} [\frac{e^{- S_{eff} [{\tilde{Δ}}_{c}]}}{\sqrt{det ({\tilde{D}}^{- 1})}}],

(149)

which is obtained from Eq. (111) with m set equal to one. In that expression, m was the parameter that was introduced in Section 6 as an aid to identifying the various orders in the expansion. This can be written as the sum of two terms,

{〈 n^{(γ)} 〉}_{tot} = {〈 n^{(γ)} 〉}_{c} + δ 〈 n^{(γ)} 〉,

(150)

where

{〈 n^{(γ)} 〉}_{c}

is the occupancy of species

γ

at the mean field level, given by Eq. (131) ,

{〈 n^{(γ)} 〉}_{c} = - \frac{1}{N_{sites} β} \frac{\partial}{\partial μ_{γ}} S_{eff} [{\tilde{Δ}}_{c}],

(151)

and

δ 〈 n^{(γ)} 〉

is the correction to that mean site occupancy due to fluctuations, which is given by

δ 〈 n^{(γ)} 〉 = \frac{1}{N_{sites} β} \frac{\partial}{\partial μ_{γ}} ln [det ({\tilde{D}}^{- 1})],

(152)

with the inverse propagator

{\tilde{D}}_{(k_{1}, σ_{1}), (k_{2}, σ_{2})}^{- 1}

from Eq. (148) written in matrix form as

{\tilde{D}}^{- 1} = I + σ_{c}^{2} β Λ,

(153)

where I is the identity matrix and

Λ

is the (diagonal) matrix of eigenvalues of the potential. Since

{\tilde{D}}^{- 1}

is diagonal, its determinant is given by the product of its diagonal elements as

det {\tilde{D}}^{- 1} = \prod_{k, σ} (1 + β λ_{k, σ} σ_{c}^{2}) .

(154)

The logarithm of this determinant then gives the sum over logarithms of diagonal elements as

ln (det {\tilde{D}}^{- 1}) = \sum_{k, σ} ln (1 + β λ_{k, σ} σ_{c}^{2}) .

(155)

The only

μ_{γ}

dependence is contained in

σ_{c}^{2}

, and its derivative is given by

\frac{\partial σ_{c}^{2}}{\partial μ_{γ}} = β {〈 n^{(γ)} 〉}_{c} [q_{γ}^{2} - {〈 ρ^{2} 〉}_{c} - 2 ρ_{c} (q_{γ} - ρ_{c})],

(156)

so that the derivative of

ln (det D^{- 1})

is given by the coefficient

C = \frac{1}{2 N_{sites}} \sum_{k, σ} \frac{β λ_{k σ}}{[1 + β λ_{k, σ} σ_{c}^{2}]} .

(157)

Then the mean fluctuation in occupancy

δ 〈 n^{(γ)} 〉

becomes

δ 〈 n^{(γ)} 〉 = C {〈 n^{(γ)} 〉}_{c} [q_{γ}^{2} - {〈 ρ^{2} 〉}_{c} - 2 ρ_{c} (q_{γ} - ρ_{c})] .

(158)

These changes in the mean field occupancies at this one loop order are plotted in Figure 21. These changes are not large, and the fluctuation corrections amount to no more than about 10% of the total. The most dramatic effects are seen in the region of large positive

ε_{‖}

, with the corrections to the vacancies and perpendicular dimers going in the opposite direction to the corrections due to the mean field in Figure 15. This can be interpreted as a relaxation of the rigid penalties imposed by the electrostatic self energies at the mean field level. An enhancement of parallel dimers is also seen across the whole range of

ε_{‖}

that peaks around

ε_{‖} = μ_{dimer}

The mean charge density at this one-loop-correction level of approximation is given by the same form as Eqs. (22) and (125) for the noninteracting and mean field approximations and is given by

ρ_{tot} = {〈 ρ 〉}_{tot} = \sum_{γ} q_{γ} {〈 n^{(γ)} 〉}_{tot},

(159)

where now

{〈 n^{(γ)} 〉}_{tot}

is taken from Eqs. (150) and (158). and the charge density becomes

ρ_{tot} = ρ_{c} + δ ρ_{tot},

(160)

where

δ ρ_{tot} = ρ_{c} C (1 - 3 {〈 ρ^{2} 〉}_{c} + 2 ρ_{c}^{2}) .

(161)

The change in the charge density

δ ρ_{tot}

is shown as the solid blue curve in Figure 22. The most notable features are a negative contribution to the total charge above

β ε_{‖} \approx 2

, reflecting the addition of more negatively charged vacancies due to the fluctuations, and a positive contribution to the total charge around

ε_{‖} \approx μ_{dimer}

. Interestingly, this positive contribution is not associated with the increase of parallel dimer occupancy, since parallel dimers are electrically neutral. Rather, it is due to the fact that, although both the negatively charged vacancies and positively charged perpendicular dimers are both reduced around

ε_{‖} \approx μ_{dimer}

, the vacancies are decreased more.

The variance in the charge is given by

σ_{tot}^{2} = {〈 ρ^{2} 〉}_{tot} - ρ_{tot}^{2} .

(162)

The average of the square of the charge density is given by

{〈 ρ^{2} 〉}_{tot} = \sum_{γ} q_{γ}^{2} {〈 n^{(γ)} 〉}_{tot}

(163)

and is written explicitly as

{〈 ρ^{2} 〉}_{tot} = {〈 ρ^{2} 〉}_{c} + δ {〈 ρ^{2} 〉}_{tot},

(164)

where

{〈 ρ^{2} 〉}_{c}

is given by Eq. (133), and

δ {〈 ρ^{2} 〉}_{tot}

works out to be

δ {〈 ρ^{2} 〉}_{tot} = C [{〈 ρ^{2} 〉}_{c} (1 - {〈 ρ^{2} 〉}_{c} - 2 ρ_{c}^{2}) - 2 ρ_{c}^{2}] .

(165)

The charge variance can then be written as

σ_{tot}^{2} = σ_{c}^{2} + σ_{f}^{2},

(166)

where the fluctuation correction is

\begin{matrix} σ_{f}^{2} & = & - {〈 ρ^{2} 〉}_{c}^{2} C (1 + 9 ρ_{c}^{2} C) + {〈 ρ^{2} 〉}_{c} C [(1 + 8 ρ_{c}^{2}) + 6 ρ_{c}^{2} C (1 + 2 ρ_{c}^{2})] \\ - & ρ_{c}^{2} C [4 (1 + ρ_{c}^{2}) - C (1 + 4 ρ_{c}^{2} + 4 ρ_{c}^{4})] . \end{matrix}

(167)

The standard deviation can then be written as

σ_{tot} = σ_{c} + δ σ_{tot},

(168)

The change

δ σ_{tot}

in the standard deviation of the charge densities due to fluctuations are given in Figure 22, where the fluctuations account for changes typically of the order of 10% of the mean field value.

Having obtained results for the effects of fluctuations on the mean occupancies and charge densities, we now consider the corresponding corrections to the entropy. For that, we need the grand thermodynamic potential, which was expanded in Section 6, as Eqs. (110)-(112), with the mean field and lowest order fluctuation terms given by

Ω_{G} ≃ Ω_{c} + \frac{1}{2 β} ln det D^{- 1} .

(169)

The corresponding expression for the interacting entropy is

S_{tot} = S_{c} + δ S_{tot},

(170)

where the mean field entropy

S_{c}

was derived and the results presented in Section 7 in Eqs. (140)-(142). The fluctuation contribution to the entropy

δ S_{tot}

is given by

δ S_{tot} = k_{B} β^{2} \frac{\partial}{\partial β} \{\frac{1}{2 β} ln [det (D^{- 1})]\},

(171)

which requires performing another derivative.

To evaluate this contribution to the entropy

δ S_{tot}

in Eq. (171), we return to Eq. (155) and substitute into the fluctuation contribution to the entropy from Eq. (171), which after performing derivatives gives

δ S_{tot} = S_{f}^{(A)} + S_{f}^{(B)},

(172)

where

S_{f}^{(A)} = - \frac{k_{B}}{2} \sum_{k, σ} ln (1 + β λ_{k, σ} σ_{c}^{2})

(173)

and

S_{f}^{(B)} = \frac{k_{B}}{2} \sum_{k, σ} \frac{1}{(1 + β λ_{k, σ} σ_{c}^{2})} (β λ_{k, σ} σ_{c}^{2} + σ_{c}^{2} β^{2} \frac{\partial λ_{k, σ}}{\partial β} + β λ_{k, σ} β \frac{\partial σ_{c}^{2}}{\partial β}) .

The derivative of the eigenvalue

λ_{k, σ}

of the potential is given by

\frac{\partial λ_{k, σ}}{\partial β} = \frac{\partial}{\partial β} {\tilde{V}}_{↑ ↑} (k) + σ \frac{\partial}{\partial β} | {\tilde{V}}_{↑ ↓} (k) |,

(174)

where

{\tilde{V}}_{↑ ↑} (k)

is the Fourier transform of the interaction potential of a chain with itself and

{\tilde{V}}_{↑ ↓} (k)

is the Fourier transform of the interaction potential between the two chains. Both of these potentials depend on the screening vector

q_{s}

, and the only

β

dependence of the potential is contained in

q_{s}

, which is proportional to

\sqrt{β}

. The derivative we need is then given by Eq. (139). As a consequence, the derivative of the Fourier transform of the potential is

\frac{\partial {\tilde{V}}_{b_{1}, b_{2}} (k)}{\partial β} = \frac{1}{2} \sum_{ℓ = - N}^{N} [e^{- i k ℓ} \frac{\partial V_{b_{1}, b_{2}} (ℓ)}{\partial β} + e^{i k ℓ} \frac{\partial V_{b_{2}, b_{1}} (ℓ)}{\partial β}],

(175)

where the derivative of the potential in the position basis is given by

\frac{\partial V_{b_{1}, b_{2}} (ℓ)}{\partial β} = V_{b_{1}, b_{2}} (ℓ) (- \frac{q_{s}}{2 β} d_{b_{1}, b_{2}} (ℓ)) .

(176)

Substituting and using the symmetry relations in Eqs. (47) and (49), the derivative of the Fourier transform of the potential is

\frac{\partial {\tilde{V}}_{b_{1}, b_{2}} (k)}{\partial β} = - \frac{q_{s}}{2 β} \sum_{ℓ = - N}^{N} e^{- i k ℓ} V_{b_{1}, b_{2}} (ℓ) d_{b_{1}, b_{2}} (ℓ) .

(177)

Using these relations, the derivative of the eigenvalues becomes

\begin{matrix} β^{2} \frac{\partial λ_{k, σ}}{\partial β} & = & - \frac{q_{s}}{2} \sum_{ℓ = - N}^{N} \{β V_{↑ ↑} (ℓ) d_{↑ ↑} (ℓ) cos (k ℓ) \\ + σ V_{↑ ↓} (ℓ) d_{↑ ↓} (ℓ) [\frac{Re [{\tilde{V}}_{↑ ↓} (k)]}{| {\tilde{V}}_{↑ ↓} (k) |} cos (k ℓ) - \frac{Im [{\tilde{V}}_{↑ ↓} (k)]}{| {\tilde{V}}_{↑ ↓} (k) |} sin (k ℓ)]\}, \end{matrix}

(178)

where

Re [{\tilde{V}}_{↑ ↓} (k)]

and

Im [{\tilde{V}}_{↑ ↓} (k)]

are the real and imaginary parts of

{\tilde{V}}_{↑ ↓} (k)]

Eq. (172) also requires the derivative of the variance

σ_{c}^{2}

, where

σ_{c}^{2}

is given by Eq. (132). This contains the average charge density from Eq. (125) and the average of the square of the charge density from Eq. (133). This derivative is

\begin{matrix} β \frac{\partial σ_{c}^{2}}{\partial β} & = & \frac{ρ_{c}}{1 + σ_{c}^{2} β S_{lattice}} \{({[ρ E]}_{av} - ρ_{c} E_{c}) [2 + (1 - {〈 ρ^{2} 〉}_{c}) β S_{lattice}] + \\ + ρ_{c} (3 σ_{c}^{2} + ρ_{c}^{2} - 1) β^{2} \frac{\partial S_{lattice}}{\partial β}\} - ({[ρ^{2} E]}_{av} - {〈 ρ^{2} 〉}_{c} E_{c}), \end{matrix}

(179)

where

E \equiv β (ε_{γ} - μ_{γ} + ρ_{c} q_{γ} S_{lattice}),

(180)

E_{c} = \sum_{γ} E {〈 n^{(γ)} 〉}_{c},

(181)

{[ρ E]}_{av} = \sum_{γ} q_{γ} E {〈 n^{(γ)} 〉}_{c},

(182)

and

{[ρ^{2} E]}_{av} = \sum_{γ} q_{γ}^{2} E {〈 n^{(γ)} 〉}_{c} .

(183)

Here

{[\dots]}_{av}

is the weighted average of the specified quantity over the distribution given by the mean-field occupancies

{〈 n^{(γ)} 〉}_{c}

of the three species.

It is interesting to note that the combinations

{[ρ^{2} E]}_{av} - {〈 ρ^{2} 〉}_{c} E_{c}

and

{[ρ E]}_{av} - ρ_{c} E_{c}

can be written in form similar to that of the zeroth-order entropy. These forms are

{[ρ^{2} E]}_{av} - {〈 ρ^{2} 〉}_{c} E_{c} = - \sum_{γ} [{〈 n^{(γ)} 〉}_{c} ln {〈 n^{(γ)} 〉}_{c} (1 - \sum_{γ^{'}} {〈 n^{(γ^{'})} 〉}_{c} ln {〈 n^{(γ^{'})} 〉}_{c})]

(184)

and

{[ρ E]}_{av} - ρ_{c} E_{c} = - \sum_{γ} q_{γ} {〈 n^{(γ)} 〉}_{c} ln {〈 n^{(γ)} 〉}_{c} + ρ_{c} \sum_{γ} {〈 n^{(γ)} 〉}_{c} ln {〈 n^{(γ)} 〉}_{c} .

(185)

The fluctuation entropy contribution

S_{f}^{(A)}

is easy to calculate, given the eigenvalues

λ_{k, σ}

. Eqs. (174)-(185) simply collect the results that allow us to calculate

S_{f}^{(B)}

. The results for the total fluctuation entropy

δ S_{tot}

and the two contributions to it are shown in Figure 23 for

ε_{⊥} = \frac{1}{2} ε_{‖}

. Like the mean field entropy, the fluctuation entropy becomes larger in magnitude as the binding becomes weak. However, the fluctuation contribution is actually negative in this region, suggesting that the fluctuations are leading to increasing order. These corrections to the entropy are typically an order of magnitude larger than the difference between the mean field and noninteracting values of the entropy.

10. Discussion

Entropy forms an important contribution to the free energies of many biological systems. Here we summarize the results for the entropies and particle densities for a lattice gas model representing the adsorption of molecules on double-stranded DNA. The entropies and particle densities are presented at three successive levels of approximation, first, for the noninteracting system then at the mean-field level, and finally with the inclusion of fluctuations to one-loop order. Those fluctuation corrections result from correlations induced by the electrostatic forces among the dimer molecules themselves and with the electrically charged DNA substrate.

Perhaps the most transparent descriptors of the DNA system’s behavior are the average occupation numbers

〈 n^{(‖)} 〉

〈 n^{(⊥)} 〉

, and

〈 n^{(v)} 〉

representing the thermal average of the numbers of parallel-adsorbed dimers, perpendicular-adsorbed dimers, and vacancies, respectively, on each site. These particle densities are proportional to the probability that a given site would be found to be occupied by a particular species. These site-occupation numbers are shown in Figure 24 for the three levels of approximation. The short-dashed curves are for the noninteracting level of approximation

{〈 n^{(γ)} 〉}_{NI}

, the long-dashed curves for the mean field level

{〈 n^{(γ)} 〉}_{c}

, and the solid curves for the inclusion of fluctuations

{〈 n^{(γ)} 〉}_{tot}

. The corresponding charge densities and standard deviations of the charge density are shown in Figure 25.

The site occupancies exhibit similar qualitative behavior in all three levels of approximation. Due to the energetics of binding

ε_{‖} = 2 ε_{⊥}

, parallel dimers occupy the greatest fraction of sites for all negative binding energies

(ε_{‖}, ε_{⊥} < 0)

, followed the perpendicular dimers and then the vacancies, at least for the parameters used in these calculations. At weak binding (

ε_{‖}, ε_{⊥} \to 0

) in the noninteracting system, the difference between these energies becomes negligible, and both

n^{(‖)}

and

n^{(⊥)}

approach the same value of

40 %

. When the binding energy

ε_{‖}

ε_{⊥}

increases and reaches the chemical potential

μ_{dimer}

of the dimers in solution, they are repelled from the surface of the DNA and go into solution, leaving a “naked” lattice of negatively-charged vacancies, with the parallel dimers being more repelled than perpendicular ones.

In contrast to the case of noninteracting sites, the inclusion of interactions between sites gives a pronounced gap of about

25 %

by which parallel adsorption exceeds perpendicular adsorption in the weak binding limit. This enhancement of parallel adsorption in the mean-field approximation results from the inclusion of the charge on the perpendicular dimers in an average or mean field sense, which penalizes the addition of positively-charged perpendicular dimers when the total charge on the DNA lattice is positive. Fluctuations reduce this effect slightly. Recall that the perpendicular dimers are positively charged because one end is not attached to a negative binding site, and this means that it is energetically unfavorable to have a second one nearby. Parallel binding, on the other hand, neutralizes the negative charge on the sites, so that a DNA double strand completely covered in parallel-adsorbed dimers would be electrically neutral. The decrease in perpendicular-dimer occupancy caused by this electrostatic repulsion of like charges leads to an increase in sites occupied by vacancies. Vacancies are negatively charged and are energetically favored when the perpendicular dimers provide a positively-charged region. There may, in fact, be correlations resulting from the lower electrostatic energy of a configuration in which sites alternate between perpendicular dimers and vacancies, i.e., between positively and negatively charged sites, in a Wigner-lattice-like state similar to those described in the literature [8,19,26].

The average charge

ρ_{c}

per site depends only on the perpendicular dimers and vacancies since the parallel dimers neutralize sites, so that the charge density is written simply as

ρ = \sum_{γ} q_{γ} 〈 n^{(γ)} 〉 = 〈 n^{(⊥)} 〉 - 〈 n^{(v)} 〉,

(186)

and this connection can be seen in the plots of charge density

ρ

in Figure 25, when compared with Figure 24. For all approximations, the average charge is positive at negative and slightly positive binding energies, indicating charge inversion. This is a result of the occupation numbers shown in Figure 24, because parallel binding, which neutralizes the charge, increasingly dominates in the strong binding limit (

ε_{‖} = 2 ε_{⊥} ≪ 0

). Similarly, the decrease in the magnitude of the charge inversion from the noninteracting to the mean-field approximations can be explained by the enhancement of parallel adsorption and reduction of perpendicular adsorption due to like-charge repulsion. The fluctuation corrections to the site occupancies are not large when compared with the mean-field values. They only seem significant at positive binding energies where the site occupancies are shifted in the direction of the noninteracting system. Of course, the site occupancies are single-particle properties and do not measure spatial correlations.

The behavior of the entropy is shown in Figure 26, in which the entropy per site is plotted versus

β ε_{‖}

for all three levels of approximation. The three curves are practically indistinguishable for strong binding (negative values of

β ε_{‖}

) until the binding energy becomes quite weak (approaching zero), indicating little effect of correlations induced by electrostatic repulsion on the entropy in that region. This is because the dimers are closely packed, leaving little freedom for disorder. The pronounced differences are seen in the region of

ε_{‖} \approx μ_{dimer}

and more positive binding energies (

ε_{‖} = 2 ε_{⊥} ≫ 0

), because the system becomes more disordered. There, the dimers tend to leave the surface and enter the solution, and the fluctuation corrections to the entropy are greater in that region and act to reduce the entropy. This shows that the electrostatic correlations increase the order and reduce the entropy. Surprisingly, this decrease in entropy at large positive

ε_{‖}

due to fluctuations is so significant that it overcompensates for the entropy generated at the mean field level, leading to a total entropy

S_{tot}

that is even lower than in the noninteracting case

S_{NI}

The form of the noninteracting entropy comes from the lattice gas model, in which parallel adsorption is treated as “local," only occupying one site. When the geometrical blocking effect is included, in which parallel-adsorbed dimers occupy two sites, we would expect the entropy to differ significantly from the

n ln n

result even at the noninteracting level. It is unclear how large a contribution to the full entropy the blocking effect provides, but it is certainly lower for dimers than for longer polymers, for which the nonlocality is greater. Because of this, it will be important in future work to develop a way of incorporating the blocking effect into these types of field-theoretic models.

Dimers are certainly the simplest example of polymers, although biological polymers are typically considerably longer. There is extensive work done on the dimer model, beginning with the work of Fisher [50,51]. Some of this work has been motivated by the analogy between dimer models and quantum spin systems [36]. In this context, Fisher has derived expressions for the contributions of hard-core crowding effects to the partition function, including the geometrical blocking effect. Fisher’s results include an expression of the noninteracting partition function for dimers on a 1D linear lattice. In a future publication, we plan to adapt Fisher’s approach to study the statistical consequences of these blocking effects[52].

In addition, it has been suggested that methods can be adapted from analogous problems in particle physics[37] and condensed matter physics[53]. This involves using numerical codes like those of Adams and Chandrasekharan[37] that are used for simulations in lattice quantum chromodynamics.

Modeling the behavior of DNA and other biologically active polymeric molecules under physiological conditions is an area in which quantitative calculation is particularly difficult. This is because it is necessary to accommodate simultaneously the influences of geometry, electrostatic forces, and charge correlations. All of these play roles of varying significance in determining the properties of these molecules. Despite these challenges, considerable progress has been made in extending our physical understanding of these systems, and some surprising new physics, such as charge inversion and condensation, have been discovered in the process.

The work presented here focuses on electrostatic effects and especially on the role they play in the entropy, with the overall goal of determining the entropy as a contribution to the free energy and as a driving force in biochemical reactions. For the DNA-based model that we consider, we find that, particularly in the low-coverage regime, a uniform mean field theory gives changes

\sim 20 %

from the usual noninteracting entropy term. We also find, though, that the fluctuation contributions are both considerably larger and in the opposite direction. This establishes that the fluctuations are significant, and the opposite sign of their contribution suggests that they lead to additional order in the system.

Several aspects of the formalism and calculations presented here have been done in such a way as to be accessible to students and others wishing to extend these techniques to more realistic models of polyelectrolytes condensing on the surfaces of DNA. We have shown in Section 3 why the DNA helices can be treated as a pair of one-dimensional lattices (Figure 9), albeit with unusual interactions due to the three-dimensional nature of screened Coulomb potential, as shown in Figure 12. We show that by writing the interaction term in the diagonal basis, as in Eq. (68) and Figure 14, the problem can be greatly simplified. In addition, we have chosen a basis in which the eigenvalues and eigenvectors are real, in Eqs. (71) and (72), which makes the physical interpretation much more transparent. Then we show in detail in Eq. (84) and Appendix A how to perform a Hubbard-Stratanovich transformation of the partition function to decouple the sum over configurations from the sum over lattice sites, which allows one to perform the sum over all configurations of the system exactly. This introduces an integral over an auxiliary field

\tilde{Δ}

. Often such an auxiliary field emerges as a complicated complex matrix, but in our diagonal representation it is real and represents the charge density in the diagonal basis, which was shown explicitly for the mean field by comparing Eqs. (118) and (125). Also in this diagonal basis, it becomes clear how to choose a proper integration path to have a converging integral. One must then expand the effective action in a power series in the fluctuations of auxiliary field

δ \tilde{Δ}

, as in Eq. (100), which physically is in powers of the fluctuations of the charge density about the mean field. Setting the first derivative term in this expansion to zero yields the mean field, which itself includes terms of the interaction to all orders, as can be seen in Eq. (117) or (118). Whether the expansion is useful depends on the size of the charge fluctuations. The series is an asymptotic one, which means that the infinite series does not necessarily converge. However, as is true for all asymptotic expansions, it is useful as long as the last term kept in the expansion is smaller than the term before it, which is true in our case. This indicates that the mean field, which is the saddle point, is a reasonable first approximation of the system. The fluctuations about this mean field then give insight into how the actual solution differs from the mean field.

In conclusion, we have shown with this simple model the importance of including fluctuation contributions beyond the mean field. The method we have used has allowed us to include vacancies, and we have found that the inclusion of the fluctuation term decreases the entropy by ∼50% in the weak binding regime. This is because, due to the presence of vacancies, the bound dimer concentration is low because of the competition between dimers being repelled from the DNA molecule and the the chemical potential driving them from the solution onto the DNA surface. It is surprising that this decrease in entropy due to correlations is so significant that it overcompensates for the entropy increase at the mean field level, so that the total entropy is even lower than in the absence of interactions between lattice sites. This effect of fluctuations could be important for other biological systems as well, and this approach can be useful where correlations are significant.

Author Contributions

Writing–original draft preparation, M. D. S., M. F. B. and T. M.; writing–review and editing, M. D. S., M. F. B. and T. M. All authors contributed to this paper equally. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Acknowledgments

We are grateful to Dr. Kevin R. Ward, Director of Research, Department of Emergency Medicine, University of Michigan, for introducing us to the role of charge in biological systems and to the practical consequences that could be achieved by controlling the charge distribution on biological molecules.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. A Gaussian Integral Identity

In this appendix, we prove the integral identity that we use in the derivation of the partition function

Z_{G}

for the interacting model. The identity is

I = \int \sqrt{\frac{β λ}{- 2 π}} d \tilde{Δ} e^{\frac{β}{2} λ {\tilde{Δ}}^{2} - β λ \tilde{Δ} \tilde{ρ}} e^{\frac{β}{2} λ {\tilde{ρ}}^{2}} = 1 .

(A1)

The integration path is over the real axis from

- \infty

to ∞ when

λ < 0

and over the imaginary axis from

- i \infty

i \infty

when

λ > 0

. This integral can be evaluated by noticing that the exponent is a factor times a perfect square,

\frac{β}{2} λ {\tilde{Δ}}^{2} - β λ \tilde{Δ} \tilde{ρ} + \frac{β}{2} λ {\tilde{ρ}}^{2} = \frac{β}{2} λ {(\tilde{Δ} - \tilde{ρ})}^{2} .

(A2)

Substituting, the integral becomes

I = \int \sqrt{\frac{β λ}{- 2 π}} d \tilde{Δ} e^{\frac{β}{2} λ {(\tilde{Δ} - \tilde{ρ})}^{2}} .

(A3)

Now it is clear why the path of integration must be chosen differently for positive versus negative

λ

, since it must be chosen such that the integral converges. If

λ = 0

, the identity fails, but in that case it is clearly unnecessary. If

λ < 0

, we can write

λ = - | λ |

, and the integral converges for

\tilde{Δ}

ranging over the real axis from

- \infty

to ∞. Substituting

y = \tilde{Δ} - \tilde{ρ}

, with y ranging from

- \infty

to ∞ the integral can be evaluated as

\begin{matrix} I & = & \int_{- \infty}^{\infty} \sqrt{\frac{β | λ |}{2 π}} d y e^{- \frac{β}{2} | λ | y^{2}} = \sqrt{\frac{β | λ |}{2 π}} \sqrt{\frac{2 π}{β | λ |}} \\ = & 1, (λ < 0) . \end{matrix}

(A4)

For

λ > 0

, the situation is slightly more complicated. In that case, in order to have a decaying Gaussian in the integrand, we must integrate

\tilde{Δ}

over the imaginary axis from

- i \infty

i \infty

. This is accomplished by substituting

y = - i (\tilde{Δ} - \tilde{ρ})

, for y ranging from

- \infty

to ∞. In this case, we have

\begin{matrix} I & = & i \int_{- \infty}^{\infty} \sqrt{\frac{β λ}{- 2 π}} d y e^{- \frac{β}{2} λ y^{2}} = i \sqrt{\frac{β λ}{- 2 π}} \sqrt{\frac{2 π}{β λ}} \\ = & 1, \end{matrix}

(A5)

where the i the numerator cancels the

\sqrt{- 1}

in the denominator. This proves the identity for positive

λ

. Therefore, we have seen that this identity works for positive and negative values of

λ

, and, although it does not work for

λ = 0

, it is not needed in that case.

Appendix B. Chemical Potential of Free Dimers

The chemical potential of a solution of free dimers is the derivative of the Helmholtz free energy of the dimers

F_{dimer}

with respect to the number of dimers

N_{dimer}

, with temperature and volume held constant,

μ_{dimer} = {(\frac{\partial F_{dimer}}{\partial N_{dimer}})}_{T, V} .

(A6)

Assuming that

F_{dimer}

is due only to electrostatic interactions, in a dilute solution the free energy is just the number of dimers times the electrostatic self energy

E_{dimer}

of a single dimer. Then the chemical potential becomes

μ_{dimer} = {(\frac{\partial (N_{dimer} E_{dimer})}{\partial N_{dimer}})}_{T, V} = E_{dimer},

(A7)

and so we need the electrostatic energy of a single dimer. Suppose we model a dimer as a cylinder of cross-section radius R and length L of total charge Q, as shown in Figure A1. An exact solution to this problem has been provided by Cifta[54], and the derivation proceeds as follows. The volume charge density of the cylinder is given by

ρ_{0} = \frac{Q}{π R^{2} L},

(A8)

and the Coulomb self-energy is given by

E_{s} = \frac{1}{4 π ϵ} (\frac{ρ_{0}^{2}}{2}) \int_{V} d^{3} r_{1} \int_{V} d^{3} r_{2} \frac{1}{| {\vec{r}}_{1} - {\vec{r}}_{2} |},

(A9)

where

ϵ

is the dielectric permeability of the medium surrounding the cylinder. In evaluating the integrals, the length to radius ratio is defined as

ξ = \frac{L}{R},

(A10)

and the exact result, as calculated by Cifta[54], is given by

E_{s} (ξ) = \frac{1}{4 π ϵ} (\frac{4 Q^{2}}{R ξ^{2}}) \{\frac{ξ}{16} [4 ln (2 ξ) - 3] + \frac{32}{45 π} - (\frac{1}{16 ξ})_{4} F_{3} (\frac{1}{2}, 1, 1, \frac{5}{2}; 2, 3, 4; - \frac{4}{ξ^{2}})\},

(A11)

where the function

_{p} F_{q} (a_{1}, a_{2}, \dots, a_{p}; b_{1}, b_{2}, \dots, b_{p}; z)

is the generalized hypergeometric function whose series expansion is

_{p} F_{q} (a_{1}, a_{2}, \dots, a_{p}; b_{1}, b_{2}, \dots, b_{p}; z) = \sum_{n = 0}^{\infty} \frac{{(a_{1})}_{n} {(a_{2})}_{n} \dots {(a_{p})}_{n}}{{(b_{1})}_{n} {(b_{2})}_{n} \dots {(b_{q})}_{n}} \frac{z^{n}}{n!},

(A12)

where the variable z can be either real or complex and

{(c)}_{n}

is the Pochhammer symbol defined as

{(c)}_{n} = \frac{Γ (c + n)}{Γ (c)}, n = 1, 2, \dots .

(A13)

Here

Γ (z)

is the gamma function and

{(c)}_{0} = 1

. For a long, thin rod, the logarithm term would dominate, but a dimer approximated by a cylinder, pictured in Figure A1, does not fall into this regime.

Figure A1. A free dimer approximated by a cylinder, shown in cross section, where the dimer is designed to attach to two phosphates separated by lattice spacing a. The length of the cylinder is

L = 2 a

and the radius is

R = \frac{a}{2}

, making the ratio

ξ = \frac{L}{R} = 4

Figure A1. A free dimer approximated by a cylinder, shown in cross section, where the dimer is designed to attach to two phosphates separated by lattice spacing a. The length of the cylinder is

L = 2 a

and the radius is

R = \frac{a}{2}

, making the ratio

ξ = \frac{L}{R} = 4

As shown in the picture, for the dimer, the radius of the dimer is

R = 2 a

, where a is the lattice spacing along a DNA helical chain. The length of the dimer is

L = 2 a

, and so the length ratio is

ξ = \frac{L}{R} = 4

. The charge is

Q = 2 e

, where e is the magnitude of the charge of an electron, and the chemical potential becomes

\begin{matrix} μ_{dimer} & = & E_{dimer} = \\ = & \frac{1}{4 π ϵ} (\frac{e^{2}}{8 a}) \{\frac{1}{4} [4 ln (8) - 3] + \frac{32}{45 π} - (\frac{1}{128})_{4} F_{3} (\frac{1}{2}, 1, 1, \frac{5}{2}; 2, 3, 4; - \frac{1}{4})\} . \end{matrix}

(A14)

For the parameters given in the text, the chemical potential for the dimer, in units of

β = \frac{1}{k_{B} T}

T = 300 K

is given by

β μ_{dimer} = 0.79

. It is interesting to note that approximately the same value was obtained by Nguyen and Shklovskii[27] and Bishop and McMullen[28] assuming that the dimer is an infinitely thin rod of length L but that there were upper and lower screening cutoffs. This exact solution avoids the use of those cutoffs.

References

Udgaonkar, J.B. Entropy in Biology. Resonance 2001, 6, 61–66. [Google Scholar] [CrossRef]
Lodish, H.; Berk, A.; Zipursky, S.L.; Matsudaira, P.; Baltimore, D.; Darnell, J. Molecular Cell Biology, 4th edition ed.; W.H. Freeman: New York, 2004. [Google Scholar]
Yoshidome, T. General Framework of Pressure Effects on Structures Formed by Entropically Driven Self-Assembly. Entropy 2010, 12, 1632–1652. [Google Scholar] [CrossRef]
Bishop, M.F.; Ferrone, F.A. The Sickle-Cell Fiber Revisited. Biomolecules 2023, 13, 413. [Google Scholar] [CrossRef] [PubMed]
Cao, Z.; Ferrone, F.A. Homogeneous Nucleation in Sickle Hemoglobin: Stochastic Measurements with a Parallel Method. Biophys. J. 1997, 72, 342–352. [Google Scholar] [CrossRef]
Wong, G.C.L.; Lin, A.; Tang, J.X.; Li, Y.; Janmey, P.A.; Safinya, C.R. Lamellar phase of two-dimensional rafts of actin filaments. Phys. Rev. Lett. 2003, 91, 018103. [Google Scholar] [CrossRef]
Angelini, T.; Liang, H.; Wriggers, W.; Wong, G. Like-charge attraction between polyelectrolytes induced by counterion charge density waves. Proc Natl Acad Sci U S A. 2003, 100, 8634–8637. [Google Scholar] [CrossRef]
Gelbart, W.M.; Bruinsma, R.F.; Pincus, P.A.; Parsegian, V.A. DNA-inspired electrostatics. Physics Today 2000, 53, 38–44. [Google Scholar] [CrossRef]
Bloomfield, V.A. Condensation of DNA by multivalent cations: Considerations on mechanism. Biopolymers 1991, 31, 1471–1481. [Google Scholar] [CrossRef]
Bloomfield, V.A. DNA Condensation by Multivalent Cations. Biopoly 1997, 44, 269–282. [Google Scholar] [CrossRef]
Raspaud, E.; Chaperon, I.; Leforestier, A.; Livolant, F. Spermine-Induced Aggregation of DNA, Nucleosome, and Chromatin. Biophysical Journal 1999, 77, 1547–1555. [Google Scholar] [CrossRef]
Manning, G.S. Limiting Laws and Counterion Condensation in Polyelectrolyte Solutions I. Colligative Properties. J. Chem. Phys. 1969, 51, 924–933. [Google Scholar] [CrossRef]
Manning, G.S. Limiting Laws and Counterion Condensation in Polyelectrolyte Solutions II. Self-Diffusion fo the Small Ions. J. Chem. Phys. 1969, 51, 934–938. [Google Scholar] [CrossRef]
Bloomfield, V.A. DNA Condensation. Current Opinion in Structural Biology 1996, 6, 334–341. [Google Scholar] [CrossRef] [PubMed]
Hud, N.V.; Vilfan, I.D. Toroidal DNA Condensates: Unraveling the Fine Structure and the Role of Nucleation in Determining Size. Ann. Rev. Biophys. Biomol. Struct. 2005, 34, 295–318. [Google Scholar] [CrossRef]
Raspaud, E.; Pelta, J.; de Frutos, M.; Livolant, F. Solubility and Charge Inversion of Complexes of DNA and Basic Proteins. Phys. Rev. Lett. 2006, 97, 068103. [Google Scholar] [CrossRef]
Kornyshev, A.A.; Lee, D.J.; Leikin, S.; Wynveen, A. Structure and interactions of biological helices. Reviews of Modern Physics 2007, 79, 943–996. [Google Scholar] [CrossRef]
Shklovskii, B.I. Screening of a macroion by multivalent ions: Correlation-induced inversion of charge. Phys. Rev. E 1999, 60, 5802–5811. [Google Scholar] [CrossRef]
Grosberg, A.Y.; Nguyen, T.T.; Shklovskii, B.I. Colloquium: The physics of charge inversion in chemical and biological systems. Rev. Mod. Phys. 2002, 74, 329–345. [Google Scholar] [CrossRef]
Ha, B.Y.; Liu, A.J. Conterion-Mediated Attraction between Two Like-Charged Rods. Phys. Rev. Lett. 1997, 79, 1289–1292. [Google Scholar] [CrossRef]
Ha, B.Y.; Liu, A.J. Effect of non-pairwise-additive interactions on bundles of rodlike polyelectrolytes. Phys. Rev. Lett. 1998, 81, 1011–1014. [Google Scholar] [CrossRef]
Ha, B.Y.; Liu, A.J. Counterion-mediated, non-pairwise-additive attractions in bundles of like-charged rods. Phys. Rev. E 1999, 60, 803–813. [Google Scholar] [CrossRef]
Ha, B.Y.; Liu, A.J. Effect of nonzero chain diameter on “DNA” condensation. Phys. Rev. E 2001, 63, 021503. [Google Scholar] [CrossRef]
Podgornik, R.; Parsegian, V.A. Charge-Fluctuation Forces between Rodlike Polyelectrolytes: Pairwise Summability Reexamined. Phys. Rev. Lett. 1998, 80, 1560. [Google Scholar] [CrossRef]
Shklovskii, B.I. Wigner crystal model of counterion induced bundle formation of rodlike polyelectrolytes. Phys. Rev. Lett. 1999, 82, 3286–3271. [Google Scholar] [CrossRef]
Nguyen, T.T.; Shklovskii, B.I. Model of inversion of DNA charge by a positive polymer: Fractionalization of the polymer charge. Phys. Rev. Lett. 2002, 89, 018101. [Google Scholar] [CrossRef] [PubMed]
Nguyen, T.T.; Shklovskii, B.I. Inversion of DNA charge by a positive polymer via fractionalization of the polymer charge. Physica A 2002, 310, 197–211. [Google Scholar] [CrossRef]
Bishop, M.F.; McMullen, T. Lattice-gas model of DNA charge inversion by a positively charged polyelectrolyte. Phys. Rev. E 2006, 74, 021906–(8). [Google Scholar] [CrossRef]
Sharp, K.A. Polyelectrolyte Electrostatics: Salt Dependence, Entropic, and Enthalpic Contributions to Free Energy in the Nonlinear Poisson-Boltzmann Model. Biopolymers 1995, 36, 227–243. [Google Scholar] [CrossRef]
Lehninger, A. Principles of Biochemistry; Worth: New York, 1982. [Google Scholar]
Maltsev, E.; Wattis, J.A.D.; Byrne, H.M. DNA charge neutralization by linear polymers. II. Reversible binding. Phys. Rev. E 2006, 74, 041918. [Google Scholar] [CrossRef]
Kunze, K.K.; Netz, R.R. Complexes of semiflexible polyelectrolytes and charged spheres as models for salt-modulated nucleosomal structures. Phys. Rev. E 2002, 66, 011918. [Google Scholar] [CrossRef]
Sievert, M.D. Entropy of fluctuations about the mean field in charge inversion on DNA. Master’s thesis, Virginia Commonwealth University, Richmond, VA, 2007.
Deng, H.; Bloomfield, V.A.; Benevides, J.M.; Thomas, G.J.J. Structural basis of polyamine–DNA recognition: Spermidine and spermine interactions with genomic B-DNAs of different GC content probed by Raman spectroscopy. Nucleic Acids Research 2000, 28, 3379–3385. [Google Scholar] [CrossRef] [PubMed]
van Dam, L.; Korolev, N.; Nordenskiöld. Polyamine-nucleic acid interactions and the effets on structure in oriented DNA fibers. Nucleic Acids Research 2002, 30, 419–428. [Google Scholar] [CrossRef]
Fisher, M.E. On the Dimer Solution of Planar Ising Models. J. Math. Phys. 1966, 7, 1776–1781. [Google Scholar] [CrossRef]
Adams, D.H.; Chandrasekharan, S. Chiral limit of strongly coupled lattice gauge theories. Nuclear Physics B 2003, 662, 220–246. [Google Scholar] [CrossRef]
Plischke, M.; Bergersen, B. Equilibrium Statistical Mechanics; Prentice-Hall: Englewood Cliffs, 1989. [Google Scholar]
Reif, F. Fundamentals of Statistical and Thermal Physics; McGraw-Hill: New York, 1965. [Google Scholar]
Moore, W.J. Physical Chemistry, fourth ed.; Prentice-Hall: Englewood Cliffs, 1972. [Google Scholar]
Moukhtar, J.; Fontaine, E.; Faivre-Moskalenko, C.; Arneodo, A. Probing Persistence in DNA Curvature Properties with Atomic Force Microscopy. Phys. Rev. Lett. 2007, 98, 178101. [Google Scholar] [CrossRef]
Arnott, S.; Hukins, D.W.L. Conservation of Conformation in Mono and Poly-nucleotides. Nature (London) 1969, 224, 886. [Google Scholar] [CrossRef]
Arnott, S.; Hukins, D.W.L. Optimised parameters for A-DNA and B-DNA. Biochem. Biophys. Res. Commun. 1972, 47, 1504. [Google Scholar] [CrossRef]
SYBYL. Molecular modeling package, Version 6.9.2. Tripos Associates: St. Louis, MO, 2006.
Misner, C.W.; Thorne, K.S.; Wheeler, J.A. Gravitation; Freeman: San Francisco, 1973. [Google Scholar]
Kittel, C. Introduction to Solid State Physics, seventh ed.; Wiley: Hoboken, NJ, 1996. [Google Scholar]
Ashcroft, N.W.; Mermin, N.D. Solid State Physics; Holt, Rinehart and Winston: New York, 1976. [Google Scholar]
Negele, J.W.; Orland, H. Quantum Many-Particle Systems; Addison-Wesley: Redwood City, California, 1988. [Google Scholar]
Altland, A.; Simons, B. Condensed Matter Field Theory, 2 ed.; Cambridge University Press: New York, 2010. [Google Scholar]
Fisher, M.E. Statistical Mechanics of Dimers on a Plane Lattice. Phys. Rev. 1961, 124, 1664–1672. [Google Scholar] [CrossRef]
Fisher, M.E.; Stephenson, J. Statistical Mechanics of Dimers on a Plane Lattice. II. Dimer Correlations and Monomers. Phys. Rev. 1963, 132, 1411–1431. [Google Scholar] [CrossRef]
Baker III, J.C. Application of the Fisher Dimer model to DNA condensation. Master’s thesis, Virginia Commonwealth University, Richmond, VA, 2017.
Syljuasen, O.F.; Sandvik, A.W. Quantum Monte Carlo with directed loops. Phys. Rev. E 2002, 66, 046701. [Google Scholar] [CrossRef]
Ciftja, O. Coulomb self-energy of a uniformly charged three-dimensional cylinder. Physica B 2012, 407, 2803–2807. [Google Scholar] [CrossRef]

Figure 1. Charge fractionalization occurs when only a fraction of the positive charges of the polyelectrolyte attach to the negatively-charged surface. The charges that are not attached then cause the surface to become positively charged. This shows a freely jointed polyelectrolyte chain of charge

3 e

is adapted from Figure 1b of Nguyen and Shklovskii [26].

3 e

is adapted from Figure 1b of Nguyen and Shklovskii [26].

Figure 2. Model of DNA as two helical chains of phosphates, shown as red and blue balls, with the yellow lines representing the base pairs. These have been labeled “up" and “down" chains, corresponding to the direction of the carbon atoms in the sugar backbone.

Figure 3. Polyamines as candidates for attachment to DNA. The diamines, including 1,3-diaminopropane (DAP

^{2 +}

), putrescine (Put

^{2 +}

), and cadaverine (Cad

^{2 +}

) have charge

+ 2 e

, with sufficient length to attach to adjacent phosphates along the DNA chain, with DAP

^{2 +}

the best fit. These are the most likely polyelectrolytes that the dimers in our model represent. Higher-charged polyamines spermidine(Spd

^{3 +}

) and spermine (Spn

^{4 +}

), shown for comparison, are considerably longer. The lattice spacing

a^{(1)}

between phosphates along a helix is shown as a dashed line for comparison.

Figure 3. Polyamines as candidates for attachment to DNA. The diamines, including 1,3-diaminopropane (DAP

^{2 +}

), putrescine (Put

^{2 +}

), and cadaverine (Cad

^{2 +}

) have charge

+ 2 e

, with sufficient length to attach to adjacent phosphates along the DNA chain, with DAP

^{2 +}

the best fit. These are the most likely polyelectrolytes that the dimers in our model represent. Higher-charged polyamines spermidine(Spd

^{3 +}

) and spermine (Spn

^{4 +}

), shown for comparison, are considerably longer. The lattice spacing

a^{(1)}

between phosphates along a helix is shown as a dashed line for comparison.

Figure 4. Double helix of negatively-charged phosphate chains of red and blue balls are wrapped around a cylinder. A parallel dimer attaching to one of the chains would neutralize two sites, while a perpendicular dimer would have one excess charge dangling into the solution, causing that site to have a net positive charge.

Figure 5. Plots of the mean occupation numbers

{〈 n^{(γ)} 〉}_{N I}

in the noninteracting lattice gas model. Curves are shown for

γ = ‖

(blue,solid), ⊥ (red,long-dashed), and v (green,short-dashed). The parameters used in the calculation are

β μ_{dimer} = 0.79

and

ε_{‖} = 2 ε_{⊥}

. For negative

ε_{‖} - μ_{dimer}

, dimers are attracted to the lattice, while for positive values of this quantity, they are repelled.

Figure 5. Plots of the mean occupation numbers

{〈 n^{(γ)} 〉}_{N I}

in the noninteracting lattice gas model. Curves are shown for

γ = ‖

(blue,solid), ⊥ (red,long-dashed), and v (green,short-dashed). The parameters used in the calculation are

β μ_{dimer} = 0.79

and

ε_{‖} = 2 ε_{⊥}

. For negative

ε_{‖} - μ_{dimer}

, dimers are attracted to the lattice, while for positive values of this quantity, they are repelled.

Figure 6. Plots of the charge densities for the noninteracting lattice gas model. The charge density

ρ_{N I}

is shown as the solid blue curve, and the standard deviation of the charge density,

σ_{c}

, is shown as the dashed red curve. The parameters used in the calculation are

β μ_{dimer} = 0.79

and

ε_{‖} = 2 ε_{⊥}

. As in Figure 5, dimers are attracted to the lattice when

ε_{‖} - μ_{dimer} < 0

, and dimers are repelled from the lattice when

ε_{‖} - μ_{dimer} > 0

Figure 6. Plots of the charge densities for the noninteracting lattice gas model. The charge density

ρ_{N I}

is shown as the solid blue curve, and the standard deviation of the charge density,

σ_{c}

, is shown as the dashed red curve. The parameters used in the calculation are

β μ_{dimer} = 0.79

and

ε_{‖} = 2 ε_{⊥}

. As in Figure 5, dimers are attracted to the lattice when

ε_{‖} - μ_{dimer} < 0

, and dimers are repelled from the lattice when

ε_{‖} - μ_{dimer} > 0

Figure 7. The entropy

S_{N I}

per site in the noninteracting lattice gas model. The parameters used in the calculation are

β μ_{dimer} = 0.79

and

ε_{‖} = 2 ε_{⊥}

. The dashed curves are the individual contributions in Eq. (33).

Figure 7. The entropy

S_{N I}

per site in the noninteracting lattice gas model. The parameters used in the calculation are

β μ_{dimer} = 0.79

and

ε_{‖} = 2 ε_{⊥}

. The dashed curves are the individual contributions in Eq. (33).

Figure 8. A segment of a single DNA strand. The helix winds along the cylindrical surface with a pitch angle

ψ

, and a helix of pitch angle

α

connects a site with its neighbor on the other chain. The helix has been stretched in the z direction so that the angle

α

, which is only

\approx 0 . 4^{\circ}

, can be clearly labeled on the diagram.

Figure 8. A segment of a single DNA strand. The helix winds along the cylindrical surface with a pitch angle

ψ

, and a helix of pitch angle

α

connects a site with its neighbor on the other chain. The helix has been stretched in the z direction so that the angle

α

, which is only

\approx 0 . 4^{\circ}

, can be clearly labeled on the diagram.

Figure 9. DNA double helix wrapped around a cylindrical tube. If the scissors cut along the solid green line, which is the center of the minor groove, then the paper can be put on a flat surface, with the double helix forming a one-dimensional chain. The coordinate

x^{(1)}

lies along the dashed brown line in the major groove, midway between the up and down chains, and

x^{(2)}

lies along a path on the surface of the cylinder that connects the two phosphates on the two sub-chains that are connected by base pairs.

x^{(1)}

lies along the dashed brown line in the major groove, midway between the up and down chains, and

x^{(2)}

lies along a path on the surface of the cylinder that connects the two phosphates on the two sub-chains that are connected by base pairs.

Figure 10. Diagram of the new coordinates

x^{(1)}

and

x^{(2)}

in terms of the cylindrical coordinates

(ϕ, z)

Figure 10. Diagram of the new coordinates

x^{(1)}

and

x^{(2)}

in terms of the cylindrical coordinates

(ϕ, z)

Figure 11. In the new coordinates

x^{(1)}

and

x^{(2)}

, the positions of the sites constitute a one-dimensional lattice with two sites per unit cell. Here

b = - \frac{1}{2}

for the “down"(↓) chain and

b = \frac{1}{2}

for the “up"(↑) chain.

Figure 11. In the new coordinates

x^{(1)}

and

x^{(2)}

, the positions of the sites constitute a one-dimensional lattice with two sites per unit cell. Here

b = - \frac{1}{2}

for the “down"(↓) chain and

b = \frac{1}{2}

for the “up"(↑) chain.

Figure 12. (a) Distance

d_{b_{1} b_{2}} (ℓ) / R_{DNA}

between sites and (b) screened Coulomb potential

V_{b_{1} b_{2}} (ℓ_{1} - ℓ_{2})

as a function of lattice index ℓ, within the same chain (

↑ ↑

) and between the two chains (

↑ ↓

). The red curve (

↑ ↑

), representing same-chain repulsion, is an even function, and the blue curve (

↑ ↓

), representing opposite-chain repulsion, is neither even nor odd. The point

ℓ = 0

for the same chain repulsion is omitted because it would correspond to a site interacting with itself. The inset in (b) is the screened Coulomb interaction as a function of the straight-line distance d between charges. At the physiological temperature of

T = 37^{\circ}

C (300.15 K),

β \equiv 1 / k_{B} T \approx

1/(27 meV).

Figure 12. (a) Distance

d_{b_{1} b_{2}} (ℓ) / R_{DNA}

between sites and (b) screened Coulomb potential

V_{b_{1} b_{2}} (ℓ_{1} - ℓ_{2})

as a function of lattice index ℓ, within the same chain (

↑ ↑

) and between the two chains (

↑ ↓

). The red curve (

↑ ↑

), representing same-chain repulsion, is an even function, and the blue curve (

↑ ↓

), representing opposite-chain repulsion, is neither even nor odd. The point

ℓ = 0

T = 37^{\circ}

C (300.15 K),

β \equiv 1 / k_{B} T \approx

1/(27 meV).

Figure 13. Plots of the Fourier transforms

{\tilde{V}}_{↑ ↑} (k)

(solid red curve) and

{\tilde{V}}_{↑ ↓} (k)

against the dimensionless wavevector k. The symbols

R e

and

I m

correspond to the real (blue dashed curve) and imaginary parts (green dot-dashed curve) of the complex Fourier transform, respectively.

Figure 13. Plots of the Fourier transforms

{\tilde{V}}_{↑ ↑} (k)

(solid red curve) and

{\tilde{V}}_{↑ ↓} (k)

against the dimensionless wavevector k. The symbols

R e

and

I m

correspond to the real (blue dashed curve) and imaginary parts (green dot-dashed curve) of the complex Fourier transform, respectively.

Figure 14. Plots of the eigenvalues

λ_{k \pm}

of the

2 \times 2

blocks of

\tilde{V} (k)

Figure 14. Plots of the eigenvalues

λ_{k \pm}

of the

2 \times 2

blocks of

\tilde{V} (k)

Figure 15. A plot of the differences

δ {〈 n^{(γ)} 〉}_{c}

in mean-field average occupation numbers from their noninteracting values for ‖ (blue, solid),

γ = ⊥

(red, long-dashed), and v (green, short-dashed). The parameters used in the calculation are

β μ_{dimer} = 0.79

and

ε_{‖} = 2 ε_{⊥}

. The noninteracting results were shown in Figure 5.

Figure 15. A plot of the differences

δ {〈 n^{(γ)} 〉}_{c}

in mean-field average occupation numbers from their noninteracting values for ‖ (blue, solid),

γ = ⊥

(red, long-dashed), and v (green, short-dashed). The parameters used in the calculation are

β μ_{dimer} = 0.79

and

ε_{‖} = 2 ε_{⊥}

. The noninteracting results were shown in Figure 5.

Figure 16. Plots of the differences in the charge densities and standard deviations of the charge density for the mean field of the interacting lattice gas model and the noninteracting model.

δ ρ_{c} = ρ_{c} - ρ_{NI}

is shown as the solid blue curve, and

δ σ_{c} = σ_{c} - σ_{NI}

is shown as the dashed red curve. The parameters used in the calculation are

β μ_{dimer} = 0.79

and

ε_{‖} = 2 ε_{⊥}

. The noninteracting results were shown in Figure 6.

Figure 16. Plots of the differences in the charge densities and standard deviations of the charge density for the mean field of the interacting lattice gas model and the noninteracting model.

δ ρ_{c} = ρ_{c} - ρ_{NI}

is shown as the solid blue curve, and

δ σ_{c} = σ_{c} - σ_{NI}

is shown as the dashed red curve. The parameters used in the calculation are

β μ_{dimer} = 0.79

and

ε_{‖} = 2 ε_{⊥}

. The noninteracting results were shown in Figure 6.

Figure 17. The functional form of mean-field entropy

\frac{S^{0}}{k_{B} N_{sites}}

for the interacting lattice gas model at the mean field level. The constituent parts of the entropy are shown as well.

Figure 17. The functional form of mean-field entropy

\frac{S^{0}}{k_{B} N_{sites}}

for the interacting lattice gas model at the mean field level. The constituent parts of the entropy are shown as well.

Figure 18. The difference between the mean-field entropy for the interacting lattice gas model and the entropy for the noninteracting model.

Figure 19. The inverse propagator for the fluctuations of the auxiliary Hartree field

{({\tilde{D}}^{- 1})}_{(k_{1}, σ_{1}), (k_{2}, σ_{2})}

Figure 19. The inverse propagator for the fluctuations of the auxiliary Hartree field

{({\tilde{D}}^{- 1})}_{(k_{1}, σ_{1}), (k_{2}, σ_{2})}

Figure 20. The propagator for the fluctuations of the auxiliary Hartree field

{\tilde{D}}_{(k_{1}, σ_{1}), (k_{2}, σ_{2})}

Figure 20. The propagator for the fluctuations of the auxiliary Hartree field

{\tilde{D}}_{(k_{1}, σ_{1}), (k_{2}, σ_{2})}

Figure 21. The changes in the occupancies

δ 〈 n^{(γ)} 〉

due to fluctuations. These should be compared with the changes between occupancies of the mean-field and noninteracting systems shown in Figure 15.

Figure 21. The changes in the occupancies

δ 〈 n^{(γ)} 〉

due to fluctuations. These should be compared with the changes between occupancies of the mean-field and noninteracting systems shown in Figure 15.

Figure 22. The changes in the charge density

δ ρ_{tot}

and in the standard deviation

δ σ_{tot}

of the charge density due to fluctuations. These are further corrections to the mean-field values of Figure 16.

Figure 22. The changes in the charge density

δ ρ_{tot}

and in the standard deviation

δ σ_{tot}

of the charge density due to fluctuations. These are further corrections to the mean-field values of Figure 16.

Figure 23. The fluctuation contribution to the entropy

{\bar{S}}_{f} = \frac{S_{f}}{k_{B} N_{sites}}

per site. The two contributions,

{\bar{S}}_{f}^{(A)}

and

{\bar{S}}_{f}^{(B)}

, to the total entropy are also shown, and they are defined similarly.

Figure 23. The fluctuation contribution to the entropy

{\bar{S}}_{f} = \frac{S_{f}}{k_{B} N_{sites}}

per site. The two contributions,

{\bar{S}}_{f}^{(A)}

and

{\bar{S}}_{f}^{(B)}

, to the total entropy are also shown, and they are defined similarly.

Figure 24. The average occupation numbers in the noninteracting approximation (

{〈 n^{(γ)} 〉}_{N I}

), shown as dotted curves, the mean-field approximation (

{〈 n^{(γ)} 〉}_{c}

), shown as dashed curves, and the model including the lowest order corrections to mean field, shown as solid curves, plotted for

γ = ‖

(blue), ⊥ (red), and v(green). Parameters used in the plot are

β μ_{dimer} = 0.79

and

ε_{‖} = 2 ε_{⊥}

Figure 24. The average occupation numbers in the noninteracting approximation (

{〈 n^{(γ)} 〉}_{N I}

), shown as dotted curves, the mean-field approximation (

{〈 n^{(γ)} 〉}_{c}

), shown as dashed curves, and the model including the lowest order corrections to mean field, shown as solid curves, plotted for

γ = ‖

(blue), ⊥ (red), and v(green). Parameters used in the plot are

β μ_{dimer} = 0.79

and

ε_{‖} = 2 ε_{⊥}

Figure 25. The average charge per site in the noninteracting

{〈 ρ 〉}_{N I}

(green, short dashes), mean-field (blue, long dashes) and inclusion of fluctuations (red, solid). This plot assumes

β μ_{dimer} = 0.79

and

ε^{(‖)} = 2 ε^{(⊥)}

Figure 25. The average charge per site in the noninteracting

{〈 ρ 〉}_{N I}

(green, short dashes), mean-field (blue, long dashes) and inclusion of fluctuations (red, solid). This plot assumes

β μ_{dimer} = 0.79

and

ε^{(‖)} = 2 ε^{(⊥)}

Figure 26. The entropy S per site, in units of the Boltzmann constant

k_{B}

, in the noninteracting (green, short dashes), mean-field (blue, long dashes), and and inclusion of fluctuations (red, solid) approximations. These curves assume

β μ_{dimer} = 0.79

and

ε_{‖} = 2 ε_{⊥}

Figure 26. The entropy S per site, in units of the Boltzmann constant

k_{B}

, in the noninteracting (green, short dashes), mean-field (blue, long dashes), and and inclusion of fluctuations (red, solid) approximations. These curves assume

β μ_{dimer} = 0.79

and

ε_{‖} = 2 ε_{⊥}

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.