MIT8 09F14 Chapter 1
MIT8 09F14 Chapter 1
MIT8 09F14 Chapter 1
1.1 Introduction
These lecture notes cover the third course in Classical Mechanics, taught at MIT since
the Fall of 2012 by Professor Stewart to advanced undergraduates (course 8.09) as well as
to graduate students (course 8.309). In the prerequisite classical mechanics II course the
students are taught both Lagrangian and Hamiltonian dynamics, including Kepler bound
motion and central force scattering, and the basic ideas of canonical transformations. This
course briefly reviews the needed concepts, but assumes some familiarity with these ideas.
References used for this course include
• Landau and Lifshitz vol.6, Fluid Mechanics. Symon, Mechanics for reading material
on non-viscous fluids.
• Review: Landau & Lifshitz vol.1, Mechanics. (Typically used for the prerequisite
Classical Mechanics II course and hence useful here for review)
1
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
Velocities are represented through vi ≡ ṙi for spatial coordinates, or through q̇i for
generalized coordinates. Note that dots above a symbol will always denote the total time
derivative dtd . Momenta are likewise either Newtonian pi = mi vi or generalized pi .
For a fixed set of masses mi Newton’s 2nd law can be expressed in 2 equivalent ways:
1. It can be expressed as N second-order equations Fi = dtd (mi ṙi ) with 2N boundary
conditions given in ri (0) and ṙi (0). The problem then becomes one of determining the
N vector variables ri (t).
2. It can also be expressed as an equivalent set of 2N 1st order equations Fi = ṗi &
pi /mi = ṙi with 2N boundary conditions given in ri (0) and pi (0). The problem then
becomes one of determining the 2N vector variables ri (t) and pi (t).
Note that F = ma holds in inertial frames. These are frames where the motion of a
particle not subject to forces is in a straight line with constant velocity. The converse does not
hold. Inertial frames describe time and space homogeneously (invariant to displacements),
isotropically (invariant to rotations), and in a time independent manner. Noninertial frames
also generically have fictitious “forces”, such as the centrifugal and Coriolis effects. (Inertial
frames also play a key role in special relativity. In general relativity the concept of inertial
frames is replaced by that of geodesic motion.)
The existence of an inertial frame is a useful approximation for working out the dynam-
ics of particles, and non-inertial terms can often be included as perturbative corrections.
Examples of approximate inertial frames are that of a fixed Earth, or better yet, of fixed
stars. We can still test for how noninertial we are by looking for fictitious forces that (a) may
point back to an origin with no source for the force or (b) behave in a non-standard fashion
in different frames (i.e. they transform in a strange manner when going between different
frames).
We will use primes will denote coordinate transformations. If r is measured in an inertial
frame S, and r0 is measured in frame S 0 with relation to S by a transformation r0 = f (r, t),
then S 0 is inertial iff r̈ = 0 ↔ r̈0 = 0. This is solved by the Galilean transformations,
r0 = r + v0 t
t0 = t,
which preserves the inertiality of frames, with F = mr̈ and F0 = mr̈0 implying each other.
Galilean transformations are the non-relativistic limit, v c, of Lorentz transformations
which preserve inertial frames in special relativity. A few examples related to the concepts
of inertial frames are:
1. In a rotating frame, the transformation is given by
0
x cos(θ) sin(θ) x
=
y0 − sin(θ) cos(θ) y
If θ = ωt for some constant ω, then r̈ = 0 still gives r̈0 6= 0, so the primed frame is
noninertial.
2
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
Even if ¨r = 0 we can still have r̈ =6 0 and θ¨ 6= 0, and we can not in general form
a simple Newtonian force law equation mq̈ = Fq for each of these coordinates. This
is different than the first example, since here we are picking coordinates rather than
changing the reference frame, so to remind ourselves about their behavior we will call
these ”non-inertial coordinates” (which we may for example decide to use in an inertial
frame). In general, curvilinear coordinates are non-inertial.
Lagrangian Mechanics
In Lagrangian mechanics, the key function is the Lagrangian
L = L(q, q̇, t). (1.3)
Here, q = (q1 , . . . , qN ) and likewise q̇ = (q̇1 , . . . , q̇N ). We are now letting N denote the
number of scalar (rather than vector) variables, and will often use the short form to denote
dependence on these variables, as in Eq. (1.3). Typically we can write L = T − V where
T is the kinetic energy and V is the potential energy. In the simplest cases, T = T (q̇)
and V = V (q), but we also allow the more general possibility that T = T (q, q̇, t) and
V = V (q, q̇, t). It turns out, as we will discuss later, that even this generalization does not
describe all possible classical mechanics problems.
The solution to a given mechanical problem is obtained by solving a set of N second-order
differential equations known as Euler-Lagrange equations of motion,
d ∂L ∂L
− = 0. (1.4)
dt ∂q̇i ∂qi
3
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
These equations involve q̈i , and reproduce the Newtonian equations F = ma. The principle
of stationary action (Hamilton’s principle),
Z t2
δS = δ L(q, q̇, t) dt = 0, (1.5)
t1
is the starting point for deriving the Euler-Lagrange equations. Although you have covered
the Calculus of Variations in an earlier course on Classical Mechanics, we will review the
main ideas in Section 1.5.
There are several advantages to working with the Lagrangian formulation, including
1. It is easier to work with the scalars T and V rather than vectors like F.
2. The same formula in equation (1.4) holds true regardless of the choice of coordinates.
To demonstrate this, let us consider new coordinates
Therefore
∂Q˙ ∂Q
= , (1.10)
∂q̇ ∂q
4
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
∂Q ∂L0 ∂Q
Since ∂q̇
= 0 there is no term∂Q ∂q̇
in the last line.
Plugging these results into 0 = dtd ∂∂q̇L − ∂L
∂q
gives
∂ L0 ∂ L0
d
0= − . (1.13)
dt ∂Q˙ ∂Q
• This implies we can freely use the Euler-Lagrange equations for noninertial coor-
dinates.
• We can formulate L in whatever coordinates are easiest, and then change to
convenient variables that better describe the symmetry of a system (for example,
Cartesian to spherical).
3. Continuing our list of advantages for using L, we note that it is also easy to incorporate
constraints. Examples include a mass constrained to a surface or a disk rolling without
slipping. Often when using L we can avoid discussing forces of constraint (for example,
the force normal to the surface).
Lets discuss the last point in more detail (we will also continue to discuss it in the next
section). The method for many problems with constraints is to simply make a good choice for
the generalized coordinates to use for the Lagrangian, picking N − k independent variables
qi for a system with k constraints.
Example: For a bead on a helix as in Fig. 1.2 we only need one variable, q1 = z.
Example: A mass m2 attached by a massless pendulum to a horizontally sliding mass m1
as in Fig. 1.3, can be described with two variables q1 = x and q2 = θ.
Example: As an example using non-inertial coordinates consider a potential V = V (r, θ)
in polar coordinates for a fixed mass m at position r = rr̂. Since ṙ = ṙr̂ + rθθ ˙ ˆ we have
T = m2 ṙ2 = m2 ṙ2 + r2 θ˙2 , giving
m 2
L= ṙ + r2 θ˙2 − V (r, θ). (1.14)
2
5
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
Figure 1.3: Pendulum of mass m2 hanging on a rigid bar of length ` whose support m1 is a
frictionless horizontally sliding bead
This gives
∂V
mr̈ − mrθ˙2 = − = Fr , (1.16)
∂r
6 mr̈. For θ the Euler-Lagrange equation is
from which we see that Fr =
d ∂L ∂L d 2 ˙ ∂ V
0= − = mr θ + . (1.17)
dt ∂θ˙ ∂θ dt ∂θ
This gives
d 2 ˙ ∂V
mr θ = − = Fθ , (1.18)
dt ∂θ
which is equivalent to the relation between angular momentum and torque perpendicular to
the plane, L˙ z = Fθ = τz . (Recall L = r × p and τ = r × F.)
6
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
Example: Let us consider a particle rolling due to gravity in a frictionless cone, shown
in
p Fig. 1.4, whose opening angle α defines an equation for points on the cone tan(α) =
x2 + y 2 /z. There are 4 steps which we can take to solve this problem (which are more
general than this example):
1. Formulate T and V by N = 3 generalized coordinates. Here it is most convenient
to
m 2 2 ˙2 2
choose cylindrical coordinates denoted (r, θ, z), so that T = 2 ṙ + r θ + ż and
V = mgz.
3. Find the Euler-Lagrange equations. For r, 0 = dtd ∂∂ṙL − ∂∂rL , which here is
d
m 1 + cot2 α ṙ − mrθ˙2 + mg cot α
0= (1.20)
dt
giving
1 + cot2 α r̈ − rθ˙2 + g cot α = 0.
(1.21)
d ∂L ∂L
For θ we have 0 = dt ∂θ˙
− ∂θ
, so
d 2
0= mr θ̇ − 0, (1.22)
dt
giving
(2ṙθ˙ + rθ)r
¨ = 0. (1.23)
7
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
Hamiltonian Mechanics
∂L
In Hamiltonian mechanics, the canonical momenta pi ≡ ∂q̇ i
are promoted to coordinates
on equal footing with the generalized coordinates qi . The coordinates (q, p) are canonical
variables, and the space of canonical variables is known as phase space.
∂L
The Euler-Lagrange equations say ṗi = ∂q i
. These need not equal the kinematic momenta
mi q̇i if V = V (q, q̇). Performing the Legendre transformation
(where for this equation, and henceforth, repeated indices will imply a sum unless otherwise
specified) yields the Hamilton equations of motion
∂H
q̇i = (1.25)
∂pi
∂H
ṗi = −
∂qi
which are 2N 1st order equations. We also have the result that
∂H ∂L
=− . (1.26)
∂t ∂t
Proof: (for N = 1) Consider
∂H ∂H ∂H
dH = dq + dp + dt (1.27)
∂q ∂p ∂t
∂L ∂L ∂L
= pdq̇ + q̇dp − dq − dq̇ − dt . (1.28)
∂q ∂q̇ ∂t
∂L ∂L
Since we are free to independently vary dq, dp, and dt this implies ∂q̇
= p, ∂q
= ṗ, and
∂H
∂t
= − ∂L
∂t
.
We can interpret the two Hamilton equations as follows:
∂H ∂L
• q̇i = ∂pi
is an inversion of pi = ∂q̇i
= pi (q, q̇, t).
• ṗi = − ∂H
∂qi
provides the Newtonian dynamics.
However, these two equation have an have equal footing in Hamiltonian mechanics, since
∂L
the coordinates and momenta are treated on a common ground. We can use pi = ∂q̇ i
to
construct H from L and then forget about L.
8
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
Here we consider transforming between L and H either before or after removing the redun-
dant coordinate z, but in this course we will only consider constraints imposed on Lagrangians
and not in the Hamiltonian formalism (the step indicated by =⇒). For the curious, the topic
of imposing constraints on Hamiltonians, including even more general constraints than those
we will consider, is covered well in Dirac’s little book “Lectures on Quantum Mechanics”.
Although Hamiltonian and Lagrangian mechanics provide equivalent formalisms, there is of-
ten an advantage to using one or the other. In the case of Hamiltonian mechanics potential
advantages include the language of phase space with Liouville’s Theorem, Poisson Brackets
and the connection to quantum mechanics, as well as the Hamilton-Jacobi transformation
theory (all to be covered later on).
Special case: Let us consider a special case that is sufficient to imply that the Hamiltonian
is equal to the energy, H = E ≡ T + V . If we only have quadratic dependence on velocities
in the kinetic energy, T = 21 Tjk (q)q̇j q̇k , and V = V (q) with L = T − V , then
∂L 1 1
q̇i pi = q̇i = q̇i T ikq̇k + q̇j Tji q̇i = 2T. (1.30)
∂q̇i 2 2
Hence,
H = q̇i pi − L = T + V = E (1.31)
which is just the energy.
9
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
vectors from the left imply transposition of that vector. Note that ~q is an unusual vector,
since its components can have different dimensions, eg. ~q = (x, θ), but nevertheless this
notation is useful. To find H,
∂L
pj = = aj + Tjk q̇k , (1.34)
∂q̇j
meaning p~ = ~a + Tˆ · ~q˙. Inverting this gives ~q˙ = Tˆ−1· (p~ − ~a), where Tˆ−1 will exist because
of the positive-definite nature of kinetic energy, which implies that Tˆ is a postive definite
matrix. Thus, H = ~q˙ · p~ − L yields
1
H = (p~ − ~a) · Tˆ−1 · (p~ − ~a) − L0 (q, t) (1.35)
2
as the Hamiltonian. So for any Lagrangian in the form of Eq. (1.32), we can find Tˆ−1 and
write down the Hamiltonian as in Eq. (1.35) immediately.
A more detailed Example. Find L and H for the frictionless pendulum shown in Fig. 1.3.
This system has two constraints, that m1 is restricted to lie on the x-axis sliding without
friction, and that the rod between m1 and m2 is rigid, giving
y1 = 0 , (y1 − y2 )2 + (x1 − x2 )2 = `2 . (1.37)
Prior to imposing any constraints the Lagrangian is
m1 2 m2 2
L=T −V = ẋ + (ẋ + ẏ22 ) − m2 gy2 − m1 gy1 . (1.38)
2 1 2 2
Lets choose to use x ≡ x1 and the angle θ as the independent coordinates after imposing
the constraints in Eq. (1.37). This allows us to eliminate y1 = 0, x2 = x + ` sin θ and
y2 = −` cos θ, together with ẋ2 = ẋ + ` cos θ θ, ˙ ẏ2 = ` sin θ θ,
˙ ẋ1 = ẋ. The Lagrangian with
constraints imposed is
m1 2 m2 2
ẋ + 2` cos θ ẋθ˙ + `2 cos2 θ θ˙2 + `2 sin2 θ θ˙2 + m2 g` cos θ .
L= ẋ + (1.39)
2 2
Next we determine the Hamiltonian. First we find
∂L ˙ = (m1 + m2 )ẋ + m2 ` cos θ θ˙ ,
px = = m1 ẋ + m2 (ẋ + ` cos θθ) (1.40)
∂ẋ
∂L
pθ = = m2 ` cos θ ẋ + m2 `2 θ˙ .
∂θ˙
10
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
∂H
ṗx = − = 0.
∂x
As we might expect, the CM momentum is time independent. Next for (θ, pθ ):
∂H 1 h i
θ̇ = = (m 1 + m 2 )pθ − m 2 ` cos θp x , (1.45)
∂pθ m2 `2 (m1 + m2 sin2 θ)
∂H sin θ cos θ h
2 2 2
i
ṗθ = − = 2 m2 ` p x + (m 1 + m 2 )p θ − 2m 2 ` cos θp x p θ
∂θ ` (m1 + m2 sin2 θ)2
sin θpx pθ
− m2 g` sin θ − .
`(m1 + m2 sin θ)
These non-linear coupled equations are quite complicated, but could be solved in math-
ematica or another numerical package. To test our results for these equations of motion
analytically, we can take the small angle limit, approximating sin θ ' θ, cos θ ' 1 to obtain
px pθ 1 h i
ẋ = − , ṗx = 0 , θ˙ = ( m1 + m )p
2 θ − m2 x ,
`p
m1 `m1 m1 m2 `2
θ h 2 2 2
i θp p
x θ
ṗθ = 2 2 m2 ` px + (m1 + m2 )pθ − 2m2 ` cos θpx pθ − − m2 g`θ . (1.46)
` m1 `m1
11
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
To simplify it further we can work in the CM frame, thus setting px = 0, and linearize the
equations by noting that pθ ∼ θ̇ should be small for θ to remain small, and hence θp2θ is a
higher order term. For the non-trivial equations this leaves
pθ pθ
ẋ = − , θ˙ = 2 , ṗθ = −m2 g`θ , (1.47)
`m1 µ`
where µ = m1 m2 /(m1 + m2 ) is the reduced mass for the two-body system. Thus θ¨ =
ṗθ /(µ`2 ) = − mµ2`g θ as expected for simple harmonic motion.
FR = −∇R V = 0 (1.48)
Note that i mi r0i = 0 from the definitions of M , R, and r0i , so T splits into two
P
terms, one for the CM motion and one for relative motion. We also observe that T is
independent of R. This means that R is cyclic for the full Lagrangian L, so P = M R ˙
is a conserved quantity. In our study of rigid bodies we will also need the forms of M
and R for a conRtinuous body with massR distribution ρ(r), which for a three dimensional
body are M = d3 r ρ(r) and R = M1 d3 r ρ(r) r.
12
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
2. Let us consider a system that is invariant with respect to rotations of angle φ about a
symmetry axis. This has a conserved angular momentum. If we pick φ as a generalized
coordinate, then L = T −V is independent of φ, so ṗφ = ∂L
∂φ
= 0 meaning pφ is constant.
In particular, for a system where V is independent of the angular velocity φ˙ we have
∂T X ∂ṙi X ∂ ri
pφ = = mi ṙi · = mi vi · . (1.51)
∂ϕ̇ i
∂ϕ̇ i
∂ϕ
Note
P that L about the CM is conserved for systems with no external torque,
ext ext
τ = i ri ×Fi = 0 and internal forces that are all central. Defining rij ≡ ri −rj and
its magnitude appropriately, this means Vij = Vij (rij ). This implies that Fji = −∇i Vij
(no sum on the repeated index) is parallel to rij . Hence,
dL X X X
= ri × ṗi = ri × Fext
i + ri × Fji . (1.53)
dt i i i,j
13
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
ri × Fext
P
However, i i = 0, so
dL X
= rij × Fji = 0. (1.54)
dt i<j
3. One can also consider a scaling transformation. Suppose that under the transformation
ri → λri the potential is homogeneous and transforms as V → λk V for some constant
k. Letting T be quadratic in ṙi and taking time to transform as t → λ1−k/2 t then
gives ṙi → λk/2 ṙi . So by construction T → λk T also, and thus the full Lagrangian
L → λk L. This overall factor does not change the Euler-Lagrange equations, and hence
the transformation is a symmetry of the dynamics, only changing the overall scale or
units of the coordinate and time variables, but not their dynamical relationship. This
can be applied for several well known potentials:
a) k = 2 for a harmonic oscillator. Here the scaling for time is given by 1 − k/2 = 0,
so it does not change with λ. Thus, the frequency of the oscillator, which is a
time variable, is independent of the amplitude.
b) k = −1 for the Coulomb potential. Here 1−k/2 = 3/2 so there is a more intricate
relation between coordinates and time. This power is consistent with the behavior
of bound state orbits, where the period of the orbit T obeys T 2 ∝ a3 , for a the
semi-major axis distance (Kepler’s 3rd law).
√ Here 1 − k/2 = 1/2 so for a freely falling
c) k = 1 for a uniform gravitational field.
object, the time of free fall goes as h where h is the distance fallen.
5. Let us consider the conservation of energy and the relationship between energy and
the Hamiltonian. Applying the time derivative gives H˙ = ∂H
∂q
q̇ + ∂H
∂p
ṗ + ∂H
∂t
. However,
q̇ = ∂H
∂p
and ṗ = − ∂H
∂q
. Thus
∂H ∂L
Ḣ = =− . (1.55)
∂t ∂t
There are two things to consider.
If H = E then the two points are equivalent, but otherwise either of the two could be
true while the other is false.
14
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
of m from the fixed wall and x0 is its displacement from the center of the moving cart.
Using x,
m k
L(x, ẋ) = T − V = ẋ2 − (x − v0 t)2 , (1.56)
2 2
where the kinetic term is quadratic in ẋ and the potential term is independent of ẋ.
This means that H falls in the special case considered in Eq. (1.31) so
p2 k
H =E =T +V = + (x − v0 t)2 , (1.57)
2m 2
However ∂H∂t
6= 0 so the energy is not conserved. (Of course the full energy would
be conserved, but we have not accounted for the energy needed to pull the cart at a
constant velocity, treating that instead as external to our system. That is what led to
the time dependent H.)
If we instead choose to use the coordinate x0 = x − v0 t, then
m 02 m k
L0 (x0 , ẋ0 ) = ẋ + mv0 x0 + v02 − x02 . (1.58)
2 2 2
Note that p0 = mẋ0 +mv0 = mẋ = p. This Lagrangian fits the general form in equation
(1.32) with a = mv0 and L0 = mv02 /2 − kx02 /2. So
1 2 k m
H 0 (x0 , p0 ) = ẋ0 p0 − L0 = (p0 − mv0 ) + x02 − v02 , (1.59)
2m 2 2
15
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
Here the last terms is a constant shift. The first and second terms in this expression for
H 0 look kind of like the energy that we would calculate if we were sitting on the cart
and did not know it was moving, which is not the same as the energy above. Hence,
0
H 0 6= E, but H˙ 0 = 0 because ∂H
∂t
= 0, so H 0 is conserved.
16
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
disk never tips over. The no-slip condition is v = aφ˙ where v is the velocity of the center of
the disk, and v = |v|. This means ẋ = v sin(θ) = a sin(θ)φ˙ and ẏ = −v cos(θ) = −a cos(θ)φ,˙
or in differential notation, dx − a sin(θ)dφ = 0 and dy + a cos(θ)dφ = 0.
In general, constraints of the form
X
aj (q)dqj + at (q)dt = 0 (1.61)
j
are not holonomic. We will call this a semi-holonomic constraint, following the terminology
of Goldstein.
Let us consider the special case of a holonomic constraint in differential form, f (q1 , ..., q3N , t) =
0. This means
X ∂f ∂f
df = dqj + dt = 0, (1.62)
j
∂qj ∂t
17
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
∂f ∂f
so aj = ∂qj
and at = ∂t
. The symmetry of mixed partial derivatives means
These conditions imply that a seemingly semi-holonomic constraint is in fact holonomic. (In
math we would say that we have an exact differential form df for the holonomic case, but
the differential form in Eq.(1.61) need not always be exact.)
Example: To demonstrate that not all semiholonomic constrants are secretly holo-
nomic, consider the constraint in the example of the vertical disk. Here there is no func-
tion h(x, y, θ, φ) that we can multiply the constraint df = 0 by to make it holonomic.
For the vertical disk from before, we could try (dx − a sin(θ) dφ)h = 0 with ax = h,
∂a
aφ = −a sin(θ)h, aθ = 0, and ay = 0 all for some function h. As we must have ∂θφ = ∂a ∂φ
θ
,
∂h k ∂ax ∂aθ ∂h
then 0 = −a cos(θ) − a sin(θ) ∂θ , so h = sin(θ) . That said, ∂θ = ∂x gives ∂θ = 0 which is a
contradiction for a non-trivial h with k 6= 0.
If the rolling is instead constrained to a line rather than a plane, then the constraint is
holonomic. Take as an example θ = π2 for rolling along x̂, then ẋ = aφ˙ and ẏ = 0. Integrating
we have x = aϕ + x0 , y = y0 , and θ = π2 , which together form a set of holonomic constraints.
A useful concept for discussing constraints is that of the virtual displacement δri of
particle i. There are a few properties to be noted of δri .
• It is infinitesimal.
• It is carried out at a fixed time (so time dependent constraints do not change its form).
Example: let us consider a bead constrained to a moving wire. The wire is oriented along
the x-axis and is moving with coordinate y = v0 t. Here the virtual displacement of the
18
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
bead δr is always parallel to x̂ (since it is determined at a fixed time), whereas the real
displacement dr has a component along ŷ in a time interval dt.
For a large number of constraints, the constraint force Zi is Rperpendicular to δri , meaning
Zi · δri = 0, so the “virtual work” (in analogy to work W = F · dr) Pof a constraint force
vanishes. More generally, there is no net work from constraints, so i Zi · δri = 0 (which
holds for the actions of surfaces, rolling constraints, and similar things). The Newtonian
equation of motion is ṗi = Fi + Zi , where Fi encapsulates other forces. Vanishing virtual
work gives X
(ṗi − Fi ) · δri = 0 (1.64)
i
which is the D’Alembert principle. This could be taken as the starting principal for classical
mechanics instead of the Hamilton principle of stationary action.
Of course Eq.(1.64) is not fully satisfactory since we are now used to the idea of working
with generalized coordinates rather than the cartesian vector coordinates used there. So lets
∂ ri
transform to generalized coordinates through ri = ri (q, t), so δri = ∂q j
δqj , where again we
sum over repeated indicies (like j here). This means
∂ri
Fi · δri = Fi · δqj ≡ Qj δqj (1.65)
∂qj
where we have defined generalized forces
∂ri
Qj ≡ Fi · . (1.66)
∂qj
We can also transform the ṗi · δri term using our earlier point transformation results as well
∂ 2 ri
d ∂ ri
P ∂ 2 ri ∂vi
as the fact that dt ∂qj = ∂q j ∂t
+ k ∂qj ∂qk q̇k = ∂qj . Writing out the index sums explicitly,
this gives
X X ∂ri
ṗi · δri = mi r̈i · δqj
i i,j
∂qj
X d ∂ ri
d ∂ri
= mi ṙi · − mi ṙi · δqj
i,j
dt ∂q j dt ∂q j
X d ∂vi
∂ vi
= mi vi · − mi vi · δ qj
i,j
dt ∂q̇j ∂qj
X d ∂T ∂T
= − δqj (1.67)
j
dt ∂q̇ j ∂q j
for T = 12 i mi vi2 . Together with the D’Alembert principle, we obtain the final result
P
X d ∂T ∂T
− − Qj δqj = 0. (1.68)
j
dt ∂q̇j ∂qj
19
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
We will see momentarily that this result is somewhat more general than the Euler-Lagrange
equations, containing them as a special case.
We will start by considering systems with only holonomic constraints, postponing other
types of constraints to the next section. Here we can find the independent coordinates qj
with j = 1, . . . , N − k that satisfy the k constraints. This implies that the generalized virtual
displacements δqj are independent, so that their coefficients in Eq. (1.68) must vanish,
d ∂T ∂T
− − Qj = 0 . (1.69)
dt ∂q̇j ∂qj
There are several special cases of this result, which we derived from the d’Alembert principle.
ri ∂V
Qj = −(∇i V ) · =− (1.70)
∂qj ∂qj
where we assume that the potential can be expressed in the generalized coordinates
as V = V (q, t). Then using L = T − V , we see that Eq. (1.69) simply reproduces the
d ∂L ∂L
Euler-Lagrange equations dt ∂q̇j − ∂q j
= 0.
∂V d ∂V
2. If Qj = − ∂qj
+ dt
for V = V (q, q̇, t), which is the case for velocity dependent
∂q̇j
forces derivable from a potential
(like the electromagnetic Lorentz force), then the
d ∂L ∂L
Euler-Lagrange equations dt ∂q̇j
− ∂qj
= 0 are again reproduced.
• Sliding friction is F = −µFN vv , so this is a constant force that is always opposite the
direction of motion (but vanishes when there is no motion).
20
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
A general form for a friction force is Fi = −hi (vi ) vvii (where as a reminder there is no
implicit sum on i here since we specified i on the right-hand-side). For this form
X vi ∂rj X vi ∂vi
Rj = − hi · =− hi · . (1.72)
i
vi ∂qj i
vi ∂q̇j
∂F
=− (1.73)
∂q̇j
where vi
XZ
F= dvi0 hi (vi0 ) (1.74)
i 0
is the “dissipation function”. This is a scalar function like L so it is relatively easy to work
with.
Example: Consider a sphere of radius a and mass m falling in a viscous fluid. Then
T = 12 m0 ẏ 2 where m0 < m accounts for the mass of displaced fluid (recall Archimedes princi-
ple that the buoyant force on a body is equal to the weight of fluid the body displaces). Also
V = m0 gy, and L = T −V . Here h ∝ ẏ, so F = 3πηaẏ 2 , where by the constant
of proportion-
ality is determined by the constant η, which is the viscosity. From this, dt ∂∂ẏL − ∂∂yL = − ∂F
d
∂ẏ
gives the equation of motion m0 ÿ + m0 g = −6πηaẏ. The friction force 6πηaẏ is known as
Stokes Law. (We will derive this equation for the friction force from first principles later
on, in our discussion of fluids.) This differential equation can be solved by adding a par-
ticular solution yp (t) to a solution of the homogeneous equation m0 ÿH + 6πηaẏH = 0. For
the time derivatives the results are ẏp = −m0 g/(6πηa) and ẏH = A exp(−6πηat/m0 ), where
the constant A must be determined by an initial condition. The result ẏ = ẏH + ẏp can be
integrated in time once more to obtain the full solution y(t) for the motion.
Example: if we add sliding friction to the case of two masses on a plane connected by
a spring (considered on problem set #1), then hi = µf mi g for some friction coefficient µf ,
and q q
F = µf g(m1 v1 + m2 v2 ) = µf g m1 ẋ21 + ẏ12 + m2 ẋ22 + ẏ22 . (1.75)
21
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
Using integration by parts on the second term, and the vanishing of the variation at the
R s2 P h ∂f d ∂f
i
endpoints to remove the surface term, δJ vanishes when s1 i ∂yi − ds ∂ẏi δyi (s) ds =
0. For independent variations δyi (for example, after imposing holonomic constraints), this
can only occur if
∂f d ∂f
− = 0. (1.78)
∂yi ds ∂ẏi
The scope of this calculus of variation result for extremizing the integral over f is more
general than its application to classical mechanics.
Example: Hamilton’s principle states that motion qi (t) extremizes the action, so in this
case s = t, yi = qi , f = L, and J = S. Demanding δS = 0 then yields the Euler-Lagrange
equations of motion from Eq. (1.78).
Example: As an example outside of classical mechanics, consider showing that the shortest
distance between points on aR sphere of radius a are great circles. This can be seen by
s
minimizing the distance J = s12 ds where for a spherical surface,
p q
ds = (dx) + (dy) + (dz) = a2 (dθ)2 + a2 sin2 (θ)(dφ)2
2 2 2 (1.79)
22
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
q
so f = 1 + sin2 (θ)φ˙ 2 . The solution for the minimal path is given by solving dθ
d ∂f
∂ϕ̇
∂f
− ∂ϕ =
0. After some algebra these are indeed found to be great circles, described by sin(φ − α) =
β cot(θ) where α, β are constants.
Example: Hamilton’s principle can also be used to yield the Hamilton equations of motion,
by considering the variation of a path in phase space. In this case
Z t2 h i
δJ[q, p] = δ dt pi q̇i − H(q, p, t) = 0 (1.81)
t1
must be solved with fixed endpoints: δqi (t1 ) = δqi (t2 ) = 0 and δpi (t1 ) = δpi (t2 ) = 0. Here,
the role of yi , of is played by the 2N variables (q1 , . . . , qN , p1 , . . . , pN ). As f = pi q̇i − H, then
d ∂f ∂f ∂H
− =0 =⇒ ṗi = − , (1.82)
dt ∂q̇i ∂qi ∂qi
d ∂f ∂f ∂H
− =0 =⇒ q̇i = ,
dt ∂ṗi ∂pi ∂pi
giving Hamilton’s equations as expected. Note that because f is independent of ṗi , the
term (∂f /∂ṗi )δṗi = 0, and it would seem that we do not really need the condition that
δpi (t1 ) = δpi (t2 ) = 0 to remove the surface term. However, these conditions on the variations
δpi are actually required in order to put qi and pi on the same footing (which we will exploit
later in detail when discussing canonical transformations).
It is interesting and useful to note that D’Alembert’s principle
d ∂L ∂L
− − Rj δqj = 0 (1.83)
dt ∂q̇j ∂ qj
is a “differential” version of the equations that encode the classical dynamics, while Hamil-
ton’s principle Z t2
∂L d ∂L
δJ = dt − δqj = 0 (1.84)
t1 ∂qj dt ∂ q̇j
(for Rj = 0 where all forces come from a potential) is an integrated version.
23
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
Let us consider k constraints for n coordinates, with α ∈ {1, . . . , k} being the index
running over the constraints. These holonomic or semi-holonomic constraints take the form
gα (q, q̇, t) = ajα (q, t)q̇j + atα (q, t) = 0 (1.85)
where again repeated indices are summed. Thus, gα dt = ajα dqj + atα dt = 0. For a virtual
displacement δqj we have dt = 0, so
n
X
ajα δqj = 0 , (1.86)
j=1
which gives us k equations constraining the virtual displacements. For each equation we
can multiply by a function λα (t) known as Lagrange multipliers, and sum over α, and the
combination will still be zero. Adding this zero to D’Alembert’s principle yields
d ∂L ∂L
− − λα ajα δqj = 0 (1.87)
dt ∂q̇j ∂ qj
where the sums implicitly run over both α and j. Its clear that the Lagrange multiplier
term is zero if we sum over j first, but now we want to consider summing first over α for
a fixed j. Our goal is make the term in square brackets zero. Only n − k of the virtual
displacements δqj are independent, so for these values of j the square brackets must vanish.
For the remaining k values of j we can simply choose the k Lagrange multipliers λα to force
the k square bracketed equations to be satisfied. This is known as the method of Lagrange
multipliers. Thus all square bracketed terms are zero, and we have the generalization of the
Euler-Lagrange equations which includes terms for the constraints:
d ∂L ∂L
− = λα ajα . (1.88)
dt ∂q̇j ∂qj
This is n equations, for the n possible values of j, and on the right-hand-side we sum over α
for each one of these equations. The sum λα ajα can be interpreted as a generalized constraint
force Qj . The Lagrange multipliers λα and generalized coordinates qj together form n + k
parameters, and equation (1.88) in conjunction with gα = 0 for each α from (1.85) together
form n + k equations to be solved.
There are two important cases to be considered.
1. In the holonomic case, fα (q, t) = 0. Here, gα = f˙α = ∂f α
q̇ + ∂f∂tα , so ajα = ∂f
∂qj j ∂qj
α
. This
gives
k
d ∂L ∂ L X ∂fα
− = λα (1.89)
dt ∂q̇j ∂qj α=1
∂qj
for holonomic constraints. The same result can be derived from a generalized Hamil-
ton’s principle Z t2
J[qj , λα ] = (L + λα fα ) dt (1.90)
t1
24
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
for semi-holonomic constraints. This result cannot be derived from Hamilton’s principle
in general, justifying the time we spent discussing d’Alembert’s principle, which we
have used to obtain (1.91). Recall that static friction imposes a no-slip constraint in
the form of our equation gα = 0. For g ∝ q̇, the form , ∂g
∂q̇
, is consistent with the form
of generalized force we derived from our dissipation function, ∂F ∂q̇
from our discussion
of friction.
We end this chapter with several examples of the use of Lagrange multipliers.
Example: Consider a particle of mass m at rest on the top of a sphere of radius a, as shown
above in Fig. 1.7. The particle is given an infinitesimal displacement θ = θ0 so that it slides
down. At what angle does it leave the sphere?
We use the coordinates (r, θ, φ) but set φ = 0 by symmetry as it is not important. The
constraint r ≥ a is non-holonomic, but while the particle is in contact with the sphere the
constraint f = r − a = 0 is holonomic. To answer this question
we will look for the point
m 2 2 ˙2
where the constraint force vanishes. Here T = 2 ṙ + r θ and V = mgz = mgr cos(θ) so
that L = T − V , then dtd ∂∂ṙL − ∂∂rL = λ ∂f
∂r
gives
mr̈ − mrθ˙2 + mg cos(θ) = λ, (1.92)
while d
dt
∂L
∂θ˙
− ∂L
∂θ
= λ ∂f
∂θ
= 0 gives
d 2
mr θ̇ − mgr sin(θ) = 0. (1.93)
dt
This in conjunction with r = a gives 3 equations for the 3 variables (r, θ, λ). Putting them
together gives ṙ = 0 so r̈ = 0. This means
ma2 θ¨ = mga sin(θ), −maθ˙2 + mg cos(θ) = λ.
Multiply the first of these by θ˙ and integrate over time, knowing that θ˙ = 0 when θ = 0,
gives θ˙2 = 2g
a
(1 − cos(θ)). Thus,
λ = mg(3 cos(θ) − 2) (1.94)
25
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
is the radial constraint force. The mass leaves the sphere when λ = 0 which is when
cos(θ) = 23 (so θ ≈ 48o ).
What if we instead imposed the constraint f 0 = r2 − a2 = 0? If we call its Lagrange
0
multiplier λ0 we would get λ0 ∂f
∂r
= 2aλ0 when r = a, so 2aλ0 = λ is the constraint force from
0
before. The meaning of λ is different, and it has different units, but we still have the same
constraint force.
What are the equations of motion for θ > arccos 32 ? Now we no longer have the
constraint so
˙ 2 d 2 ˙
mr̈ − mrθ + mg cos(θ) = 0 and mr θ − mgr sin(θ) = 0.
dt
The initial conditions are r1 = a, θ1 = arccos 23 , ṙ1 = 0, and θ˙12 = 3a
2g
from before. Simpler
coordinates are x = r sin(θ) and z = r cos(θ), giving
m 2
ẋ + ż 2 − mgz,
L= (1.95)
2
√
so ẍ = 0 and z̈ = −g with initial conditions z1 = 2a 3
, x1 = 35a , and the initial velocities
simply left as ż1 and ẋ1 for simplicity in writing (though the actual values follow from
ż1 = −a sin θ1 θ˙1 and ẋ1 = a cos θ1 θ˙1 ). This means
Example: Consider a hoop of radius a and mass m rolling down an inclined plane of angle
φ without slipping as shown in Fig. 1.11, where we define the x̂ direction as being parallel to
the ramp as shown. What is the friction force of constraint, and how does the acceleration
compare to the case where the hoop is sliding rather than rolling?
The no-slip constraint means aθ˙ = ẋ, so h = aθ˙ − ẋ = a, which can be made holonomic
but which we will treat as semi-holonomic. Then T = TCM + Trotation = 12 mẋ2 + 12 ma2 θ˙2 as
Ihoop = ma2 . Meanwhile, V = mg(l − x) sin(φ) so that V (x = l) = 0. This means
m 2 ma2 2
L=T −V = ẋ + θ̇ + mg(x − l) sin(φ). (1.99)
2 2
26
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
along with ẋ = aθ.˙ Taking a time derivative of the constraint gives ẍ = aθ,¨ so mẍ = λ, and
g
ẍ = 2 sin(φ). This is one-half of the acceleration of a sliding mass. Plugging this back in we
find that
1
λ = mg sin(φ) (1.101)
2
is the friction force in the −x̂ direction for the no-sliding constraint, and also θ¨ = g sin(φ). 2a
Example: Consider a wedge of mass m2 and angle α resting on ice and moving without
friction. Let us also consider a mass m1 sliding without friction on the wedge and try to
find the equations of motion and constraint forces. The constraints are that y2 = 0 so the
27
CHAPTER 1. A REVIEW OF ANALYTICAL MECHANICS
energy is V = m1 gy1 + m2 g(y2 + y0 ), where y0 is the CM of the wedge taken from above its
bottom. Then L = T − V , with the constraints f1 = (y1 − y2 ) − (x1 − x2 ) tan(α) = 0 and
f2 = y2 = 0. The equations of motion from the Euler-Lagrange equations with holonomic
constraints are
d ∂L ∂L ∂f1 ∂f2
− = λ1 + λ2 =⇒ m1 ẍ1 = −λ1 tan(α), (1.102)
dt ∂ẋ1 ∂x1 ∂x1 ∂x1
d ∂L ∂L ∂f1 ∂f2
− = λ1 + λ2 =⇒ m1 ÿ1 + m1 g = λ1 ,
dt ∂ẏ1 ∂y1 ∂y1 ∂y1
d ∂L ∂L ∂f1 ∂f2
− = λ1 + λ2 =⇒ m2 ẍ2 = λ1 tan(α),
dt ∂ẋ2 ∂x2 ∂x2 ∂x2
d ∂L ∂L ∂f1 ∂f2
− = λ1 + λ2 =⇒ m2 ÿ2 + m2 g = −λ1 + λ2 ,
dt ∂ ẏ2 ∂y2 ∂y2 ∂ y2
tan(α)
ẍ2 = λ1 , (1.104)
m2
tan(α)
ẍ1 = − λ1 ,
m1
λ1
ÿ1 = − g,
m1
the accelerations are constant. As a check on our results, if m2 → ∞, then ẍ2 = 0 so indeed
the wedge is fixed; and for this case, ẍ1 = −g sin(α) cos(α) and ÿ1 = −g sin2 (α) which both
vanish as α → 0 as expected (since in that limit the wedge disappears, flattening onto the
icy floor below it).
28
MIT OpenCourseWare
https://ocw.mit.edu
For information about citing these materials or our Terms of Use, visit: https://ocw.mit.edu/terms.