Conics Section Module
Conics Section Module
Conics Section Module
1. Conic Sections
A mathematical topic has a physical/perceptual aspect, a geometric description, and an
algebraic formulation. Important features visible in any of these three aspects should be visible in
the others also, and one should search for and exploit these correspondences. The organizing
principle of this module is the interrelationships between physical/perceptual, geometric, and
algebraic forms. We refer to this principle as the PGA way of thinking. Throughout the module,
we are developing the PGA Way of Thinking, which is valuable and used by mathematicians. A
central instructional objective is to help students acquire this Way of Thinking. The specific
context (circles, ellipses, etc) chosen to implement this objective is only a vehicle, and less
important than the objective itself. The particular context of conic sections is very rich and, in
our view, suitable for high school students.
Beginning with the circle as the most familiar conic section, and continuing through the
others, we follow the sequence: perception, geometry, algebra. That is, what is our intuitive
understanding of circle based on our perception of physical examples? Then the need for
communicationin this case the need to communicate to others a precise description of the curve
formalizing the intuitiondemands a characterization of the curve as a geometric locus. The
adequacy of the geometric description is tested by formally proving some intuitive properties
about the curve. Together with the need for communication, we utilize the need for computation.
Some problems about these properties invoke computation, in the form of algebraic
representations of the curve in terms of Cartesian or parametric equations and manipulations of
these representations. With geometric and algebraic descriptions in hand, new properties of the
locus can be discovered and proved. Proving, in general, draws on the need for certaintyto
know that something is true. Truth alone, however, is not our only aim, and we desire to educate
students to strive to know why something is truethe cause that makes it truea need we refer
to as the need for causality. Finally, by comparing and reflecting on the results, we aim to instill
in the students the desire to reveal a structurethe need for structure. Specifically, our aim is to
lead students to recognize a common structure among all the conic sectionsgeometric as well
as algebraic. This sequence from perception through formalization and generalization
characterizes the development of many mathematical concepts. It serves to humanize
mathematics, showing students that definitions and theorems are not handed down by the gods
but arise from human experience. We have designed problems giving students ample opportunity
to develop and explore all these viewpoints (physical/perceptual, geometric, and algebraic) and
to contrast perceptual justifications with geometric and algebraic proofs of the properties of the
conic sections. We want students to understand that the mathematical definition of a circle, say,
is the basis for deriving all the properties of this locus, and the means for proving that some
newly encountered object is or is not a circle. Although we begin with perception, it is critical
that students also encounter situations in which perception alone leads to ambiguous or incorrect
expectations. The need for certainty compels students to resolve these situations by deductive
reasoning, which, in turn, promotes the replacement of perceptual reasoning by deductive
reasoning.
These five needs manifest a crucial principle, called the necessity principle. It claims: For
students to learn what we intend to teach them, they must have a need for it, where need refers
to intellectual need, not only psychological need. Intellectual need has to do with disciplinary
knowledge being born out of peoples current knowledge through engagement in problematic
situations conceived as such by them. Psychological need, on the other hand, has to do with
peoples desire, volition, interest, self determination, and the like. Indeed, before one immerses
oneself in a problem, one must be willing to engage in the problem and persist in the
engagement. Our focus in this module is on intellectual rather than psychological needs. As the
module unfolds, we urge the reader to contrast this Necessity approach with the current
standards-driven approach in high school teaching.
The module also emphasizes another crucial principle: the repeated reasoning principle.
It claims: Students must practice reasoning in order to internalize, organize, and retain the
mathematics they have learned. Repeated reasoning, not mere drill and practice of routine
problems, is essential to the process of internalizationa state where one is able to apply
knowledge autonomously and spontaneously. The sequence of problems must continually call
for reasoning through the situations and solutions and must respond to the students changing
intellectual needs.
As the discussion of each curve moves from visual intuition to geometric characterization
to algebraic formalization, natural questions recur in each case, for example, from does a circle
uniquely determine its center? to does an ellipse uniquely determine its foci? Students should
develop a structural way of thinking, where they spontaneously ask these questions and explore
the relationships between properties of different conic sections. The Dandelin sphere
construction will be presented for the ellipse, but students must rethink it carefully for the
parabola and hyperbola. Since the focus/directrix definition applies uniformly to all conics,
students must carefully formulate the reasons for their differing appearances. The problem of
finding tangent lines recurs for each curve. Algebraic techniques such as completing the square
also apply to the equations of all conics and must be reasoned out in each case. Repeated
reasoning, in the context of the PGA way of thinking, promotes in particular attention to
meaning, especially for the algebraic symbols, which we call referential symbolic reasoning.
Since the algebraic symbols can have both geometric and intuitive meanings, students have
numerous opportunities to interpret their algebraic manipulations in geometric or physical terms.
As a result they notice patterns and opportunities to simplify calculations which would otherwise
be overlooked. A related goal is for students to develop the algebraic invariance way of
thinkinga habit of mind where one manipulates an algebraic expression not haphazardly but
with the purpose of arriving at a desired form and maintaining certain properties of the
expression invariant.
In designing and teaching this module, we have struck balances in emphasis between
several aspects of the subject which are in tension. These tensions include:
Specific properties of each conic versus general properties common to all.
The use of elementary methods suitable for high school presentation versus links to
more advanced (calculus, linear algebra) methods.
Use of synthetic versus analytic geometry methods.
Emphasis on 3d (sections of a cone) versus 2d (focus/directrix) definitions of the
conics.
Lastly, our approach is to carefully attend to subject matterdefinitions, theorems,
proofs, problems and their solutions, etc.as well as to ways of thinking (WoT), such the PGA
way of thinking, referential symbolic way of thinking, and algebraic invariance mentioned
above. We will refer to elements of subject matter as ways of understanding, to differentiate
them from ways of thinking. For example, the following are different ways of understanding the
phrase derivative of a function at a , or the symbol f ( a ) : the slope of a line tangent to the
graph of a function at a or the lim f ( a h ) f ( a ) / h or the instantaneous rate of change
h0
at a or the slope of the best linear approximation to a function near a . Other ways of
understanding and ways of thinking will emerge as the module unfolds.
The module consists of six units. Each unit begins with a list of focus ways of
understanding and way of thinking, and proceeds with the classroom problems that attend to
them. A pedagogical discussion on these problems, including observations from our own classes,
then follows. The unit concludes with a set of practice problems.
2. x 2 y 2 2ax 2by ( a 2 b2 r 2 ) 0 .
We can make the second equation more compact and easier to remember by
substituting f 2a , g 2b , and h a 2 b 2 r 2 to obtain
constructionthat there was a reason to think of the two chords. In our approach to this module,
we aim at demystifying mathematics for the students, in that solutions to mathematical problems
do not emerge from nothing but from careful reasoning and a representation of the necessary and
sufficient conditions of the problem.
There is an algebraic solution mirroring this geometric one. Let the point Pi have
coordinates ( xi , yi ) . The conditions telling algebra that the three points lie on a circle with center
( a , b ) and radius r are the three equations ( xi a ) 2 ( yi b) 2 r 2 . They state that each point is
r units from the center. Subtracting a pair of these equations gives a condition that the center is
equidistant from two of the given points, the specific distance r having cancelled out (referential
y y2
x x
x x
1 2 a 1 2 ,
symbolic reasoning). This turns out to be, for example, b 1
2
2
y2 y1
precisely the statement that ( a , b ) lies on the perpendicular bisector of the segment PP
1 2 . This is
the algebraic proof of the claim that the perpendicular bisector is the locus of equidistant points.
The center can be found by solving any two of these linear equations simultaneously. The third
linear equation is dependent on the two chosen, and therefore the center lies on it too.
Students may wish to locate the center as the intersection of two diameters. How easy this
is depends on the available tools. Given a ruler, one might locate two points on the circle a
maximal distance apart and join them. (Is this an allowed use of a ruler?) With straightedge and
compass, it can be done by inscribing a right angle at a point of the circle; its sides will meet the
circle at ends of a diameter. This could necessitate the theorem that an inscribed angle measures
half of the intercepted arc. It is useful for students to realize how geometric solutions depend on
the available tools.
In our classroom one student drew a circle on a sheet of paper and folded the paper so as
to bring one half of the circle onto the other, thus making the fold line a diameter. Two such
diameters intersect at the center. This is an excellent example of an empirical/perceptual proof as
opposed to a deductive one. The perceptual approach is valuable for highlighting intuitive
properties of the circle, which should be deductively verifiable later: its symmetry about any
diameter, for example.
( x a ) 2 ( y b) 2 r 2 .
The problem that Bruce encountered can be
generalized: If we expand the brackets in the
circles equation and collect the corresponding
terms, we can rewrite the above equation in the
form x 2 y 2 2ax 2by (a 2 b2 r 2 ) 0 .
This shows that the equation of a circle can be
written in two different forms: as
1. ( x a )2 ( y b)2 r 2
or as
2. x 2 y 2 2ax 2by (a 2 b2 r 2 ) 0 .
We can make the second equation more
compact and easier to remember by substituting
f 2a , g 2b , and h a 2 b 2 r 2 to
obtain x 2 y 2 fx gy h 0. This shows
that every circle can be represented by an
equation of the form
3. x 2 y 2 fx gy h 0.
Question: Does every equation of the form
x 2 y 2 fx gy h 0 represent a circle?
h . This
2
2
4
between the line and any chosen line in the plane which meets it. The angle between the line and
its orthogonal projection into the plane will do, and this is equivalent to taking the complement
of the angle between the line and the planes normal.
To define the angle between curves, and specifically the idea of orthogonal curves,
students suggested using their tangent lines. Asked why this makes sense, they responded that
the lines locally approximate the curves and we already know what the angle between lines
means. This is a proper answer. The definition for the angle between two curves emerged
naturally for the students from their image of tangent line as a local approximation to a curve.
Here the instructor can discuss the relationship between this meaning of tangent line and
Euclids meaning as a line intersecting exactly once (which may be familiar to students from
high-school). This is part of the need for communication, in that different meanings of the same
term must be equivalent, for otherwise it wouldnt be possible to communicate meaningfully
about the term. Furthermore, this would advance another important way of thinkingthat a term
can have multiple interpretations, and it is advantageous to have multiple interpretations for a
term. Once this equivalency has been established, the instructor may ask, Imagine that I am
blind. How can I prove that given a circle, there is a line that does not intersect it? The game
of playing blind is to remove the use of visual perception, which is common among students. A
geometric approach to this depends on a careful axiomatization of geometry, so algebra is more
appropriate here. If the circle and line have equations ( x a )2 ( y b)2 r 2 and y mx k ,
how many intersection points can they have? Students should explain clearly why substituting
y mx k into the equation of the circle has the meaning of locating intersection points. Since a
quadratic equation results, there are two, one, or no intersection points, and any of these
possibilities can occur by adjusting the parameters. The case with no intersections answers the
given question, and the case of a single intersection defines the idea of a tangent line.
10
between the centers s , the circles will be orthogonal if each radius to A or B is tangent to the
other circle, that is, if the radii are perpendicular there. This happens if r12 r22 s 2 (by the
converse to the Pythagorean theorem). Thus, the construction can be made as follows. Draw one
circle with any center O and radius r1 ; at the endpoint A of a radius, draw a perpendicular
segment AO of any length r2 , and a second circle centered at O will be orthogonal to the first
circle at A . Why also at the second intersection B ? By the earlier argument that there are pairs
of congruent triangles! So students see that the first approach was not wrong or a waste of
time; the reasoning is valuable after we see how it fits into the structure of the problem.
The equation of the circle is ( x 1)2 ( y 2)2 16 ,
from which the intercepts are easily found as
(0, 2 15) and (1 2 3,0) . They also follow from
applying the Pythagorean theorem to a diagram,
which should not be surprising since the Pythagorean
theorem is the basis for the equation of the circle.
They are of course symmetric about the vertical and horizontal diameters of the circle.
To find the tangent lines, we need to tell algebra or geometry what a tangent line is. As
we saw, for circles one can use the simplest definition: a line meeting the circle in exactly one
point. However, students should have no trouble giving examples of lines which meet a parabola
in one point but are not tangent to it, or lines tangent to a non-convex closed curve which meet it
more than once.
As Euclid shows, the line perpendicular to the radius of a circle at some point P on the
circle meets the circle only there and is otherwise external to it. Indeed, if O is the center and Q
some other point on this line, then OQ is greater than the radius OP by the Pythagorean
theorem, so Q is outside the circle. This can be confirmed by students who know calculus:
implicit differentiation of the equation of the circle gives dy / dx ( x 1) / ( y 2) , the negative
Classroom Problem 6: A circle of
radius 4 is centered at (1, 2) . Where
does it cross the coordinate axes?
What lines are tangent to it at those
points?
reciprocal of the slope of the radius to ( x , y ) . The slopes are 1 / 15 at the y -intercepts and
3 at the x -intercepts, which look plausible from a picture. The two signs again reflect the
symmetry of the circle about its diameters. Since the equation of the circle presents it as a level
set of a function f ( x, y ) , one can also find a normal vector by taking the gradient, which gives
twice the radius vector. One can finesse the meaning of the tangent line by simply defining it to
be the line having the slope computed by calculus, but this gives no insight unless the process of
differentiation is understood.
An algebraic approach may lead to more insight than calculus for many students. We
seek a line y mx b intersecting the circle only once, say at a y-intercept x 0 . Solving the
equations of the line and circle simultaneously leads to the quadratic
(1 m 2 ) x 2 2(mb 2m 1) x (b2 4b 11) 0. If this has only one root x 0 , then two
coefficients must vanish: b 2 4b 11 0 gives b 2 15 , so that the line and the circle have
the same y-intercept, and then m 1 / (b 2) 1 / 15 , the expected slope.
11
form ( x a )2 ( y b)2 r 2 . Now choose another set of values for f , g , and h , but
this time f should be a fraction, g an irrational number, and h a negative integer.
Again convert the resulting equation into an equation of the form
( x a ) 2 ( y b) 2 r 2 .
Problem 2. Find the equation of a circle centered at the origin that is tangent to the
line 2 x 2 y 39 .
Problem 3: Let P, Q, R, and S be four distinct points in the plane such that no three
12
( x 2) 2 y 2 1.
Problem 6: How many common tangents can there be for two circles in the plane?
Exhibit all the possibilities, and prove that there are no others.
Problem 7: Given 2 intersecting circles in a plane, consider the 4 tangents at the
points of intersection (1 tangent at each of 2 points for each of 2 circles). Draw
perpendicular lines from each circle center to each tangent, thus obtaining 6 points of
intersection: 4 lines from each center yields 8 points, only 6 of which are distinct.
Show that all these points (A,B,C,D,E,F) are on the same circle.
13
( py x )( x p 3) 0
In our classroom we did not introduce parametric equations of the circle. Some of the
following problems serve to do so. In Problem 8, for example, students need to use the definition
of a circle, including the part about being planar, to verify that the given curves are circles. They
also need to observe that parametric equations describe not only a locus, but also the motion of a
point along that locus in time. This motion is useful for physical modeling but irrelevant for
geometric properties of the locus. The meaning of the parameter t needs to be deeply understood
in order to distinguish between intersection of the loci and collision of the points moving along
them. Here algebra encodes more information than the geometry requires.
Problem 10: A particle moves along the line 2 x
3
y 2 0 . At 12:00 AM, the
5
2
. What is the particles y -coordinate at that time? The
2
4
particles velocity in the direction of the x -axis is meter/second. What is the
9
particles position (in the x - y plane) at any given time t ? What is the particles
velocity along the y -axis?
Problem 11: The position function r (t ) ( x (t ), y (t ), z (t )) is often written as
r (t ) x(t )i y (t ) j z (t )k . Consider the two parametric curves:
r1 (t ) (cos t )i (sin t ) j and r2 (t ) (cos t ) j (sin t )k . What are these curves? Do they
intersect?
Problem 12: A particle moves so that its position in the plane at time t seconds from
the start of its movement is (cos t ,sin t ) . If the particles movement was observed for
2 seconds, what curve did the particle complete? If the particles movement was
particles x -coordinate is
seconds, what curve did the particle complete? Answer the same
2
question for 4 seconds and for 1 second.
Problem 13: Three particles, A, B, and C, move in space. The positions of the three
objects in space at time t seconds from the start of their movement are, respectively:
t
(cos t ,sin 2t , 0) , (cos , 0,sin t ) , (0,cos 2t ,sin 4t ) . What are the curves along which
2
the particles move? Can you tell which of the particles is the fastest or the slowest?
Problem 14: The hands of a clock have lengths 4 and 8 inches. Where are they
located t minutes after noon? What is the angle between them at that time, and what
is the distance between their tips? At what times do the hands coincide?
14
Referential symbolic way of thinking. Attending, when there is a need, to the meaning
of symbols and their manipulations.
Deductive reasoning: Logical structure of proofs: what is given, what is proved, what
can be assumed or chosen freely. Distinguishing between a theorem and its converse.
Algebra as deduction: Every algebraic computation is a proof of something. Try to
state explicitly what it proved.
PGA way of thinking: Attending to interrelationships between physical/perceptual,
geometric, and algebraic realities.
Algebraic invariance way of thinking. An equation can be rewritten in various forms
which make certain properties more noticeable. The link between the forms is
provided by something which remains invariant, for example the solution set.
Algebraic way of thinking. This is a broad way of thinking. One of its instantiations is
the realization that when applying algebra, to a geometry problem for example, one
must tell algebra all the relevant geometric constraints. Likewise, we may talk of a
geometric way of thinking in this manner; namely, in solving a geometry problem,
one must tell geometry all the given conditions.
Usefulness of completing the square.
15
such that PF1 PF2 d . The instructor drew one on the board using a shoelace wrapped around
two pegs (perceptual verification that the definition is compatible with our intuition).
This problem serves to check that our formal definition
can justify properties of the ellipse that we intuitively
expect to hold. Let T , T be the vertices on the major
axis with T closer to F2 , A a vertex on the minor axis,
b
T'
F1
F2
Figure2:Ellipseaxes
conjectured that AF1 BF2 but could not justify it; also
F1
16
Closely related to this problem is the claim that the ellipse is symmetric about its center.
We can prove this, and in fact more: the ellipse is symmetric about both axes. Reflection of any
point P on the ellipse about the line joining the foci (the major axis) preserves both focal
distances and so maps P to another point on the ellipse. Reflection about the perpendicular
bisector of the focal segment (minor axis) interchanges the two focal distances and so preserves
their sum. The composition of these reflections is reflection about the center, so the ellipse is
symmetric about its center.
Classroom Problem 4: Find a
Cartesian equation of an ellipse.
Does it confirm the intuition that
an ellipse can be obtained by
squashing a circle?
17
of the foci.)
In our classroom students briefly considered using the
radius from the center, but preferred the focal radii.
They conjectured that the angles between the tangent
line and these may sum to 90 , but then switched to the
conjecture that these angles are equal. Students
observed that this is true in the special case of a circle,
and at the vertices of an ellipse, providing some initial
evidence for the conjecture. Let a line be tangent to an ellipse with equation in standard form at
point P ( x1 , y1 ) in the first quadrant, and make angle with the major axis (see Figure 4). Let
Classroom Problem 5: We know
that the tangent line to a circle is
perpendicular to the radius at the
point of tangency. Is there a
similar characterization of the
tangent line to an ellipse?
the angles between the tangent line and the focal radii PFi be i . Let the angles between
PF1, PF2 and the major axis be , respectively. We want to prove that 1 2 .
F1
F2
Figure4:Characterizationofthetangentlinetoanellipse
Students had several approaches to this problem, each promoting important Ways of Thinking,
and the different approaches can be productively compared. First, we need to determine the
equation of the tangent line, that is, tell algebra that the line is tangent to the ellipse. This can be
done using calculus, or the algebraic method of finding a line having a single intersection of
multiplicity two with the ellipse. The result is ( x1 x / a 2 ) ( y1 y / b 2 ) 1 . The notational
distinction between ( x1 , y1 ) , the chosen point of tangency, and ( x , y ) , a variable point on the
tangent line, deserves emphasis. The Need for Communication requires distinguishing them;
using the same symbol for two objects would tell algebra that they are the same. ( x1 , y1 ) satisfies
both the equation of the ellipse and that of the tangent line; ( x , y ) satisfies only the latter. Some
angles in Figure 4 (more precisely, their tangents) can be computed from the slopes of the lines:
tan b 2 x1 / a 2 y1 ,
tan y1 / ( x1 c),
tan y1 / (c x1 ).
18
We can use the exterior angle theorem to find 1 and 2 : 1 and 2 . Then trig
identities [prove them!] allow us to compute, for example,
tan tan
tan 1
.
1 tan tan
Here is an example of the Necessity for such trig identities (When will I ever use this?).
One finds straightforwardly (don't fear complex algebra!)
b 2 x1
y
1
2
a y1 x1 c
tan 1
,
b 2 x1
1 2
a ( x1 c)
y1
b2 x
2 1
c x1 a y1
tan 2
.
b 2 x1
1 2
a (c x1 )
We first clear denominators to obtain
b2 x1 ( x1 c ) a 2 y12
tan 1
,
y1[a 2 ( x1 c ) b2 x1 ]
a 2 y12 b2 x1 ( c x1 )
.
y1[a 2 ( c x1 ) b2 x1 ]
One can check the equality of these by brute force (e.g. multiplying one by the reciprocal of the
other and simplifying), but the Referential Symbolic WoT suggests a meaning-based
simplification. Indeed, from the equation of the ellipse, the terms b 2 x12 a 2 y12 in the numerators
tan 2
simplify to a 2b 2 , and in the denominators a 2 b 2 c 2 can be used. Each tangent then reduces to
b2
, so they are equal (note also that referential symbolic reasoning provides students with a
cy1
check of their work: since the expression computed is a tangent, it should have no units and thus
the numerator and denominator must have the same overall power). Finally, we need to consider
whether tan 1 tan 2 suffices to conclude that 1 2 . This provides Necessity for the concept
of a one-to-one function on a specific domain, here 0 .
A second proof of the equal-angles property of the tangent line requires less computation
and provides more insight. It suffices to show that the normal line at the point P of the ellipse
bisects angle P in the triangle F1PF2 . If this normal meets the opposite side F1 F2 at a point K ,
then by Euclid, Prop. VI.3, this is equivalent to showing that F1K / F2 K F1P / F2 P . (An angle
bisector in a triangle is characterized by dividing the opposite side in proportion to the adjacent
sides.) Since the normal line has equation [Prove this!] ( xa 2 / x1 ) ( yb 2 / y1 ) c 2 , the point K
has coordinates ( c 2 x1 / a 2 ,0) and the required proportion becomes
19
c ( c 2 x1 / a 2 ) r1
,
c ( c 2 x1 / a 2 ) r2
which follows from our earlier computation of the focal radii, (1.3). There is a third, physicsbased proof which provides insight. Imagine an object moving around the ellipse, with any
velocity v , not necessarily constant. Since the sum of its distances to the foci remains constant,
the sum of the rates of change of the focal distances is zero. But the rate of change of the
distance to a focus is simply (minus) the component of velocity in the direction of the focus. This
says that v cos 1 v cos 2 0 , again implying 1 2 .
Finally, a useful follow-up question is: given an ellipse with its foci, perform a Euclidean
construction of the tangent line at a specified point. Besides the solution following from the
equal-angles property, namely drawing the focal radii and bisecting the angle they form, there is
another following from the squashed circle viewpoint. Begin with the circle x 2 y 2 a 2 and its
tangent line at some point ( x1 , y1 ) . Squashing by a factor b / a in the vertical direction produces
the ellipse ( x / a )2 ( y / b)2 1 along with its tangent line at the corresponding point
P( x1 , by1 / a ) , and the squashing does not change the x -intercept of the tangent line. Therefore,
from the tangent line to the circle we obtain this intercept, and the tangent line to the ellipse is
drawn by joining the intercept to P .
Classroom Problem 6: Is the
focus/directrix definition of an
ellipse equivalent to the two-focus
definition? That is, is the locus of
points whose distances from focus
and directrix are in the ratio e 1
f a 2 / c . That is, r2 is e times the distance from ( x , y ) to the vertical line l with equation
x f . [We observe that f a , so that this line lies to the right of every point on the ellipse,
and also that e 1 .]
A useful Way of Thinking is that every algebraic computation is a proof of something,
and one should try to state explicitly what it proved. Our computation of r2 proved that, if P lies
on the ellipse, then PF2 ePl . That is, the distance from P to the focus F2 is e (the
eccentricity) times the distance to l , the directrix.
P
Most students do not yet possess the WoT that the
converse should automatically be investigated, and
that reversing the algebraic reasoning may prove it.
r1
r2
Here we ask whether a point satisfying PF2 ePl
is necessarily on the ellipse. If so, then we have an
c
x
O
F1
F2
alternative definition of an ellipse, the
focus/directrix definition. One way to establish this
2
is to begin with r2 ac ( ac x ) and apply the
Pythagorean Theorem (twice) to Figure 5 to obtain
Figure5:Focus/directrixdefinitionofellipse
20
it!], we can take the square root to obtain r1 a ac x , and add to obtain r1 r2 2a . Thus, the
focus/directrix property implies the two-focus definition of the ellipse.
In 1822, G.P. Dandelin discovered a beautiful proof
that conic sections are indeed plane sections of a
cone. In our classroom we motivated this
construction and presented the reasoning in the case of the ellipse. It is also possible, but
complicated, to take an algebraic approach: obtain equations for a cone and a plane cutting it in
three-dimensional Cartesian coordinates, and derive an equation for their intersection. In addition
to the complexity of the algebra, a conceptual difficulty is that the three-dimensional Cartesian
coordinates x, y are not also standard Cartesian coordinates in the cutting plane, where we want
to have an equation for the locus.
Intuition tells us that a horizontal plane cuts a cone in a circle, whereas if the plane is
tilted slightly the intersection is an oval curve which may be an ellipse. Perceptually, it seems
unlikely that this oval will be symmetric about its center rather than egg-shaped; we need
deductive reasoning to correct our intuition in this case. In Figure 6, a cone has vertex V and is
tangent to a sphere along circle C lying
in plane and having center O . A
plane meets the cone in the curve E
(which we assume is closed), is tangent
to the sphere at F , and cuts the plane
in the line d . Conversely, for any
plane cutting the cone in a closed
curve E , we can draw a sphere tangent
to both the cone and the plane as shown.
Our objective is to show that E is an
ellipse. P is an arbitrary point on E ,
the generator VP meets at L , and
PM is drawn perpendicular to . PD
is drawn perpendicular to d . The
cutting angle is PDM and the
cone's elevation angle is VLM . It
is important for students to attend to the
temporal (logical) sequence in which
elements are added to the diagram:
which points, lines, and so forth are
chosen first and which others are then
determined by these choices. Some
elements are given, such as the cone and
plane, some are fixed, like the point of
Figure6:Dandelinsphereconstructionofellipse
tangency F , and some are variable like
P . We introduce the variable point P in order to show that however it is chosen on E it will
satisfy one of the definitions of an ellipse. Students need practice (Repeated Reasoning) with the
overall logical structure of such proofs.
Classroom Problem 7: Is a (suitable)
plane section of a cone an ellipse?
21
To prove that E is an ellipse, we need to identify candidates for the focus and directrix.
The point of tangency F might be the focus, and d might be the directrix. Some support for this
conjecture comes from the facts that, as the plane becomes horizontal, F moves to the center of
the circle E and d moves off to infinity. By way of telling geometry that the sphere is inscribed
in the cone, we note that PL PF are tangents to the sphere from the same external point.
(Students are familiar with the fact that tangents to a circle from an external point are equal, but
had difficulty deducing the corresponding fact for spheres. To obtain the latter from the former,
one has to find the right circular cross-section of the sphere.) We also have
PM / PL PM / PF sin and PM / PD sin . Therefore,
PF sin
e,
PD sin
a constant independent of P . If we know that e 1 , then we have shown that E is an ellipse.
This follows from our implicit assumption that the plane cuts the cone in a closed curve, which
occurs only when the plane cuts each generator of the cone. That in turn requires , which
implies e 1 .
Although Dandelin's proof is visual and elegant, it also raises several subtle issues which
should be made explicit if not necessarily resolved rigorously in class. First, how do we know
that there is always a (unique) sphere tangent to both the cone and the plane ? An intuitive
continuity argument is convincing but not rigorous: begin with a small spherical balloon tangent
to the cone only, and ``blow it up" until it becomes tangent to the plane as well. A geometric
proof can be given by using a suitable plane cross-section of the three-dimensional diagram and
the fact that a triangle has a unique inscribed circle. Second, the fact that a cutting plane with
produces a closed curve of intersection relies on some intuitive topology. (Indeed, the
proof shows only that the curve of intersection is contained in an ellipse. We are relying on this
intuitive topology for the conclusion that the curve is a complete ellipse.) We did not press the
issue of what it means to be a closed curve at this point, but we returned to it when comparing
the Dandelin sphere proofs for the different conic sections. Third, the definition of a cone as a
figure of solid geometry may still be at the intuitive level for students and should be formalized.
For example, begin with a circle and a point other than its center on the line through its center
and perpendicular to its plane; take the union of all lines joining this point to the points of the
circle. Students do not always realize initially that this defines a double cone.
Supplementary and Practice Problems:
Problem 1. Given an ellipse, construct (synthetically) its foci.
Although the construction is purely synthetic, the proof uses coordinates. First, we find
the center. Construct a pair of parallel chords (AB and CD in Figure 6). Connect the midpoints
M 1 and M 2 . Construct a second pair of parallel chords EF and GH not parallel to the first pair,
and connect their midpoints M 3 and M 4 . Then the intersection of M 1M 2 and M 3 M 4 is the
center of the ellipse. To prove this, it suffices to show that the center is contained on the line
joining midpoints of any 2 parallel chords. Let the ellipse be given by ( x / a )2 ( y / b)2 1 .
Suppose one chord is given by the line y mx n1 and the parallel chord is given by y mx n2
. Then the endpoints of the first chord are found by solving
b2 m 2 a 2 x 2 2mn1a 2 x (n12 a 2b2 ) 0 . However, we can find the midpoint without
22
mn1a 2
(average of 2 solutions to a
b2 m 2a 2
m 2n1a 2
n1 . Similarly, the other midpoint will
quadratic is B / 2 A) , so y-coordinate is y1 2
b m 2a 2
mn2a 2 m 2 n2 a 2
be ( x2 , y2 ) 2
, 2
n2 . To show that the line connecting the midpoints
2 2
2 2
b m a b m a
y1 y2
.
x1 x2
yi m 2 ni a 2 ni (b 2 m 2 a 2 )
xi
mni a 2
m 2 a 2 (b 2 m 2 a 2 )
A
ma 2
E
C
b2
M4
,
ma 2
F
which is independent of ni , giving the desired
M1
equality.
To find the axes, we pick any point on
O
the ellipse and draw a circle centered at O
M2
through that point. The circle will intersect the
G
ellipse at 4 points, which we connect to get a
M3
rectangle whose sides are parallel to the
B
H
desired axes [prove this!]. We can then draw
D
lines parallel to these sides through the origin.
Figure7:Constructingellipsefoci1
With the axes in hand, pick a point P on the
ellipse that is also on the minor axis. Draw a circle centered at P with radius equal to the semimajor axis. The intersection points of this circle with the major axis are the foci.
The most difficult step in this proof can be viewed as showing that the locus of midpoints
of a family of (all possible) parallel chords of an ellipse is a diameter of the ellipse. Analogous
statements for the parabola and hyperbola can be proved similarly.
Problem 2. Use a second Dandelin sphere to confirm that an ellipse has a second
focus/directrix pair.
Problem 3. Let E be an ellipse with center O , major axis of length 2a , and minor axis of
length 2b . Prove that if P E , then b PO a .
Problem 4. Find the equation of the ellipse with foci 23 ,0 and 23 ,0 and directrix the line
x 4.
Problem 5. Find the eccentricity and the directrices of the ellipse
2
2
x 2 y 3
1.
5 4
Problem 6. Consider the equations
23
4 x 2 5 y 2 16 x 10 y 39 0
4 x 2 5 y 2 16 x 10 y 39 0.
a. Show that one of the above equations defines an ellipse and that the other equation does
not define an ellipse.
b. For the equation which does define an ellipse, find the coordinates of the center, the
coordinates of the foci, the length of the major axis, and the length of the minor axis.
y2
x2
Problem 7. Find an equation for an ellipse with the same foci as the ellipse 25
16
1 but
greater eccentricity. Graph both ellipses on the same set of axes.
Problem 8. Let E be an ellipse with center O . Let P E . Find an expression for PO in terms
of the angle between PO and the major axis of E .
Problem 9. Find a tangent line to the ellipse x 2 4 y 2 196 which is perpendicular to the line
x 5.
Problem 10. Find a normal line to the ellipse 3x 2 2 y 2 50 which is perpendicular to the line
6x y 7 .
Problem 11. Prove that if PQ is a diameter of an ellipse, then the tangent lines to the ellipse at
the points P and Q are parallel.
Problem 12. Let PQ be a diameter of an ellipse.
a. Prove that the midpoints of the chords of the ellipse which are parallel to PQ all lie on
the same line.
b. Prove that the line containing the midpoints of the chords of the ellipse which are
parallel to PQ is parallel to the tangent lines to the ellipse at the points P and Q .
Problem 13. Is a plane section of a circular cylinder also an ellipse?
Perceptually, it seems unlikely that sections of cylinders and cones have the same shape.
The Dandelin sphere argument actually applies to the cylinder as well as the cone. Again, this is
an excellent opportunity for students to appreciate the advantages of the Deductive over the
Empirical proof scheme.
Problem 14. A mining company drills a mineshaft into the ground. The hole is a circular
cylinder, with its axis making an angle with the vertical. If the (elliptical) opening at the
surface must fit within the square plot of ground owned by the company, whose side is twice the
diameter of the cylinder, how does this restrict ?
If the axes of the ellipse are assumed parallel to the sides of the square, this is an easy
application of the squashing of the circle to an ellipse. However, a larger ellipse will fit
diagonally. The sides of the square are then tangent to the ellipse and make a 45 degree angle
with the axes. Determining where such lines are tangent is a nontrivial exercise.
If we tilt our heads by the angle , we see the cylindrical mineshaft as if its axis were vertical,
and the ground as a cutting plane with elevation angle . Then the elliptical opening can be
described by the equation x 2 cos2 y 2 r 2 , where r is the radius of the hole. Here the ellipse
is obtained by stretching, rather than squashing, a circle.
In the simplest approach to the problem, one might assume that the ellipse's axes are parallel to
the sides of the square, so the constraint is simply that the major axis 2 r / cos not exceed the
side 4r of the square. This gives cos 1 / 2 , or / 3 .
However, a larger ellipse will fit in the square with the major axis along the diagonal. Rather
24
than inscribe a tilted ellipse inside the square, it may be easier to draw a tilted square around the
standard ellipse. The sides of the square, with slopes 1 , must then be tangent to the ellipse. To
achieve this we use implicit differentiation to locate the points where the ellipse has slope 1 ,
finding 2 x cos2 2 yy 0 with y 1 , so that y x cos2 . (Of course, other methods of
finding the tangent lines, discussed above, lead to the same result.) Substituting back into the
equation of the ellipse gives the points of tangency as
r
( x, y )
( 1, cos2 ),
2
cos 1 cos
where all four sign combinations are possible. The x -intercept of the tangent line at the point
with positive coordinates can be computed as
r
1 cos2 .
cos
The four tangent lines form a square whose diagonal is twice this value, so its side is this value
times 2 . The problem constraint is then
r 2
1 cos2 4r.
cos
which can be solved to yield simply cos 1 / 7 0.37796. This translates into 67.79 o ,
versus 600 as found previously when the major axis was parallel to the sides.
Problem 15. A dog is tied up in a large field by a loop of rope running through its collar and
around two posts. If the posts are separated by a distance d , and the rope has length l , this
restricts the dog to an elliptical region in which the sum of focal distances is less than l d . If a
straight river runs through the field along a given line, how close to this river can the dog get?
Consider a numerical example in which l 20 and d 6 . Let the ellipse be centered at
the origin with major axis along the x -axis, while the river has equation y x 16 . Then
a 7, c 3, and the equation of the ellipse is ( x 2 / 49) ( y 2 / 40) 1 . As in Problem ellipse9,
there are two tangent lines to the ellipse parallel to the river, namely y x 89 . The minus
sign gives the tangent line nearer to the river, and the distance between this line and the river is
the closest the dog can get. The distance between parallel lines of slope m whose y -intercepts
differ by b is b / m 2 1 , which in our case gives a distance of (16 89 ) / 2 4.6429 .
25
Students should see a need to reconcile these equations with their previous understanding of
a parabola, probably defined as the graph of a quadratic function y ax 2 bx c . Only one of
the above equations has this form, but students should recognize that reversing the roles of x
and y all but the last to be put into this standard form. The last example, with a general line as
directrix, creates Necessity for the formula for the distance from a point (focus) to a line (see line
unit).
One pair of students in our classroom interpreted this problem geometrically rather than
algebraically, and they presented a geometric construction of points on the locus. Given the focus
F and directrix l , choose a point Q on l , draw FQ and mark its midpoint M . Then the
26
Figure8:Geometricconstructionofpointsonparabola
( x0 / 2, k / 2) and P is at ( x, y ) where
27
2 1
0
(1 2 x x 2 )dx (4 2 5) / 3 0.219.
Unit 4: The Hyperbola
28
Structural way of thinking. Try to generalize established properties and explore a parameter
space (if an ellipse has 0 e 1 , what happens for e 1 ?).
Usefulness of completing the square.
Classroom Problems
Classroom Problem 1: What locus satisfies the focus/directrix definition with e 1 ?
Classroom Problem 2: What are the asymptotes of a hyperbola? Does a parabola have
asymptotes?
Classroom Problem 3: (a) Find a condition on a, b, m, n so that the line y mx n is tangent
to the hyperbola ( x 2 / a 2 ) ( y 2 / b2 ) 1 .
(b) Prove that the tangent line to a hyperbola bisects the angle between the focal radii.
(c) Find the angle between a hyperbola and an ellipse that share the same foci.
Pedagogical Considerations
( x k )2 y 2 e2 x 2 .
We now wish to simplify this equation to a symmetric form like that of the ellipse, for
example making any symmetry of the locus obvious. The following computation is remarkable
in that no high school student would recognize it as simplification. The equation gets more
complicated at each step, and even the final form is about as long as the initial form. It counts as
simplification only because we possess the PGA Way of Thinking that translating the coordinate
axes and introducing new parameters for combinations of e and k will clarify the geometry of
the locus. Expanding, collecting terms, and completing the square in x leads to
k
k 2e 2
( e 2 1)( x 2 ) 2 y 2 2 .
e 1
e 1
2
2 2
2
2
2
2 2
2
Introducing a k e / (e 1) and b k e / (e 1) (since both quantities are positive) leads
to
( x e2k1 ) 2 y 2
2 1.
a2
b
This tells us that there is a better choice of coordinates than the one we made: we should
transform x x e2k1 and y y . The arrows mean that each point ( x, y ) on the curve should
be moved right to the location given by the shifted coordinates. The final form of the equation of
the hyperbola is indeed ( x 2 / a 2 ) ( y 2 / b2 ) 1 . Its x -intercepts are ( a ,0) .
What can be said about the precise shape of the hyperbola whose equation we have
found? It is now clearly symmetric about both the x and y axes (hence the origin too) and
therefore its shape is determined by the portion in the first quadrant, y (b / a ) x 2 a 2 .
Perhaps it is monotonically increasing, like
29
student suggested that larger x 's give larger y 's. How can we tell algebra this meaning? One
possibility is that if x1 x2 ,then y ( x1 ) y ( x2 ) ; that is, (b / a ) x12 a 2 (b / a ) x22 a 2 . This is
true because the composition of the increasing functions x 2 a 2 and x is increasing. Using the
derivative to test for an increasing function would be overkill; calculus is not needed here.
We need to locate the focus and directrix of the hyperbola given by our final equation, which
have been lost in translation (bad pun). Since the shift right was by a / e , the directrix is now the
line x a / e . Applying the same shift to the focus locates it at ( ae, 0) . Note that a / e a ae .
Because of the symmetry of the curve, there is a second focus at ( ae, 0) and a corresponding
directrix at x a / e .
In our classroom we distributed a textbook derivation of the equation of the hyperbola to
our students for comparison with their solution to this problem. The textbook derivation employs
20/20 hindsight to situate the focus at ( ae, 0) and the directrix at x a / e from the beginning,
miraculously resulting in the simple standard form of the equation. We contrasted this with our
implementation of the Necessity Principle and the PGA WoT. Students should see for
themselves the effect of various placements of the focus and directrix against the coordinate axes
and decide which they consider simplest.
K
-5
F1
F2
-2
y1
1 (b / y1 ) 2 .
y1
b
Thus, as y1 , we must have x1 also, in such a way that x1 / y1 has the limiting
value a / b . Therefore, the tangent line (1.4) has the limiting form y (b / a ) x , and the
asymptotes are these lines through the origin (center). One can show also that the vertical (or
30
perpendicular) distance from the hyperbola to an asymptote goes to zero in the limit.
To see the three-dimensional origin of the asymptotes, take a plane cutting a cone in the
hyperbola, and a parallel plane through the vertex cutting it in a pair of lines. The hyperbola
meets every generator of the cone except for these two lines. They should be related to the
asymptotes, but unfortunately do not lie in the plane of the hyperbola. Projecting them into this
plane, and translating them to pass through the center, indeed gives the asymptotes.
A parallel treatment of the parabola shows that it has no asymptotes. (It also has no center.) The
tangent line to the parabola y 2 2kx at ( x1 , y1 ) has the equation yy1 k ( x x1 ) , which can also
be written
k
y
y x 1.
y1
2
In the limit y1 , the slope goes to zero and the intercept to infinity; the tangent line has no
limiting position.
Classroom Problem 3: (a) Find a condition on
a, b, m, n so that the line y mx n is tangent to
the hyperbola ( x 2 / a 2 ) ( y 2 / b2 ) 1 .
(b) Prove that the tangent line to a hyperbola bisects
the angle between the focal radii.
(c) Find the angle between a hyperbola and an
ellipse that share the same foci.
8
K
-5
F1
F2
-2
-4
-6
Figure10:Ellipseandhyperbolawiththesamefoci
31
x 16 / 5 .
Problem 2: Find the center, eccentricity, foci, directrices, and asymptotes of the hyperbola
x 2 2 x 4 y 2 3 0.
x2
a2
y2
b2
32
and Q( x2 , y2 ) . Let R ( x, y ) be any other point on the line l . Without loss of generality, we may
assume that x1 x2 x . Let S and T be the points with coordinates ( x2 , y1 ) and ( x, y2 ) ,
respectively. Since RT is parallel to the y axis and QT is parallel to the x axis, QTR is a
right angle. Similarly, PSQ is a right angle. Moreover, QRT and PQS are corresponding
angles formed by the parallel lines RT and QS and the transversal l and are therefore
congruent. Thus QRT is similar to PQS by AA similarity. It follows that
QS
PS
RT
QT
, that is:
y2 y1 y y2
.
x2 x1 x x2
Naming this common ratio m and solving for y , we find that y mx b , where b y2 mx2 .
Students should be able to explain why generalizing this argument implies that any point ( x, y )
on the line satisfies the equation y mx b . They should also realize the reason for fixing two
points that define the line, which we denote by particular coordinates, and considering any other
point on the line, which we denote with general coordinates ( x, y ) .
A second approach is to suppose that the given line l is the perpendicular bisector of
some segment PQ . If the coordinates of P and Q are ( x1 , y1 ) and ( x2 , y2 ) , respectively, and
( x, y ) is a point on l , then since ( x, y ) is equidistant from P and Q , we have
( x x1 ) 2 ( y y1 ) 2 ( x x2 ) 2 ( y y2 ) 2 , or equivalently, Ax By C 0 , where
33
means R is between P and Q) is to take the concept of distance as primary and define a line as
having distances sum. That is, given points P and Q, we define PQ as the union of 3 sets:
1. {R : R ~ Q ~ P & RQ QP RP}
2. {R : Q ~ R ~ P & RQ RP QP}
3. {R : Q ~ P ~ R & QP PR QR}
The necessity for all 3 of these sets can be established through overaching problem 8. If one
blindly applies the distance formula, the calculation is lengthy. However, we can choose (say)
the x-axis to be along the direction of PQ , in which case it is clear that R has the desired
properties iff it has y-coordinate of 0. From an expert perspective, this is really defining a line
through defining an axis. That is, we define a primary line as the set of all ordered pairs (x,0), or
y=0. From this, we can apply translation and rotation to obtain the general form of any line.
Classroom Problem 2: Show
that if A and B are both nonzero, then the equation
Ax By C 0 defines a line.
QS
PS
and
follows from the equations Axi Byi C 0 for i 1, 2,3 . The conclusion then follows
since PQS and RQT are complementary and SQT is a right angle by construction. Note
that the cases of horizontal and vertical lines ( Ax By C 0 where exactly one of the
coefficients A and B is zero) can then be treated as a composition of a rotation and a
translation.
For the second approach, consider the segment whose endpoints have coordinates (0, k )
2
2
and ( A, k B ) where k A B2 B 2 C . Then ( x, y ) is on the perpendicular bisector of this segment if
and only if ( x 2 ( y k ) 2 ( x A) 2 ( y k B ) 2 , or equivalently, if and only if
Ax By C 0 .
The third approach relies on the converse of the Betweenness Theorem (namely, if P , Q
and R are distinct points such that PQ + QR = PR , then Q is between P and R ). If we
assume that ( xi , yi ) satisfy the equation Ax By C 0 , for i 1, 2,3 and that the points satisfy
the hypotheses of the theorem, then the result follows from a straightforward calculation.
Supplementary and Practice Problems:
Problem 1: In school you learned that a line in the plane has the equation y mx b . What
does this mean? Specifically, explain the meaning of the following two assertions:
a. A line in the x - y plane has the equation y mx b .
b. The equation y mx b represents a line.
Prove these two assertions.
34
Problem 2: Given two points in the plane, ( a , b ) and ( c, d ) , prove that the equation of the line
yb xa
.
that goes through these points is
d b ca
Problem 3: Show that the distance from the point ( x0 , y0 ) to the line Ax By C 0 is given
| Ax0 By0 C |
by
.
A2 B 2
35
36
.
0 2
0 1 / 2
The quadratic form appearing in the equation of the ellipse is X t AX , where
1
1
A (T 1 ) t T 1
.
1 5 / 4
A student who suspects that the curve is an ellipse should try to locate its vertices, the points of
maximum and minimum distance from the center (which is at the origin due to the symmetry of
the equation about the origin). One plausible way to do this is by using Lagrange multipliers to
extremize x 2 y 2 subject to the equation as a constraint. This leads quickly to the equations
x y
x
x (5 / 4) y y . (1.5)
which might be recognized as precisely the eigenvalue problem for the matrix A (not T ). The
eigenvalues are (9 65) / 8 , approximately 2.133 and 0.117 . Taking the dot product of
the eigenvalue equations (1.5) with ( x , y ) and using the equation of the curve leads to
Classroom Problem 2: Intrigued by the idea
of obtaining an ellipse by squashing a circle,
a student investigates the effect of other
simple (linear) transformations on the circle.
Beginning with the circle x 2 y 2 r 2 , she
creates a new curve by plotting the new
points ( x , y ) ( x 2 y , 2 y ) for each point
( x, y ) on the circle. What curve does she
obtain? What equation could she have
graphed to obtain the new curve directly?
37
x 2 y 2
r ,
2.92 0.685
with its axes along the eigenvectors of A . The major axis would be along the unit vector
(0.750, 0.662) , which makes an angle of about 0.723 radians with the x -axis. To verify their
conjecture, they could use the formulas for rotation of axes at this point. Alternatively, they
could locate the foci of the supposed ellipse along its axes and write down the condition for the
sum of the focal distances to be the appropriate constant, namely the major axis. This would
produce the equation of the curve once again, up to roundoff errors.
We briefly discuss the simpler equation first. Once
again, students who are familar with diagonalizing
symmetric matrices and their associated quadratic
forms will have an easier time than those who are
not. Those without this background may again start
by locating the vertices of the (presumed) conic,
either as the points of maximum/minimum distance
from the center (the origin, by symmetry) or the
points where the position vector is normal to the
curve. The position vectors of these points are
eigenvectors of the symmetric matrix
B / 2
A
M
.
B /2 C
The quadratic equation for the eigenvalues is
B 2 4 AC
2
( A C )
0.
4
From this we learn that the sum of the eigenvalues, 1 2 , is A C (the trace of M ); the
Classroom Problem 3: What curve is
represented by a general equation
Ax 2 Bxy Cy 2 F 0 ? By a more
general equation
Ax 2 Bxy Cy 2 Dx Ey F 0 ?
Describe the curves geometrically as
completely as possible. What sets of
coefficients lead to the same curve, up
to a rigid motion? How do they
determine its geometric parameters?
A C ( A C ) 2 B 2 / 2.
In particular, the eigenvalues are always real. The simpler equation 1 x 2 2 y 2 F 0
definitely represents a conic, and would lead to a matrix having the same trace and determinant.
However, at this point students really must use the formulas for rotation of axes and explicitly
transform the original equation into this simpler form. The type of conic depends on whether the
38
eigenvalues have the same or opposite sign; it is a hyperbola when B 2 4 AC 0 and an ellipse
when B 2 4 AC 0 . The semiaxes of the ellipse are | F / i | . In the remaining case
B 2 4 AC 0 , one eigenvalue vanishes and the corresponding semiaxis is undefined, so
something is wrong. In fact, the original quadratic form is then a perfect square, and the conic
is either empty or degenerate: a pair of lines. An equation of the given form cannot represent a
parabola, which is evident from its symmetry about the origin.
Students should attempt to reduce the more general equation to the less general one by
translation of coordinate axes to remove the linear terms. Replacing x by x a , and y by
y b , the linear terms cancel out if the linear equations
2 Aa Bb D,
Ba 2Cb E
are satisfied. That is, the plan succeeds unless B 2 4 AC 0 . In that case the locus is not
symmetric about any origin. It is still (generally) possible to remove the xy term by rotating one
axis to lie along the eigenvector with the nonvanishing eigenvalue, and to remove the term linear
in either x or y by a translation. The resulting locus, if nondegenerate, is a parabola.
Supplementary and Practice Problems:
Problem 1. Identify the type of conic section defined by the given equation (none of the conics
are degenerate):
a. x 2 2 3 xy 3 y 2 2 3 x 2 y 0
b. x 2 4 xy y 2 1
c. 73x 2 72 xy 52 y 2 30 x 40 y 75 0
d. Use rotation of axes to graph the conic sections in parts (a)-(c).
39
a 2 m2
y x
.
m2
Together with z ax these are equations of two lines through the origin, two generators of the
cone. One possible geometric explanation is as follows. The plane will meet a circular crosssection of the cone in two points, and therefore it contains the two generators through these
points and no point of any other generator. Visually, as a rotating generator sweeps out the cone,
it passes from one side of the cutting plane to the other and back again. Therefore there are two
moments when it lies in the plane, by continuity.
Considering planes not containing the vertex, visual intuition may suggest that the intersections
are curves which are either closed and oval, or open and roughly parabolic. A direct algebraic
approach is challenging. For example, the plane z ax 1 leads to
(m 2 a 2 ) x 2 m 2 y 2 2ax 1 0 . Students may observe that the relative size of m and a
determines the sign of the x 2 term, and that the case m a does look parabolic. However, this is
the equation for the projection of the intersection curve into the xy plane, not its equation as a
plane curve in the cutting plane, so the interpretation is problematic. Eventually, students may
relate the coordinate x in the cutting plane to the spatial Cartesian coordinate x by x x cos
; for now they may continue intuitively. Imagine that an initially horizontal cutting plane slowly
rotates to a position with . The circular intersection deforms into an oval shape and clearly
remains closed. In particular, the intersection curve contains a single point on each generator.
The deformation continues until , when the cutting plane becomes parallel to a single
generator of the cone, which it no longer meets: that point of intersection moved off to infinity as
40
the oval stretched. The intersection curve is now open and unbounded. Continuing to , the
plane is parallel to two generators (the same two we saw when the plane contained the vertex)
and also meets both nappes of the cone in open and unbounded curves. The three qualitatively
distinct intersection curves are called ellipses (from ellipsis, meaning deficiency or falling short),
parabolas (placing beside or comparable to), and hyperbolas (throwing beyond). (The names, due
to Appolonius, do not arise directly from the picture of slicing a cone.)
Problem 2 A sphere is illuminated by a point light source some distance away. Explain why its
shadow on a plane surface will be a conic section.
Draw the line through the light source and the center of the sphere. This will be the axis of a
cone. Rotate this line about the light source until it is tangent to the sphere. It is now a generator
of the cone. Rotating the generator about the axis produces the cone, which is tangent to the
sphere along a circle. The elevation angle is given by cos R / D , where R is the radius of
the sphere and D is the distance from the light source to the center of the sphere. Now consider
the sphere's shadow on a plane placed behind it. Light rays from the source reach this plane if
they are external to the cone, and do not if internal. The boundary of the shadow is therefore the
intersection of the cone with the plane. Only the single nappe of the cone tangent to the sphere is
relevant; only one branch of a hyperbolic shadow appears. As mentioned, if the sphere is actually
in contact with (resting on) the plane, we call the point of contact a focus of the conic section.
Every (nondegenerate) conic section arises as the shadow of a sphere in contact with the cutting
plane, either by an intuitive argument in which one blows up a balloon enclosed between the
cone and plane until it is tangent to both, or by the fact, applied to a plane cross-section of the
diagram, that every triangle has an inscribed circle (applicable to the ellipse case) as well as three
excircles (in the other cases). It may not be obvious at this stage that the position of the focus
depends only on the conic section and not on the cone and plane used to obtain it. This is similar
in spirit to Problem circle1, whether a circle determines its center, and may necessitate a
characterization of a conic section as a locus in its plane.
Problem 3 We define the distance from a point to a line as the perpendicular distance. How
might we define the distance from a point to an ellipse (parabola, hyperbola)? Given a point not
on an ellipse (parabola, hyperbola), construct the point on the ellipse that realizes this distance.
Problem 4 Given two point A and B in the plane, find the locus of all points in the plane that are
twice as far away from A as from B? How would you generalize this problem?
Problem 5 Prove algebraically that, in any triangle, the following lines meet at a point.
a. The three medians (lines from a vertex to the midpoint of the opposite side)
b. The three altitudes (lines through a vertex perpendicular to the opposite side, extended
if necessary)
c. The three angle bisectors
Problem 6 Show that the line connecting the topmost point to the rightmost point of the circle
x2 y 2
given by x 2 y 2 a 2 b 2 is tangent to the ellipse 2 2 1 .
a
b
41
Problem 7: Fix two points A and B in the plane. What is the locus of all points P such that
AP BP AB ?
Students may recognize this as a degenerate case of the ellipse: the sum of distances is
fixed by AB. If students are not careful with the distances, they may incorrectly answer that the
locus is a line, rather than merely the segment. This provides an opportunity to challenge their
intuitive assumption that an equation derived from a locus is equivalent to that locus.
Problem 8: A segment AB with endpoints on the sides of a right angle moves so that the
distance AB remains fixed. What is the locus of all the midpoints of AB ?
B
M
A
References
There are innumerable sources for information on conic sections. There follow a few which are
less well-known, or take an unusual approach.
W. H. Drew, A Geometrical Treatise on the Conic Sections, Macmillan 1875 .
J. K. Johnstone and C.-K. Shene, Computing the intersection of a plane and a natural quadric,
Computers and Graphics 16(2)(1992)179 -- 186 .
42