Astro Book

Download as pdf or txt
Download as pdf or txt
You are on page 1of 233

Introduction to Astronautics

Mechanics of Solar System Flight

1.2

2.0 1.7

3.0

0.8 1.7 2.0 0.4 1.51 3.0 5.0 0.0

1.495 1.5

1.51 1.594

1.7

1.594 1.51 0.4 1.5 1.495 0.8 1.7 2.0 1.2 0.8 0.4 0.0 0.4 0.8 1.2 1.6 3.0 2.0 1.7 2.0

1.2

by

David F. Chichka
June 23, 2010

The cover illustration is a sketch of the curves of zero velocity in the Earth-moon three-body problem.

Contents

1 Introduction 1.1 Newton . . . . . . . . . . . . 1.2 Reference Frames . . . . . . . 1.3 A Few Notational Comments 1.3.1 Vectors . . . . . . . . . 1.3.2 Astronomical symbols

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . .

1 2 2 3 3 3 5 5 7 7 8 9 10 11 11 12 12 13 14 17 17 18 20 20 20 22 22 23 24 25

2 The Two-Body Problem 2.1 The General Equation of Motion . . . 2.2 Solution of the Equation of Motion . . 2.2.1 Specic Energy . . . . . . . . . 2.2.2 Angular Momentum . . . . . . 2.2.3 The Laplace Vector . . . . . . . 2.2.4 The Orbit Equation . . . . . . . 2.3 Relative and Inertial Motion . . . . . . 2.3.1 Potential Energy . . . . . . . . 2.3.2 Kinetic Energy . . . . . . . . . 2.4 Properties of Conic Sections . . . . . . 2.4.1 Terminology . . . . . . . . . . . 2.5 Elliptic Orbits . . . . . . . . . . . . . . 2.6 Hyperbolic Orbits . . . . . . . . . . . . 2.6.1 Geometry of the Hyperbola . . 2.6.2 Hyperbolic Orbits . . . . . . . . 2.7 Special Cases and Degenerate Conics . 2.7.1 Circular Orbits . . . . . . . . . 2.7.2 The Parabolic Orbit and Escape 2.7.3 Degenerate Conics . . . . . . . 2.8 Keplers Laws . . . . . . . . . . . . . . 2.8.1 Keplers Second Law . . . . . . 2.8.2 Keplers Third Law . . . . . . . 2.8.3 Measurement of Planetary Mass

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Speed . . . . . . . . . . . . . . . . . . . .

iii

iv

Contents 2.9 Canonical Units . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.10 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 28 31 31 34 37 37 39 40 41 42 44 47 48 50 50 51 52 55 55 55 57 57 58 58 59 59 60 61 62 63 64 65 65 66 66

3 Position in Orbit and Time in Transit 3.1 Keplers Equation . . . . . . . . . . . . . . 3.2 Lamberts Problem . . . . . . . . . . . . . 3.3 Hyperbolic Trajectories . . . . . . . . . . . 3.3.1 An Analogue of Keplers Equation . 3.3.2 Time in Transit . . . . . . . . . . . 3.4 Parabolic Trajectories . . . . . . . . . . . 3.5 Implementation Considerations . . . . . . 3.5.1 Finding the Zeroes of Functions . . 3.5.2 Solving Keplers Equation . . . . . 3.5.3 Time of Flight Problems . . . . . . 3.5.4 Finding the Orbital Characteristics 3.6 Choosing the Right Solution . . . . . . . . 3.6.1 Parabola and Hyperbola . . . . . . 3.6.2 Elliptic transfer . . . . . . . . . . . 3.7 Problems . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . .

. . . . . . . . . . . . . . .

. . . . . . . . . . . . . . .

. . . . . . . . . . . . . . .

. . . . . . . . . . . . . . .

. . . . . . . . . . . . . . .

. . . . . . . . . . . . . . .

. . . . . . . . . . . . . . .

. . . . . . . . . . . . . . .

. . . . . . . . . . . . . . .

. . . . . . . . . . . . . . .

. . . . . . . . . . . . . . .

. . . . . . . . . . . . . . .

. . . . . . . . . . . . . . .

. . . . . . . . . . . . . . .

4 Orbital Elements Putting the Orbit in its Place 4.1 Coordinate Frames . . . . . . . . . . . . . . . . . . 4.1.1 Heliocentric Orbits . . . . . . . . . . . . . . 4.1.2 Geocentric Orbits . . . . . . . . . . . . . . . 4.2 The Classical Orbital Elements . . . . . . . . . . . 4.2.1 Size and Shape of the Orbit . . . . . . . . . 4.2.2 The Angular Elements . . . . . . . . . . . . 4.2.3 Position on the Orbit . . . . . . . . . . . . . 4.2.4 Summary . . . . . . . . . . . . . . . . . . . 4.3 Special Cases and Singularities . . . . . . . . . . . . 4.4 The Orbital Elements from Position and Velocity . 4.5 From Elements to Position and Velocity . . . . . . 4.5.1 The Perifocal Coordinate Frame . . . . . . . 4.5.2 Perifocal to Inertial Transformation . . . . . 4.6 Alternate Element Sets . . . . . . . . . . . . . . . . 4.6.1 Equinoctial Elements . . . . . . . . . . . . . 4.6.2 Delauney Elements . . . . . . . . . . . . . . 4.7 Problems . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . .

Contents 5 Impulsive Orbital Maneuvers 5.1 Initial Considerations . . . . . . . . . . . . 5.2 Single-Impulse Changes . . . . . . . . . . . 5.2.1 Change of Inclination . . . . . . . . 5.2.2 Changes in Orbital Elements . . . . 5.2.3 A Note on Coordinate Frames . . . 5.3 Transfer Between Circular Orbits . . . . . 5.3.1 Hohmann Transfer . . . . . . . . . 5.3.2 Three-Impulse Transfer . . . . . . . 5.3.3 Transfers Including Plane Change . 5.4 More General Transfer Problems . . . . . 5.4.1 Transfer to Coplanar Elliptic Orbit 5.4.2 Transfer from Ellipse to Ellipse . . 5.5 Changes Due to Small Impulse . . . . . . . 5.6 Problems . . . . . . . . . . . . . . . . . . .

v 69 69 70 71 72 73 74 75 77 78 79 80 83 84 85 89 89 90 91 92 94 94 95 97 98 100 100 102 104 104 106 108 110 110 113 115 119 120 122 126

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

6 Interplanetary Trajectories 6.1 Introduction Patched Conics . . . . . . . . . . . . . . . . . 6.2 The Sphere of Inuence . . . . . . . . . . . . . . . . . . . . . 6.2.1 Equations of Motion . . . . . . . . . . . . . . . . . . 6.2.2 The Sphere of Inuence . . . . . . . . . . . . . . . . 6.3 The Heliocentric Phase . . . . . . . . . . . . . . . . . . . . . 6.3.1 Hohmann Transfers . . . . . . . . . . . . . . . . . . . 6.3.2 Non-Hohmann Transfers . . . . . . . . . . . . . . . . 6.3.3 Phase Angle at Departure . . . . . . . . . . . . . . . 6.3.4 The Synodic Period and Mission Duration . . . . . . 6.4 Departure . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6.4.1 Patch Conditions . . . . . . . . . . . . . . . . . . . . 6.4.2 Departure Hyperbola . . . . . . . . . . . . . . . . . . 6.5 Planetary Arrival and Capture . . . . . . . . . . . . . . . . . 6.5.1 Equating the Heliocentric and Planetary Trajectories 6.5.2 Velocity Oset and the Impact Parameter . . . . . . 6.5.3 Planetary Capture . . . . . . . . . . . . . . . . . . . 6.6 Gravity Assist Trajectories . . . . . . . . . . . . . . . . . . . 6.6.1 Planar Fly-bys . . . . . . . . . . . . . . . . . . . . . 6.6.2 Fly-bys for Plane Change The Ulysses Mission . . . 6.7 A Fairly Involved Example . . . . . . . . . . . . . . . . . . . 6.8 Planetary Ephemeris . . . . . . . . . . . . . . . . . . . . . . 6.8.1 Julian Dates and Ephemeris Time . . . . . . . . . . . 6.8.2 Mean Planetary Elements . . . . . . . . . . . . . . . 6.9 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . .

vi 7 Restricted Three-body Problem 7.1 Formulation . . . . . . . . . . . . 7.2 Equilibrium Points . . . . . . . . 7.3 Stability of the Lagrange Points . 7.4 Jacobis Integral . . . . . . . . . . 7.4.1 A Constant of the Motion 7.4.2 Surfaces of Zero Velocity .

Contents 131 131 132 134 137 137 139 143 143 143 147 148 151 152 152 153 153 154 156 157 160 161 163 163 164 167 167 169 169 170 172 174 174 175 177 178 180 180 182 184

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

8 Gravitational Presence of the Earth 8.1 Gravitational Potentials . . . . . . . . . . . 8.1.1 Conservative Force Fields . . . . . . 8.1.2 Gravitational elds . . . . . . . . . . 8.2 Equations of Motion using Potentials . . . . 8.3 Gravitational Potential of the Earth . . . . . 8.3.1 Legendre Polynomials . . . . . . . . 8.3.2 Expression of the Gravitational Field 9 Satellite Clusters and Relative Motion 9.1 Rendezvous in Circular Orbit . . . . . . . 9.1.1 Equations of Relative Motion . . . 9.1.2 Specialization to Circular Orbits . . 9.1.3 Natural Motion . . . . . . . . . . . 9.1.4 The Transition Matrix . . . . . . . 9.1.5 Impulsive Rendezvous . . . . . . . 9.2 Natural Motion and Orbital Elements . . . 9.2.1 Expansions of the Orbital Elements 9.2.2 An Order Analysis . . . . . . . . . . . . . . . . . .

10 Rocket Performance 10.1 Thrust of a Rocket . . . . . . . . . . . . . . . 10.1.1 The Rocket Equation . . . . . . . . . . 10.2 The Sounding Rocket . . . . . . . . . . . . . . 10.2.1 Powered Boost . . . . . . . . . . . . . 10.2.2 Coast Phase . . . . . . . . . . . . . . . 10.3 Aerodynamic Eects on Rocket Performance . 10.3.1 Pressure Thrust . . . . . . . . . . . . . 10.3.2 Drag . . . . . . . . . . . . . . . . . . . 10.4 Specic Impulse . . . . . . . . . . . . . . . . . 10.5 Single Stage Performance and Mass Fractions 10.6 Multi-Stage Rockets . . . . . . . . . . . . . . 10.6.1 Two-Stage Rockets . . . . . . . . . . . 10.6.2 The n-Stage Rocket . . . . . . . . . . . 10.7 Problems . . . . . . . . . . . . . . . . . . . . .

Contents 11 Solar Sailing A Sidereal Time and Various Years A.1 Solar vs. Sidereal Days . . . . . . . . . . . . . . . . . . . . . . . . . . A.1.1 The Mean Solar Day . . . . . . . . . . . . . . . . . . . . . . . A.2 Tropical and Sidereal Years . . . . . . . . . . . . . . . . . . . . . . . B Review of Vectors and Kinematics B.1 Vector Arithmetic . . . . . . . . . . . . . . B.1.1 Notation . . . . . . . . . . . . . . . B.1.2 Addition and Scalar Multiplication B.1.3 Magnitude . . . . . . . . . . . . . . B.1.4 Expression through Unit Vectors . B.1.5 Dot and Cross Products . . . . . . B.2 Coordinate Transformations . . . . . . . . B.2.1 Single-axis Rotations . . . . . . . . B.2.2 Euler Rotations . . . . . . . . . . . B.3 Kinematics . . . . . . . . . . . . . . . . . B.3.1 Position and Velocity . . . . . . . . B.3.2 Motion in a Rotating Frame . . . . B.3.3 Direction Cosines . . . . . . . . . . B.3.4 Inversion . . . . . . . . . . . . . . . C Series Expansions C.1 Taylor Expansions . . . . . . C.2 Lagrange Expansions . . . . . C.3 Some Trigonometric Identities C.3.1 Identities . . . . . . . C.3.2 Expansions . . . . . .

vii 187 191 191 192 192 193 193 193 194 194 194 195 196 196 198 198 198 199 201 202 203 203 204 205 205 205 207 207 208 208 209 211 212 215 216 217

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

D A Brief Review of System Stability D.1 Problem Formulation and Equilibrium Points D.2 Motion Near an Equilibrium Point . . . . . . D.2.1 Linearizing the System . . . . . . . . . D.2.2 Solution of the Linearized System . . . D.2.3 Motion Near the Equilibrium Point . . D.2.4 Stability . . . . . . . . . . . . . . . . . D.3 Second-Derivative Systems . . . . . . . . . . . D.3.1 Motion near the Equilibrium Point . . D.3.2 Stability of Non-dissipative Systems . .

. . . . . . . . .

. . . . . . . . .

. . . . . . . . .

. . . . . . . . .

. . . . . . . . .

. . . . . . . . .

. . . . . . . . .

. . . . . . . . .

. . . . . . . . .

. . . . . . . . .

. . . . . . . . .

. . . . . . . . .

. . . . . . . . .

viii E Tables E.1 Basic Solar System Units . . . E.2 Earth Physical Data . . . . . E.3 Planetary Data . . . . . . . . E.3.1 Orbital Characteristics E.3.2 Physical Data . . . . . E.4 The Moon . . . . . . . . . . .

Contents 219 220 220 220 220 221 221

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

List of Figures

2.1 2.2 2.3 2.4 2.5 2.6 3.1 3.2 3.3 4.1 4.2 4.3 5.1 5.2 5.3 5.4 5.5 5.6 5.7 5.8 6.1 6.2 6.3 6.4 6.5 6.6 6.7 6.8

A two-body system. . . . . . . . . Variable denitions for the ellipse. Flight path angle. . . . . . . . . . Terminology of the hyperbola. . . Area swept in time dt. . . . . . . Relating r, v, and . . . . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

5 14 15 17 23 24 32 34 35 56 58 63 71 72 74 75 80 83 83 86 91 92 96 98 101 102 104 105

Deriving Keplers equation. . . . . . . . . . . . . . . . . . . . . . . . The elliptic transit problem. . . . . . . . . . . . . . . . . . . . . . . . Solving Lamberts problem. . . . . . . . . . . . . . . . . . . . . . . . Heliocentric coordinate system. . . . . . . . . . . . . . . . . . . . . . Classical orbital elements . . . . . . . . . . . . . . . . . . . . . . . . . The perifocal coordinate frame. . . . . . . . . . . . . . . . . . . . . . Rotating the angular momentum vector. . Eect of impulse on orbital elements. . . . Coplanar transfer between circular orbits. Hohmann transfer. . . . . . . . . . . . . . Coplanar transfer to elliptic orbit. . . . . . General transfer to coplanar elliptic orbit. Transfer between aligned elliptic orbits. . . Problem 5 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Three mass diagram . . . . . . . . . . . . . . . . . . . . Simplied sphere of inuence system. . . . . . . . . . . . Candidate Earth-to-Venus transfer ellipse (Example 6.2). Planetary motion during Earth-Mars Hohmann transfer. Heliocentric velocity of the spacecraft. . . . . . . . . . . Escape trajectory starting from circular parking orbit. . . Heliocentric view of planetary arrival. . . . . . . . . . . . Coordinate frame. . . . . . . . . . . . . . . . . . . . . . .

ix

x 6.9 6.10 6.11 6.12 6.13 6.14 6.15 6.16 6.17 6.18 7.1 7.2 7.3 7.4 8.1 8.2 9.1 9.2 9.3 9.4

List of Figures Velocity oset at planetary arrival . . . . . . . . . . . . . . . . . . . . 107 Adjusting the miss distance. . . . . . . . . . . . . . . . . . . . . . . . 107 Geometry of a y-by trajectory. . . . . . . . . . . . . . . . . . . . . . 110 Rotation of spacecraft velocity vector during y-by. . . . . . . . . . . 111 Fly-by from the planetary point of view . . . . . . . . . . . . . . . . . 112 Ulysses Earth-to-Jupiter and rst solar orbit (from NASA/ESA website)114 Martian y-by. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 118 Sidereal vs. solar motion of a planet. . . . . . . . . . . . . . . . . . . 122 Oberth two-impulse escape. . . . . . . . . . . . . . . . . . . . . . . . 127 Grand tours of Voyagers 1 and 2. . . . . . . . . . . . . . . . . . . . . 129 Geometry of three-body problem. . . . . . X-axis Lagrange points. . . . . . . . . . . Triangular Lagrange points . . . . . . . . . Curves of zero velocity for the Earth-moon . . . . . . . . . . . . . . . system. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 131 133 134 141

Integration over annulus. . . . . . . . . . . . . . . . . . . . . . . . . . 147 The n-body problem. . . . . . . . . . . . . . . . . . . . . . . . . . . . 149 Spacecraft in ight near space station. . . . . . . . . . . Spacecraft in ight near space station. . . . . . . . . . . Relative motion of satellite (example 9.1). . . . . . . . . Motion of astronaut relative to spacecraft (example 9.2). . . . . . . . . . . . . . . . . . . . . . . . . . . . . 154 156 160 162

10.1 Sketch of rocket and exhaust . . . . . . . . . . . . . . . . . . . . . . . 168 10.2 Atmospheric pressure on a rocket . . . . . . . . . . . . . . . . . . . . 174 B.1 B.2 B.3 B.4 Rotation about the Z -axis. . . . . . . . . Position vector. . . . . . . . . . . . . . . PLEASE FIND A NAME FOR ME . . . I AM NOTHING WITH OUT A NAME . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 196 198 199 200

Chapter 1

Introduction

Man has been trying to explain the motion of the planets (and the Sun and Moon, which were initially regarded in the same manner) for aeons. For most of this time, the attempt was to explain the apparent motion in terms of then-current theories. In Ptolemys time, for example, the circle was considered a perfect shape, and it was assumed that the heavens were expressions of perfection. Therefore, it seemed clear that the motion of the heavenly bodies must be basically circular. When this failed to accord with observation, Ptolemy revised the simple scheme of nested spheres to include epicycles. The natural assumption that the Earth was the center of the motion was also slightly modied to allow for a center of rotation slightly removed from the center of the Earth. In the sixteenth century (1543), Nicolaus Copernicus noticed that all the epicycles of the ptolemaic system had periods of exactly one year, and this led him to postulate that the Sun, rather than the Earth, was at the center. He still thought that orbits should be circular, and thus he still required small epicycles to explain retrograde motion. Still, he achieved a great reduction in the complexity of the description. In the late sixteenth century, Johannes Kepler met Tycho Brahe. Brahe was a meticulous astronomer; Kepler a sickly mathematician. Kepler took observational data provided by Brahe and, for the rst time, attempted to derive a theory to t the data, rather than force the data to t a philosophical assumption. Kepler published the rst two of his three laws of planetary motion in 1609; the third followed nine years later. Keplers three laws will keep; we will not need to spend years tting curves to data to derive them. Instead, they will become statements of the results of two statements made in the 1660s by Isaac Newton.

Chapter 1. Introduction

1.1

Newton

Just about the entirety of basic orbital mechanics can be derived from two statements made by Isaac Newton. The rst of these is his second law of motion: The rate of change of momentum of a body (or system) is proportional to the impressed force, and is in the direction of the force. The mathematical formulation of this law is F= d (mv) dt

For our purposes, mass will almost always be xed, so that we have instead the famous statement F = ma The second basic principle is Newtons law of universal gravitation. This might be expressed in words as Any two bodies experience an attraction towards each other. This attractive force is proportional to the product of the masses of the two bodies and inversely proportional to the square of the distance between them. This is a vector equation, and can be written F1,2 = Gm1 m2 (r2 r1 ) r2 r1 3

where F1,2 is read, the force on the rst body (that is, the body labeled 1) due to the second body. The vectors r1 and r2 are the positions of the two bodies, and m1 and m2 their masses. G is the universal gravitational constant, dened in an appropriate set of units. This is all we need to know to derive Keplers laws, if we use the knowledge correctly.

1.2

Reference Frames

Newton formulated his laws of motion with respect to an inertial frame. Such a frame must be non-accelerating, which requires both that the origin of the frame be moving at constant velocity and that the frame not be rotating. Such a frame is now known not to exist, and we settle for one that is inertial enough; it doesnt move enough, quickly enough, to measurably aect the outcome of our calculations. For throwing a baseball, a frame xed in the Earth is inertial for all intents and purposes, and in fact the path of the ball covers so short a distance in comparison to the circumfrence of the Earth that we consider the Earth to be at.

1.3: A Few Notational Comments

When ring long-range artillery, the assumption of a xed Earth is no longer sucient. In this case, the eect of the rotation of the Earth, and of the dierent radius from the axis of rotation to the surface at the ring point and at the target, is too large to ignore. For such problems, we include the rotation of the Earth, but ignore its motion through space. What frame we choose for our problems will depend upon what we are studying. For orbital problems, the eect of a spinning planet will be negligible; for problems concerning rocket launch, it certainly will not.

1.3

A Few Notational Comments

Most of the notation of this book should be obvious. All variables will be described upon being introduced.

1.3.1

Vectors

In general, a vector quantity will be shown in the text and equations as a bold lowercase latin character, as was done in the last section. Matrices will be capitalized and not bold. Lowercase letters that are not bold will denote scalar quantities, as will Greek letters nearly always. Throughout this book, vectors will be assumed to be in either two or three dimensions, depending on whether we are discussing planar motion or three-dimensional motion. In the few cases where we might need more general vectors, it will be made explicit. The magnitude of a vector is generally referred to by the lowercase non-bold letter, as a= a The components of a vector in general will be denoted as a1 , a2 , a3 . When each , they may be shown component is associated with a specic unit vector, say , , k instead as ai , aj , ak . A brief review of vectors in included in Appendix B. Finally, the position vector r will generally be assumed to have the elements x, y , and z . This will not always be enforced, particularly when the use of several values would require awkward subscripting.

1.3.2

Astronomical symbols

This is a book about spacecraft and space travel, and will be concerned mostly with travel inside our solar system. When specic bodies within the solar system are used, subscripts denoting these bodies will be taken from traditional astrological usage. These symbols are

4 the Sun the Moon Mercury Venus Earth Mars

Chapter 1. Introduction

Jupiter Saturn Uranus Neptune Pluto

Examples in this book are worked out using the physical constants as listed in the appendices. Usually, the values used will be given in the examples themselves. While answers and preliminary results will only be listed to several signicant digits, all calculations are carried out in extended precision, and intermediate results are used to the full precision to which they are calculated. A Comment on Nomenclature: As celestial mechanics is a very old eld of study, it might be expected that the terminology and symbols used are well-dened. That is not the case. Many dierent symbols are used, and many times dierent terms are used, for the various quantities involved in orbital mechanics. When there appears to be a standard, such as the use of h for the angular momentum vector, we have attempted to adhere to it. However, in the absence of such standards, we have not hesitated to assign notation of our own choosing.

Chapter 2

The Two-Body Problem

We will examine the simplest problem of orbital mechanics, that of two bodies in space, with no forces other than their mutual gravitational attraction. Due to the huge distances involved in orbital mechanics, this is in fact a good approximation of most planets in their motion about their stars, and of most moons about their planets. The eects of other planets and bodies in the same solar system are minor.

2.1

The General Equation of Motion


z Z 1 y x r1 rc r2 r 2

Consider the situation as shown in Figure 2.1. The two masses shown are considered to have negligible diameter when compared to the distance between them, and thus will be treated as points. The X, Y, Z frame is the inertial frame, in which Newtons equations are assumed to hold. The vectors r1 , r2 , and rc are dened in this frame. The vector r, which gives the position of mass 2 with respect to mass 1, is dened in the local frame x, y, z . The x, y, z axes are parallel to their counterparts in the inertial frame, but move along with mass 1. From basic mechanics, we nd the point c, the center of mass of the system, to be X rc = r1 + m2 r m1 r = r2 m1 + m2 m1 + m2 (2.1)

Figure 2.1: A two-body system.

Chapter 2. The Two-Body Problem

From Newtons law of gravitation, we have the forces on the bodies as F1 = Gm1 m2 r r3 F2 = Gm1 m2 r r3 (2.2)

Acceleration is the second derivative of the inertial position of a body. Since r1 and r2 are in the inertial frame, we can use Newtons second law and eqn. (2.2) to write Gm1 m2 r Gm1 m2 r m1 r1 = m (2.3) r = 2 2 r3 r3 With this and eqn. (2.1), we can write r1 = rc m2 r m1 + m2

with a similar result for r2 . Using this, we substitute for r1 and r2 in (2.3) to get m1 rc m2 rc + Adding these equations gives (m1 + m2 ) rc = 0 = rc = 0 Gm1 m2 r m2 r = m1 + m2 r3

Gm1 m2 r m1 r = m1 + m2 r3

This means that the point c moves in a straight line, which depends only on the initial conditions. This is what we expect from applying Newtons rst law to the overall system. Since we assume we know the initial conditions, we will always have rc . We know r1 and r2 in terms of rc , r, and the known values of m1 , m2 , and G. Thus, if we knew r, we would have complete knowledge of the system. We know that r = r2 r1 . From our two equations of motion (2.3), we can write m1 m2 ( r2 r1 ) =
2 Gm2 Gm1 M2 r Gm1 m2 (m1 + m2 )r 1 m2 r = 3 3 r r r3 G(m1 + m2 )r = r= r3

It is convenient to use the notation = G(m1 + m2 ). Thus the equation of motion becomes r= r r3 (2.4)

2.2: Solution of the Equation of Motion

Note: In practice, one mass is almost always vastly larger than the other, as is the case for a planet orbiting the Sun, or (even more so), a spacecraft orbiting a planet. Therefore in such cases it is general practice to ignore the smaller mass in the computation of . In the case of an orbit about the Earth, we would thus ignore the mass of the spacecraft, and write r = r r3

where = Gm , that is, the universal gravitational constant times the mass of the Earth. The value of associated with a particular celestial body is called the gravitational parameter of the body.

2.2

Solution of the Equation of Motion

Now that we have an equation for the motion of the second body about the rst, we proceed to solve it.1 This is a somewhat lengthy and intricate process, and we will proceed via three intermediate steps. Each of the steps is of great interest in its own right, as each involves a constant of the orbital motion. These constants will be used later to describe both the orbit and the motion of the body on the orbit. . In what follows, we will be using the velocity vector v interchangeably with r The velocity is dened as the time rate of change of position, of course, so these quantities are equal by denition. The only reason to choose one over the other in a particular instance is to aid the clarity of the presentation.

2.2.1

Specic Energy

Because there are no dissipative terms in the equation of motion, we know that the total energy of the system is constant. We therefore look for some expression of the total energy. We recall that (see Appendix B) = rr 1d 1d 2 ) = ( rr v 2 dt 2 dt

and note that this is the form obtained by taking the dot product of the left side . Now, v 2 /2 is the per-unit-mass expression of the kinetic energy of of (2.4) with r a particle. Hence, we see that we can obtain an expression for the derivative of the kinetic energy in a straightforward manner.
By solve, we usually mean that we will attempt to nd an explicit function r(t) that will, when dierentiated, satisfy the equation. We will not nd such a function, and o we are using the term rather loosely here. We will, however, be able to fully classify all types of motion that satisfy eqn. (2.4), so it isnt a complete misuse of the term.
1

Chapter 2. The Two-Body Problem

Now, since v is expressed in an accelerating reference frame, it is not the inertial velocity of either particle. Still, this seems a promising avenue, so we take it, forming . We get the dot product of equation (2.4) with r = rr Recalling that r = rr r we can re-write (2.5) as 1d 2 r v = 2 2 dt r Now, d dt so (2.6) becomes d 1d 2 v = 2 dt dt Finally, we can integrate this result to get 1 2 v = +C 2 r (2.7) 1 r 1 r = r r2 (2.6) rr r3 (2.5)

where C is a constant of integration. We refer to C as the Specic Mechanical Energy of the system, and re-write (2.7) to get C= v2 2 r (2.8)

This quantity is often referred to simply as the energy of the orbit. Equation (2.8), in various forms, will be an important tool in describing the motion of bodies in orbit. Recall that the kinetic energy of a body in motion is mv 2 /2. The term specic is used here to denote that the quantity is expressed per unit of mass, so that the m drops out. The rst term in the expression for C above is therefore clearly related to the kinetic energy of the body in orbit. It is somewhat more dicult to see the relationship between the the second term and the potential energy of the body. We will return to this point later in the chapter.

2.2.2

Angular Momentum
rr=0 r3

The next major step is to cross the vector r into equation (2.4). We get r r+

2.2: Solution of the Equation of Motion Since any vector crossed into itself produces zero, this reduces to r r=0 Note that d ) = r (r r r dt Using this, equation (2.9) becomes d (r v) = 0 dt

(2.9)

Integrating this gives a vector constant of integration on the right hand side. Calling this constant vector h, we have r v = h = constant. (2.10)

This vector constant of integration is called the specic angular momentum. Note the similarity to the usual denition of angular momentum as mr v. Again, we are simply dropping the mass term. Note that h being constant requires that the motion be restricted to a plane, because r is always normal to h. Physically, we can see that the velocity is always in the plane to which h is normal; thus, the body can never move out of that plane. The plane of motion is referred to as the orbital plane.

2.2.3

The Laplace Vector

The third intermediate step in solving the equation of motion is to cross the original equation into h. The result is rh= Note that hr r3

d = ( r h) = rh+rh rh dt is of necessity zero). Using some results for vector (because h is a constant, so h triple products (see Appendix B) along with equation (2.10), we can also write h r = (r h) r = [v(r r) r(r v)] = r2 v (rr )r Now, r 1 d r = v 2r dt r r r d d r ( r h) = dt dt r

so our equation becomes

10

Chapter 2. The Two-Body Problem

Since each side of the equation is a pure time derivative, the obvious thing to do is integrate, and so we do, getting r h= +A r r (2.11)

A is another vector constant of integration, known usually as the Laplace vector.2 We re-arrange (2.11) to dene it as A = v (r v) r/r The Laplace vector can be zero. (2.12)

2.2.4

The Orbit Equation

So far, we have derived 7 constants of the motion for the body in motion (because h and A are three-dimensional vectors, they count for three scalar constants each). Due to some relationships between these constants, they actually only provide ve pieces of information. This information is sucient to dene the size, shape, and orientation of the orbit, as will be shown in the next few chapters. However, they do not oer an intuitive feel for the orbit. To get a better idea of the orbit, we turn equation (2.11) into a scalar equation by dotting it with r as follows: h= rr rr +Ar r (2.13)

Using some triple-product identities from Appendix B, we have that h=rr h = h h = h2 rr and recalling the denition of the dot product, this gives us that h2 = r + rA cos (2.14)

where is the included angle between the Laplace vector and the radial vector r. We re-write this equation as h2 / r= (2.15) 1 + (A/) cos This equation is of the form r=
2

p 1 + e cos

(2.16)

The Laplace vector is also associated with the name Runge-Lenz vector, and is referred to by Battin [2] as the eccentricity vector. Some authors go so far as to call it the Laplace-Runge-Lenz vector, but thats just inconvenient.

2.3: Relative and Inertial Motion

11

where p = h2 / and e = (A/). We will generally refer to this as the orbit equation.3 This is the equation of the class of curves known as conic sections in mathematics, described in the polar coordinates r and . The angle is known in orbital mechanics as the true anomaly. The term anomaly is often used in classical mathematics to describe an angle; the specication true serves to dierentiate this angle from others that we will see in later chapters. The value of r is simply the distance between the two bodies.

2.3

Relative and Inertial Motion

Before delving into a discussion of the dierent kinds of orbits, it is convenient at this point to clear up one or two possible points of confusion.

2.3.1

Potential Energy

where T stands for kinetic energy, and U for the potential energy.4 Leaving aside the kinetic energy for now, consider the potential energy term. We dene the potential energy in a conservative force eld as the work done by the conservative force as a body moves from some arbitrary position to the reference position (we state without further explanation that gravitation is a conservative eld). At some point r, the work done in moving a unit mass to r + dr is dW = F dr = r dr = 2 dr 3 r r

We said that eqn. (2.8) is the equation for the specic mechanical energy. In other words C = T + U = v 2 /2 /r

Integrating from r to the reference point R, we get


R

WrR =

dr = r2

1 1 R r

Now, the reference point is arbitrary. We choose it to be at innity, so U (r) = lim WrR = /r
R

It is also known in some texts as the trajectory equation, and many give it no name at all. We use the term here for convenience of reference only. 4 Many texts use a negative sign here. The classic Brouwer and Clemence [4], for example, state the energy equation as C = T U (p. 27) (their symbols are actually dierent; we have converted to our notation for clarity). The dierence comes in the denition of potential energy as the negative of a function known as the potential function; U is then the potential function. See Thomson [15] for more details of the argument presented here; see [4] for the opposite approach.

12

Chapter 2. The Two-Body Problem

and we have the potential as in eqn. (2.8). Note that while the we may choose R freely for the purpose of this exercise, it is not truly arbitrary in the derivation of eqn. (2.8). This is because the value of a constant of integration in general is chosen to meet the boundary conditions of the particular problem being evaluated. Thus the way eqn. (2.8) is derived tells us that C is constant; eqn. (2.8) itself tells us how to evaluate C to meet the conditions of a particular problem. The exercise just completed demonstrates only that /r is indeed the potential energy, if R is taken as the reference.

2.3.2

Kinetic Energy

In expressing the energy of the system, we used the relative velocity v, which we noted is expressed in an accelerating frame. By this we mean that it is the velocity of the second body relative to the rst, and the rst body is itself accelerating. In contrast, the conservation of energy is generally expressed relative to an inertial frame. Consider the total kinetic energy TI of the system, expressed in terms of the inertial velocities v1 and v2 . We have TI = m1 m2 v1 v1 + v2 v2 . 2 2 m1 v . m1 + m2 (2.17)

Recalling the denition of the center of mass (2.1), we have that v1 = vc m2 v m1 + m2 and v2 = vc +

Substitution of these expressions into (2.17) and multiplying out gives the kinetic energy to be m1 + m2 m1 m2 1 TI = v v. vc vc + 2 2 m1 + m2 We have already seen that the velocity of the enter of mass is constant. Thus the inertial kinetic energy is a constant added to the relative kinetic energy. The energy C is therefore not quite the total mechanical energy as seen from the inertial point of view. It is clearly intimately related, however, and is clearly a conserved quantity. The second term, /r, is genuinely the potential energy, and the sum of the two terms is a conserved quantity. It will do no harm to consider C to be the total mechanical energy of the system.

2.4

Properties of Conic Sections

We have said that equation (2.16) is the equation for a conic section. Conic sections are the curves that result when a plane is passed through a pair of innite circular cones sharing a vertex and having the same axis of symmetry. The lines of intersection draw the conic section in space.

2.4: Properties of Conic Sections

13

Another common way that conic sections are dened, especially in geometry texts, is Denition: A conic section is either a circle or a locus of points such that for each point, the ratio of its distance from a given point (the focus) to its distance from a given line (the directrix) is constant. This constant is the eccentricity e. This denition is true, but for us is not terribly useful. Recall equation (2.16).The conic sections are usually thought of as falling into four basic categories, depending upon the value of the eccentricity e. These are: e=0 0<e<1 e=1 e>1 circle ellipse parabola hyperbola

We will mostly be concerned with the ellipse and the hyperbola. Circles are special cases of the ellipse, where many computations simplify because e = 0. The parabola is another special case, and can be thought of as a limiting case of both the ellipse (as the eccentricity rises to 1), and the hyperbola (as e again approaches 1, this time from above). Clearly, a plane passing only through the vertex would create a point, and a plane tangent to the side of the cones would create a line. These are known as degenerate conic sections, and except for a brief discussion in a later section, we will for the most part ignore them. There is also the possibility of the eccentricity being negative; some authors describe an ellipse for which e < 0 as a subcircular ellipse. Such a condition does not arise in orbital mechanics.

2.4.1

Terminology

The following list includes the common terms describing the geometry of conic sections. Not all of them are meaningful for all types of conics. Focus: All conic sections have at least one focus; the ellipse and the hyperbola have two. In these cases, one is known as the empty focus. The circle is a special case of the ellipse, and the focus and empty focus are both at the center. Physically, the focus is the location of the rst body, around which the second body orbits. In the mathematical derivation of this chapter, the focus is the point from which the vector r is drawn. Axes: Conic sections have a major and a minor axis. The major axis is the line that runs through both foci; the minor axis is perpendicular to it and crosses it halfway between the two foci. The axes are not uniquely dened for the circle, and for any non-elliptic section the minor axis has little meaning.

14

Chapter 2. The Two-Body Problem

Latus Rectum: The width of each curve at the focus is the latus rectum; half of this distance is the semi-latus rectum, usually denoted p. Apses: The points on the curve at which it crosses the major axis are called apses. These points occur by denition when the value of the true anomaly equals zero and . The rst of these is called the periapsis, and the second the apoapsis.5 semimajor axis: The semimajor axis a is half of the distance between the apses of a conic section. For a parabola, for which the empty focus is taken to be at innity and which thus has only one apse, a . For a hyperbola, a is taken to be negative. semiminor axis: The semiminor axis b is well-dened only for the ellipse; it is the distance from the point at which the axes cross to the point at which the minor axis crosses the ellipse. c: The distance between the foci is 2c, but there is no name given to this distance in most orbital texts. In the next few sections, we will take a look at the dierent conic sections, beginning with the ellipse. For the most part, relations dened for the ellipse will hold true for other conic sections, when they make sense. Exceptions will be noted.

2.5

Elliptic Orbits

The ellipse is the only closed conic section (we will consider the circle as a special case of the ellipse), and so all closed orbits are ellipses. In this section, we will assume that e > 0, so that the ellipse is not periapsis ra a circle. The special qualities of the circle will be p apoapsis discussed later. Focus Recall the manner in which the trajectory Empty focus c equation was derived. We generally think of a small rp body orbiting a much larger one, for example the b Earth orbiting the Sun. The trajectory equation describes the path of the bodies in relationship to a the other, and thus the focus of the ellipse is at the center of mass of the body being considered the Figure 2.2: Variable denitions stationary mass; the one about which the other for the ellipse. orbits. We made no actual assumptions about the
5 The terms periapsis and apoapsis are often ignored in favor of pericenter and apocenter. This is a matter of taste.

2.5: Elliptic Orbits

15

relative masses, however, and mathematically it would be as valid to say that the Sun describes an elliptic orbit about the Earth. It will not be troublesome, however, to continue to think of the larger mass as stationary, and the smaller the orbiting mass. Let rp and ra be the distances from the focus6 to the periapsis and the apoapsis, respectively. It is easily enough seen that rp = Since rp + ra = 2a, we have 2a = p p + 1+e 1e = 2a(1 + e)(1 e) = (1 e)p + (1 + e)p = p = a(1 e2 ) p 1+e ra = p 1e

This result can be used to obtain simple expressions for the distance to the apses in terms of the semimajor axis as well: rp = p a(1 e2 ) = = a(1 e) 1+e 1+e (2.18)

This is true for all conic sections other than the parabola. The complementary statement that ra = a(1 + e) is clearly true for the ellipse. ra is the largest value r can achieve for an elliptic orbit. Recall that for us, p = h2 /. The gravitational parameter = G(m1 + m2 ) is determined by the phys ical properties of the system, so that for a specied pair of masses, the semi-latus rectum of a particular v orbit is purely dependent upon the magnitude of the angular momentum of the orbit. Recall the vector denition of h: r h=rv This gives that h = rv sin , where is the included angle. Since v is normal to r at the apses, we have that Figure 2.3: Flight path angle. h = v a ra = v p rp (2.19) Consider Figure 2.3. By analogy with aircraft ight, we may call the line normal to the radial vector the local horizontal, and the angle between the local horizontal and the velocity vector the ight path angle. Again using the equation for the
6 From here on, focus will mean primary focus; there is little use for the empty focus, and if it referred to at all, it will be explicitly denoted as empty.

16

Chapter 2. The Two-Body Problem

magnitude of the cross product, we have at any point on the orbit h = rv sin = rv sin = 2 h cos = rv

(2.20)

Because this is derived from the denition of h, it is true for all conic sections. Now consider the value of the energy C for an elliptic orbit. At periapsis, we can write 2 vp C= 2 rp Using the results of this section, we can re-write this as C= p h2 = 2 2 2rp rp 2rp rp

Writing p and rp in terms of a and e produces a(1 e2 ) C= 2 2 2a (1 e) a(1 e) and this expression simplies to (2.21) 2a An obvious result of this is that, since a is clearly positive for any ellipse the energy C of any elliptic orbit is negative. While negative energy may not seem to make much sense, it is due to the reference point for zero potential energy. C=
Example 2.1. A vehicle in orbit about the Earth has periapsis radius rp = 7000 km and velocity vp = 9 km s . Find the apoapsis radius and velocity. Because the orbit is about the Earth, we use the gravitational parameter for the Earth. The value used in this text is = 3.98601 105 km3 /s2 . Note the need to use consistent units. Begin by computing the orbital energy: C= From eqn. (2.21) we then have a= = 12120.7 km 2C rp = 0.42248 a v2 = 16.443 km2 /s2 2 r

We can now compute the eccentricity from the relation rp = a(1 e) to be e=1

2.6: Hyperbolic Orbits

17

and from this the periapsis radius ra = a(1 + e) = 17241.4 km. Then because the angular momentum is constant, we have r p vp = r a v a = va = r p vp km = 3.65540 ra s

Note that there is almost never only one way to solve a problem in orbital mechanics. The method above is one approach to the problem posed. However, the problem could also have been solved by nding the angular momentum h, from that the semi-latus rectum, and then the eccentricity.

2.6

Hyperbolic Orbits

The other main type of conic section is the hyperbola. Like the ellipse, it is a symmetric orbit, but in this case, only mathematically. The two parts never meet.

2.6.1

Geometry of the Hyperbola

A hyperbola is shown in Figure 2.4. The solid line represents the actual orbit, and is curved about the primary focus. The broken line is the imaginary part of the orbit, and curves about the empty focus. The two parts of the orbit are mirror images, and the major axis is the line running through the two foci. Because the eccentricity of a hyperbola is greater than one, there is some angle for which 1 e cos = 0. As that 2a value is approached, the distance r goes to innity. For physical orbits, this is all that is possible. Mathematically, howp Empty ever, the other part of the hyperbola does focus exist. One way to think of this is to imagine that the hyperbola extends to innity and wraps around, to come back in Focus from negative innity. It is not, however, necessary to have any picture in mind to make the equations work. From the orbit equation, it is clear Asymptotes that rp = p/(1 + e), for any conic section. Dening the apoapsis distance as the value of r when the true anomaly equals , we again have ra = p/(1 e), Figure 2.4: Terminology of the hyperbola. and in this case the value is negative. Looking at Figure 2.4, this is perfectly reasonable. Note that at = the vector r would be running to the left in the sketch,

18

Chapter 2. The Two-Body Problem

while the point at which the imaginary arc of the hyperbola crosses the major axis is to the right of the primary focus, indicating that its position is negative in the r direction. Because the equations are mathematically valid, we suppose that it is also still valid to write 2a = rp + ra Then, following what we did before, once again p = a(1 e2 ). Note that the semilatus rectum p still makes perfect sense; the primary branch of the hyperbola will still pass above the focus and the distance will still be positive. We then have a= p <0 1 e2

It should be noted that this does not say that the absolute distance between the periapsis and apoapsis is negative. Recall that the logic for letting ra be negative was that it was a location measured from the primary focus in a specied direction. This also applies to the semimajor axis. Think of it as the distance from the periapsis to the apoapsis, positive in the direction from the periapsis towards the primary focus. There is no semiminor axis for the hyperbola. Of course, the minor axis is still dened, when thought of as the axis of symmetry between the two curves of the hyperbola. However, neither branch of the hyperbola ever intersects that axis, so b cannot be dened.

2.6.2

Hyperbolic Orbits

Now we apply the geometry of the hyperbola to the orbit equation, and vice-versa. Once we accept that the semimajor axis a exists and is meaningful despite being negative, we can repeat the manipulations we did for the elliptic orbit and show that C = /2a Because a is negative, this means that the energy is positive on a hyperbolic orbit. We have said that our reference point for potential energy is R . This means that the potential energy as a function of the distance of the body in orbit from the central body is dened to be zero when the body is at an innite distance. As a result, any body in orbit with a negative specic energy can never reach the reference point. That is, the value of r remains nite. On the other hand, C > 0 implies that not only can the body reach innity, but it will still have velocity when it gets there (because the potential energy is zero, the remainder is kinetic energy). Since the energy of the orbit is constant, we have v2 =C+ 2 r

2.6: Hyperbolic Orbits As r , the last term goes to zero and


2 v = 2C

19

which clearly makes no sense at all for C < 0, but is perfectly acceptable for hyperbolic orbits. The value v is known as the hyperbolic excess speed. It is the speed that a space probe has as it gets far from the solar system, for example. The angle of the asymptotes is also easy to nd. Since r= we have r = 1 + e cos 0. This gives cos = 1/e. (2.22) Because cos( ) = cos( ), this tells us that value of the true anomaly on the primary branch of the hyperbola (which is also the physically realizable trajectory) is limited to . The values of such that cos < 1/e correspond to the imaginary branch of the hyperbola. Note also that because cos < 0 for all nite values of e, the true anomaly always sweeps at least through 90 to +90 .
Example 2.2. What are the conditions at periapsis for a spacecraft on a trajectory that will approach innity with v = 2 km/s with = 120 ? From the information available, we can easily compute the energy: C=
2 v = 2 km2 /s2 2

p , 1 + e cos

Compute the eccentricity from knowledge of : cos = 1 e = e= 1 cos = e=2

Now, a = /2C = 99650 km, so rp = a(1 e) = a = 99650 km The corresponding value of velocity can be computed from
2 vp km2 =C+ =2+ =6 2 2 rp 99650 s

from which vp =

12 3.4641 km/s.

20

Chapter 2. The Two-Body Problem

2.7

Special Cases and Degenerate Conics

We will briey discuss the special cases of circular and parabolic orbits. Circular orbits in particular are very important because many Earth satellite missions require them, and because we often approximate planetary orbits as circular. We will also mention the degenerate conics, though they almost never occur in astrodynamics work.

2.7.1

Circular Orbits

As mentioned, the circle is the special case of the ellipse for which the eccentricity is zero. This means that the orbit equation reduces to r=p=a The Laplace vector for a circular orbit has zero magnitude, and therefore there is nothing from which to measure the true anomaly (in practice, a reference is simply specied). Because r is constant on a circle, the velocity is also constant on a circular orbit. The circular orbit velocity can be found from C=
2 vc 2 r

From above, r = a = /2C , so this becomes v2 = c 2r 2 r = vc = r

Because the velocity is constant, it is easy to compute the period of a circular orbit. The circumference is 2r, so the period P can be found from P vc = 2r = P = 2a3/2

We will see later that this expression is true for all elliptic orbits.

2.7.2

The Parabolic Orbit and Escape Speed

The parabola can be thought of as a transitional case between the elliptical and hyperbolic orbits. On the one hand, it is an ellipse for which e 1. On the other, it is a hyperbola for which e 1.7
7 The symbols and here mean approaches from below and approaches from above, respectively.

2.7: Special Cases and Degenerate Conics

21

In thinking of the parabola as the limiting case of an ellipse, we have the length of the ellipse becoming innite. This is easily seen from rp (2.23) a= 1e which is true for any conic section for which r, a, and e are dened. As e 1, the only way for this relation to be valid is for either rp to become vanishingly small, or a . The rst possibility is nonsensical. Since eqn. (2.23) holds for hyperbolas as well, and rp is positive for hyperbolas, the same argument has a as e 1 for the hyperbola. Thus in the one case, the semimajor axis is going to positive innity, and the apoapsis is then moving towards innity. Then, as e crosses unity, the semimajor axis suddenly goes to negative innity, and the apoapsis, now on the imaginary branch of the hyperbola, is moving back towards the periapsis, but from the negative direction. In both cases, however, the energy is going to zero. Note that lim C = lim =0 a e1 2a and =0 lim C = lim a 2a e1 We therefore see that the specic energy for a parabolic orbit is zero. 2 Consider this in light of the hyperbolic excess speed. Since v = 2C , this clearly indicates that the excess speed is zero. Thus a craft on a parabolic orbit will reach an innite distance from the planet, but will have no energy or velocity left. We might also consider this in light of the velocity at some specied orbital radius. Suppose that the spacecraft is at r = r0 , some specied value. From our previous discussion, it is clear that the parabola is the minimum energy orbit for which the spacecraft can attain an innite distance. Since both r0 and are constants, the energy equation C = v 2 /2 /r0 implies that C increase as v increases. Let 2 ve = . (2.24) r0 This will result in C = 0, so that r as the time of ight gets large. Any lower velocity at r0 will result in a negative value for the specic energy, so that the distance can never become innite, and the orbit is an ellipse. For this reason, the parabolic orbit is also known as an escape orbit, and ve as dened in eqn. (2.24) is known as the escape speed. Note that in practice, innite means very large. Since under any circumstances, innite time would be required to attain innite distance, we are never truly interested in doing so. However, the escape speed is a close approximation to the speed necessary to escape from the gravitational pull of a planet or star, and is used for initial approximations in interplanetary and interstellar mission analyses. These topics will be dealt with in detail in later chapters.

22

Chapter 2. The Two-Body Problem

2.7.3

Degenerate Conics

The degenerate conics are the point and the line. The rst of these is simply disposed of, while the second requires some discussion. Point The point is not a physically realizable orbit. It corresponds to the case in which the body in orbit is at the center of the central body, and has no velocity relative to it. More precisely, it means that the two masses m1 and m2 used in deriving the equation of motion are centered at the same point. Newtons law of gravitation is not dened for this case; it would consider the two bodies to be a single body. We will waste no further space on the point as a possible orbit. Line orbits. r . r3 This is dened (and correct) for all r = 0. However, in deriving the Laplace vector and then the orbit equation, we made the tacit assumption that the angular momentum was nonzero. All of the analysis in section 2.2.3 is still correct, but the Laplace vector comes out to be zero. The line is the orbit resulting from r > 0 and the angular momentum h = 0. In this case, we have that either v = 0 or that v is aligned with r (so that the cross product is zero). In either case, we see that since r is always aligned with r, there will never be a component of velocity that is not aligned with r. Consequently, the velocity will always be either directly towards or away from the central body. There is no version of the orbit equation corresponding to this case. Note that for this orbit, the energy and angular momentum relations still hold. The angular momentum is constant at zero, and the specic energy can still be computed as C = v 2 /2 /r. The only point at which the energy equation breaks down is at r = 0, which can be treated mathematically as a singularity.8 Physically, of course, there is no need to consider what happens as r 0, as the spacecraft (or comet, or asteroid, or planet, or...) will have crashed into the Earth (planet, Sun, ...) before that happens. r= Recall the equation of motion

2.8

Keplers Laws

In the late 16th century, the Danish astronomer and aristocrat Tycho Brahe made meticulous observations of the planets. When Brahe died in 1601, his notebooks
8

We will not get into this here.

2.8: Keplers Laws

23

were inherited by Johannes Kepler (1571 1630), a sickly mathematician that he had chanced to meet about 18 months previously. Brahe appears to have been lacking the gifts of mathematical insight possessed by Kepler, and the latter had already begun working on the observational data. Finally, in 1609, Kepler set forth two laws of planetary motion, and ten years later, a third. Keplers laws of planetary motion are: The orbit of each planet is an ellipse, with the Sun at one focus. The line joining the planet to the Sun sweeps equal areas in equal time. The square of the period of the planets orbit is proportional to the cube of its mean distance from the Sun. These laws are purely descriptive. They make no attempt to explain why the planets behave as they do. They serve, however, as a remarkable example of rigorous reduction and analysis. Since Keplers laws incorporate several years of observations, it is necessary that any theory of planetary motion agree with them. To this end, we can somewhat verify the results of the two-body analysis if we can use these results to derive Keplers laws. The rst of them has already explicitly been derived, when the orbit equation turned out to be the equation of conic sections. In the following sections, we will derive the second and third of Keplers laws.

2.8.1

Keplers Second Law

The second law concerns the area swept by the line joining the Sun to the planet. The derivation requires rst an expression for the rate at which this area is swept. Consider Figure 2.5. In this gure, the shaded area is the area swept by the radius vector in an incremental time dt. Since dt is very small, the corresponding change in the anomaly is also small, and the curve from r to r + r is essentially a straight line segment. Using this approximation, the shaded area A is given as

111111111 000000000 r + r 000000000 111111111 000000000 111111111 000000000 111111111 000000000 111111111 000000000 111111111 r 000000000 111111111 000000000 111111111 000000000 111111111

A =

1 2 r sin + rdr sin 2

Figure 2.5: Area swept in time dt.

where dr is the change in the magnitude of r (note that this is not the same as the magnitude of r). Since is small,

24

Chapter 2. The Two-Body Problem sin . Also, we know that dr

rdt and dt Therefore, we have A =

1 2 r dt + rr dt 2 2

The derivative of A with respect to time is by denition the limit 1 2 1 1 dA A = lim = lim r + rr dt = r2 dt 0 dt 0 dt dt 2 2 2 (2.25)

We can now relate this result to the specic angular momentum. We have that h=rv and therefore that h = rv sin where is the included angle. Figure 2.6 shows this schematically. We can see from this gure that v sin = r and thus that dA 1 r2 v sin 1 = r2 = = h dt 2 2r 2 (2.26)
r v sin v

Since we already know that h is constant, we have shown that dA/dt is constant throughout the orbit. Therefore, over any period, we Figure 2.6: Relating r, v, and . have that h A = t 2 which is a mathematical expression of Keplers second law.

2.8.2

Keplers Third Law

The third law concerns the period of the orbit and its relationship to the semimajor axis. In deriving this law, it helps to know in advance that the area of an ellipse is given by A = ab. If P is the period of an elliptic orbit, we have from (2.26) that
P

A
0

= Adt
0

(h/2)dt =

hP = ab 2

2.8: Keplers Laws The semiminor axis b is related to the semimajor axis by the relation b = a 1 e2 so that a2 1 e2 2ab = 2 P = h h p = a(1 2 ), which we plug in to give 2a2 1 e2 2 P = = a3/2 a(1 e2 ) P2 = 4 2 3 a

25

Recall that h =

This in turn implies

(2.27)

and (2.27) is a mathematical expression of the third law.

2.8.3

Measurement of Planetary Mass

As an implementation of the third law, consider the problem of measuring the mass of a planet that has a satellite. In this situation, there are two orbits to be considered. One is the orbit of the satellite about the planet, and the other the orbit of the planet about the Sun. Let the subscript s denote variables relating to the satellite, and p those relating to the planet. Then by Keplers third law, we can write
2 G(mp + ms ) = 4 2 a3 s /Ps

(2.28) (2.29)

and
2 G(m + mp ) = 4 2 a3 p /Pp

The values of the semimajor axis and the period of the orbits can be obtained to very high accuracy by from observations. Dividing (2.28) by (2.29) gives mp + ms = m + mp as ap
3

Pp Ps

On the assumption that m mp and mp ms , we can then write mp = m as ap


3

Pp Ps

(2.30)

which gives us the mass of the planet in terms of the mass of the Sun.

26

Chapter 2. The Two-Body Problem

Example 2.3. Phobos travels about Mars in only 0.31891 sidereal days and the mean radius of its orbit is 9378 km.9 The orbit of Mars about the Sun requires 688.9 sidereal days and has a mean radius of 227.8106 km Solution: From (2.30), we have m = m as a
3

P Ps

9378 227.8 106

688.9 0.31891

where the subscript s refers to Phobos (the satellite, in this case). Working out the fractions gives m = 3.256 107 = m = 3.072 107 m m From tabulated data in Bate, Mueller, and White [1], we have the answer to be 3.083 107 .

For the Earth-Moon system, the solution to the equations would be signicantly inuenced by the inclusion of the mass of the Moon. As the Earth is only about 81 times as massive as the Moon, the mass of the latter is not always negligible in calculations of this sort.

2.9

Canonical Units

So far, we have only seen units of length and time in examples. This is because the physics and formulae we have investigated are the same, whether measured in seconds or days, feet or furlongs. However, when it comes time to apply these results to systems, we need to have numbers to use, and those numbers are usually a count of some unit. Some of the examples so far in this book have involved the Earth, and we have used the value 3.98601 105 km3 /s2 as the gravitational parameter. This in turn requires the use of kilometers for all length measurements and seconds for all time measurements, with the result that velocities are specied in kilometers per second. As a result, distances tend to be in the thousands and tens of thousands, velocity on the order of ten, and periods in the tens of thousands. For example, a circular low-Earth orbit might have radius r = 6900 km, which gives vc = 7.600km/s and P = 5704 sec. The orbital energy is C = 28.88 km2 /s2 , and the angular momentum is h = 52444 km2 /s. These values become even more disparate when solar scales are involved. Distances are hundreds of millions of kilometers, velocities tens of kilometers per second, and periods in the millions of seconds (a mean solar year is over 31 million seconds). The concept of canonical units unies the various units, and also makes the results for various physical systems similar. The idea of a canonical unit system is
9

Danby [6], appendix C.2.

2.9: Canonical Units

27

that the actual units used for length and time are arbitrary, so that they may chosen to be convenient. In the case of orbital systems, the physical system itself appears in the equations through the gravitational parameter. We can therefore normalize the units, by choosing the units for a particular problem such that the gravitational parameter is unity. To demonstrate, we can go back to the equation of motion: r = r/r3 . r is measured in distance units, and the acceleration r in distance per time unit squared. Abbreviating, and writing the equation only with units, this gives DU 1 2 = TU DU2 which implies that the units of are DU3 /TU2 , as in km3 /s2 . But we can dene any set of units we like. Because mass does not appear in the calculations, we have only one unit to specify; the others will then be computed to be consistent. The usual approach is to specify a distance unit. The distance unit for Solar System work is the Astronomical Unit, AU which is roughly the mean diameter of the orbit of the Earth about the Sun.10 The time unit is then dened such that = 1. To compute this, recall that a3 P = 2 so that the period of a circular orbit with radius a = 1 DU, with = 1 DU3 /TU2 , is P = 2 TU.
Example 2.4. Given that 1AU = 1.495979 108 km, compute the conversions from the canonical time unit and the canonical velocity unit 1 DU/TU to kilometers, and kilometers per second, respectively. Solution: From the preceeding discussion, we have P = 2 TU = 2 a3 / .

Since we are told that a = 1AU = 1.495979 108 km, and we can look up the value of in Appendix C, we have 1 TU = (1.495979 108 km)3 = 5022600 s 1.32715 1011 km3 /s2

One distance unit per time unit is then 1.495979 108 /5022600 = 29.785 km/s.
10

Recall that using only the mass of the larger body in the gravitational parameter is an approximation, since it ignores the mass of the smaller body. This is close enough for most work, but for very precise measurments it is not. Therefore, the astronomical unit is more correctly dened as the distance AU such that 2 (AU )3/2 /(Gm )1/2 equals the period of the Earth. This is itself arbitrary, of course, and it woudl have been just as correct to dene the distance unit as the measured average distance and adjust the time unit accordingly.

28

Chapter 2. The Two-Body Problem

The actual tabulated values of the conversion factors (from [1], p. 429) are 1 TU = 5022675.7 s and 1 DU/TU = 29.784852 km/s. These are calculated from a slightly dierent value of the astronomical unit. Note: Various sources agree on these values to 5, 6, or more decimal places. Vallado [16], for example, lists 1 TU = 58.132440906 solar days = 5022642.8943 s and 1 DU/TU = 29.7846916749 km/s We will seldom need such precision in this book. Because of the diculty of choosing a particular authority, no denitive values for these constants are listed in this book. If needed, the values derived from the physical data in Appendix C will suce.

As mentioned, the distance unit for the Heliocentric system is the astronomical unit. The distance unit for the Geocentric system (in which the Earth is the central body) is the mean radius of the Earth (in this text, r = 6378.145 km). The time unit is then computed as in the above example.

2.10

Problems

1. Classify the following orbits as circular, hyperbolic, et cetera. Assume canonical units. (a) r = 2, v = 2. (b) rp = 1.5, p = 3. , v = 0.5k . (c) r = 1.2k

(d) C = 1/4, p = 2. , v = 0.25 . (e) r = 1.2 + 0.5k + 0.3 + 0.6k

2. What is the eccentricity of an orbit for which ra = krp , in terms of k ? Check your result by showing that the cases k = 1 and k give the appropriate values. 3. Find the ratio of the angular rate d/dt at periapsis to the rate at apoapsis, in terms of the orbital eccentricity. 4. What is the average value of r for an elliptic orbit, taking the true anomaly as the independent variable? 5. What velocity would the moon need to escape from the Earth, starting at its present orbital radius? Compute the answer both with and without including the mass of the moon in computing the gravitational parameter.

2.10: Problems

29

6. A vehicle is traveling near the Earth with velocity v = 5000 meters per second. (a) If the vehicle is in circular orbit, what is its altitude above the surface? (b) What is the closest it can be to the Earth if it is on an escape trajectory? (c) What is the orbital eccentricity if it is at its closest approach, and the distance to the center of the Earth is 24000 kilometers? 7. Take the orbit of the earth about the sun to be circular (a good rst approximation), and consider a meteor approaching the earth on a parabolic orbit about the sun. What is the minimum and maximum speed of the meteor relative to the earth when it strikes? 8. Show that the velocity at the ends of the semiminor axis of an elliptic orbit is the same as the velocity of a satellite on a circular orbit that passes through that point. 9. Show that the ight path angle is 45 when the true anomaly = /2 for all parabolic orbits. 10. What is the value of r on an elliptic orbit such that the vehicle is moving most directly away from the central body (that is, v is most nearly aligned with r)? 11. Halleys comet last passed perihelion on February 9, 1986. It has semimajor axis a = 17.9654 AU and eccentricity e = 0.967298. (a) Calculate the radius and velocity at perihelion and aphelion, in both canonical units and kilometers and seconds. (b) Calculate the period, and date of next perihelion. 12. Derive an expression for the period of an orbit in terms of the speed v0 and radius r0 at some specied point. 13. (Moulton [11]) Suppose a comet on a parabolic orbit with perihelion q strikes and combines with a body of equal mass, which is stationary before the collision. What is the eccentricity and perihelion distance of the combined mass? 14. (Wiesel [17]) Suppose that a spaceship with a solar sail is in motion about the sun. The sail is kept at on to the sun, so that it produces an acceleration from radiation pressure of r a=k 3 r where k is a constant depending on sail reectivity, vehicle mass, and other properties. Show that the equation of motion of the spaceship is r = ( k ) r . r3

30

Chapter 2. The Two-Body Problem What types of consic sections are possible if k < ? When they are equal? When k > ?

15. Assume a comet is on a parabolic trajectory, which lies in the plane of the Earths orbit about the sun. What perihelion maximizes the amount of time that the comet spends within the orbit of the Earth? Assume the orbit of the Earth is circular.

Chapter 3

Position in Orbit and Time in Transit

In the preceding chapter, we have derived the shape and size of orbits. This allows us to predict where a spacecraft or celestial body will travel, and conversely, where it has been, at least to the accuracy of the two-body assumptions we have made. It remains to express the position on the orbit as a function of time, so that we can answer the question not only of where the craft is going, but also when it will get there. We will rst consider these questions for the elliptic orbit, for which we can nd a graphical approach to the solution. After that, we will explore the hyperbolic orbit, and then the parabolic as the limiting case of the other two. We will nd expressions for the position as a function of time, and for the time of transit between two points on a particular orbit. In the nal section of the chapter, we will look at some numerical methods for nding the solutions to the equations we derive. This will complete the necessary machinery for the solution of the mission problems in later chapters.

3.1

Keplers Equation

We will begin by considering the elliptic orbit. The approach we take is due to Kepler, and is geometric in nature. It is based on Keplers second law, that the rate at which area is swept along an orbit is constant. This means that if the area is known, the time it took to sweep the area is known. Unfortunately, the area under an ellipse is not a simple thing to calculate. Therefore, Kepler derived a solution in terms of a superscribed circle, as in Figure 3.1a. First note that we can relate areas in ellipses to areas in circles. We do this by noting that in Figure 3.1a, a vertical line such as P R has the property that PQ b = a PR

31

32

Chapter 3. Position in Orbit and Time in Transit

R Q E P O

B D F e a O

Ac

Figure 3.1: Deriving Keplers equation. The shaded area Ac in Figure 3.1b is thus proportional to Ae in Figure 3.1a. For a circle, however, the area is much easier to calculate. Kepler dened the angle E as the eccentric anomaly (as opposed to the true anomaly ). The area enclosed by the circle, the major axis, and the radius OB of the circle is simply a2 E/2 (E in radians). Then we subtract the area of the triangle, and we have 1 2 1 Ac = a E a2 e sin E 2 2 1 2 a (E e sin E ) = 2 Then the area we wanted in the rst place is just ab bAc = (E e sin E ) a 2 Now recall Keplers 2nd and 3rd laws. The third law gives us Ae = T = 2 a3 /

and we know that the total area of an ellipse is ab. The 2nd law gave us that dA/dt is constant. This means that it must equal the total area divided by the period, so dA ab ab = = 3 dt 2 2 a / a3

(t tp ), where tp is the time at periapsis. Plugging our last Now, Ae must equal A result into this gives Ae = ab ab (E e sin E ) = 2 2 (t tp ) a3 (3.1)

3.1: Keplers Equation

33

We now make two further denitions. The rst is the mean motion n (sometimes called the mean angular velocity), which is simply the average value of the angular velocity. This is the total angle swept in a single orbit, divided by the period of the orbit, and is given by n = 2/T = /a3 The second is the mean anomaly M M = n(t tp ) Using these denitions, 3.1 becomes M = E e sin E This result is known as Keplers Equation. In terms directly of time since periapsis passage, this can be written t tp = a3 / (E e sin E ) (3.2)

and we will see this many times in future chapters. Assuming we can solve Keplers equation, we can use the result to nd , which is what we started out looking for. Going back to Figure 3.1a, we can see that a cos E r cos = ae Recalling the orbit equation, we have cos E = a(1 e2 ) cos e(1 + e cos ) p cos +e = + a(1 + e cos ) a(1 + e cos ) 1 + e cos e + cos = cos E = 1 + e cos

(3.3)

Now, the altitude of the inscribed triangle is related to its intersection with the ellipse as (b/a)a sin E = r sin We again plug in for r and rearrange, arriving at 1 e2 sin sin E = 1 + e cos We rearrange the expression (3.3) for cos E into one for cos instead: cos = e cos E e cos E 1

(3.4)

Now substituting this into equation (3.4) for and manipulating, we nally get 1 e2 sin E sin = 1 e cos E

34 An alternate formulation is

Chapter 3. Position in Orbit and Time in Transit

tan

= 2

E 1+e tan 1e 2

(3.5)

which is very convenient, as /2 and E/2 are always in the same quadrant. This expression is valid only for elliptic orbits. The expression for hyperbolic orbits is similar, and we will derive it later. First, however, we will look at the time of transit on an elliptic orbit.

3.2

Lamberts Problem

Having an expression for the position along an orbit as a function of time is very much like having solved the time in transit problem. We can consider the position problem as the time in transit from one point to another, where the initial point is the periapsis. However, this would require that we know the eccentricity of the orbit. As it turns out, this is not necessary. The problem of transit time between two points on an elliptic orbit is often referred to as Lamberts Problem, after Johann Heinrich Lambert (1728 1779). Lambert explored the problem in depth, and in 1761 postulated what has become known as Lamberts Theorem, which can be stated: The time required to travel between two points on an elliptic arc depends only on the semimajor axis of the ellipse, the straight-line distance between the two points, and the sum of the distances of the points from the focus of the ellipse. This is sketched in Figure 3.2. The values needed for Lamberts problem are the distance d, the semimajor axis, and the sum r1 + r2 . The rst analytical proof of this theorem was not in fact due to Lambert, but was provided by Lagrange in 1778. In this section, we will derive LamP berts result using the geometric proper2 ties of the ellipse. This will result in an P d 1 equation that can be used to solve for the r 2 transfer time given the semimajor axis r 1 of the transfer orbit. The same equation can be used with minor modications for parabolic and hyperbolic orbits as well. Lamberts problem concerns the time it takes to travel from one point to another on an elliptic orbit. This probFigure 3.2: The elliptic transit problem. lem is clearly similar to that of position as a function of time, in which we

3.2: Lamberts Problem

35

d E2 r 1 E1

Figure 3.3: Solving Lamberts problem. found it convenient to work with eccentric anomaly. Thus, we set up the problem as in Figure 3.3. We will assume that E2 is greater than E1 . This is always reasonable, since if it is not the case, we can solve the identical problem of ight from P2 to P1 on an identical orbit in the opposite direction. It will turn out to be useful to dene the angles 1 (E2 + E1 ) 2 1 = (E2 E1 ) 2 = and perform the analysis using these rather than the eccentric anomalies themselves. First consider the distance between the points. We can see from Figure 3.3 that the x and y coordinates of any point on the ellipse are given by x = a cos E and y = b sin E (3.6)

The distance is given by d2 = (x2 x1 )2 + (y2 y1 )2 , which can be written (using the denitions above) as d2 = a2 [cos( + ) cos( )]2 + b2 [sin( + ) sin( )]2 (3.7)

36

Chapter 3. Position in Orbit and Time in Transit

Recalling b2 = a2 (1 e2 ) and simplifying the trigonometric terms leads to d2 = a2 [(cos cos sin sin ) (cos cos + sin sin )]2 +a2 (1 e2 ) [(sin cos + cos sin ) (sin cos cos sin )]2 = 4a2 sin2 sin2 + 4a2 (1 e2 ) cos2 sin2 = 4a2 sin2 (1 e2 cos2 ) (3.8) Because e < 1, the product e cos < 1, so we invent an angle and dene cos = e cos Using this, we can rewrite 3.8 as d2 = 4a2 sin2 (1 cos2 ) = 4a2 sin2 sin2 = d = 2a sin sin Using the fact that r = a(1 e cos E ) and 3.9, the sum of the radii can be expressed as r1 + r2 = 2a ae cos( ) ae cos( + ) = 2a(1 e cos cos ) = 2a(1 cos cos ) Now add this expression to the result (3.10) for the distance. r1 + r2 + d = 2a(1 + sin sin cos cos ) (3.13) (3.11) (3.9)

(3.10)

(3.12)

Recognizing the right hand side as having the form of a trigonometric angle-sum identity, we dene =+ which allows 3.13 to be written as r1 + r2 + d = 2a(1 cos ) = 4a sin2 (/2) Similarly, dening = allows the creation of the identity r1 + r2 d = 2a(1 cos ) = 4a sin2 (/2) (3.15)

(3.14)

3.3: Hyperbolic Trajectories

37

Equations (3.14) and (3.15) allow us to use the known values of a, r1 , r2 , and d to solve for and , and thereby for the eccentric anomalies E1 and E2 , if we also know the eccentricity. This would be all that we need to nd the time of travel between the two points using Keplers equation. The transit time can be written t = t2 t1 = a3 / [(E2 e sin E2 ) (E1 e sin E1 )]

Writing the eccentric anomalies in terms of and produces t = a3/2 [2 + e sin( ) e sin( + )] = a3/2 [2 2e sin cos ] = a3/2 [ 2 sin cos ] = a3/2 [ (sin sin )]

(3.16)

Here, the eccentricity turns out not to be necessary, as we have managed to write the time of transit in terms only of a, , and . Grouping the signicant results, we have t = a3 / [( sin ) ( sin )] r1 + r2 + d 4a r1 + r2 d 4a (3.17) (3.18) (3.19)

= 2 sin1 = 2 sin1

(3.17) is often called Lamberts Equation. For this set of equations to be valid, the transfer must match the assumptions used to derive them. Recall that we assumed a transfer from a point nearer perigee to one farther away. Also, we have tacitly assumed that the satellite does not pass perigee during the transfer. These assumptions will be taken into account when we discuss the implementation of Lamberts work to solving problems in the last section of the chapter.

3.3

Hyperbolic Trajectories

In obtaining our results for time of ight and time in transit for elliptic orbits, it was helpful to be able to use geometric intuition. This is not as easy in the case of hyperbolic trajectories, as there is no superscribed gure with simple, familiar properties to use. Instead, we will use more algebraic approaches, guided by the similarity between these problems and the elliptic problem.

3.3.1

An Analogue of Keplers Equation

For hyperbolic orbits, the semimajor axis is negative. It will turn out to be convenient in the derivations of this section to replace a with its absolute value, which will be denoted a a =

38

Chapter 3. Position in Orbit and Time in Transit

Then we begin with vis-viva equation v= 2 = r a 2 + r a

and substitute the components of the velocity and square the equation to get v2 = r 2 + r2 2 = 2 + r a (3.20)

Recall from the derivation of Keplers second law the result = r2 2A =h and rewrite this to get r2 = a (e2 1) a (e2 1) = 2 = r4 p =

(3.21)

Using eqn. (3.21) to eliminate from eqn. (3.20) gives r 2 = (e2 1) 2 a + r a r2 2 2 = ( ar /)r = 2 ar + r 2 a 2 e2 + a 2 = (r + a )2 a 2 e2 a rdr (r + a )2 a 2 e2 = = dt

(3.22)

We can dene a variable analogous to the mean motion for this case. Recalling that n = /a3 , dene 3 n = /a Then from eqn. (3.22) we can write a 3 / rdr = a dt = a n dt = (r + a )2 a 2 e2 rdr (r + a )2 a 2 e2 (3.23)

This equation can be integrated at least once in terms of hyperbolic functions. Knowing this, we are moved to introduce a value similar to the eccentric anomaly. Denoting this value F , we dene it through the relation r=a (e cosh F 1) (3.24)

3.3: Hyperbolic Trajectories

39

(note the similarity to eqn. (3.6)), and refer to it as the hyperbolic eccentric anomaly. From eqn. (3.24), we see that cosh F = a +r a e and dr = a e sinh F dF

which allows eqn. (3.23) to be written (after quite a bit of manipulation) as n dt = (e cosh F 1)dF This relation is simple to integrate, and results in e sinh F F = n (t tp ) which is the hyperbolic equivalent of Keplers equation. Beginning with the relation r = a(1 e cosh F ) = it is straightforward to derive equations such as cos = cosh F e 1 e cosh F (3.26) a(1 e2 ) 1 + e cos (3.25)

relating the true anomaly to the hyperbolic eccentric anomaly. Further such relations are left as an exercise for the reader.

3.3.2

Time in Transit

Deriving Lamberts Equation for elliptic orbits required a goodly amount of geometric intuition, as well as some fairly intricate algebra. We will use the insight gained in that derivation to help in nding a solution to the problem in the case of hyperbolic orbits. One very useful bit of information is that there exist analogues of eqn. (3.6) and eqn. (3.11) for the hyperbola. We have already dened the hyperbolic eccentric anomaly F ; it turns out that the hyperbola obeys the relations1 x=a cosh F and y = a e2 1 sinh F (3.27) It is easy enough to show that eqn. (3.24) is satised under these relations. Note that F is not an angle in the Euclidean sense, as is the eccentric anomaly E . F does not range from 0 to 2 as does an angle in standard geometry. Rather, this angle is dened on a hyperbola, and runs from to +.
1

When e = 2, the result is an equilateral hyperbola. The results of this section can be derived through geometrical arguments using the equilateral hyperbola in the place of the circumscribed circle used to derive Keplers equation.

40

Chapter 3. Position in Orbit and Time in Transit

Once this relationship is dened, steps nearly identical to those performed to derive Lamberts equation can be followed to get an analogous result for the hyperbola. The resulting equation is t = a 3 / [(sinh ) (sinh )] r1 + r2 + d 4 a r1 + r2 d 4 a (3.28)

where in this case the angles and are given by = 2 sinh1 = 2 sinh1 (3.29) (3.30)

Recall that we are using hyperbolic angles and functions. Therefore, and are not restricted to the range [0, 2 ].

3.4

Parabolic Trajectories

The parabola is in many ways the easiest trajectory for which to derive a relationship for position as a function of time. Beginning with the orbit equation r= p 1 + cos

we use the facts that p = h2 / and 1 + cos = 2 cos2 (/2) to write r= h2 1 2 cos2 (/2)

We recall from the derivation of Keplers Laws that r2 d = h dt dt = so that we now have dt = which in turn implies h3 4 cos 42 2 r2 d h d

h3 d sec4 2 4 0 2 This equation can be integrated to get the relation

t tp =

t tp =

h3 22

tan

1 + tan3 2 3 2

(3.31)

3.5: Implementation Considerations

41

This result is known as Barkers Equation. Barker did not establish the relationship, but provided extensive tables for its solution during the eighteenth century. One of the great benets of Lamberts work is that the eccentricity is not needed to solve for time of ight. This is not a great contribution in the case of parabolic trajectories, as the eccentricity is known for this case. Theoretically, one could iterate on Barkers equation to solve for time of ight. In this case, the result in the style of Lambert produces a simple equation, instead. Beginning with the elliptic time of ight equations, we take the limit as a . Recalling (3.18), we can see that the sine of and thus itself becomes small. This holds true also for . We can therefore replace the sines of the angles with the Taylor expansion about zero to get t = a3 /[(3 /6 ) ( 3 /6 )]

When taking the limit, we will see that all higher order terms go to zero, and after substitution arrive at the result t = (r1 + r2 + d)3/2 (r1 + r2 d)3/2 6 (3.32)

This equation, having been obtained from the expression derived for elliptic orbits, contains all of the assumptions inherent in the earlier result, and more. As a result, (3.32) is valid only when the angle between the two radial vectors is less than ; when the angle is greater than , the minus sign is converted to a plus and the relationship is (r1 + r2 + d)3/2 + (r1 + r2 d)3/2 t = (3.33) 6 This and other issues involved in solving time of ight problems are addressed in the following sections.

3.5

Implementation Considerations

Although the equations derived in this section are framed in the context of nding time of ight, it is also very common to see the inverse problem. That is, given two (or more) observations, nd the semimajor axis and eccentricity of the orbit. This is a problem of orbit determination. In general, we would expect the information we are given to consist of the distances to the object, the times of the observations, and the angle between the radial vectors at the two times. Knowing the angle, we can construct the chord length, giving us most of the information necessary for the time of ight solution. The nal part of the problem is to iterate on the value of the semimajor axis until a value is found that gives a time of ight that matches the known actual time.

42

Chapter 3. Position in Orbit and Time in Transit

Using the results of the previous sections to solve for t is not always straightforward. The equations include trancendental functions which must be solved numerically, and the problem may not t the assumptions used in deriving the equations. Further, the results (except when using universal variables) give dierent equations for elliptic, hyperbolic, and parabolic orbits, which necessitates some way to know which set of equations to use. Even when the type of conic section is known, there may be diculties. In solving position and time of ight problems, we will usually nd it necessary to use some numerical methods. The class of techniques we will need are those involved in nding the zeroes of functions. As this will be a common feature of the problems ahead, we will cover this topic next.

3.5.1

Finding the Zeroes of Functions

The general class of problems we consider here are stated: Given a known function f (x), nd one or more values of x such that f (x) = 0. There are many such techniques, but we will consider only two. Note that in numerical methods, we do not nd actual values of the zeroes. Instead, we nd values that suciently closely approximate the them. The accuracy required is generally decided based on the needs of the particular problem. Bisection The most basic of all zero-nding techniques is bisection. This method usually assumes that there are two known values x and x such that f (x) f (x) < 0 (3.34)

We assume that f (x) is continuous on the interval [x, x], so that this condition guarantees that f ( x) = 0 for some x [x, x]. We will further assume that there is only one such value, though this is not actually necessary. The technique is simplicity itself. Written in algorithmic format, it might be described Given : () x, x, > 0 x = (x + x)/2 if |f ( x)| < exit if f ( x) f (x) < 0 then xx else xx go to ()

3.5: Implementation Considerations

43

This is read something like: Given a lower bound, an upper bound, and a tolerance , set x to the midpoint of the interval. If the value of the function is suciently small, call x a zero and exit. If not, check to see if the zero is between the lower bound and x . If it is, set the upper bound to x . If it is not, set the lower bound to x . Then go back to the top and do it all over again. It is easy to see that the length of the interval will be halved at each step, giving rise to the name of the technique. The value of is an error tolerance; when the absolute value of the function is less than we take it to be eectively zero. The version of the technique given here is the most basic. It is common to add another tolerance, this one on the length of the interval. In this way, it is possible to control the accuracy with which x is found, which is often more important than how close the function is to zero. It also puts a bound on the number of iterations that will be needed to nd a value of x that is close enough to the actual zero of the function. Newton-Raphson Iteration The bisection technique is the simplest and in many ways the most robust of the zero-nding algorithms. All it requires is an interval, a tolerance, and some time. It is also the slowest of the techniques. It is almost never used in practice. The most popular technique in this class is known as Newton-Raphson iteration. This is based on the Taylor expansion of function under consideration about the current point x: h2 f (x + h) = f (x) + hf (x) + f + 2 Since we are looking for the value x for which f ( x) = 0, we use the rst two terms of the expansion to approximate f ( x) = f (x + h) f (x) + hf (x) = 0 We can then easily solve for h as h= f (x) f (x)

The algorithm is usually written something like Given : () f (), f (), x0 , > 0 x x0 If |f (x)| < exit h f (x)/f (x) xxh go to ()

44

Chapter 3. Position in Orbit and Time in Transit

Generally, a count of iterations is kept and the procedure aborted if the number gets very large. Again, there are many enhancements to the basic algorithm presented. It is standard to include a value of as a lower bound on the magnitude of h; when the correction to the current estimate of the zero is less than , the value is close enough.
Example 3.1. Find 2 using Newton-Raphson iteration.

Solution: Finding the square root of a number y is equivalent to nding x such that y x2 = 0. Therefore we set up our equations with y = 2 as f (x) = 2x x 1 1.5 1.41666 1.414216 f (x) 1 0.25 0.006944 6.0 10 6 f (x) = 2 x2

Taking x0 = 1 as our starting guess, we have the sequence of approximations f (x) 2 3 2.83333 h 0.5 0.08333 0.002451

Here we have chosen to stop looking when the value of the function is zero to 6 digits. Note the speed with which the algorithm converges to the solution. Note also, however, that had the starting value been x0 = 0, the rst iteration would have resulted in f = 0, causing the algorithm to fail.

Newton-Raphson iteration is almost always much, much faster than bisection, when it works. It requires more work, in that the derivative of the function must be evaluated as well as the function itself, but this is usually a small price to pay for the much smaller number of total iterations. The drawback to the technique is that, for general functions there is no guarantee that the derivative will not be zero on the interval, in which case the technique will fail. There is also the possibility that if x0 is poorly chosen, the technique will take some time to come near the desired zero, and may in fact nd the wrong zero altogether. In our cases, the functions we will consider will be well-behaved, and we will not need to worry. We will use the Newton-Raphson technique rather than bisection, with some thought given to nding appropriate initial guesses at the zero positions.

3.5.2

Solving Keplers Equation

The only real diculty in applying the Newton-Raphson technique to Keplers equation is the necessity to nd a starting point. This was particularly important in the years before computers, of course, when all computations had to be performed by

3.5: Implementation Considerations

45

hand. As a result, there is an extensive literature devoted to the subject. We will make only a couple of elementary observations here. We rst note that, as e 0, the ellipse becomes a circle. On a circular orbit, E = M , so we might suspect that for small eccentricity, E will be close to M . This is a fair assumption in many cases.
Example 3.2. The eccentricity of Mars orbit is e = 0.0934. Find 200 days2 after perihelion. Solution: First nd the mean anomaly M . For simplicity, we will use tabulated values of Mars orbital period: 200.0 = 1.8292 687.0 Writing Keplers equation in a form suitable for Newton-Raphson gives T = 1y 321.73d = 687.0d = M = 2 M = E e sin E = f (E ) = E e sin E M The derivative f (E ) = 1 e cos E Since e is small, we take as our rst approximation E = M . Then the Newton-Raphson algorithm produces E 1.8292 1.9174 1.9170 f (E ) 0.09030 0.00035 5 109 f (E ) 1.0239 1.0317 h 0.088195 0.000338

After only a single iteration, we have the value of E to four signicant digits; after a second, the accuracy has increased to eight. We then solve for using eqn. (3.5): tan = 1.5637 = = 2.0036 2

Setting E0 = M actually works quite well even for eccentricities in the range of e = 0.5. However, for values of e approaching unity, a better approximation can be made when it is realized that M E M + e. Convergence can be improved by choosing E0 = M + e/2 halfway between the bounds. When solving equations on hyperbolic orbits, the same iteration techniques are used. However, the initial approximation may be harder to nd. For e only slightly larger than unity, we can expand the hyperbolic sine term in eqn. (3.25) to get M e(F + F 3 /6) F
2

M/6

The use of the term day is confusing. In this case, because we have not specied sidereal days, we refer to the mean solar day. Confusingly enough, many tables list sidereal periods in mean solar units. Thus the sidereal period of the Earths orbit is often listed as 365.256d.

46

Chapter 3. Position in Orbit and Time in Transit

As F becomes large, this approximation becomes less valid. Note that an upper bound on F is less easy to dene in this case as well, as sinh F is not bounded. When e is not small, we can begin with the same expansion, but retain e in the simplication: e(F + F 3 /6) F M 0 =
3 F0 +

6M 6(e 1) F0 =0 e e

(3.35)

Solving this cubic will result in a better choice for F0 . When F is large, the cubic approximation to sinh F becomes poor. Considering the exponential expression of sinh F , we have3 sinh F = exp(F ) exp(F ) 2

which, for large F , approaches sinh F = exp(F )/2. Thus, when F is large, we might use F0 = ln(2M/e) (3.36) Which equation to use is hard to know a priori, since we do not know in advance whether F is large. To choose, we pick some test value of F . As in Roy [13], we try F = 5/2 and the expansion eqn. (3.35) becomes M =e 5 125 + 2 48 5 5e 5/2 2 (3.37)

If M is larger than this approximation, then F > 5/2 and we use eqn. (3.36) to generate our rst guess F0 . If M is less, then F is also less and we can use eqn. (3.35).
Example 3.3. Suppose we launch a space probe and accelerate it with a y-by of Venus. If the probe reaches perihelion with a radius of 80 106 km and speed 60 km s on January 1, 2000, on what day will it reach the orbit of Saturn? Solution: This problem does not require iterative solution, because the angles are known. All that is required is to rst compute the eccentricity: C = v 2 /2 /r = 141.25
Since most calculators do not have hyperbolic function keys, it is convenient to recall the expressions exp(F ) + exp(F ) exp(F ) exp(F ) cosh F = sinh F = 2 2 sinh1 x = ln(x + x2 + 1) tanh1 x = 1 ln 2 cosh1 x = ln(x + 1+x 1x x2 1)
3

3.5: Implementation Considerations


= a = 4.697 108 km 1+ 2Ch2 = 1.1703 2

47

h = rp vp = 4.8 109 = e=

We know the radius of the orbit of Saturn is a = 9.5388AU = 1.427 109 km. We can then use the relation r = a(1 e cosh F ) = cosh F = ar = 3.450 ae

from which we nd F = 1.910. We use this in (3.25) to nd the time since perihelion. t tp = e sinh F F = 54.63 106 s = 632.26 days n

Since the year 2000 is a leap year, it has 366 days and the probe will cross the orbit of Saturn on the 266th day of 2001. This turns out to be September 23.

3.5.3

Time of Flight Problems

Solving time of ight problems is slightly more complicated than solving position problems, because we often do not know a priori whether the orbit is elliptic or hyperbolic. There are simple tests for this, however. It can be shown that the parabolic case is the limiting case for time of transfer; all other things being equal, the parabolic time of ight is the lower bound for the elliptic time of ight, and the upper bound for the hyperbolic.
Example 3.4. A comet is sighted at a distance of r1 = 6.336 108 km from the Sun. 110 days later, it is sighted again, having traversed an angle of 20.9 , at a distance of r2 = 1.886 108 km. Find the semimajor axis of the comets trajectory. Solution: We have to begin by guring out what set of equations to use. We do this by trying out the parabolic elapsed time equation rst.
2 + r 2 2r r cos(21 ) Find the chord length: d2 = r1 1 2 2

d = 4.623 108

The parabolic time of ight is given by eqn. (3.32) (note that the angle traversed is less than ) as t = 1.794 107 s 208 days Since the observed time of 110 days is less than this value, the trajectory must be hyperbolic. We now turn to nding the semimajor axis, which will give us all we need to solve the problem. We have discussed no method by which a rst guess is to be made. Here, we will simply try a = r1 + r2 , which results in t( a = 8.222 108 ) = 1.589 107 s 184 days

48

Chapter 3. Position in Orbit and Time in Transit


This is far too high. Since the longest time of ight is given by a , we suspect that moving a towards zero might shorten the time of ight. Since we need to shorten it considerably, lets try a = 3.00 108 . This gives t( a = 3.00 108 ) = 1.367 107 s 158 days Since a reduction of over half in the semimajor axis took us less than halfway to the goal, we might try reducing it by a factor of 4 this time: t( a = 7.50 107 ) = 9.221 106 s 107 days A couple more guesses and we arrive at a = 8.0 107 km.

Note that it is certainly possible to set up time-of-ight problems to be solved using Newtons method. The function and its derivatives are more complicated than for orbital position problems, but the method is valid. This method, coupled with a scheme for generating an initial estimate, would be the probable method for implementation in a computer program.

3.5.4

Finding the Orbital Characteristics

In time-of-ight problems, it is usually not just the semimajor axis but all of the characteristics of the orbit that we seek. In Example 3.4 above, for example, knowing the semimajor axis of the cometary orbit is not sucient to predict, for example, when and where it will reach perihelion. To answer these kinds of questions, we need to nd the eccentricity of the trajectory, as well. This knowledge is assumed in the formulation of Keplers equation, but not so in the derivation of the transit time results. Therefore, we must be able to derive the eccentricity from these results. Recalling the denitions of , , and in section 3.2, we note that = 2 = E2 E1 and + = 2 = cos (3.38)

E2 + E1 + = e cos (3.39) 2 2 These two equations are not enough to solve for e. However, recalling that r = a(1 e cos E ), we can write cos E2 a r2 = (3.40) cos E1 a r1 From (3.38), we can write cos E2 = cos(E1 + 2) = cos E1 cos(2) sin E1 sin(2) which we divide through by cos E1 to get cos E2 = cos(2) tan E1 sin(2) cos E1 (3.41)

3.5: Implementation Considerations We can now include (3.40) in (3.41) to arrive at tan E1 = a r2 1 cos(2) sin(2) a r1

49

(3.42)

Since the values of and are byproducts of the solution of the time-of-ight problem, (3.42) will allow us to nd E1 (and then, should we desire, E2 ). We can then use any number of relations to nd the eccentricity. Kaplan [8] attacks the entire problem in a somwhat dierent matter (though still using the geometry of the transfer). He proceeds to derive a quadratic exression with the semi-latus rectum as the argument. Solving for the case of the ellipse, this provides a(r1 + d r2 )(r2 + d r1 ) 2 + p= sin (3.43) d2 2 and, for the hyperbolic problem, p= a(r1 + d r2 )(r2 + d r1 ) sinh2 d2 + 2 . (3.44)

Taking limiting arguments as in section 3.4, the result for the parabolic case becomes (r1 + d r2 )(r2 + d r1 ) p= d2 r1 + r2 + d + 4 r1 + r2 d 4
2

(3.45)

Between these and (3.42), nding the orbital parameters is relatively straightforward.
Example 3.5. Considering the comet of Example 3.4, how long after the second sighting will the comet reach perihelion, and how far from the Sun will it be? Solution: By reasoning similar to that used above, it is easy to show that in hyperbolic problems a r2 1 cosh(2) (3.46) tanh F1 = sinh(2) a r1 From the information given, we compute = 2.8904 from which (3.46) gives tanh F1 = 1 2.686 108 = 0.9806 1.5956 1.2434 7.136 108 ar = 1.750 a cosh F = F1 = 2.3121 = 1.8470

We can now nd the eccentricity from e=

50

Chapter 3. Position in Orbit and Time in Transit


From the eccentricity, it is now trivial to compute perihelion. rp = a (1 e) = 60.0 106 km Time to perihelion is found easily from the hyperbolic version of Keplers equation. Knowing F1 and e, we solve for time of ight from t tp = a 3 / (e sinh F F ) = 1.264 107 s = 146.25 days

Since this is time to perihelion from the rst sighting, we subtract the 110 days to the second sighting and nd that the comet will reach perihelion in only 36.25 more days.

3.6

Choosing the Right Solution

Although we have derived approaches to the time of ight problem for each of the orbit types, there remains the problem of selecting the appropriate solution. For any problem involving a known focus and two points on the curve, there are two hyperbolic, two parabolic, and four elliptic solutions. We ignored this, tacitly assuming that the solution was straightforward, It is easily seen that for any two points P1 and P2 dened relative to a given focus, there are innitely many of each conic section that pass through the points. It should not be surprising then that there are multiple possible solutions for a specied travel time. In the hyperbolic and parabolic case, these correspond to solutions that sweep angles of less than or greater than . For the elliptic case, there are (usually) two possible ellipses that pass through the points, and the two solutions relate again to sweeps of greater than or less than pi.

3.6.1

Parabola and Hyperbola

As mentioned, for parabolic and hyperbolic orbits, the two possible solutions for a given travel time between two points depends upon the sweep angle. If the two points are directly opposed, so that the observations are separated by 180 , the two paths are identical. For situations in which the observations are separated by an angle less than , there is no way to know purely from their positions and the travel time which solution is correct. It is necessary to know whether the object has in fact traveled the short path, or taken the longer arc so that the transfer angle is greater than . For the parabolic transfer, it has already been mentioned that for transfers of less than , we use (3.32). For transfers of greater than , by contrast, we use (3.33). The only dierence is changing a subtraction to an addition. For the hyperbolic orbit, the result is eectively the same. Recall that we nd the angle as (equation (3.30)) = 2 sinh1 r1 + r2 d . 4 a

3.6: Choosing the Right Solution

51

For transfer angles greater than , we simply take the negative of this. The subtraction in (3.28) is then a subtraction of a negative, becoming an addition.

3.6.2

Elliptic transfer

In the case of elliptic transfer, the two elliptic solutions each provide two possible transfers, one less than and one greater. These two will have dierent travel times, however, so there is still only one solution for each ellipse that satises the problem. It remains, however, to nd which ellipse to use. Recall that the choice between elliptic or hyperbolic solution is made by determining the parabolic transfer time. The parabola can be thought of as the minimum energy hyperbola, and if the required travel time is greater than the parabolic transfer time, the solution must be elliptic. In the same way, we can determine the minimum energy ellipse that satises the geometry for a particular problem, neglecting the transfer time. Recall (3.18): = 2 sin1 r1 + r2 + d . 4a

Clearly, there is a minimum value of the semimajor axis, below which the term under radical is greater than unity and the inverse sine has no real solution. This minimum value of a is the semimajor axis of the minimum energy ellipse: am = r1 + r2 + d . 4 (3.47)

Using this value of a, = and sin = 0, so that tm = a3 m / [ ( sin )] . (3.48)

If the transfer time is greater than this, it must be true that the transfer is the long way around (in time, not necessarily angle swept) on whichever ellipse is the correct solution. Letting 0 and 0 be the values of the angles from (3.18) and (3.19), we can specify the possible solutions as angle swept < = 0 angle swept > = 0 t < tm = 0 t > tm = 2 0 When nding the orbital characteristics using the methods of section 3.5.4, it is necessary to be sure to use the correct values of and . Prussing and Conway [12] give a geometric interpretation of these angles, and a more detailed discussion.

52

Chapter 3. Position in Orbit and Time in Transit

3.7

Problems

1. A minor planet has an orbit with eccentricity e = 0.21654 and period 4.3856 years. Calculate the eccentric anomaly 1.2841 years after perihelion. 2. Halleys comet has semimajor axis a = 17.9654 AU, eccentricity e = 0.967298, and periapsis passage was tp = February 9, 1986. Find the eccentric and true anomalies on October 27, 1960. 3. An object is at perihelion at rp = 8.00 107 km. When will the object be at r = 3rp if: (a) The orbit is elliptic with a = 2.5rp ? (b) The orbit is parabolic? (c) The orbit is hyperbolic with e = 2?

4. Consider the space probe in example 3.3. How far will it be from the Sun when the warranty expires on the rst of January, 2003? 5. A satellite in orbit about the Earth has perigee height 622 kilometers and apogee height 2622 kilometers above the surface. (a) What is the time of ight from = 15 to = 120 ? (b) What is the time of ight from = 240 to = 30 ? 6. A comet has an orbit in the plane of the ecliptic. At aphelion the comets velocity is 9.29 km/s and the distance from the Sun is ra = 3.365 108 km. How long after aphelion will the comet cross the orbit of the Earth? 7. An object is observed at an altitude above the Earth of 366 km and 12.5 minutes later at an altitude of 633 km; the angle between the two observations was 45 . Find the semimajor axis and eccentricity of the orbit, and the true anomaly at the time of the rst observation. 8. An object is observed at an altitude above the Earth of 7202 km and 14.4 minutes later at an altitude of 11980 km; the angle between the two observations was 80 . Find the semimajor axis and eccentricity of the orbit, and the true anomaly at the time of the rst observation. 9. An object is observed at an altitude above the Earth of 92000 km and 2.9 hours later at an altitude of 7062 km; the angle between the two observations was 96 . Find the semimajor axis and eccentricity of the orbit, and the true anomaly at the time of the rst observation.

3.7: Problems

53

10. A periodic scan of the solar system shows an object at a distance of r1 = 1.544 108 km from the sun. A scan 38.25 days later, in a direction 50 away, shows what appears to be the same object at a distance of r2 = 1.602 108 km. What is the eccentricity and semi-major axis of the objects orbit, and the true anomalies of the sightings? 11. An object is sighted at a distance of 2.82 108 km from the Sun. 90 days later it is sighted at a distance of 3.33 108 km, and the angle between the two sightings is 40 . What are the true anomalies at the two sightings? What is the semimajor axis and the eccentricity of the objects orbit? Is it possible that the object has made a complete orbit between the two sightings?

Chapter 4

Orbital Elements Putting the Orbit in its Place

In the last chapters, we discussed the orbit of two generic bodies, somewhere in space. We began with the specication that the bodies existed in some inertial frame, but no mention of any particular frame was later made. We discussed only those characteristics of the orbit that are independent of its inertial characteristics. In this chapter, we consider the orbit in relation to the inertial frame. In a sense, we will be locating the orbit in space.

4.1

Coordinate Frames

The orbital elements that will be introduced later serve, in part, to locate an orbit with respect to an inertial reference frame. For the purposes of denition, we could simply assume the existence of such a frame and go on from there. However, orbital mechanics has been built up from centuries of observation of the motions of the planets their moons, and agreed-upon coordinate frames in which to express these observations have naturally been dened. There are two frames of particular interest for us. Interplanetary travel will require transfers between planetary orbits, which are sun-centered. Such orbits are called heliocentric. The orbits of Earth satellites are dened in the geocentric, meaning Earth-centered, frame.

4.1.1

Heliocentric Orbits

The heliocentric coordinate frame has its origin, not surprisingly, at the center of the Sun. The fundamental plane of this frame is the plane of the ecliptic, the plane in which the orbit of the Earth lies. The Z -axis is then dened to be normal to this plane, along the angular momentum vector of the Earth.

55

56
Z

Chapter 4. Orbital Elements

The axes of the heliocentric system are (eectively) xed with respect to the distant stars. Confirst day of spring sider Figure 4.1. The line running from the center of the Earth first day of summer SUN Y through the center of the Sun on first day of winter the rst day of spring denes the X -axis. The rst day of Spring first day of autumn X is the vernal equinox, and the X axis lies in the vernal equinox diFigure 4.1: Heliocentric coordinate system. rection. The direction is usually given the symbol by astronomers, because it used to point to the constellation Aries.1 The inertial Y -axis is then specied to complete a right-handed orthogonal system. Many of the terms introduced above come through the ages from very early astronomy, and thus some historical perspective will not be amiss at this point. It will also serve to introduce some further terminology, some of which is still in occasional use. From Berry [3], pp.11-14: When the sun crosses the equator the day is equal to the night, and the times when this occurs are consequently known as the equinoxes, the vernal equinox occurring when the sun crosses the equator from south to north (about March 21st), and the autumnal equinox when it crosses back (about September 23rd). The points on the celestial sphere where the sun crosses the equator...are called the equinoctial points, occasionally also the equinoxes. After the vernal equinox the sun in its path along the ecliptic recedes from the equator towards the north.... The time when the sun is at its greatest distance from the equator in the north side is called the summer solstice, because then the northward motion of the sun is arrested and it temporarily appears to stand still. Similarly...the winter solstice.... Among the constellations which rst received names were those through which the sun passes in its annual circuit of the celestial sphere.... This strip of the celestial sphere was called the zodiac, because the constellations in it were (with one exception) named after living things (Greek o , an animal); it was divided into twelve equal parts, the signs of the zodiac, through one of which the sun passed every month.... Owing, however, to an alteration of the position of the equator, the sign Aries, which was dened by Hipparchus in the second century b.c. as beginning at the
Note that the Earth is actually at a negative value of X at this point on its orbit, because the direction dates from when the Earth was considered the center of the celestial sphere. The direction was thus dened as the direction from the Earth to the Sun at the equinox.
1

4.2: The Classical Orbital Elements vernal equinoctial point, no longer contains the constellation Aries, but the preceding one, Pisces; and there is a corresponding change throughout the zodiac. The more precise numerical methods of modern astronomy have, however, rendered the signs of the zodiac almost obsolete; but the rst point of Aries () and the rst point of Libra ( ), are still the recognized names for the equinoctial points.

57

(Boldface in the original.) The alteration of the position of the equator mentioned by Berry is due to the precession of the axis of rotation (and symmetry) of the Earth. The axis completes one cycle about the celestial Z -axis in about 26 thousand years. For this reason the heliocentric system is not quite xed, but the motion is suciently slow that we may take it as xed for our purposes. For some long-term or very precise measurements, however, the motion of the equinoxes must be taken into account.2

4.1.2

Geocentric Orbits

Consider again Figure 4.1. The Earths equatorial plane is tilted approximately 23.45 with respect to the plane of the ecliptic. The line of intersection of the two planes will run through the center of both the Earth and Sun on the equinoxes; for the rest of the Earths orbit it will be parallel to the equinoctial direction. (Another way to dene the equinoxes is when the line does pass though the centers of both bodies.) The line of intersection also serves as the X -axis for the geocentric equatorial coordinate system, so that this axis is always parallel to, and in the same direction as, the heliocentric X -axis. The Z -axis is then taken to be the axis of rotation of the Earth, positive northwards in the heliocentric frame, and the Y -axis again is chosen to complete a right-handed system.

4.2

The Classical Orbital Elements

The elements of an orbit are a set of parameters that fully describe the orbit, in the sense that given the orbital parameters, it is possible to sketch out the set of points in inertial space that are on the orbit. Another way to think of this is that the elements describe all possible positions of a spacecraft that is somewhere on that orbit. One additional orbital element is added to serve as a reference time, so that the position of that spacecraft can be rened to a specic point as a function of time. Such a full specication is equivalent to specifying the inertial position and velocity of the spacecraft at any point in time. As it takes three position coordinates and three corresponding velocity components to describe this, it is not surprising that it also requires six elements to fully describe the orbit. These six can be broken down into three subsets: one set (two elements) describes the size and shape of the orbit as
2 For more on the precession of the equinoxes, see Thomson [15], sec. 5.15, pp. 146 . Gurzadyan ( [7], chapter 1) has a brief overview of some of the history of Celestial Mechanics.

58

Chapter 4. Orbital Elements

it lies in the orbital plane; one set (three elements) denes the orientation of the orbit with respect to the inertial reference frame, and the nal set (one element) provides the time reference just mentioned. There are several sets of elements in use in the orbital mechanics community. We will concern ourselves with those that are often referred to as the classical orbital elements. These are not in fact the most widely used, due to some mathematical singularities, but they are the most straightforward, and will serve our purposes well. For other sets of elements, the reader is referred to Battin [2], or Danby [6].

4.2.1

Size and Shape of the Orbit

The size and shape of the orbit have been discussed at length in earlier chapters. These are dened by the eccentricity e and the semimajor axis a. Recall that the semi-latus rectum is fully dened by these two, so that any two of the three elements (a, e, p) would give the same information. It is most common to use a and e, though an occasional text will specify orbits through p and e. Because the eccentricity gives the most direct information about the orbits shape, it is always in the element set.

4.2.2

The Angular Elements

Three angular elements are needed to dene the orientation of the orbit with reZ spect to the inertial coordinate system. These elements coincide with Euler roh 00000000000000 11111111111111 tation angles (see Appendix B). They 00000000000000 11111111111111 00000000000000 11111111111111 00000000000000 11111111111111 00000000000000 11111111111111 are shown in Figure 4.2, and the names 00000000000000 11111111111111 u 00000000000000 11111111111111 00000000000000 11111111111111 00000000000000 11111111111111 of these elements are: 00000000000000 11111111111111 00000000000000 11111111111111 Periapsis 00000000000000000 11111111111111111 00000000000000 11111111111111 00000000000000000 11111111111111111 00000000000000 11111111111111 00000000000000000 11111111111111111 00000000000000 11111111111111 - The right ascension of the ascending 00000000000000000 11111111111111111 00000000000000 11111111111111 00000000000000000 11111111111111111 Y 00000000000000 11111111111111 00000000000000000 11111111111111111 00000000000000 11111111111111 00000000000000000 11111111111111111 00000000000000 11111111111111 node, 00000000000000000 11111111111111111 00000000000000 11111111111111 00000000000000000 11111111111111111 00000000000000 11111111111111 00000000000000000 11111111111111111 00000000000000 11111111111111 00000000000000000 11111111111111111 00000000000000000 11111111111111111 - inclination, and 00000000000000000 11111111111111111 00000000000000000 11111111111111111 00000000000000000 11111111111111111 00000000000000000 11111111111111111 - argument of periapsis. 00000000000000000 11111111111111111 00000000000000000 11111111111111111 00000000000000000 11111111111111111 X 00000000000000000 11111111111111111 Line of nodes 00000000000000000 11111111111111111 The line of intersection of the plane 00000000000000000 11111111111111111 00000000000000000 11111111111111111 00000000000000000 11111111111111111 00000000000000000 11111111111111111 of the orbit with the fundamental plane 00000000000000000 11111111111111111 00000000000000000 11111111111111111 00000000000000000 11111111111111111 00000000000000000 11111111111111111 of the axis system is known as the line of nodes. The nodes themselves are the Figure 4.2: Classical orbital elements points at which the orbit itself crosses the fundamental plane. The ascending node (sometimes, but not often, the rising node) is the node at which a spacecraft on the orbit is passing from the southern (z < 0) hemisphere to the northern hemisphere; it is ascending because z is positive. The other node, at which the craft would be going from north to south, is known as the descending node.

4.2: The Classical Orbital Elements

59

The term right ascension comes from the practice of describing the apparent positions of celestial bodies in terms of the angles describing the line of sight to them in terms of ascension and declination. (These terms are still used in some implementations, but with this one exception will not be needed in this book.) The right ascension of the ascending node is dened as the angle between the inertial X -axis and the line of nodes, measured positive about the positive Z -axis. The inclination of the orbit is the angle between the orbital plane and the fundamental plane. This is also the angle between the inertial Z -axis and the angular momentum vector of the orbit. The angle is measured about the line of nodes, positive by the right-hand rule, with the direction of the line of nodes being taken positive from the descending node towards the ascending node. The argument of periapsis is dened with respect to the Laplace vector, which we recall points from the focus to the periapsis of the orbit. is the angle from the line of nodes to the Laplace vector, measured in the direction of travel around the orbit (this is also positive about the angular momentum vector).

4.2.3

Position on the Orbit

There remains the specication of the position of the body on the orbit. This is dened in one of two ways. We can either use the true anomaly at some specied time, or the time of periapsis passage. From either of these bits of information (along with a and e), the position of the body along the orbit at any other known time can be found. The time of periapsis passage, which we will denote Tp , is simply the time at which the body on the orbit is at periapsis. This is clearly not unique for a closed orbit, since the body will cross periapsis an innite number of times. For an orbit with non-negative energy, there is only one possible value for Tp . When choosing to specify the true anomaly at a specied time, the anomaly at epoch is usually used. When referring to an observed set of data, the epoch is generally taken to be the time of the observation. For theoretical work, the epoch is an agreed-upon time, set by convention and committee. The current value for solar-system orbits is the precise time of vernal equinox of the year 2000. In 2025, the epoch will be changed to the equinox of 2050. In either case, the symbol we will use for the true anomaly at epoch is 0 .

4.2.4

Summary

As mentioned, there are many element sets from which to choose. Even within the classical elements, there are many possibilities for inclusion. For our purposes, the set we will normally use will be (a, e, , , , 0 ). Note that for the orbit itself, only the rst ve in the set are needed, so that when it is only the characteristics of the

60

Chapter 4. Orbital Elements

orbit that are of concern the last element might be dropped. Special cases, in which this set may not be valid, are the topic of the next section.

4.3

Special Cases and Singularities

As mentioned, there are many cases in which the classical orbital elements just described are insucient. This is because there are many common cases in which one or more of them are not dened. Consider the case of an orbit lying in the fundamental plane (in the geocentric system, this would be called an equatorial orbit). In this case, there is no line of nodes, so clearly there can be no right ascension of the ascending node. There can also be no denition of the argument of periapsis, since there is nothing from which to measure it. There can also be no argument of periapsis if there is no periapsis, which is the case for circular orbits. This is a case of special importance, as many orbits of interest in applications are circular or close to circular. For these cases, some extensions to the classical element set can be dened. Let u denote the argument of latitude of a point along the orbit, dened as the angle between the line of nodes and the ray from the focus to the point. In Figure 4.2, the value of u would be the sum u=+ For an inclined circular orbit, this value is still dened. We will let l denote the true longitude along an orbit, which we will dene as the sum l =++ =+u when these values are dened. For equatorial orbits, l is simply the angle from the X -axis to the line joining the focus to the body on the orbit. Note that this is not true for inclined orbits, because the angle is not in the same plane as the others. With these denitions, we can dene the following additional elements to describe an orbit: 1. : The longitude of periapsis. This is dened for all non-circular orbits. When the inclination is zero, = . 2. u0 : The argument of latitude at epoch; u0 = + 0 . This is dened for any orbit for which the argument of latitude is dened. 3. l0 : True longitude at epoch. l0 = + + 0 = + 0 = + u0 Although we have been using freely, in many cases the position of a body on an orbit is written using l. For general orbits, this is simply minus a constant, so that it makes little dierence in practice. However, the fact that is measured from

4.4: The Orbital Elements from Position and Velocity

61

periapsis can cause troubles in many applications, and so the true longitude is used for safety and full generality. While the use of any particular element set is a matter of convenience when all elements are constant, this is not the case when the elements can change, as is the case when a spacecraft res its motors. In the case of a small thrust, we would like the elements of the orbit to change by only a small amount. This is not necessarily the case with the classical elements, in particular when the eccentricity is nearly zero. This case will be explored more fully in the appropriate chapter.

4.4

The Orbital Elements from Position and Velocity

As mentioned earlier, specifying a complete set of orbital elements is the equivalent of specifying the position and velocity of the body on the orbit. It naturally follows that, given the position r and velocity v at some specied time, we should be able to compute the elements. In this section, we will show how this is done. Recall that we assume that the value of the gravitational parameter is known in advance. The following steps are valid for non-circular orbits. Deriving relationships for circular orbits and other special cases is left as an exercise for the reader. Note that these steps need not be followed in this order, and that this is not the only way to approach the problem. 1. From the vectors r and v, compute the mechanical properties of the orbit. That is, compute the the specic angular momentum vector, h = r v, its magnitude h, and the specic energy, C = v v/2 /r. 2. The energy provides the semimajor axis: a = /2C . 3. Compute the Laplace vector A = v h r/r. The eccentricity is then simply the magnitude A divided by the gravitational parameter: e = A /. (Alternatively, e = 1 + 2Ch2 /2 , but the Laplace vector will be useful later.)

4. The inclination angle can be immediately computed. Because the inclination is the angle between h and the inertial Z -axis, we have that = h cos hk h3 = = = cos1 h3 h

Recall that the inclination angle is always non-negative and less than or equal to 180 .

62

Chapter 4. Orbital Elements 5. Find the line of nodes. By denition, the line of nodes lies in the plane of the orbit, and in the fundamental plane. Thus it is normal to both h and to the Z -axis. Now, we know that a b produces a vector normal to both a and b. Thus we dene the vector n by h = h2 n=k + h1

The order of the cross product is chosen to produce a vector that points towards the ascending node. We need only the direction; we are not concerned with the magnitude. It might however be noted that if n is normalized to a unit vector at this point, it will be convenient for later computations involving the magnitude n.

6. The right ascension of the ascending node is the angle between X and the line of nodes. Thus n1 n = n cos = = cos1 n If n2 > 0, then is less than . 7. The argument of periapsis is the angle from n to A, so we have A n = An cos = = cos1 An An .

Recall that, measuring from n, the rst half of the orbit is north of the fundamental plane. Thus, if A3 > 0, then 0 < < . 8. The true anomaly at epoch 0 is the angle between A and r, so 0 = cos1 Ar Ar

where we have assumed the reference epoch to be the time of the observation. If r v > 0, then 0 is less than . Naturally, the above recipe fails for circular orbits, and assumes that the element set desired includes 0 rather than Tp . However, Keplers equation can be used to correct for the latter (assuming that the time of the observation is known), while the former requires the use of an appropriate alternate element set.

4.5

From Elements to Position and Velocity

We now know how to nd , , and the other elements, given r and v at some time t. The reverse problem is a touch more dicult. The approach requires us to rst dene one more coordinate frame. It will be simple to express r and v as a function of the classical elements in this new frame, and we then use the usual transformations to go from this frame to the inertial.

4.5: From Elements to Position and Velocity

63

4.5.1

The Perifocal Coordinate Frame

The Perifocal frame is dened by the orbit itself, using the angular momentum and Laplace vectors. The origin is at the focus of the orbit, and the X -axis points to periapsis; from this we get the name. The perifocal system is sketched in Figure 4.3, using the same orbit as in Z Figure 4.2. The axes of the perifocal system are identied with the subscript p. Yp Zp The easiest mathematical description of Xp the perifocal frame is to say that the Xp r axis is aligned with the Laplace vector, the Zp axis with the angular momentum vector, and the Yp axis is chosen Y to complete the right-handed orthogonal system. X Because the Zp axis is aligned with the angular momentum vector of the orbit, it is normal to the orbit plane. Keeping this in mind, and looking at Figure 4.3: The perifocal coordinate Figure 4.3, it is easy to see that frame. rp = r cos p + r sin p . (4.1) (The notation rp here means the vector r expressed in the frame subscripted p.) To nd vp , we dierentiate r to get p = (r vp = r cos r sin ) p + (r sin + r cos ) p (4.2)

Note that the perifocal frame is inertial, so that there is no need to take time derivatives of the unit vectors p and p . To evaluate the derivative, begin with the orbit equation p r= . 1 + e cos Recalling that h = r2 and p = h2 /, we dierentiate the orbit equation to get (after a fair bit of careful arithmetic): r = e sin p and r = (1 + e cos ) p

which we substitute into eqn. (4.2) to arrive at vp = [ sin p + (e + cos ) p ] p (4.3)

Now that we have r and v in the perifocal frame, we use our knowledge of coordinate transformations (see B for a review) to put them into the inertial frame.

64

Chapter 4. Orbital Elements

4.5.2

Perifocal to Inertial Transformation

The angular orbital elements serve the role of Euler rotation angles. To envision this, imagine a rigid wire ellipse in space, of the same size and shape as the orbit, with a wire pointer also running from the focus to the periapsis. Let it begin in the plane of the ecliptic, with the major axis along the inertial X -axis, so that the pointer runs in the positive X -direction. The focus of the ellipse is at the origin of the inertial system. Now rotate the ellipse about the Z -axis through the angle . This will result in the pointer being along what will be the line of nodes. Next, the ellipse is rotated about its major axis, through the inclination angle i. Finally, rotate the ellipse about its normal (the angular momentum vector of the orbit), through the angle . The ellipse is now exactly in the position and orientation of the orbit. This is a standard 3-1-3 Euler rotation. We let r be the position vector in the desired inertial frame of reference, and rp the position written in the perifocal frame. The two are related as rp = [ about Zp ][ about X ][ about Z ]r which is inverted as r = [ about Z ]T [ about X ]T [ about Zp ]T rp Filling in the matrices, we have cos sin 0 1 0 0 cos sin 0 r = sin cos 0 0 cos sin sin cos 0 rp 0 0 1 0 sin cos 0 0 1

which multiplies out to cos cos sin cos sin sin cos cos cos sin sin sin r = cos sin + sin cos cos sin sin + cos cos cos sin cos rp sin sin cos sin cos (4.4) Being lazy, we write this more compactly as r = LI p rp (4.5)

Similarly, the velocity vector is transformed as v = LI p vp and from the last section, we already know rp and vp . (4.6)

4.6: Alternate Element Sets

65

4.6

Alternate Element Sets

There is nothing magical about the classical set of orbital elements. They are convenient for many purposes because they have obvious and immediate meanings in terms of the size, shape, and orientation of the orbit. They are not suitable for use in many cases, however. Any set of six numbers could be used as elements, so long as knowing those numbers would enable us to compute the position and velocity of the body in orbit at any given time. For example, the position and velocity at velocity at some specied time could be used, eectively providing an initial condition for the orbiting body. There are several sets of elements used for varying purposes. One problem with the classical set is the large number of cases in which some are not dened. Even more troublesome for many uses is the fact that a small change in the velocity of the orbiting body does not necessarily result in a small change in the elements. In particular, consider a satellite in a nearly circular orbit. The argument of periapsis is well-dened. However, a tiny change in velocity may cause the eccentricity to go to zero, and the argument of periapsis is no longer dened. A second small change could make the orbit again elliptical, but the argument of periapsis will have no relation to its earlier value. While there are a number of alternative element sets, we will here present only two. The rst is closely related to the classical set, and has the advantage that it is singular only in very special cases. The second is derived from a dierent approach to the equations of motion, which we will not explore here.

4.6.1

Equinoctial Elements

The equinoctial element set can be dened in terms of the classical set. One of the elements in this set is the semimajor axis a. The other ve are dened as L=+M P1 = e sin P2 = e cos Q1 = tan sin Q1 = tan cos 2 2 where = + is known as the longitude of periapsis, and M is the mean motion as dened in section 3.1. This set of elements is singular only for the case of true retrograde orbits, where the inclination = . Note that the use of the mean motion in dening l causes this to no longer be constant. This is somewhat at odds with the description of the elements as a set of six constants. The deviation is due to convenience, as M incorporates both the time since periapsis passage and the mean motion. The element L is usually called the mean longitude. It should be noted that there is no xed set of symbols for the equinoctial elements.

66

Chapter 4. Orbital Elements

4.6.2

Delauney Elements

The Delauney elements are derived from the expression of the orbital equations of motion as a Hamiltonian system. This form of mechanics is common in investigations of celestial mechanics, but is not needed for the purposes of this text. The Delauney elements are constants of the motion derived from the Hamiltonian approach, just as we have derived six constants from the Newtonian approach we have taken, again with a modication to include the mean anomoly, bringing time into the element set. We adopt here Delauneys own notation, which is that most often used for his element set. Dening them in terms of the classical elements, the Delauney set is l=M L = a G= H= a(1 e2 ) cos a(1 e2 ) g= h = .

Here, M is again the mean anomaly, M = n(t tp ). The symbols L and h do not have their previous meanings, however. In fact, it is obvious that G is actually the magnitude of the angular momentum, and H is the component along the inertial Z axis. Note that the Delauney elements are also subject to singularities. The element g is undened when the eccentricity goes to zero, and L becomes unbounded as the orbit approaches being parabolic. As might be suspected, there is a corresponding set of non-singular elements. As with the equinoctial set, these lack the easily seen physical meaning of most of the Delauney set. They are useful for perturbation analyses in cases when the singularities cannot be condently avoided, but are well beyond the needs of this text. We will revisit the Delauney elements briey in a later chapter, when we discuss the eects of imperfections in the two-body assumption.

4.7

Problems

1. For each of the special cases (a) e = 0, sin = 0, (b) e = 0, sin = 0, and (c) e = 0, sin = 0, give a complete set of orbital elements, and show how each is computed from r and v. v= 2. What are the orbital elements if at some point on an orbit r = + 0.2k, . 0.9 + 0.123k 3. What are the orbital elements if r = ( 2/2) + ( 2/2) , v = (1/2) . v = 4. What are the orbital elements if r = k, .

4.7: Problems

67

5. Find (by logic and inspection) vectors r and v such that a circular geocentric orbit has = 90 , = 45 , and a period of 2 hours. Explain your selections. 6. In canonical units ( = 1), the following data is given. Determine a set of independent orbital elements for each case. (a) 3 2 3 2 k r= 2 2 6 v= 6 (b) 2 2 r= + 2 2 2 2 v= + 2 2 7. Given the orbital elements: p = 0.23, e = 0.82, = /2, = , = 260 , 0 = 190 . (a) Find r and v in the perifocal system. (b) Convert to the inertial system. 8. Find the inertial position and velocity vectors for the following cases in earth orbit: (a) a = 7016 km, (b) rp = 6678 km, i = 35 , e = 1.5, = 130 , = 115 , = 0 e = 0.05, i = 45 , = 0 , = 20 , = 10

Chapter 5

Impulsive Orbital Maneuvers

In previous chapters we have examined the two-body problem and discovered that the orbit of a body is constant for all time, at least to the accuracy of the two-body assumptions we used to nd the solution. In this chapter, we will consider the problem of changing the orbit by applying thrust to the body in orbit. The change in velocity (in magnitude or direction, or both) may change either or both of the energy and angular momentum, which we have seen control the size and shape or the trajectory. In this chapter, we will consider only impulsive maneuvers. The dening assumption is that the change in velocity occurs over so short a period as the be eectively instantaneous. Because the velocity is assumed to change in zero time, there is no change in position. This is never quite true, but is a close approximation to the case when chemical rockets are used. Rockets generally produce high thrust, and the time spent thrusting is generally so short in comparison with the time of the orbit that we can neglect it for most purposes.

5.1

Initial Considerations

The eect of an impulse is to alter the velocity vector of an orbit by some additive v. Let the velocity before the impulse be v1 and the velocity after be v2 = v1 + v Then the angular momentum after the impulse is h2 = r (v1 + v) = r v1 + r v = h1 + r v (5.1)

so that we see that the change in the angular momentum is zero only if the impulse is zero, or is applied purely in the radial direction. The change in orbital energy also can be zero even for a non-zero impulse, as it is possible to change the direction without changing the magnitude of the velocity.

69

70

Chapter 5. Impulsive Orbital Maneuvers

There are two basic types of problems we will consider in this chapter. One might be thought of as an orbit transition problem, in which we use an impulse to change some aspect of the orbit that a vehicle follows, such as the inclination. This may take only a single impulse, if a point can be found that is on both the initial and the desired orbit. An example of this kind of problem is the inclination change problem, which will be considered later in this chapter. We think of this as a transition problem rather than an orbit change problem because the impulse does not change the initial orbit. Rather, the impulse causes the vehicle to stop following the original orbit and begin following a dierent orbit, which also passes through the point of application of the impulse.1 The other kind of problem we will consider is the orbit transfer problem. In this problem, there is no point on the initial orbit that is shared with the desired orbit. An example of this type of problem is raising the orbit of a satellite from low-Earth to geosynchronous orbit. In this case, at least two impulses are required. The rst will put the vehicle on a transfer orbit, which intersects the initial orbit. The second will be applied at a point on the transfer orbit which is also on the desired orbit, and will serve to transition the vehicle onto that orbit. It can be shown that any orbit transfer problem can be solved as a two-impulse problem. Such a solution may not be the best solution, of course. It might be better for a number of reasons to have several intermediate orbits in the overall transfer, and to transition from one to another with the needed intermediate impulses. The usual criterion for best solution to an orbit transfer problem is that solution which requires the least total impulse usually referred to as the least delta vee. This is because the impulse will generally have to be provided by burning fuel. That fuel rst has to be lifted into space, along with tankage for it. We will see in later chapters that each pound lifted into orbit costs many times its own mass in fuel to launch it. For this reason, fuel is a precious resource in space vehicles, and it is not to be expended lightly.

5.2

Single-Impulse Changes

In some cases, the desire is not to transfer from one orbit to a completely dierent orbit. Instead, many of the parameters of the new orbit may be the same or nearly the same as in the original orbit, with only one or two modied.2 These maneuvers typically require only a single impulse.
While the dierence is purely pedagogical, it serves to make plain the dierence between the original orbit (before the impulse is applied) and the new orbit. 2 It is possible to change all of the elements of an orbit with a single impulse. The only thing that both pre- and post-impulse orbits must have in common is that both pass through the point of application of the impulse.
1

5.2: Single-Impulse Changes

71

5.2.1

Change of Inclination

One of the most common of such problems is the need to change the inclination of an orbit. This is a particular problem in the case of satellites intended for equatorial orbits; if they are launched from points away from the equator (such as all launch sites in the United States of America), their initial orbit will of necessity have an inclination at least equal to the latitude of the launch site.3 The inclination of an orbit is dened by h the angle between the inertial Z axis and the angular momentum vector h of the orbit. h1 Thus, changing the inclination is equivalent h2 to rotating h. Recalling eqn. (5.1), we know that v2 h = r v
v v1

The discussion here will apply to any rotation of h, which may not result in an equivalent change of the orbital element . Thus Figure 5.1: Rotating the angular mo- plane change, rather than change of inclination, might be a more accurate title. mentum vector. In general, a plane change might be combined with changes in other orbit parameters, and the necessary impulse will be driven by a combination of factors. However, in the special case of pure plane change of a circular orbit, the needed impulse can be easily computed. Because the orbit is to have the same energy after the impulse, only the direction of the velocity is to change, not its magnitude. Also, the resulting orbit is to be circular, so that the resulting velocity vector is normal to the radius vector. With these constraints, we can see from geometry that the angle between the initial and nal velocity vectors will equal that between the initial and nal angular momentum vectors. From the law of cosines, and noting that since v1 = v2 we can denote both magnitudes simply as v , we have
2 2 v 2 = v 1 + v2 2v1 v2 cos 2 = 2v (1 cos )

(5.2)

Using the identity we write this as 2 sin2 = 1 cos(2) v 2 = 4v 2 sin2


3

v = 2v sin

(5.3)

This is because the launch vehicle does not travel a great distance relative to the size of the Earth during launch, so that it is a good approximation to say that the initial orbit must travel above the launch site. The only way this can happen is for the orbit to have an inclination at least equal to the latitude of the site.

72

Chapter 5. Impulsive Orbital Maneuvers

Note that this means that to change the inclination of a circular orbit by 60 degrees would require an impulse equal in magnitude to the velocity of the orbit. Note also that the impulse is not normal to the original orbit plane, because to maintain the same total velocity, the impulse must reduce the tangential velocity as it provides a vertical component. This requires a component directed backwards along the original velocity direction. This analysis shows that inclination change is a very expensive operation. It explains why it is worthwhile to seek launch points on the equator for a satellite intended for equatorial orbit.

5.2.2

Changes in Orbital Elements

Note that the analysis of inclination change above discusses only the rotation of the anZ v gular momentum vector about some axis. We did not restrict the point of application of the impulse, which means that the change in inclination achieved is a relative change; that is, the new orbit plane is inclined to the original by . This is not necessarily Y the change in the orbital element , as deX ned in the last section, unless the impulse is applied when the vehicle is crossing the fundamental plane of the inertial axis frame Figure 5.2: Eect of impulse on orbital (that is, at the line of nodes). elements. Consider instead what happens when the impulse is applied ninety degrees from the line of nodes, as illustrated in Figure 5.2. The impulse is applied normal to the plane of the orbit. In this case, there will still be a rotation of the angular momentum, and there is also (unless the impulse is very small) a change in the inclination angle. However, most of the rotation of the angular momentum vector is reected in a rotation of the orbit plane about the inertial Z -axis, the new orbit will also have dierent node locations than the original orbit. Some such rotation is likely whenever the impulse is applied at any point other than the line of nodes. It is worth noting as well that the semimajor axis of the orbit changes, also. This is because the impulse adds energy to the orbit, so that the new orbit has more energy and hence a larger semimajor axis than the original orbit. Thus we see that it is a rare and well-designed impulse that eects only a single element.

5.2: Single-Impulse Changes

73

5.2.3

A Note on Coordinate Frames

As noted above, changing the plane of the orbit requires rotating the angular momentum vector. This will result in a change in the inclination, but as mentioned in the last section, the change in inclination is not necessarily the same as the rotation of the angular momentum vector. In fact, it would be more accurate to refer to the maneuver as plane change, rather than inclination change. The terminology is largely because the usual need for the change is to achieve a desired nal orbital inclination, so that the impulse is delivered at the line of nodes and the terms become equivalent. This points up the need for a coordinate system in which to describe the impulse. As the impulse is dened as the change in a vector quantity, it is itself a vector quantity. In some cases, the coordinate system will be dened by the problem. In others, such as those in the next several sections, it will not. The problem of raising the apoapsis of an orbit, for instance, is independent of the other elements of the orbit, so it can be dened without reference to a global coordinate frame. In general, a global frame will be dened only if the problem requires it. This leaves open the question of dening the impulse vector. When the coordinate system is not dened, we choose to dene the impulse in terms of its radial, tangential, and out-of-plane components. Formally, we dene the impulse as v = vi + vj + vk k where is in the direction from the orbit focus to the point of application of the is parallel to the angular momentum vector prior to the impulse, and impulse, k completes the right-handed system.
Example 5.1. A space vehicle is in circular orbit about the earth with inclination = 40 , right ascension = 45 , and semimajor axis a = 7000 km. Compute the impulse needed to increase inclination by 5 degrees, without changing the other orbital elements. What is the point of application of the impulse? Because the line of nodes is not to change, the impulse must be applied at one of the nodes. Which one does not matter, so we arbitrarily choose the rising node. The point of application is then r = a cos + a sin + 0k There are two approaches to computing the impulse. The rst, more complicated, and more generally applicable is to use the techniques of Chapter 4 to compute the velocity at the point of application, for both the pre-impulse and the post-impulse orbit. The other option is to use aspects unique to this problem to simplify the computations. In this case, the orbit is and remains circular, so that the velocity vector is normal to the position vector before and after the impulse. We also have the magnitude constant across the impulse, with km v = /a = 7.546 . s

74

Chapter 5. Impulsive Orbital Maneuvers


Because the point of application is at the rising node, we know that the orbit inclination is the same as the angle between the velocity vectors and the fundamental plane. So the velocity vectors before and after the impulse are v = v cos 45 cos(90 + 45 ) + v cos 45 sin(90 + 45 ) + v sin 45 k = 3.7730 + 3.7730 + 5.3359k

v+ = v cos 50 cos(90 + 45 ) + v cos 50 sin(90 + 45 ) + v sin 50 k = 3.4298 + 3.4298 + 5.7806k

(the superscript and + are used to mean immediately before impulse and immediately after impulse, respectively). The impulse is just the rst subtracted from the latter: km . v = v+ v = 0.343 0.343 + 0.445k s The magnitude of the impulse is v = 0.658 km/s. Note that this is the same value we would get using the analysis of section 5.2.1, v = 2v sin (/2) = 0.658 because this is also a pure inclination change problem. The direction of the impulse must be found from this more extensive approach, however.

5.3

Transfer Between Circular Orbits

In looking at orbit transfer, we will begin with the most straightforward problem, a two-impulse transfer from one circular orbit to another, with both orbits in the same plane. This transfer will thus rev2 quire no velocity component normal to the plane of the orbits, and can be considered a two dimensional problem. 00 11 Consider the situation as in Figure 00 11 00 11 v 1 5.3. We wish to transfer from the ina1 ner orbit with radius a1 , to the outer orbit with radius a2 . We will assume that a2 two impulses are sucient, one to transfer the craft from the he inner orbit to the transfer orbit, and a second to convert the transfer orbit to the outer cirFigure 5.3: Coplanar transfer between cir- cular orbit. Any orbit, such as the one cular orbits. shown in the gure, that intersects both

5.3: Transfer Between Circular Orbits

75

the inner and outer orbits is a candidate for the transfer orbit. Note that in general the transfer orbit need not be elliptic. As discussed earlier, it is usually desired to nd the orbit that requires the least total impulse. In a general coplanar transfer problem, we would have three variables to choose involving the rst impulse: the direction of the impulse, its magnitude, and the time of application. Because both orbits in this case are circular, however, the timing has no eect on the geometry of the transfer, and thus no eect on the total impulse required. We therefore ignore it, and have only two variables of concern. There is no such freedom when it comes to the second impulse, because the point of application is the point at which the transfer orbit crosses the nal orbit, and the components of v2 are dened by the need to transition to the nal orbit. Therefore, a complete mathematical description of the optimization problem would require minimization over two variables. Actually working out the solution requires some eort, but the result is well-known. It was conjectured by Hohmann in 1925, and is referred to as the Hohmann transfer. This transfer strategy is known to be the optimal two-impulse transfer between any two circular coplanar orbits.

5.3.1

Hohmann Transfer

The Hohmann transfer is illustrated in Figure 5.4, where the dashed line represents the transfer path between the inner and outer circular orbits. The most notable characteristic of the Hohmann transfer is that the impulse is applied in the same direction as the velocity vector of the circular orbit. The result is a transfer ellipse with periapsis a1 and apoapsis a2 . A result of knowing the direction of the impulses is that we need only compute their magnitudes, so that the computations become scalar rather than vector equations. We can compute the total energy required for the Hohmann transfer in a fairly straightforward manner. We know that for any orbit, C = v 2 /2 /r = /2a Now, circular orbit speed is vc = /a
v1 a2 a1 v2

and this will give us both the initial and the nal velocities. For the transfer orbit, we use the fact that for any ellipse, 2a = (rp +ra ), so that, denoting the semimajor axis of the transfer ellipse as at , we Figure 5.4: Hohmann transfer. have 1 at = (a1 + a2 ) 2

76

Chapter 5. Impulsive Orbital Maneuvers

Then, since the impulses are applied along the direction of the current velocity, the magnitude afterward the rst impulse is simply the sum of he circular orbit velocity and the magnitude of the impulse. Likewise, the second impulse is just the amount needed to increase the velocity from that of the apoapsis of the transfer orbit to the desired nal circular velocity. We thus have v 1 = v 2 = 2(Ct + /a1 ) /a2 /a1 (5.4) (5.5)

2(Ct + /a2 )

where Ct = /2at = /(a1 + a2 ). The total impulse required is simply the sum of the two. As the Hohmann transfer orbit is simply half an ellipse, the transfer time is simply tf = (a1 + a2 )3 8 (5.6)

It should be obvious that the transfer from a larger orbit to a smaller follows the same logic. In this case, the only dierence is that the impulses are used to slow the vehicle rather than to increase its speed.
Example 5.2. A satellite is carried to low-Earth orbit (altitude 200 km) by the Space Shuttle. Compute the impulses needed to take it into a twelve-hour orbit, using a Hohmann transfer. Compute the radii of the initial and nal orbits. The rst is the radius of the Earth plus 200, so a1 = 6578.145 km. The second is the radius of a twelve-hour Earth orbit: P = 12 60 60 = 2 The energy of the transfer orbit is Ct = = 12.01 km2 /s2 a1 + a2 a3 2 / = a2 = 26610 km

which allows us to compute its periapsis and apoapsis velocities vp = 2 Ct + a1 = 9.857 km s and va = 2 Ct + a2 = 2.437 km . s

The magnitudes of the initial and nal impulses are then v1 = vp vc1 = 9.857 7.784 = 2.073 v2 km s km = vc2 va = 3.870 2.437 = 1.434 s

(Note: Due to rounding errors, the numbers given in the example may not be completely consistent.)

5.3: Transfer Between Circular Orbits

77

Note that the Hohmann transfer is the minimum-fuel two-impulse transfer. It is also the two-impulse transfer with the longest time of ight. In many cases, this makes it unsuitable, and we will examine the more general two-impulse coplanar problem in the chapter on interplanetary ight.

5.3.2

Three-Impulse Transfer

While the Hohmann transfer is always the most ecient transfer using only twoimpulses, it is sometimes possible to achieve even greater eciency using three (or more) impulses. The rst impulse causes the vehicle to transition from the initial circular orbit to an elliptic orbit; the second impulse then causes transition from this to a second elliptic orbit. This technique is thus often referred to as a bi-elliptic transfer. One particular formulation of this transfer involves using intermediate orbits with very large semi-major axes. This is motivated by the observation that, regardless of the periapsis, the apoapsis velocity (if the eccentricity is suciently large), is very small. It is reasonable to conclude that it should therefore require only a very small impulse at apoapsis of a highly eccentric orbit to eect a large change in periapsis. Consider the limiting case, in which at ; that is, the intermediate orbit is parabolic. We will suppose that the impulse is applied in the direction of the velocity, so that we need deal only with the magnitude. The energy of any parabolic orbit is zero, so v2 Ct = 1 = 0 = v1 = 2/a1 2 a1 so that the velocity after the initial impulse is the local escape velocity. The velocity before it is the circular orbit velocity, which we know to be /a1 , so we have that v1 = ( 2 1) /a1 Ignoring the second impulse for now, we note that the third and nal impulse will cause the vehicle to go from the return orbit to a circular. Since the return orbit is a very large ellipse, the upper bound on the velocity when the vehicle arrives at the desired orbit is the escape velocity at a2 . By the logic just used, we can then say that the highest possible impulse needed for the transition is v3 = ( 2 1) /a2 The remaining impulse is that required for transition from the rst intermediate orbit to the second. Assuming that the impulse is applied at the common apoapsis of the orbits, we have that v 2 = 2 1 1 ra a2 + ra 2 1 1 ra a 1 + ra

78

Chapter 5. Impulsive Orbital Maneuvers

where ra is the apoapsis of the intermediate orbits. As ra , it is clear that v2 0. In this limiting case, then, the total impulse required for the transfer is vtotal = ( 2 1) /a1 + ( 2 1) /a2 If the ratio a2 /a1 is greater than about 12, this is lower than the impulse required for a Hohmann transfer. Naturally, it is not possible to make such a transfer, because it would require an innite amount of time. However, it can be shown that if a2 /a1 > 15.582, a bi-elliptic transfer is always more ecient, so long as the apoapsis of the intermediate orbits is greater than a2 .

5.3.3

Transfers Including Plane Change

It should be no surprise that when plane change is included in the problem, it becomes much more complicated. It is not sucient to rst rotate the orbit plane to the desired inclination, and then perform a Hohmann transfer. Nor is it sucient to perform the Hohmann transfer rst, and then alter the inclination.
Example 5.3. A satellite in a 90-kilometer low earth orbit with inclination 30 is to be sent to an equatorial orbit with period 12 hours. Both initial and nal orbits are circular. Compute the necessary impulses if (a) a Hohmann transfer is done rst, followed by the plane change, and (b) if the plane change is performed in the same impulse as the terminal thrust of the Hohmann transfer. Part (a) is simply a matter of summing the Hohmann impulses and the plane-change impulse from section 5.2.1. As the radii of the orbits are the same as in example 5.2, the impulses are the same: v1 = 2.073 and The inclination change requires v = 2vc2 sin(/2) = 2 3.870 sin(15 ) = 2.003 so that the total impulse for the transfer plus plane change is vtot = 5.510 km/s. Combining the last two impulses for (b) requires that the velocity vector on arrival be subtracted from the nal vector. The magnitude is
2 2 2 v = va +vc 2 2va vc2 cos

v2 = 1.434.

va vc2

km . v = 2.141 s

This is larger than either the nal Hohmann impulse or the plane-change impulse, but is much smaller than the sum of the two. The total for the entire maneuver is vtot = 4.214 km/s, for a savings of 1.296 km/s.

5.4: More General Transfer Problems

79

Note that, because the inclination change impulse is proportional to the velocity when it is applied, it would have made no sense to perform the plane change before the transfer. However, while applying it along with the terminal Hohmann impulse is much better, the optimal solution includes dividing the plane change between the two Hohmann impulses, with the majority in the later burn. The actual distribution of plane change depends on the particular problem. This is also a situation in which the three-impulse maneuver of section 5.3.2 is applicable. Because the plane rotation adds to the total impulse requirement, and is proportional to the velocity, it makes the savings possible even greater. It can be shown that if the required plane change of a circular orbit is greater than about 49 , it is more ecient to use a three-impulse maneuver and make the change along with the second impulse. This is even if there is no increase in the orbit radius.

5.4

More General Transfer Problems

The coplanar circular problem of the last section is one of the few orbit transfer problems that has a simple, well-dened solution. Typically, the solution must be worked out for each individual problem. Consider a general problem of transfer from an elliptic orbit to a second, inclined elliptic orbit.4 The variables describing the problem include the sizes and eccentricities of the orbits, their relative inclinations, and the orientations of their semi-major axes relative to their relative line of nodes (that is, the line of intersection of the two orbit planes). There are three variables in the two-impulse transfer problem in this case. The timing of the initial impulse is arbitrary. The impulse itself has three components, but the impulse must satisfy the constraint that the transfer ellipse intersect the desired nal orbit. Because of this, only two of the three components can be freely chosen, giving a total of three free variables. As with the coplanar problem, the particulars of the nal impulse will be dened by the velocity vector of the vehicle at the point at which it intercepts the target orbit. In a three-impulse problem, there are four additional variables, as all of the variables associated with the initial impulse recur for the intermediate impulse. Also, since the intermediate point can be chosen as desired, the constraint on the initial impulse is removed. In such general problems, a three-impulse solution may be desirable even when the outer orbit is not greatly larger than the initial orbit. This is due to the expense of inclination change. Because impulses to change inclination are proportional to the current velocity, it may make little sense to change inclination as part of the initial impulse, as this may be the point of maximum velocity of the transfer orbit.
4

The orbits need not be elliptic, but the problems are similar for transfer to hyperbolic orbits.

80

Chapter 5. Impulsive Orbital Maneuvers

5.4.1

Transfer to Coplanar Elliptic Orbit

Consider what appears to be a very simple problem, as outlined in Figure 5.5. A vehicle is originally on a circular orbit of radius a0 . The desired orbit is coplanar, highly elliptic, and has semi-major axis a. The eccentricity and energy of the target orbit are such that it intersects the original orbit. In this case, the possible solutions include a single-impulse strategy, as indicated by vsingle in the sketch. Another possibility is a two-impulse solution. One such is drawn in the sketch, which assumes that the impulses should v1 be planned such that the transfer orbit v2 a0 is an ellipse that is tangent to the target vsingle orbit at its periapsis. Note that while there is only one single-impulse solution, Figure 5.5: Coplanar transfer to elliptic or- there are innite possible two-impulse bit. solutions. The single impulse solution is straightforward to compute, given a and the eccentricity e of the target orbit. It is necessary only to compute the velocity vectors of the two orbits at their point of intersection. The velocity before the impulse is v0 = /a0

where we dene the coordinate system such that is radially outward at the point where the two orbits intersect, and is tangent to the circular orbit at that point. The corresponding velocity on the elliptic orbit is found from the energy and angular momentum of the orbit as v2 = 2 ; a0 a h= a(1 e2 ) = a0 v sin

where is the included angle between the radius and velocity vectors. From these, we can compute the necessary impulse.
Example 5.4. A spacecraft is on a clockwise circular orbit of radius a0 = 2 (in canonical units). What is the impulse required to transition to a orbit with the same energy, also clockwise, but eccentricity e = 0.25?

The velocity on the original orbit is v0 = velocity after the impulse will be v=

/a0 = 1/ 2. The magnitude of the 1 = 2

1 2 a0 a

5.4: More General Transfer Problems

81

(in this case, the velocity does not change, because the impulse does not change the energy). The angular momentum is h= a(1 e2 ) = a0 vj = vj = 0.6847

(note that vj = v sin ). The radial component of the post-impulse velocity is then vi = The impulse is then v = vi + ( v 0 vj ) = 0.177 0.023 (to three signicant digits).
2 = 0.1768. v 2 vj

For a two-impulse transfer, it makes sense to consider a Hohmann-like scenario. That is, the rst impulse is tangential to the original circular orbit and creates a transfer ellipse that is tangent to the target orbit at either periapsis or apoapsis. This leaves only two cases to analyze. To periapsis: This is the case illustrated in Figure 5.5. Letting rp and ra be the periapsis and apoapsis of the desired orbit, we can compute the characteristics of the transfer orbit as a0 + rp at = ; Ct = . (5.7) 2 a0 + rp The impulses are then v 1 = v 2 = a0 2 2 Ct + a0 2 Ct + rp (5.8) (5.9)

1 1 rp rp + ra

Note that because the periapsis of the desired orbit is less than the radius of the initial orbit, the rst impulse slows the vehicle. The second impulse will accelerate the vehicle, as the target orbit must have (given the problem statement) more energy than the transfer orbit. To apoapsis: The analysis for this case is eectively the same. Expressing the energy of the transfer ellipse in terms of a0 and ra , the impulses are v 1 = v 2 = 2 2 1 1 a0 ra + a 0 1 1 ra ra + a0 a0 2 1 1 ra ra + rp (5.10) (5.11)

In this case, the rst burn accelerates, and the second decelerates.

82

Chapter 5. Impulsive Orbital Maneuvers

Example 5.5. For the case of example 5.4, compute the total impulse required for transfer to (a) periapis, and (b) apoapsis of the desired orbit.

First, the periapsis and apoapsis of the target orbit are rp = a(1 e) = 1.5 and ra = a(1 + e) = 2.5 The velocities at periapsis and apoapsis of the transfer ellipse are vt,p = 2 1 1 + 2 + 1.5 1.5 = 0.8729; vt,a = 2 1 1 + 2 + 1.5 2 = 0.6547.

The impulses for case (a) are then v1 = v0 vt,a = 0.0525 and for a total impulse of .0925. For case (b), the velocities at the apses of the transfer ellipse are vt,p = 0.7454 and vt,a = 0.5963. The impulses are v1 = vt,p v0 = 0.0382; for a total of 0.0868. Note that both of these require much less total impulse than the 0.178 km/s required for the single-impulse approach. v2 = vt,a va = .04856 v2 = vp vt,p = 0.9129 0.8729 = 0.0400

In contrast, consider how much more complicated is the two-impulse approach in full generality.5 Restricting our discussion to elliptic transfer orbits, we continue the analysis of the last few paragraphs to construct a more general approach. It should be noted that there is no reason to expect that the solution of example 5.5 is not the optimum, and in fact the second is the best two-impulse solution. There are circumstances in which the optimal solution is not feasible, however. In Figure 5.5, the two-impulse transfer begins with a tangential impulse, and terminates at an apse of the ellipse. In Figure 5.6, the rst impulse occurs at point A and puts the craft on a transfer ellipse whose semimajor axis is not aligned with that of the target ellipse, and which crosses the target orbit in two places. Because point A can be chosen anywhere along the initial orbit, and because v1 is arbitrary, the characteristics of the transfer ellipse can be chosen to be whatever is desireable. The
5 To be completely rigorous, we should also include the possibility of out-of-plane motion. This can easily be ruled out in this case, however.

5.4: More General Transfer Problems

83

B A v1

v2

Figure 5.6: General transfer to coplanar elliptic orbit.

mission designer has three variables to choose from, as it is assumed that the impulse out of the plane of the orbits is zero. The second impulse must be applied when the transfer orbit crosses the target orbit. In Figure 5.6, there are two choices, points B and C . Whichever is chosen, v2 will be fully dened by the arrival velocity and the velocity that the craft must have after the impulse to be on the desired nal orbit. Computing the point of intersection is not straightforward, and must be done numerically. The time of transfer may then be computed using Lamberts equations. While the computations are laborious, and the problem has several variables, the problem is not conceptually dicult. The techniques needed to perv form the analysis have all been covered 11 00 in this and previous chapters. However, v nothing can be said in general about the solutions to general transfer problems. Each must be solved for the unique conditions of the particular mission requireFigure 5.7: Transfer between aligned ellip- ments. Some relatively simple versions of tic orbits. these problems will be seen later, when interplanetary missions are discussed.
1 2

5.4.2

Transfer from Ellipse to Ellipse

The last section discussed a relatively simple problem, because the initial orbit was circular. The general problem is even more dicult when both initial and nal orbits

84

Chapter 5. Impulsive Orbital Maneuvers

are elliptic. We will not explore this problem. However, in the special case of elliptic orbits with colinear major axes, the problem has a fairly straightforward solution. It can be shown that the minimum two-impulse solution to such problems always involves a transfer from apsis to apsis, as in Figure 5.7. In this case, the transfer shown is from the apoapsis of the inner orbit to the periapsis of the outer orbit. This is not always the case; in fact, is easy to construct situations in which the transfer will be periapsis to periapsis. It depends on the size of the two orbits, and whether the periapses are on the same side of the central body, as in Figure 5.7, or on opposite sides.

5.5

Changes Due to Small Impulse

To this point, we have considered only very large impulses. We will now consider the case of a single small impulse. This is a basic part of sensitivity analysis, in which the eect of small changes at one point of a process is considered, in light of the changes they cause later. The discussion begun here can also be considered the simplest investigation of orbital perturbations, which is a much broader topic, with a later section of this book devoted to it. As a simple case, consider the application of some very small impulse to an orbit. To make it even simpler, we will assume that the impulse is applied tangent to the orbit, so that the velocity is changed a small amount in magnitude, and not at all in direction. Beginning with the vis-viva equation, we have C = v 2 /2 /r = /2a = v 2 = (2/r 1/a) (5.12)

and this is valid for all orbits. We dierentiate this expression to get 2vdv = 2dr da + 2 . r2 a (5.13)

Since we are assuming a small impulse, we can also assume that the eects will be small.6 Thus we write v for dv , et cetera. This leads to 2vv = 2r a + 2 r2 a

Here, we use the lower case to dierentiate this small impulse from the possibly very large v that we often compute for maneuvers. Now, r is the change in the radius at the time of the impulse and due to the impulse. The impulse is essentially instantaneous, and in any case is an acceleration,
6 This is the essence of linearization, which will be used in detail in later chapters. It is presented more fully in Appendix D.

5.6: Problems

85

so that its rst-order eects are purely on the velocity. Thus we take r to be zero, and we have 2vv a/a2 2va2 = a = v =

(5.14)

Consider this a moment. We have said that the radius of the orbit at the point of application of the impulse does not change. The semi-major axis, however, clearly does. This means that if we apply the impulse at perigee, we raise the apogee. Conversely, an impulse applied at apogee raises the perigee. A more important result of the analysis is that the eect of a small impulse on the semimajor axis is dependent on the velocity at the point of application. Since the velocity is maximized at periapsis, it follows that the best place to apply thrust for the purpose of increasing the semimajor axis is at periapsis. Because a larger value of a is associated as well with greater orbital energy, this is also the optimal point at which to thrust to increast energy. Indeed, beginning with C = v 2 /2 /r, it is easy to show that C = vv. (5.15) Conversely, of course, this also means that any force that causes the spacecraft to decelerate has a greater eect (and drains the orbit of most energy) if applied at periapsis.

5.6

Problems

1. The space shuttle is in a circular orbit of radius R (about the Earth) when it launches a small projectile from its cargo bay. The initial velocity of the projectile relative to the shuttle has magnitude u, lies in the plane of the shuttles orbit, and points outward along the radius vector from the center of the Earth. If R = 20, 000 km, nd: (a) The minimum value of u for which the projectile will escape the gravitational eld of the Earth. (b) The semi-latus rectum of the orbit of the projectile. (c) The eccentricity of the orbit in terms of u/vs , where vs is the speed of the shuttle. 2. The lunar module (LM) lifts o from the moon and, at burnout, is at periselenium of an elliptic orbit at altitude 30 km above the lunar surface. At apselenium, it will rendezvous with the command module at an altitude of 250 km. The command module is in a circular orbit.

86

Chapter 5. Impulsive Orbital Maneuvers (a) What is the burnout velocity of the LM? (b) What is the time from burnout to rendezvous? (c) What impulse is needed to match speeds between the LM and the command module at rendezvous? 3. Compute the impulses needed to go from a low-Earth circular orbit of radius a = 6578 km inclined at 28 to a 12-hour circular equatorial orbit for the following cases. (a) The orbit is rst made equatorial, and then a Hohmann transfer is made to the outer orbit. (b) A Hohmann transfer to an inclined 12-hour is performed, and the orbit is then made equatorial. (c) The inclination change is combined with the second Hohmann impulse. 4. (a) Compute the impulses needed for a Hohmann transfer from a geocentric equatorial parking orbit at r1 = 7000 km to an orbit at r2 = 105000 km. (b) Compute the three impulses needed for a bi-elliptic transfer (with the same initial and nal orbits) in which the transfer ellipses have ra = 384000 km (c) Compute the time required for each approach. Which approach requires the smallest total impulse? What is the time penalty for the bi-elliptic approach? 5. Consider a satellite in an elliptic orbit with eccentricity e and angular momentum h. Calculate the magnitude of the impulse to be applied at point B such that the apse line is rotated 180 , but the eccentricity and angular momentum remain the same. Express your answer in terms of e and h.

1
Figure 5.8: Problem 5

5.6: Problems

87

6. Compute the minimum v required to transfer between two coplanar elliptical orbits which have their major axes aligned. The inner orbit has rp = 1.1, e = 0.290 (canonical units), and the outer has rp = 5.0, e = 0.412. Assume that the periapses of the orbits are on the same side of the focus. You may assume that the transfer will be from apse to apse, and that the impulses are Hohmann-like (that is, they are tangent to the orbit). 7. A spacecraft is in circular equatorial orbit about the earth, with altitude 700 km. Find the impulse required to change the orbit to one with inclination 30 , eccentricity e = 0.10, and argument of periapsis = 270 . Express the impulse vector in the geocentric coordinate frame. Assume that after the impulse, the rising node will be on the x-axis. 8. Consider a Hohmann transfer from one circular orbit with radius a1 and inclination 1 to a second circular orbit with radius a2 and inclination 2 . Part of the required inclination change 1 is done during the rst impulse, and the remainder ( 1 ) during the second. (a) Find an expression for the total v required. (b) Find a condition that minimizes the total v in terms of 1 (consider the radii of the two orbits, and the initial and nal inclinations, as givens in the problem). Show that 1 = 0 is not optimal. 9. It is desired to rotate the plane of a circular orbit. Consider the bi-elliptic approach, in which an impulse is rst used to transition the craft to a highly elliptic orbit. The plane change is performed at apoapsis of this ellipse, and when the craft gets to periapsis, a transistion is made back to a circular orbit. Find the total v needed for this approach, and show that in the limit (as the intermediate orbit becomes nearly parabolic), the bi-elliptic maneuver requires less fuel than a direct rotation for any change greater than approximately 49 . 10. Perform a sensitivity analysis to relate an error in the initial Hohmann impulse to the error in apoapsis radius. If the desired initial impulse is vnom , let the actual delivered impulse be v = vnom + v. Dene the error in the desired apoapsis radius of the transfer ellipse to be ra . Noting that ra = 2a rp , nd the ratio ra /v in terms of , a1 , and the periapsis velocity vp of the transfer orbit.

Chapter 6

Interplanetary Trajectories

Having gained an understanding of two-body orbits and basic orbital maneuvers, we are ready to investigate interplanetary trajectories. We will limit ourselves to transfers using high thrust over short duration, such as we expect using chemical rockets. We will also consider only travel within our solar system, though the principles we develop will of course be applicable to any other planetary system we might stumble across in the future.

6.1

Introduction Patched Conics

The solar system is a very large place, and as we have discussed earlier, it is generally reasonable to consider the planets to be alone within it. When we deal with the orbit of a planet, we concern ourselves only with that planet and with the sun; when dealing with the satellite of a planet, we ignore all bodies other than the satellite and said planet. In considering interplanetary travel, we will use this idea to divide the trajectory into three phases. In the initial phase, the spacecraft will be very close to the planet of departure, and so we will consider its motion to be dened by the gravitational eld of that planet, without concerning ourselves with any other bodies. For this reason, we will consider the motion of the craft relative to the planet, rather than relative to the solar system as a whole. We will place the craft on an escape orbit, thus allowing it to pass from the eld of the departure planet into that of the sun. A primary concept of this approach is that, though the craft has escaped the gravitational eld of the planet of departure, and thus is at innite distance from that planet, it actually has not moved very far in terms of interplanetary distance. Replace the word innite with very large. Then consider what very large means. We need the distance to be large enough that, when considering the eects of the sun and the planet on the crafts motion, the sun dominates. This leads to the

89

90

Chapter 6. Interplanetary Trajectories

idea of the Sphere of Inuence, which we will discuss further later. For now, simply accept that the craft can have moved very far from the perspective of the planet, while having moved barely at all with respect to the sun. Once the craft has escaped, we will consider it to be in orbit about the sun. We will note that, though the craft has escape velocity relative to the planet it left, it will generally be in an elliptical heliocentric orbit. We will calculate our escape such that when we nd ourselves on this heliocentric orbit, it will be one that intersects the orbit of our target planet, and does so when the target is there. Finally, upon reaching the target planet, we will either enter parking orbit or execute a planetary y-by.1 In either case, we will once again consider ourselves in orbit about the planet, and worry only about the gravitational eld of the planet, and motion relative to the planet. We will consider the sun again only if we execute a y-by and once again enter interplanetary space. We call this approach to interplanetary trajectories the method of Patched Conics, the name deriving from the fact that each section is itself a conic section. The term patch comes from the necessity to make the individual sections mate up to form a continuous whole; we do this by patching them together. Before we investigate more fully the patched conic approach, we will discuss the idea of planetary spheres of inuence. We will then examine each phase of the trajectory, beginning with the heliocentric, which denes the mission duration and the requirements of the departure phase. A discussion of departure and nally of planetary arrival follows.

6.2

The Sphere of Inuence

As mentioned, the solar system is a very large place indeed, and a body can be both very close to a planet in one sense, while also being very far from the planet from another point of view. In this section, we will get an idea of what very far is, at least for the purposes of the motion of the body. Consider a body in motion near a planet (we will, not surprisingly, consider only the body, the planet, and the sun, ignoring all other planets). We are concerned with the eects on its motion due to the gravitational forces of the sun and of the planet. If the sun dominates the motion, it makes the most sense to write the equations of motion in reference to the sun. If the planet dominates, it makes more sense to write the equations in reference to the planet, and we can ignore the sun. It is this decision that we need to make. The way we make the decision is to write the equations both ways, and see which one makes the most sense. To see how this is done, we need to go back to the basic mechanics of the system, as we did in investigating the two-body problem, only this
1 The source of the term y-by should be obvious. We will discuss such trajectories thoroughly in a later section.

6.2: The Sphere of Inuence

91

time we will have three bodies to consider. To keep our discussion general, we will use masses one and two, rather than planet and sun.

6.2.1

Equations of Motion
Z

Consider the situation as in Figure 6.1. We have three masses, m1 , m2 , and m. Each body causes a force on each of the others. The forces are given by F1 =
m
2

m1

Y r m X

Gmm1 (r r1 ) Gm1 m2 (r2 r1 ) + r r1 3 r2 r1 3 Gmm2 (r r2 ) Gm1 m2 (r1 r2 ) + F2 = r r2 3 r1 r2 3 Gmm1 (r1 r) Gmm2 (r2 r) F = + r1 r 3 r2 r 3

Figure 6.1: Three mass diagram

Now, we are concerned with the motion of the masses, and thus we use Newtons second law to write, for example, r= Gm1 (r1 r) Gm2 (r2 r) F = + m r1 r 3 r2 r 3

This gives the motion of body m in inertial space. We are interested in the motion with respect to the other two bodies, however. Considering the motion with respect to mass m1 , we are interested in the change in the vector joining them. Thus we write r r1 = Gm1 (r1 r) Gm2 (r2 r) Gm(r r1 ) Gm2 (r2 r1 ) + r1 r 3 r2 r 3 r r1 3 r2 r1 3 G(m + m1 )(r r1 ) Gm2 (r2 r) Gm2 (r2 r1 ) = + (6.1) r r1 3 r2 r 3 r2 r1 3 G(m + m2 )(r r2 ) Gm1 (r1 r) Gm1 (r1 r2 ) + r r2 3 r1 r 3 r2 r1 3

With respect to the second mass, we would have r r2 = (6.2)

This is all very well, but very theoretical. Lets specialize to the case of a star and a planet, such as in Figure 6.2. We will say that the mass of the star is mS , that of the planet is mP , and that of the vehicle mV . Our question is: Does it make more sense to consider the vehicle to be in motion about the planet, or moving separately from the planet in orbit about the star?

92

Chapter 6. Interplanetary Trajectories


Vehicle r rSV r Sun
SP PV

Planet

Figure 6.2: Simplied sphere of inuence system.

For motion about the star, eqn. (6.2) becomes rSV = and about the planet, rP V = G(mP + mV )rP V GmS 3 rP V rSP rSV 3 3 rSV rSP (6.4) G(mS + mV )rSV GmP 3 rSV rSP rP V + 3 3 rP V rSP (6.3)

Consider these two equations. The rst term on the right hand side of each might be considered the central acceleration term, and were there no further terms, the equations would be identical to the two-body equation, and lead to the standard orbit equation. The second term is the perturbing acceleration. If the central term is much larger than the perturbing term, we can safely ignore the perturbation and treat the system as a two-body system.

6.2.2

The Sphere of Inuence

For simplicity, rewrite the equations (6.3) and (6.4) as rSV = CS + PS and rP V = CP + PP Here, the terms CP and PP refer to the central and pertubing accelerations relative to the planet2 , and CS and PS are the same variables referenced to the Star. We are interested in nding the surface about the planet for which CS CP = PS PP
2 Note that the perturbing acceleration is relative to the planet, and due to the presence of the star. The subscript denotes the rst point.

6.2: The Sphere of Inuence

93

Inside this surface, the central term is more dominant in comparison to the perturbing term if the equation is written in terms of motion about the planet. Outside of the surface, the motion is more dominated by the Star, and the planetary attraction serves as the perturbing term. Tisserand showed in the 1880s that, because the distance rSV will always be much greater than rP V for equality, the surface is very nearly spherical, and its radius is given by 2/5 mP rSP (6.5) rI = mS Because of the shape, this surface is known as the sphere of inuence, also sometimes termed the sphere of activity. As an example, the sphere of inuence of the Earth in our solar system is given by 2/5 5.98 1024 rI = 149.5 106 = 924, 200km 30 1.99 10

It is important to note that we are not looking for the surface for which the forces exerted by the star and the planet are of equal magnitude. For example, at a point directly between the Sun and the Earth on the Earths sphere of inuence, the accelerations due to the Earth and the Sun are a = a 3.986 1011 = 4.67 104 m/s2 = 2 rI 9.242 9.242 1010 = = 6.00 103 m/s2 (r rI )2

However, these numbers do not include the cetripetal acceleration due to the orbital velocity of the Earth, which almost completely cancels out the attraction of the Sun at this point. This eect must be included when we relate the motion of the vehicle to that of the Earth. When this correction is made, it seems that the Earth dominates the system, rather than the Sun. It is possible to extend the analysis of this section by dening a small number and dening two spheres of inuence. The inner sphere would be dened as the surface inside of which the eects of the star are much less than those of the planet, satisfying PP CP The outer sphere, similarly, is the surface for which PS CS This analysis is useful in instances in which more accuracy is necessary than is obtained with our more standard denition. This is beyond the scope of our presentation. The interested reader is referred to Roy ( [13], pp. 168-171).

94

Chapter 6. Interplanetary Trajectories

6.3

The Heliocentric Phase

An interplanetary mission begins with the departure from the planet of origin. For all missions launched so far, this has of course been the Earth; however, this will possibly not be the case for much longer. Exploratory missions to other planets are being considered that will return samples to the Earth. These return ights will be interplanetary missions starting from other planets, but will be analyzed in the same manner as those from Earth. At the other end of the mission, the vehicle will enter into the sphere of inuence of the arrival planet. Here, either a parking orbit or some sort of y-by trajectory will be needed. If properly executed, a y-by trajectory can dramatically increase the heliocentric velocity of the vehicle, sending it onwards to still another planet, or out of the solar system entirely. Compared to these two phases, where it seems all the action is, the heliocentric trajectory that joins them may seem rather dull. But it is in this trajectory that the vehicle will spend almost all of its time, and it is into another such trajectory that a y-by would place the vehicle. It is also the requirements of this portion of the mission that dene the escape trajectory to be used, and that supply the arrival conditions. We therefore consider this phase rst.

6.3.1

Hohmann Transfers

The two main considerations when planning an interplanetary mission are time and fuel. Time is an issue because of the vast distances involved. Fuel is a critical concern because all of the fuel used in the interplanetary phases must rst be lifted as payload from the departure planet. This makes each pound of fuel very expensive (both in terms of money and of overall launch mass), and so it is imperative that the mission be planned to require the least fuel possible. Partly this is done by designing the vehicles to be as light as possible, but even more so this is done by choosing trajectories that require the lowest total impulse, in keeping with time constraints. In considering the heliocentric part of the mission, we will make the simplifying assumption that the orbits of the planets are circular. For most of the planets this a very good approximation. Only Mercury (e = 0.2056) and Pluto (e = 0.2583) have eccentricities larger than 0.1; the next largest is Mars at e = 0.0934. For preliminary examination, assuming circular orbits is good enough, and it greatly simplies the computations involved. We also assume, for similar reasons, that the orbits lie in the plane of the ecliptic.
Example 6.1. Consider a mission from the Earth to Mars. Assuming circular orbits, we know that the minimum impulse trajectory will be a Hohmann transfer. What is the time of ight on the transfer ellipse?

6.3: The Heliocentric Phase

95

This is a straightforward application of the Hohmann transfer. The perihelion distance of the transfer ellipse is just the average orbital radius of the Earth, and the aphelion distance is the orbital radius of Mars. Then the period of the ellipse is P = 2 a3 = 2 (a + a )3 8

The transfer time is half the period, so tf = P/2 = which comes out to 258.9 days. (a + a )3 = 2.237 107 s 8

Note the implicit use of the assumption that the vehicle does not move from a heliocentric standpoint in escaping the Earth. Were that to be taken into account, we would have to allow for variations in both the apses of the transfer orbit, and the fact that the transfer is not precisely half of the Hohmann ellipse. Preliminary time of ight budgeting is also done without considering the escape phase, as the few hours or less in the escape phase is lost in the noise compared to the hundreds of days spent in transit.

6.3.2

Non-Hohmann Transfers

For many missions, the Hohmann transfer is sucient. If we are sending a probe to Venus, the primary consideration will be fuel savings, as there is no harm to the probe in a long transfer time. In other cases, however, there may be restrictions on transfer time, or a particular velocity relative to the arrival planet may be required. Either of these cases would require a non-Hohmann transfer orbit.3 Non-Hohmann transfers are often referred to simply as fast transfers. Even in cases in which the overall transfer is not Hohmann, we will assume that the spacecraft enters the transfer orbit at perihelion or aphelion, as it would in the Hohmann case. This is to take maximal advantage of the orbital velocity of the departure planet itself.
Example 6.2. It is desired to send a probe from the Earth to Venus in 100 days. What is the eccentricity and aphelion velocity of the transfer orbit? This is a time-of-ight problem, with known values for radii, but unknown eccentricity and semi-major axis for the transfer orbit. The problem is sketched in Figure 6.3. The object is to nd an ellipse such as the dashed one shown such that the aphelion distance of the ellipse is ra = a , and the perihelion distance is less than or equal to the orbital radius of Venus. The constraint is that the time of ight from 1 = to
3

The reasons for a non-Hohmann transfer are not limited to these two examples, of course.

96
Venus orbit

Chapter 6. Interplanetary Trajectories


Trajectory

2
Sun

Earth orbit

Figure 6.3: Candidate Earth-to-Venus transfer ellipse (Example 6.2).


2 , the true anomaly when r of the transfer ellipse equals a , equals the required tf of 100 days. This appears to be a problem such as those solved using Lamberts Theorem in Chapter 3. Those results, however, required that we know not only r1 and r2 (the orbital radii of the Earth and Venus in this case), but also the distance between the points on the orbit. Here, we do not have the means to calculate that distance because we do not know the angle between the radius vectors to the two points. Instead, we can solve the problem using iteration on other equations to get the information we need. We will choose a value for a and use that to nd the eccentricity. We can then compute the time of ight from 2 to perihelion (we will not actually need to compute 2 in the process). Subtracting that number from half the period of the transfer ellipse will give the time of ight from aphelion to 2 , which is the value we are seeking. Depending on the answer, we increase or decrease our guess for a until the time of ight is what we need. As a starting guess for a, we note that a maximum value is the semi-major axis of the Hohmann transfer ellipse, aH = (a + a )/2, and a minimum value is a /2. Since the required time of ight is not very close to the Hohmann transfer time of 146 days, we will try a rst guess about halfway between the two values. Letting a = 1.0 108 km, we have ra a e= 1= 1 = 0.49598 a at after which r = a(1 e cos E ) gives cos E = and nally tf = P/2 a3 /(E e sin E ) a a = 0.2475 ae

6.3: The Heliocentric Phase


= a3 / [ (E e sin E )] = 4.507 106 s = 52.17 d

97

This is far too short a time, so we increase the guess at the semi-major axis to lengthen the ight time. A second guess of a = 1.15 108 produces the results a = 1.15 108 = e = 0.2313; E = 1.3124; tf = 6.949 106 s = 80.43 d A few iterations provides the result a = 1.2155 108 km = e = 0.1875; tf = 8.638 106 s = 99.98 d

Remark 1: As is mentioned in the sections on solving Keplers equation, there are many well-developed techniques for nding the solutions to iterative systems such as the one in the example. In practice, a computer code implementing such a technique would be used. For illustrative purposes (and for doing homework problems), the method of informed renement (that is to say, intelligent guesswork) is sucient. This example is slightly simpler than a corresponding outward transfer, because in this case it was certain that the trajectory would be a piece of an ellipse. This was due to the restriction that the impulse used to begin the transit would be tangential to the orbit of the Earth, so that a parabolic or hyperbolic trajectory would not go inside the orbit of the Earth. In doing Example 6.2 it was also implicitly assumed that a transfer ellipse existed that satised the stated requirements. This is not always the case. It is left as an exercise to nd the lower bound on the possible transfer time using this technique. There is no such lower bound if the velocity vector at departure is not restricted in direction.

6.3.3

Phase Angle at Departure

Interplanetary transfers are not just orbit change problems, but rendezvous problems. This means that the spacecraft has to arrive at its destination at the same time as some other body, in this case the target planet. This means that it is not sucient to compute the necessary transfer trajectory. It is also necessary to nd the correct time to depart. Consider the Earth-to-Mars mission of Example 6.1. While the spacecraft is traveling, so is Mars. It is necessary that the spacecraft leave the Earth at a time such that 258.9 days later, when it has reached the orbit of Mars, the planet Mars is at the same place as the spacecraft. This is sketched in Figure 6.4. Measuring angles counter-clockwise, and letting the Earth be at zero at departure, the rendezvous will occur at SC = = . During the ight, the planet Mars will have moved through an angle equal to 2 times the ight time over the period of the planets orbit: 2tf = 2.3677 = 2 a3 /

98

Chapter 6. Interplanetary Trajectories From this discussion, it is obvious that the angular motion of any planet during the trajectory is simply tf T = (6.6) 3/2 aT
Mars at arrival

Earth at arrival

where the subscript T denotes the target planet under consideration. In the Earth-Mars example, the initial position of Mars is simply Spacecraft trajectory , because the angles are being measured in relation to the initial Mars at departure position of the Earth. More generFigure 6.4: Planetary motion during Earth- ally, the initial position of the target planet in a Hohmann transfer can be Mars Hohmann transfer. written in terms of the initial position of the planet of departure as T,i = D,i + T That this is dened for a Hohmann transfer is evident from the presence of in the equation as the angle swept by the spacecraft during its trajectory. This is not always the case, as with Example 6.2, where the transfer time is less than the Hohmann transfer. In that case, Eqn. 6.6 can still be used to compute the angular motion of the target planet during the transfer, but the nal position must be computed by nding the true anomaly of the spacecraft on its trajectory when it reaches the orbit of the target planet. The relative angular positions of the planets is often known as the phase angle. In the case of the Hohmann transfer, the necessary value at departure is simply given by tf T,D = T D = T = 3/2 (6.7) aT This equation can be written directly in terms of the orbital radii of the departure and target planets by substituting for tf .

6.3.4

The Synodic Period and Mission Duration

Whatever the angle that is to be swept by the spacecraft during the trajectory, it is clear that the departure and arrival planets must be in the correct relationship at departure for the rendezvous to be successful. Regardless of the phase angle required, the time between occurrences will be the same. This time is known as the synodic period.

6.3: The Heliocentric Phase

99

Consider the phase angle as dened in Eqn. 6.7. The rate of change is computed d d d (T,D ) = (T ) (D ) = nT nD dt dt dt where n denotes the mean motion of the planet. This is a constant, so the change in the phase angle is T,D = (nT nD )t The value is positive or negative depending on whether the transfer is inward (to a planet with a smaller orbit and faster mean motion) or outward. For the planets to regain their initial relative position, the phase angle must go through some multiple of 2 . Since only the absolute value is important, we arrive at the expression for the synodic period Tsyn = 2 nT nD (6.8) as

As an example, the synodic period for Earth-Mars is Tsyn = 2 /a3


3 /a

= 6.739 107 s = 779.9 d

This period gives rise to the idea of the launch window, a period in which the planets are nearly in the correct relationship for the transfer. The duration of the window depends on the launch vehicle and spacecraft. All vehicles must have some extra propulsive capability, to overcome errors in calculations, ineciencies in the engines, and so forth. Some of that extra capability will also allow for a slightly nonoptimal transfer. However, should the optimal launch time be missed by too much, the launch window has been missed, and the next opportunity will not come around until the synodic period (minus the length of the launch window) has passed. The duration of a one-way mission is given by the time of ight. As mentioned earlier, however, some missions currently being planned include a sample return, and of course any eventual manned missions must return the astronauts to Earth. For these missions, a second ight must be planned. Assuming a similar return trajectory, the mission time will be at least doubled. This time is further increased by the need to wait for the planets to come into alignment for the return ight. The phase angle for the return ight is computed as it was for the initial ight. In the Earth-Mars Hohmann transfer example, the necessary phase angle is given by eqn. (6.7) as tf = 3/2 = 1.311 a In other words, Mars must be leading the Earth by about 75 degrees for the rendezvous to occur. Since the Earth is in fact leading Mars when the spacecraft rst arrives

100

Chapter 6. Interplanetary Trajectories

at Mars, there will be a waiting period before the phase angle reaches the necessary value. The methods already outlined can be used to compute the change in phase angle during the rst ight, and then the necessary waiting period before the return ight can commence. The total mission duration for such a mission will include the initial ight to the target planet, the waiting period, and the duration of the return ight. For the Earth-Mars example, the ight time is 258.9 days, the waiting period is 454.3 days, and the return ight will require another 258.9 days, giving a total mission time of about 972 days, or something over two and a half years.

6.4

Departure

Once the characteristics of the heliocentric transfer orbit have been established, it is possible to consider the departure and arrival phases. In departure, the vehicle is given the necessary velocity relative to the departure planet to enter the desired heliocentric orbit. In the patched conic approach, the departure is considered solely as a trajectory about the departure planet. The restriction is that the velocity vector as r with respect to the departure planet satises the patch conditions, which are derived from the characteristics of the heliocentric orbit. This is very similar to the orbit transfer problems considered in an earlier chapter.

6.4.1

Patch Conditions

Once the spacecraft has escaped from the departure planet, its velocity is added to that of the planet to provide the velocity vector of the craft with respect to the Sun.4 In Figure 6.5, the heliocentric velocity vector of a spacecraft (vSC/ the second subscript will serve to identify the frame of reference for the vector, so that this symbol might be read the velocity vector of the spacecraft in the heliocentric frame) is seen to be the vector sum of its velocity with respect to the Earth (vSC/ ) and the velocity vector of the Earth itself (v it is understood that the velocity vector of a planet will be considered with respect to the heliocentric frame, and so no second subscript is necessary). Because the sketch is drawn in the heliocentric frame, the spacecraft must be understood to have escaped the Earth, which means it is beyond the sphere of inuence. This is an eectively innite distance when considering the departure trajectory as an Earth-referenced conic section. From a Sun-centered perspective, however, the distance is so small that the craft is considered for all practical purposes to be at the same position as the planet itself.
4

We will often refer to this as the heliocentric velocity or heliocentric velocity vector.

6.4: Departure

101

Since the craft has escaped, any velocity it has must come from the hyperbolic excess velocity of To Sun the departure trajectory. Therefore the magnitude of vSC/ is just the hyperbolic excess speed of the VSC/ Earth-centered escape trajectory. The direction of VSC/ vSC/ is likewise dened by the characteristics of V the earth-centered orbit. When added to the velocity of the Earth (or other planet of departure; recall that the Earth is serving here only as an example), Figure 6.5: Heliocentric velocthe result must be the necessary heliocentric veloc- ity of the spacecraft. ity vector to begin the interplanetary transit phase of the mission. It is this necessity to have the terminal conditions of one conic section match the required initial conditions of the next that denes the patched conics approach.
Example 6.3. As in Example 6.1, a spacecraft is to go from the Earth to Mars along a Hohmann transfer trajectory. What is the hyperbolic excess speed necessary when the vehicle escapes from the Earth? As before, we calculate the energy of the Hohmann transfer ellipse as C= = 351.5 km2 /s2 = 2at a + a

from which we get the perihelion velocity to be vSC/ = 2(C + /a ) = 32.73 km/s

Because the transfer ellipse is tangent to the orbit of the Earth, the heliocentric velocity vector of the vehicle as it escapes will be aligned with the orbital velocity of the Earth. Therefore the magnitude of vSC/ is just the magnitude of vSC/ plus the orbital velocity of the Earth. The latter is found from the usual circular velocity equation: v = /a = 29.78 km/s

Finally, we have the necessary magnitude of the spacecraft velocity vector to be vSC/ = vSC/ v = 2.94 km/s (the numbers dont quite add up due to rounding error).

Note that the result of this is precisely the impulse needed for a Hohmann transfer starting from a circular orbit about the Sun with radius a , ending at an orbit with radius a . This is eectively what is being done, with the impulses dened in Section 5.3 generalized to the hyperbolic excess speed of the departure trajectory. As mentioned before, there is no reason to restrict ourselves to Hohmann transfers. Even for fast transfers, however, the departure is generally designed so that

102

Chapter 6. Interplanetary Trajectories

To Sun

Parking orbit

Earth velocity

Spacecraft trajectory
Figure 6.6: Escape trajectory starting from circular parking orbit.

the velocity vector of the spacecraft is aligned with that of the departure planet, in order to take best advantage of the planetary velocity. If this is not the case, the magnitudes of the velocities cannot be added simply, as they are in the example, and vector addition must be employed.

6.4.2

Departure Hyperbola

The departure hyperbola will generally begin from a parking orbit, which we will assume for now to be circular. The impulse needed to accelerate the vehicle and change its trajectory to an escape hyperbola will then be in the same direction as the spacecraft velocity when the impulse is applied. The magnitude and the timing of the impulse must be calculated. Figure 6.6 shows an escape trajectory from a low-Earth parking orbit.5 The angle in the gure measures the point of application of the impulse, in this case referenced to the velocity vector of the Earth. This is a convenient reference, as is then simply the angle of the asymptote for the hyperbolic orbit. We found this earlier as , so = = cos1 (1/e) (6.9)
5 The size of the Earth in the gure is reduced; a low parking orbit would not be distinguishable from the circle denoting the Earth were it drawn to scale.

6.4: Departure

103

It remains to nd the magnitude of the impulse required to convert the circular parking orbit to the required escape hyperbola. This is computed from the hyperbolic excess speed. At the parking orbit radius, the velocity on the escape trajectory is given by v 2 D v2 2D 2 + = v1 = v (6.10) C= = 1 2 2 r1 r1 where D is the gravitational parameter of the departure planet and r1 is the radius of the parking orbit. At this point, we have made no assumptions about the direction of the impulse. It will be most ecient to apply the impulse tangentially to the parking orbit, to make the best use of the circular orbit velocity, so the velocity after the impulse will then be in the same direction as the velocity before. The magnitude of the impulse is therefore found from simple subtraction as v = v 1 v c =
2 + v

2D r1

D r1

(6.11)

Because the Earth is inclined with respect to the plane of the ecliptic, a parking orbit would have to have an inclination of 23.45 in the usual geocentric frame to have zero inclination in the heliocentric frame. This would result in the departure hyperbola lying in the plane of the Earths orbit, as sketched in Figure 6.6. However, this is not necessary. Any circular parking orbit that intersects the velocity vector of the Earth can serve as a start for the hyperbola. There is a point on any such orbit at which an impulse can be applied that will result in a hyperbola with the proper energy, and whose asymptotic velocity will be aligned with that of the Earth.
Example 6.4. Continuing the Earth-Mars transfer of previous examples, assume the spacecraft begins in a parking orbit with 200 km altitude. What is the impulse needed for injection onto the escape hyperbola? The hyperbolic excess speed for injection into the Earth-Mars Hohmann transfer was computed in example 6.3 to be v = 2.94 km/s. From eqn. (6.10) we get v1 =
2 + v

2 = r1

(2.94)2 +

km 2 = 11.396 6578 s

The circular orbit velocity of the parking orbit is vc = /r1 = 7.784 km/s

and the impulse required for injection is then v = v1 vc = 3.611 km/s.

104

Chapter 6. Interplanetary Trajectories

6.5

Planetary Arrival and Capture

The nal phase of any interplanetary trajectory is the arrival of the spacecraft at the target planet. Once there, it may be desired to settle into an orbit around the planet, or to use the gravitational eld of the planet to modify the heliocentric direction and velocity of the craft. The rst of these ideas leads to the problem of Planetary Capture. It is common to misunderstand the term planetary capture to mean that the spacecraft is caught in the gravitational eld of the planet; in eect captured. This is not the case. Since the spacecraft is arriving from an eectively innite distance, it is on a hyperbolic trajectory with respect to the planet of arrival. Such trajectories are not closed orbits, and if nothing is done to change the orbital parameters, the spacecraft will pass by the planet and head back into interplanetary space, its heliocentric trajectory warped by the passage. Such a trajectory is known colloquially as a Fly-by. When the y-by results in some desired change in the heliocentric orbit, it is known more formally as a Gravity-assist trajectory. Thus the problem of planetary capture is that of the spacecraft capturing the planet, rather than vice-versa. And to accomplish this, it is necessary to alter the velocity of the spacecraft with respect to the planet. There are two parts to this process, and we will look at them separately. Whatever else we do, however, we must rst translate the heliocentric velocity of the craft to a planet-centered trajectory.

6.5.1

Equating the Heliocentric and Planetary Trajectories


j

Consider the situation of Figure 6.7. The gure shows the heliocentric velocity vectors of a spacecraft and a VP planet at the arrival phase of an outward interplanetary transfer. In such VSC a case, the velocity of the craft is i To Sun typically less than that of the planet. Thus, the spacecraft in fact is ahead of the planet, and the planet must catch up to it as it ies. The spacecraft is not drawn separately from the planet in Figure 6.7 Figure 6.7: Heliocentric view of planetary arbecause from the heliocentric point of rival. view, by the time it is within the planetary sphere of inuence it is eectively at the same point as the planet. From the planetary perspective, however, the craft is still at an innite distance. To compute the velocity vector with respect to the planet, we note that the heliocentric velocity of the craft can be thought of as the velocity with respect to the

6.5: Planetary Arrival and Capture planet, plus the velocity of the planet with respect to the Sun. Thus, we have vSC/ = vSC/P + vP

105

(6.12)

Since we know the characteristics of the heliocentric orbit, we can nd the velocity vector when the spacecraft reaches the planet. We need only dene the coordinate frame in which we wish to work. For the purposes of planetary departure and arrival, a good choice of coordiz nate frame is that sketched in Figure 6.8. y The unit vector, which gives the direcr tion of the x-axis, is aligned with the ra unit vector (the z x dius vector. The k axis) is parallel to the angular momenFigure 6.8: Coordinate frame. tum vector of the orbit of the target planet, and the unit vector is chosen to complete the right-handed orthogonal system. In this system, the velocity vector of the planet (assumed to be on a circular orbit) is purely in the direction, which simplies computation. The planet itself is at the origin of the frame. Because part of the patched-conic approximation is that the planet-centered trajectories take negligible time, the frame does not move during the approach trajectory. Therefore we consider it inertially xed for our purposes. Note that the same frame can be used for departure calculations, allowing a more general discussion than was given in Section 6.4. To compute the spacecraft velocity in this local frame, we need to compute the angle in the sketch. We can nd the value from the specic angular velocity: hSC/ = rSC/ vSC/ = hSC/ = rSC/ vSC/ sin The value of r is just the radius of the planetary orbit aP . We know the velocity of the craft from the energy of the transfer orbit. We can thus write the heliocentric spacecraft velocity in terms of its components as vSC/ = vSC/ cos i + vSC/ sin j The velocity of the craft with respect to the planet is then vSC/P = vSC/ cos i + (vSC/ sin vP ) j
Example 6.5. A spacecraft leaves the orbit of the Earth with a heliocentric velocity of 41 km/s, tangent to the Earths orbit. What is its speed relative to Saturn when it achieves rendezvous?

106
The energy of the heliocentric orbit is C=

Chapter 6. Interplanetary Trajectories

412 1.32715 1011 km2 v2 = = 46 . 64 2 a 2 149.6 106 s2

The velocity of the craft at the orbital radius of Saturn is then (a = 1427 106 km s ) v= 2 C+ a = 9.630 km s

We compute the specic angular momentum of the orbit as h = rp vp = (149.6 106 ) (41) = 6.134 109 The angle in 6.7 is then = sin1 h a v = 0.463 rad = 26.5 km2 s

We then have the heliocentric velocity vector of the craft to be v = v (cos i + sin j) km = 8.617 i + 4.300 j s To convert this to planet-centered coordinates, we need the velocity of Saturn with respect to the sun; this is simply the orbital velocity v = 9.654km/s, and is in the positive j direction. We then re-write eqn. (6.12) to get vSC/ = vSC/ v = 8.617 i + (4.300 9.654) j = 8.617 i 5.346 j Finally, we have vSC/ = vSC/ = 10.14 km s

6.5.2

Velocity Oset and the Impact Parameter

Now that the velocity vector with respect to the planet has been computed, the characteristics of the planetary orbit can be derived. Whether the spacecraft intends to alter the planet-centered orbit to a parking orbit or continue on, the parameter that denes the problem is the periapsis radius. This is in turn dened by the velocity oset of the of the spacecraft as it enters the sphere of the arrival planet. Consider Figure 6.9. The spacecraft has arrived at the planetary sphere of inuence with some velocity v with respect to the planet. It is always possible to

6.5: Planetary Arrival and Capture

107

Vinf y rp

Figure 6.9: Velocity oset at planetary arrival

nd a line parallel to the velocity vector that runs through the center of the planet. The distance, denoted y in the gure, between this line and the velocity vector is the oset distance. From the gure, it is not obvious that this value is very easily changed, and in fact requires a negligible amount of fuel to change. This is because it is not necessary to move the craft to change its oset distance. Instead, it is necessary only to rotate the velocity vector slightly. This will change which line running through the planetary center is parallel to v, and thus change the oset distance.

1111111111 0000000000 000000000000000000000000000000 0000000000000000000000000 1111111111111111111111111 0000000000 111111111111111111111111111111 1111111111 00 11 0000000000000000000000000 1111111111111111111111111 00 11 z 000000000000000000000000011 1111111111111111111111111 00 0 1 0 1 00 11 0 1 0 1 0 1 d 0 1111111111111111111 0000000000000000000 11111111111111111 0000000000000000 0 1
Figure 6.10: Adjusting the miss distance.

To get an idea of the minimal change necessary, consider the geometry of Figure 6.10. Here, the small change in the velocity vector has a very large eect on the closest approach to the target point, almost nulling out z . In fact, the angular change to completely null z is given by = tan1 (z/d) For the situation of planetary arrival, the value of d goes to innity (in the patched conic approximation) or at the minimum to the radius of the sphere of inuence (still extremely large). Thus, we assume that a negligible amount of thrust is needed to give whatever oset we require. In general, we care about the oset distance only in that it governs the periapsis of the planetocentric trajectory. To compute the periapsis, we will need the specic

108

Chapter 6. Interplanetary Trajectories

angular momentum of the trajectory. This cannot be computed using the equations derived earlier directly. Instead, recall that h = r v = rv sin = constant Now, if the distance r is nite, the value of r sin is just the oset distance y in Figure 6.9. Letting r causes 0, but the product r sin will remain constant, and the angular momentum becomes6 h = yv We can relate this to the periapsis radius from h = yv = rp vp = y= rp v p v (6.13)

Express the periapsis velocity in terms of rp and known quantities through


2 2 vp P v = 2 2 rp

vp =

2 + 2 /r v P p

Substitution of this result into the preceding expression yields y= rp v


2 + v

2P rp

(6.14)

By squaring both sides of this equation, we can derive a quadratic equation for the periapsis radius: 2 2 rp + 2P rp /v y2 = 0 A more immediately useful result comes by setting the periapsis radius rp to the radius of the planet rP in eqn. (6.14). The resulting oset distance is known as the impact parameter, and is the minimum oset needed to avoid collision with the planet. Denoting this as b, we have rP 2P 2 + b= v (6.15) v rP

6.5.3

Planetary Capture

As mentioned above, planetary capture does not refer to a wandering body being somehow snared by a planets gravitational eld. In the patched-conic approximation, this cannot happen. Instead, the problem we address is that of converting the
Note that this is the standard expression angular momentum is the velocity times the perpendicular distance familiar to all engineering and science freshmen. This should not be surprising; although h was derived originally as a constant of integration, its physical meaning is clear.
6

6.5: Planetary Arrival and Capture

109

hyperbolic approach trajectory into a closed orbit about the planet. This is very much like the problem of departure in reverse. Consider the problem of injecting into a circular parking orbit at the periapsis of the approach hyperbola. The equations used in this case are in fact the same as for the departure.
Example 6.6. Consider the approach trajectory dened in example 6.5. What impulse is required to put the vehicle into a circular parking orbit of 5, 000 km altitude? Recall that we have assumed that we can choose the periapsis of the approach hyperbola for negligible cost. We therefore assume that these adjustments have already been made, and we may assume that the periapsis will occur at the required altitude. The distance from the center of Saturn at periapsis will then be rp = 5000 + r = 65000 km The velocity of the spacecraft relative to Saturn was computed in example 6.5 to be vsc/ = 10.14 km. Because the spacecraft is still eectively an innite distance from the planet when this calculation is made (at least from the planet-centered viewpoint), this is the velocity at innity for the hyperbola about Saturn. The energy is then
2 C = v /2 = 51.42 km/s

The velocity at periapsis is found by equating the energy at r and rp :


2 2 vp v = 2 2 rp

vp =

2 + v

2 km = 35.64 rp s

Note that this is precisely eqn. (6.10). The necesary impulse to slow the vehicle and inject it into the parking orbit is just the dierence in the periapsis and circular orbit velocities. v = vp /rp = 35.64 24.16 = 11.48 km/s Note that this could be done directly with eqn. (6.11).

The impulse is found from scalar arithmetic in the example because we assumed that it would be applied at periapsis. This would be done so that the impulse is used most eciently, and is guaranteed to be possible due to the ease with which the periapsis radius can be adjusted while the spacecraft is still far from the planet. Again, there is no reason why an impulse could not be applied at some other point on the approach hyperbola, but this would waste fuel (unless it were done to correct an emergency, of course). We also assume that the inclination of the orbit about the target planet is chosen through the orientation of the approach. Because the vehicle can be aimed to pass over the equator, the poles, or anywhere in between, the choice of parking orbit is wide. The only restriction is that the orbit will intersect a line parallel to the direction of approach and passing though the center of the planet.

110

Chapter 6. Interplanetary Trajectories

6.6

Gravity Assist Trajectories

The colloquial term for a spacecraft ight that skirts a planet is for obvious reasons a y-by. A main reason for such trajectories is to obtain information about the planet, and such opportunities are rarely wasted. From the orbital mechanics standpoint, however, the utility of these orbits is the ability to use the planets gravity to alter the heliocentric orbit. In so doing, it is possible to alter not just the direction but also the energy of the trajectory, gaining several kilometers per second of velocity at little or no cost to the vehicle.7 The use of the gravitational eld of the planet to accomplish this leads to the term Gravity Assist.

6.6.1

Planar Fly-bys
The situation of Figure 6.11 shows a spacecraft on a hyperbolic trajectory past a planet. The gure is drawn from the planetary reference frame, so that the planet itself does not appear to move during the y-by. The vectors v1 and v2 in the gure are the velocities of the spacecraft before and after the y-by, respectively. The angle is the turning angle; the velocity vector rotates through this angle during the y-by. Recall that because energy is conserved throughout the y-by, the magnitude of the velocity (with respect to the planet) does not change. To compute , we note that + 2( ) = =

= 2 2 Since we already know that for a hyperFigure 6.11: Geometry of a y-by trajec- bolic trajectory cos = 1/e, and for any angle , cos( /2) = sin , this tory. leads us to sin 2 = 1 e (6.16)

00000000000000 11111111111111 000000000000000000 111111111111111111 00000000000000 11111111111111 000000000000000000 111111111111111111 00000000000000 11111111111111 000000000000000000 111111111111111111 00000000000000 11111111111111 000000000000000000 111111111111111111 00000000000000 11111111111111 000000000000000000 111111111111111111 0000000000 1111111111 00000000000000 11111111111111 0000000000 1111111111 11 00 00000000000000 11111111111111 0000000000 1111111111 (Turning 00000000000000 11111111111111 V 0 1 angle) 00000000000000 11111111111111 0 1 00000000000000 11111111111111 0 1 00000000000000 11111111111111 0 1 00000000000000 11111111111111 0 1 00000000000000 11111111111111 V 0 1 00000000000000 11111111111111 0 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 To Sun 0 1 0 1
1 2

At this point, it is necessary to assume some additional information. The energy of the orbit is dened by the magnitude of the velocity, but the eccentricity is dened
7 There is a cost, of course. The momentum gained by the spacecraft is balanced by an equal loss in the momentum of the planet. We ignore this as negligible.

6.6: Gravity Assist Trajectories

111

by the periapsis radius. Since we saw in the last section that this is easily controlled by the spacecraft, we will assume for now that rp is either given, or is to be computed from other given information (for example, the desired turning angle may be specied, and from this rp can be computed). To relate rp to the eccentricity, recall rp =
2 2 rp vp p h2 / = = 1+e 1+e (1 + e)

e=

2 rp v p 1

2 Noting that vp = 2(C + /rp ), this gives 2 2rp (v /2 + /rp ) 2 rp v + 2 = 1 2 rp v +1 =

e =

(6.17)

Thus, given , the necessary value of rp can be computed. The value of the gravity-assist trajectory lies in the heliocentric orbit that results from the y-by. The energy of the planetocentric orbit is constant with respect to the planet during the y-by, but this does not imply that the energy of the resulting heliocentric orbit is the same before and after. In Figure 6.12, the rotation of the spacecraft heliocentric velocity vector is shown. The dashed vectors represent the values before the y-by, and the solid vectors after (the velocity of the planet is of course not changed). Since the heliocentric velocity of the craft after the y-by is simply the sum of the velocity with respect to the planet + VSC/ and the velocity of the planet itself, the + VSC/P result will be a vector whose tip lies on the circle in Figure 6.12. If vSC/P before VP the y-by is nearly in the opposite direction of vP , the resulting rotation will likely result in a heliocentric vector with Figure 6.12: Rotation of spacecraft veloc- greater length, since the origin of the vector does not change. For a Hohmann ity vector during y-by. transfer, the spacecraft velocity vector at arrival is aligned with the velocity of the planet. Thus for an outbound Hohmann transfer, for which vSC/P is in the opposite direction of vP in the sketch, any y-by will result in a greater heliocentric velocity for the spacecraft. The situation for an inbound Hohmann transfer is precisely the opposite; since the spacecraft velocity before the transfer is in the same direction, any

112

Chapter 6. Interplanetary Trajectories

y-by will result in a heliocentric orbit with less energy. This may still be desirable as a way to produce large rotations in the vector. In general, the spacecraft can be expected to be going slower than the planet. As can be seen in the sketch, a y-by that passes on the sunward side of the planet and then behind it will produce a counter-clockwise rotation of the spacecraft velocity, and will thus increase the orbital energy. A y-by in front of the planet produces a clockwise rotation and a reduction in the energy. This is not always the case, of course; a large clockwise rotation of the velocity may still increase the energy, if the angle between vSC/P and vP was originally close to .
Example 6.7. Consider the situation of Example 6.5. The view from Saturn might be as seen in Figure 6.13. Here, vi is the initial velocity vector, and vf is the nal vector, as the spacecraft escapes after the heliocentric y-by. The value of the angle in the gure is easily computed as v i = tan1 v j Suppose that we desire a y-by trajectory such that the out-bound heliocentric velocity is unchanged, but the sign of the j component is reversed (note: this is purely for illustrative purposes; there is no general reason to perform such a maneuver). Find the nal heliocentric velocity vector and the energy of the new heliocentric orbit.

VSC

VSC

To Sun i

Figure 6.13: Fly-by from the planetary point of view


First, it is important to note that the magnitude of the heliocentric velocity vector will change; the problem does not require that the magnitude of the j component of the heliocentric velocity vector be the same, only that it has a dierent sign. For this problem, it is not necessary to compute many of the parameters of the y-by trajectory. Since the planets velocity with respect to the sun has no component in the i direction, the component of the heliocentric spacecraft velocity in that direction will

6.6: Gravity Assist Trajectories

113

be identical to that of the vector with respect to Saturn. Therefore, we already know that the nal value of the planetocentric velocity vector will be vSC/ = 8.627 i + 5.495 j The heliocentric velocity vector is then i + 15.15 j vSC/ = v/ + vSC/ = 8.627 The heliocentric velocity is now vSC/ = 17.43 km s , and the energy is C= 17.432 = 58.82 km s 2 a

which is greater than zero. The spacecraft is now on an escape trajectory, out of the solar system.

6.6.2

Fly-bys for Plane Change The Ulysses Mission

One use of gravity assist trajectories is to change the inclination of the heliocentric trajectory. Recall that the impulse needed for plane change is linearly dependent on the velocity; when the velocities are in the tens of kilometers per second, as they are in heliocentric trajectories, this is clearly impossible. However, the use of a y-by to generate the necessary impulse enables otherwise impossible missions. An example of this use is the Ulysses mission, which put a spacecraft in a nearly polar heliocentric orbit. In order to achieve this orbit, the craft performed a y-by of Jupiter. The craft was launched from the Shuttle on October 6, 1990, and on February 8, 1992 made its Jupiter y-by. The resulting orbit had an inclination of 80.2 , and reached perihelion on March 12, 19958 Since the y-by sent Ulysses into the southern hemisphere, the point of y-by became the descending node, and perihelion is very nearly at the ascending node. The next appearance of the craft at the ascending node was on May 25, 2001, giving an orbital period of about 2260 days. Such a trajectory is possible because it is possible to choose not just the periapsis of the y-by hyperbola, but also the position in the planet-centered frame of reference. It takes no more eort to direct the orbit slightly above or below the plane of the ecliptic than it does to change the radial components recall that we have eectively innite distance over which to let the eects of a small impulse grow.
Example 6.8. To perform an approximate analysis of the Ulysses y-by, assume that the craft was at aphelion afterwards. Making the assumption of a circular orbit for Jupiter, this means that r = 5.2026 AU . Taking 2260 days as the period, we have P = 86400 2260 = 2a a/ = a = 5.042 108 km.

8 The craft crossed the plane of the ecliptic the next day; the fact that the perihelion is so very close to the ecliptic speaks to the nearly perfect y-by performed.

114

Chapter 6. Interplanetary Trajectories

Figure 6.14: Ulysses Earth-to-Jupiter and rst solar orbit (from NASA/ESA website)

This provides a perihelion distance of rp = 1.54 AU , which is incorrect by about 0.15 AU (the actual value is 1.39). Going with our approximate number, we have that the magnitude of the velocity after the y-by is va = 8.822 km s1 . To compute the velocity vector in the local coordinate frame, recall that we have assumed aphelion, so there is no radial component. Therefore
+ = 1.502 = 0 + 8.783 cos(80.2 ) + 8.783 sin(80.2 )k + 8.693 k. vSC/

Allowing for the planets velocity, we have


+ + vSC/ + 8.693 k = vSC/ v = 11.557

so that the velocity relative to Jupiter is


+ 1 v = vSC/ = 14.461 km s .

To proceed, it is necessary to know the velocity vector prior to the gravity assist. This could be found by noting that the transfer time is 490 days, which allows the use of Keplers equation (assuming that the departure was at perihelion) to compute the true anomaly at Jupiter arriival, and from there the other orbital characteristics. Note that the Hohmann transfer time is 997 days, and the relative velocity at arrival would only 1 be vSC/ = 5.643 km s . This is much less than that required for the mission.

Note that this example is one situation in which the fast transfer is needed. The nal mission objective could not be achieved by executing a Hohmann transfer and a gravity assist.

6.7: A Fairly Involved Example

115

6.7

A Fairly Involved Example

In this section, the ideas presented in the chapter will be brought together through a single example. While each has been illustrated by examples as they were presented, this may help in seeing how they can be assembled in analyzing a particular mission from end to end. The problem is a transfer from Earth to Mars. The spacecraft is rst launched into a circular Earth orbit of radius 6600 km. The mission planners have determined that the transfer time is not to exceed 200 days. A gravity-assist trajectory around Mars will be determined such that maximizes the heliocentric velocity after the y-by. Find the characteristics of the transfer orbit The rst thing to be done is to determine the transfer phase of the mission. To do this, we rst make the assumption that the departure from Earth will be in the nature of a Hohmann orbit. That is to say, the velocity of the escape hyperbola will be aligned with that of the Earth as the spacecraft transits from Earth-dominated to interplanetary space. As a result of this, we know that departure occurs at perihelion of the transfer orbit. As in Example 6.2, determining the transfer orbit characteristics will require an iterative process. In this case, the problem is slightly more complex because we dont yet know the type of transfer orbit. Therefore the rst thing to do is to see if the transfer time on a parabolic orbit would be greater or less than the specied time. Recalling Barkers equation (3.31) t tp = h3 22 tan 1 + tan3 2 3 2

we see that we need both the angular momentum and the true anomaly at arrival to compute the transfer time. Easily enough, we calculate rp = p 1+e = p = 2rp = 2a

and then apply the relationship h = h=

p to nd that

2 a = 6.301 109 km2 /s

on the parabolic trajectory. True anomaly at arrival can then be computed from a (1 + cos ) = p = = 1.2529

and the time of ight is found to be Tpar = 69.9 d. Because the desired time of ight is longer, we conclude that an elliptic transfer orbit will serve. From this point, Example 6.2 can be used almost directly. Letting at denote the semimajor axis of the transfer orbit (we will use a t subscript to denote quantities

116

Chapter 6. Interplanetary Trajectories

referring to the transfer orbit throughout this example), we select a value for it. Then, again using rp = a , we compute eccentricity from a = at (1 e) and the eccentric anomaly then from a = at (1 e cos E ) Finally, Tt = a3 t / (E e sin E ) provides the transfer time, and we correct the estimate of at until this equals 200 days. Noting that the Hohmann transfer time is the longest of all two-impulse transfers, the associated semimajor axis is a lower bound. Trying at = 2aH produces a time of ight of only 93.07 d. Using bisection produces the following table at /aH 2.0000 1.5000 1.2500 1.1250 1.0625 1.03125 1.015625 1.0234375 1.0273438 1.0253906 1.0244141 e 0.6038 0.4716752 0.3660102 0.2955669 0.2541297 0.2315275 0.2197049 0.2256613 0.2286056 0.2271363 0.2263995 E 0.8548957 1.1446092 1.4777967 1.8215656 2.137743 2.4012447 2.6059719 2.4931648 2.445175 2.4686073 2.4807374 Tt (days) 93.075662 108.25451 128.21249 150.94981 173.57354 193.73236 210.32709 201.07405 197.21978 199.09583 200.0714

Several more iterations could be done to get an even closer answer. However, at some point the accuracy of the calculations becomes pointless in light of the simplications made in the Patched Conic approximation. Achieving eight digits of agreement to a model that is only accurate to a few per cent is pointless. We thus take the characteristics of the transfer orbit to be at = 1.9338 108 km and e = 0.2264

It is interesting to note that an increase in the semimajor axis of only two and a half per cent has decreased the transfer time by nearly 60 days relative to the Hohmann transfer. Looking farther up the table shows that a reduction of 49 more days (to 150.9 days) would require a further ten per cent increase.

6.7: A Fairly Involved Example

117

Compute the necessary departure impulse Having found the transfer orbit, it is now possible to compute the impulse needed to transition from the parking orbit to the escape hyperbola.9 Again, we have already seen a very similar problem, in this case in Example 6.4. Because the velocity vector as the spacecraft leaves Earths sphere of inuence (that is, as r on the departure hyperbola) is aligned with that of the Earth, the arithmetic can be done without recourse to vectors. From the results of the last section, we compute the velocity at perihelion of the transfer orbit, and subtract the velocity of the Earth to get the excess speed of the departure hyperbola. Ct = /2at = 343.15 km2 /s2 vp = = v = vp

2 (Ct + /a ) = 32.985 km/s /a = 32.985 29.785 = 3.200 km/s.

The velocity on the parking orbit is found from the specied radius a1 = 6600 km to be vc = /a1 = 7.771 km/s. The velocity on the departure hyperbola at r = a1 is computed from the energy relation as v= 2 (Ch + /a1 ) =
2 + 2 /a = 11.447 km/s. v 1

The necessary impulse is then simply v = v vc = 3.675 km/s. Find the heliocentric velocity at arrival Following Example 6.5, we compute the magnitude of the arrival velocity, and then the individual components. The magnitude can be computed as usual from the energy equation, = 21.867 km/s. vSC/ = 2 Ct + a The components may then be calculated from the angular momentum. As in Example 6.5, recall that h = rv sin at any point on any trajectory. At departure, this is just ht = rp vp = a vp = 4.934 109 km2 /s
We have not discussed launch directly from Earth surface into escape trajectory. This is possible, and in fact is often done, but the details of launch through the atmosphere make the problem beyond the scope of this chapter.
9

118

Chapter 6. Interplanetary Trajectories

(recall that vp was computed above). At arrival, the value of r is simply the orbital radius of Mars, so = sin1 The heliocentric velocity is then vSC/ = vSC/ [cos + sin ] = 3.088 + 21.65 km/s. (6.18) ht a vSC/ = 1.4291 (81.88 )

Considering (6.18), note that the component of the velocity in the direction is = vSC/ sin . vSC/ If we do not need the included angle for other purposes, we could then compute the components directly from vSC/ = ht /a and
2 2 )2 = vSC/ (vSC/ )2 = vSC/ (ht /a )2 . (vSC/

Compute the Mars-relative arrival velocity Continuing to follow 6.5, this is simply the heliocentric velocity of the spacecraft, minus the velocity of the planet. The radial component remains unchanged, and vSC/ + vSC/ vSC/ = vSC/ v


Figure 6.15: Martian y-by.

= 3.088 + 21.65

/a

= 3.088 2.482 km/s. The magnitude of the vector is the excess velocity of the arrival hyperbola (in this case, the y-by hyperbola, since we will not be staying): v = vSC/ = 3.962 km/s.

Find the gravity-assist trajectory that maximizes heliocentric velocity Recalling the patch conditions and especially Figure 6.12, it is clear that the maximum velocity after the y-by will be generated by most closely aligning the departure leg of the y-by hyperbola with the velocity of Mars. This particular

6.8: Planetary Ephemeris

119

case is illustrated in Figure 6.15. It is obvious that the post y-by velocity will be maximized if the turning angle is maximized (assuming it does not thereby rotate the velocity vector beyond the y -axis, which is unlikely), and that the vector should be rotated in the counter-clockwise direction. It can easily be shown that the turning angle is maximized when the periapsis of the y-by hyperbola is minimized. The radius of the planet is an obvious lower limit, so setting rp = r = 3393 km, we have from (6.17)
2 2 r vSC/ rp v + 1 = 2.2372 e= +1=

leading to a turning angle of = 2 sin1 From the sketch, we see that


+ vSC/ = v (cos + sin )

1 e

= 0.9268 (53.10 ).

and plugging in = , and noting that = tan1 2.482 3.088 = 0.6769 (38.78 ),

we have for the post-y-by velocity


+ vSC/ = 3.839 + 0.980

km . s

Adding the velocity of the planet, we have the heliocentric velocity now to be
+ = 3.839 + 25.11 vSC/

km , s

giving a magnitude of v = 25.40 km/s. This is about 3.53 km/s greater than before the y-by, though still well below the necessary velocity to escape from the solar system.

6.8

Planetary Ephemeris

Throughout this chapter, we have made extensive use of the approximation that the planetary orbits are circular. While this is certainly a good rst approximation, it is of course not completely true. However, once the inclination and eccentricity of the orbits are brought into the problem, solutions require the use of computer iteration.

120

Chapter 6. Interplanetary Trajectories

Sucient information for the use of patched conics for elementary analysis is included in the tables in Appendix E.3. However, those tables list only the mean characteristics of the orbits themselves, with no data from which to compute the position of the planets on the orbits. Further, because the solar system is much more complex than a two-body system of the type we have investigated in this book, the elements of the planetary orbits are not constant. The topic of variation of elements will be covered in detail in a later chapter. Here, we mention only that for accurate computation of positions, these variations must be taken into account. A table of planetary positions, or algorithms and constants from which position can be computed, is known as an ephemeris. Whether the positions are tabulated as functions of time, or the orbital elements are expressed as functions of time, there is a need to better dene the terms time and date themselves.

6.8.1

Julian Dates and Ephemeris Time

The everyday denition of time is the division of the solar day into hours, minutes, and seconds. This is perfectly good for most purposes of most people, but not quite sucient for astrodynamics. For most people as well, the calendar date is enough to know. However, astrodynamics deals with time spans of centuries, and over the centuries the calendars have changed. It is not enough to state that an event happened on a particular date; it must be known on which calendar the date is specied. To make matters even more confusing, various calendars were adopted at dierent times in dierent parts of the world. For astrodynamical work, the date is specied as the Julian day number. This is the number of days since January 1, 4713 BC. The Julian day begins at noon Greenwich mean time, so that for example midnight on January 1, 1999 (or 1999 Jan. 1.0) is jd 2451179.5. There are a few dierent algorithms for nding the Julian day number for a given calendar date. Meeus ( [10], chapter 7) gives the following for a date in the Gregorian calendar. Letting y be the year, m the month, and d the day (including decimal), proceed as Y = y if m > 2 y 1 if m 2 (6.19) (6.20) (6.21) (6.22) (6.23)

M =

m if m > 2 m + 12 if m 2 A = floor (Y /100) B = 2 + floor (A/4) A jd = floor(365.25(Y + 4716)) + floor(30.6001(M + 1)) + d + B 1524.5

6.8: Planetary Ephemeris The function floor above means to round towards negative innity; floor(7.5) = 7; floor(7.5) = 8.

121

If the calendar date is already in the Julian calendar, the term B in (6.22) above is taken to be zero. The Julian calendar was in use before the Gregorian, which was introduced in 1582. However, the calendar was not adopted uniformly; Great Britain had not fully adopted it until 1752, for example, and some countries did not accept it until the early 20th century. For dates from March 1, 1900 through February 28, 2100, B = 13. Some single-line formulae are available for dates within these limits. However, the method outlined above is valid for all dates after 4712 BC, so long as the Julian-Gregorian calendar distinction is kept in mind.
Example 6.9. Find the Julian day number for 9:00 a.m. GMT on August 8, 1971. August is the eighth month, so m = 8, and of course y = 1971. The day number is d = 8.375 (eight plus 9/24 days). Then Y M = 1971 = 8

A = floor(1971/100) = 19 jd = floor(365.25 6687) + floor(30.6001 9) + 8.375 13 1524.5 = 2441171.875 Note that the time of day (GMT), expressed as the fractional part of the day number, has shifted by 0.5 as a result of being measured from noon of the previous day, rather than midnight. B = 2 + floor(19/4) 19 = 2 + 4 19 = 13

Long periods of time are measured in Julian centuries, each of which is exactly 36525 mean solar days long. The matter of time is also more complicated than might be expected. Most people think of a day as the time between one noon and the next; noon in turn is thought of as the time during the day at which the sun is highest in the sky. This is good enough for most things,10 but not for astrodynamics. The rst consideration is that the solar day is longer than the sidereal day. The term sidereal means with respect to the stars, and a sidereal day is length of time between successive passages of a distant star over a particular longitude of the earth. The solar day is the usual day just discussed above. From Figure 6.16, it can be seen that the earth must rotate only 360 for the rst, but 360 plus its angular motion about to sun for the second (this is in the gure).
10

Including, almost always, government work

122

Chapter 6. Interplanetary Trajectories

Figure 6.16: Sidereal vs. solar motion of a planet.

Even allowing for this, the time from noon to noon is not constant. This is partly because the earths orbit is elliptic, so that it moves less per day near aphelion than at perihelion. Also, the earths rotation slows due to tidal friction. Finally, it has been discovered that the rotation is also subject to sudden slowdowns, for reasons not yet understood. For all these reasons, the astronomical community uses ephemeris time, which is a ctional time tied to a perfectly predictable earth. That is to say, ephemeris time is the independent variable in the astronomers models of celestial mechanics, and Greenwich mean Time is the time that people actually use, subject to all the unknown and unpredictable perturbations. Corrections are published after long series of observations. These very ne points of time-keeping are beyond our needs for the purposes of this book. Their existence is instructive, but for us, mean solar or sidereal time will be sucient. By denition, the mean solar day is 86400 mean solar seconds, and this is very nearly the average time from noon to noon across the year (nearly because, as mentioned above, this value changes). The sidereal day is close enough to 86164 mean solar seconds long.

6.8.2

Mean Planetary Elements

Because the solar system is much more complex than a two-body system of the type we have investigated in this book, the elements of the planetary orbits are not constant. The topic of variation of elements will be covered in detail in a later chapter. Here, we mention only that for accurate computation of positions, these variations must be taken into account. There are several methods for doing so. The most accurate is to use a highly accurate model of the solar system, and start with the best observed positions available at a particular time. From these initial conditions, and using a very good numerical integration scheme, the positions and velocities of the planets can be computed well into the future. Naturally, as mentioned in chapter 4, the motion of the axes themselves must be dealt with, as well.

6.8: Planetary Ephemeris

123

Somewhat less accurate is to express the elements as functions of time. Since there is no explicit function11 for the planetary elements, this approach requires that the function be approximated in some way. The theory of such approximations is outside the scope of this text. It will suce to say that the existing expressions have many terms, and that the greater the desired accuracy of the result, the more terms are required in the computation. One factor that complicates the matter is that there are both secular and periodic variations in the elements of the planetary orbits. Highly accurate formulae will include both of these terms; Meeus devotes 40 pages to the most important periodic terms12 The periodic variations are not, however, very large, and for our purposes we will consider only the mean values of the elements. These are approximate values of the elements of the orbit associated with the planetary position and velocity at a particular time. These values can be expressed to a sucient degree of accuracy by low-order polynomials in time. Low precision is more than sucient to assess propulsion needs and perform most mission analysis. Various authors present linear, quadratic, and cubic approximations. The independent variable is usually Julian centuries, and for any term beyond the linear correction to appear in the rst several decimal places generally requires at least ten to 100 centuries. Thus, we will content ourselves with a linear expression, as f (T ) = a0 + a1 T. Table 6.1 comes from the Solar System Dynamics Group at the Jet Propulsion Laboratory.13 gives the constant term and linear corrections for the nine planets. These are referenced to the heliocentric axis frame on the epoch J2000.0 (Julian day JD2451545.0). For each planet, the rst row gives the constant terms, and the second the corresponding centennial rates. The semimajor axis is given in astronomical units, and the angular elements in degrees. The rates of the angular elements are given as degrees per century. The table was computed to best t each element over the period 1800 to 2050 AD. Outside of this 250 year span, the accuracy is questionable, but for our purposes will suce. Note that the elements given are the usual for semimajor axis, eccentricity, inclination, and right ascension of the ascending node. The last two are the longitude of perihelion, which we recall is dened as = + , and the mean longitude. These were mentioned in section 4.6. The mean longitude is dened as L = + M.
In this case, we mean that there is no function that can be expressed as a nite series of simple polynomial or trigonometric terms, and that is completely accurate 12 [10], page 413. 13 Keplerian Elements for Approximate Positions of the Major Planets, E.M. Standish, Solar System Dynamics Group, JPL/Caltech, Pasadena, California. Online at http://ssd.jpl.nasa.gov/?planet pos.
11

124

Chapter 6. Interplanetary Trajectories

a 0.38709927 0.00000037 0.72333566 0.00000390 1.00000261 0.00000562 1.52371034 0.00001847 5.20288700 -0.00011607 9.53667594 -0.00125060 19.18916464 -0.00196176 30.06992276 0.00026291 39.48211675 -0.00031596

e 0.20563593 0.00001906 0.00677672 -0.00004107 0.01671123 -0.00004392 0.09339410 0.00007882 0.04838624 -0.00013253 0.05386179 -0.00050991 0.04725744 -0.00004397 0.00859048 0.00005105 0.24882730 0.00005170

7.00497902 -0.00594749 3.39467605 -0.00078890 -0.00001531 -0.01294668 1.84969142 -0.00813131 1.30439695 -0.00183714 2.48599187 0.00193609 0.77263783 -0.00242939 1.77004347 0.00035372 17.14001206 0.00004818

L 252.25032350 149472.67411175 181.97909950 58517.81538729 100.46457166 35999.37244981 -4.55343205 19140.30268499 34.39644051 3034.74612775 49.95424423 1222.49362201 313.23810451 428.48202785 -55.12002969 218.45945325 238.92903833 145.20780515

77.45779628 0.16047689 131.60246718 0.00268329 102.93768193 0.32327364 -23.94362959 0.44441088 14.72847983 0.21252668 92.59887831 -0.41897216 170.95427630 0.40805281 44.96476227 -0.32241464 224.06891629 -0.04062942

48.33076593 -0.12534081 76.67984255 -0.27769418 0.0 0.0 49.55953891 -0.29257343 100.47390909 0.20469106 113.66242448 -0.28867794 74.01692503 0.04240589 131.78422574 -0.00508664 110.30393684 -0.01183482

Table 6.1: Best-t planetary elements and centennial rates.

6.8: Planetary Ephemeris

125

Were the planetary orbits perfectly keplerian, the rate of the mean longitude would simply be the mean motion. It should also be noted that the values for the earth are actually those for the center of mass of the earth-moon pairing.
Example 6.10. What were the orbital elements of Jupiter on February 8, 1992, when the Ulysses spacecraft made its y-by? The Julian day number for February 8, 1992 is 2448660.5. This was 2448660.5 2451545.0 = 0.078973 36525 Julian centuries before epoch for the data in the table. Compute the semimajor axis, eccentricity, inclination, and right ascension of the ascending node directly from the table as a = 5.20288700 0.00011607(0.078973) = 2.20290 AU e = 0.04838624 0.00013253(0.078973) = 0.04840 = 1.30439695 0.00183714(0.078973) = 1.30454

= 100.47390909 + 0.20469106(0.078973) = 100.458 The argument of periapsis is computed from the denition of longitude of periapsis as =

= 14.72847983 + 0.21252668(0.078973) 100.458 = 85.7460

The true anomaly is found by rst computing the mean longitude: L = 34.39644051 + 3034.74612775(0.078973) = 205.267, and then the mean anomaly as M = L = 205.267 14.7119 = 219.978 which modulo 360 is M = 140.022 . Iteratively solving Keplers equation for eccentric anomaly gives E = 141.739 . Finally, the true anomaly is found from (3.5) to be = 143.424 .

For purposes of mission planning, the true longitude is also useful. Time of ight calculations depend on knowing the angle swept by the craft during the transfer. While this can be calculated from the combination of true anomaly, right ascension, and argument of periapsis, these are combined conveniently in the true longitude.

126

Chapter 6. Interplanetary Trajectories

6.9

Problems

1. Consider a mission from the Earth to Venus and back, assuming Hohmann transfer each way. Compute (a) The time of ight in each direction. (b) The synodic period. (c) The necessary waiting period before return, and the overall mission duration. 2. A spacecraft is in a circular parking orbit of a = 7000 km about the Earth. It is to be sent on a parabolic transfer trajectory to Mars, and will be at perihelion at Earth escape. (a) What is the necessary hyperbolic escape speed for the departure hyperbola from the Earth? (b) What is the impulse needed for injection into the escape orbit? (c) What is the time of ight to Mars on the heliocentric parabola? (d) What is the velocity of the craft when it reaches Mars orbit? (e) What is the necessary phase angle at departure to achieve rendezvous with Mars? 3. Consider a mission from the Earth to Mercury. After a Hohmann transfer, the vehicle will pass the planet on the far side from the Sun. The closest approach to the planet will be an altitude of 61 km (rp = 2500 km). Find: (a) The velocity oset required to achieve the specied periapsis radius. (b) The eccentricity and turning angle of the y-by hyperbola. (c) The semi-major axis and eccentricity of the heliocentric orbit resulting from the y-by. (d) The closest approach the vehicle will have to the Sun, and the time of ight after y-by to perihelion. Note that Mercury is quite small relative to the high velocity the craft will have after the transfer. 4. We have considered departure from a circular parking orbit by a single-impulse transition to the departure hyperbola. Hermann Oberth, however, proposes a two-impulse solution as in Figure 6.17. Suppose that we desire an excess velocity of v = 2.95 km s1 from the earth. Starting from a circular orbit with apark = 3.84 105 km,

6.9: Problems

127

v direct

apark v2

v1
Figure 6.17: Oberth two-impulse escape.

(a) Compute the required direct v for the single-impulse approach. (b) Consider the two-impulse approach. Use the rst impulse to put the craft on an ellipse with a perigee of rp = 6578 km. The second will transition the craft from perigee of the ellipse to perigee of an escape hyperbola. What is the total v required for this approach. The circular Lagrange points of the Earth-Moon system are at a = 3.84 105 km (this will be studied in a later chapter). 5. Consider an earth to Jupiter transfer that arrives with = 165 on the transfer orbit. (a) What is the transfer time? (b) What is the velocity at arrival relative to Jupiter? (c) Compute a y-by with a turning angle of 90 (that is, nd the eccentricity and periapsis radius). If the y-by is used to slow down the craft and turn it towards the Sun, what are the eccentricity, semimajor axis, and perihelion of the resulting trajectory? Use the circular orbit assumptions in doing this problem. 6. Consider the Ulysses mission as in example 6.8. Supposing a Hohmann transfer to Jupiter, what is the maximum inclination that could be achieved for a postencounter heliocentric trajectory? What would the eccentricity and perihelion

128

Chapter 6. Interplanetary Trajectories distance of the resulting trajectory be? (Note that the maximum inclination will be achieved for a post-encounter trajectory that has no radial velocity component.) Use the circular orbit assumptions in this problem.

7. Consider the example of section 6.7. Assume the y-by hyperbola skims the planetary surface, and ignore any eects due to the Martian atmosphere. What would be the eect on the nal heliocentric velocity of a 1 km/s impulse applied at periapsis of the y-by hyperbola? Assume the post y-by orbit remains in the plane of the ecliptic, and assume that the orbit of Mars is circular. 8. Again, consider the Ulysses mission from section 6.6.2. (a) Using the information from table 6.1, nd the true longitude of the Earth at launch of the Ulysses mission (recall the denition of true longitude from 4.6), and of Jupiter at arrival. (b) Ignoring inclination, take the dierence in true longitude as the angle swept during ight. Assuming launch at perihelion, nd the characteristics of the transfer orbit that achieves Jupiter arrival in 490 days. (c) What is the resulting velocity vector relative to Jupiter at arrival? Include the eects of eccentricity of Jupiters orbit (again, neglect inclination). (d) From the results of part 8c, can you create a y-by that achieves an 80 degree inclination heliocentric orbit? Give the details of the y-by hyperbola (rp and eccentricity), and the resulting helioentric orbit. Note that ignoring inclination will cause some discrepancy between the results and the actual mission. 9. Among the most famous and dramatic space probe missions are the grand tours of Voyager One and, especially, Voyager Two. Their heliocentric trajectories are shown in Figure 6.18. For a basic analysis, ignore the eects of inclination in this problem. (a) Find the Julian Day numbers for the Voyager Two encounters of Jupiter, Saturn, and Uranus. (b) Find the true longitudes of each planet, and their distances from the sun, on the corresponding days. (c) Find the characteristics of the transfer orbits from Jupiter to Saturn, and from Saturn to Uranus. (d) Find an approximate y-by trajectory about Saturn. Because ignoring the inclination causes small errors, an exact solution will not be possible. Instead, nd the y-by that achieves the correct turning angle. What is the error in the resulting heliocentric velocity?

6.9: Problems

129

Figure 6.18: Grand tours of Voyagers 1 and 2.

Note that to do this part of the problem it will be necessary to compute the Saturn-relative velocity of the craft before and after y-by. Include the eect of eccentricity in computing the velocity of Saturn. The approach of this problem makes the simplication that the planetary encounters take place at essentially a single point from the heliocentric point of view, as well as ignoring inclination. The resulting errors make it impossible to exactly match the actual mission. However, the results are suciently close to show that the methods laid out in this chapter provide an excellent rst approximation. 10. Consider a preliminary design of Voyager One, as in Figure 6.18. In this problem, we will not attempt to match the actual voyage. (a) Assume that the launch date has been established as September 5, 1977. On that date, the true longitude of the earth was 342.63 , and that of Jupiter was 62.23 . Using the usual patched conic approximations, compute a trajectory from the Earth that begins at perihelion, and will achieve Jupiter rendezvous.

130

Chapter 6. Interplanetary Trajectories (b) The true longitude of Saturn at launch was 141.59 . Assuming the dates of encounter in Figure 6.18 are correct, nd the semimajor axis and eccentricity of the transfer orbit from Jupiter to Saturn.

Chapter 7

Restricted Three-body Problem

The term three body problem is most correctly used for any problem involving three masses and their mutual attraction. The restricted three-body problem is much more specialized, and was rst formulated by Leonard Euler in 1772.

7.1

Formulation

The problem is usually formulated in canonical form as follows: There are two primary bodies, the smaller of mass and the larger of mass 1 . The distance between the bodies is taken to be unity, and is assumed constant. The time unit is dened such that one orbit of the two-body system requires 2 units. In this set of units, the rotational rate is 1 and the value of the gravitational constant G comes out to be unity, as well. To this system of two bodies we add a third. The third body has very small mass, so that it has no measurm able eect on the motion of the primaries (such a situation occurs in the r1 instance of a spacecraft in the vicinity r2 r of the Earth and Moon, for example). As the distance between the 1 primaries is constant, it must be that they describe simple circular or1 bits about each other and, therefore, about the center of mass. We will deFigure 7.1: Geometry of three-body problem. scribe the motion of the innitesimal mass m with respect to the rotating

131

132

Chapter 7. Restricted Three-body Problem

frame in which i lies on the line running through the centers of the two primaries, j is in the orbital plane, and k completes the right-handed system. This frame has a . rotational rate with respect to the inertial frame of = 1k The equations of motion of the system are derived from basic mechanics. Since they will be derived in a coordinate frame that is not inertial, we must allow for the rotation of the local frame. We have r= 2r r F + 2 +r= 2 t t m

where the partial derivatives are taken with respect to the rotating frame, and r = is the vector from the center of mass to the mass m. x i + y j + zk The gravitational attraction is found from Newtons law to be F= thus we have ) + 2k (x ) + k k (x ) = r1 (1 )r2 ( x i+y j+z k i+y j+z k i + y j + zk 3 3 r1 r2 (recall = 1 and G = 1). Now, r1 = r (1 ) i and r2 = r + i. Working out the cross-products, substituting for r1 and r2 , and equating the components gives x 2y x = (x 1 + ) (x + )(1 ) 3 3 r1 r2 y (1 )y y + 2x y = 3 3 r1 r2 z (1 )z z = 3 3 r1 r2 (7.1) (7.2) (7.3) Gmr1 (1 )Gmr2 3 3 r1 r2

What we have derived is a highly non-linear set of of coupled second-order equations (the coupling between the z equation and the other two is hidden in r1 and r2 , which are functions of x, y , and z ). They were rst derived over 200 years ago, and it has since been proven that there is no general closed-form solution. We can, however, derive a great deal of information about the system.

7.2

Equilibrium Points

If we cant solve the equations of motion, we can at least look for equilibrium points. An equilibrium point in a dynamical system is one at which all of the velocities and accelerations are zero.

7.2: Equilibrium Points

133

The denition of an equilibrium point makes looking for one straight-forward. We simply set all the velocities and accelerations to zero and solve the resulting equations for x, y, and z . Consider rst the equation for z . Setting the right hand side to zero gives (1 )z z = 3 3 r1 r2 For this to be true, either z = 0 or r2 r1
3

(7.4)

Since r1 and r2 are magnitudes of vectors, they are clearly positive, and since 0 < < 1, the right hand side of (7.4) is obviously negative. Thus, the only possible solution is z = 0, giving us that all equilibrium points lie in the xy plane. Consider the other two equations. Setting x =x =y =y = 0, we get x = (x 1 + ) (x + )(1 ) + 3 3 r1 r2 y (1 )y + y = 3 3 r1 r2 (7.5) (7.6)

With z set to zero, the expressions for r1 and r2 become r1 = r2 = (x 1 + )2 + y 2 (x + )2 + y 2

Euler made the further simplication of assuming y = 0, and looking for points along the x axis. With this assumption, (7.6) reduces to an identity, and (7.5) becomes x= (x 1 + ) (x + )(1 ) + x1+ 3 x+ 3

We can clear the denominators and arrive at a quintic equation for x. Unfortunately, quintics cannot be solved analytically in general. However, for 0 1, it turns out that the polynomial has only three real roots. It can be shown that the solution always reduces to the situation shown in gure Figure 7.2 below.
L L L

Figure 7.2: X-axis Lagrange points.

134

Chapter 7. Restricted Three-body Problem

Of these three roots, two bracket the smaller primary, while the third lies at about unit distance (depending on the value of ) on the other side of the larger primary. There are two other points of equilibrium, discovered by Lagrange, also in 1772. In his honor, all of the equilibrium points are denoted by Li , and generally referred to as Lagrange points (they are also sometimes known as libration points, though the term libration has a dierent meaning in astronomy). To nd these, return to our equilibrium equations (7.5) and (7.6). Now suppose r1 = r2 = 1. Then the equations reduce to x = (x 1 + ) + (x + )(1 ) y = y + (1 )y (7.7) (7.8)

Multiplying these equations out shows them to be identities they are automatically satised. This means that the two points shown in Figure 7.3, sometimes referred to as the triangular Lagrange points, are also equilibrium points. Since the orbital radius is dened to be unity in the problem formulation, L 5 this means that the triangular Lagrange points are the same distance from each primary as the primaries are from each other. In practice, is generally very small and the smaller primary is thought of as orbiting the larger. This means that the Lagrange points are in the orbit of the smaller primary, leading and following by sixty degrees of arc. There are at least two instances in our solar system in which the Lagrange points are important. The equilibrium points in the Earth-Moon system are naturally of interest. The other set is the Lagrange points of the Sun-Jupiter sysL tem. In this case, the L4 and L5 points 4 are populated by the Trojan asteroids. Figure 7.3: Triangular Lagrange points

7.3

Stability of the Lagrange Points

We will use the results of appendix D to analyze the stability of motion near the Lagrange points of our canonical system. Dening as in section D.1 x=x + y=y + (7.9)

7.3: Stability of the Lagrange Points

135

where ( x, y ) is a Lagrange point, we take derivatives and establish the relationships x = x = y = y = (7.10) (7.11)

We use Taylors theorem to expand the equations of motion (7.1,7.2,7.3) as x 2y = y + 2x = where f (x, y ) = x (x 1 + )[(x 1 + )2 + y 2 ]3/2 (1 )(x + )[(x + )2 + y 2 ]3/2 g (x, y ) = y y [(x 1 + )2 + y 2 ]3/2 (1 )y [(x + )2 + y 2 ]3/2 and we have used the fact that f ( x, y ) = g ( x, y ) = 0 to remove these terms from the 1 equations. Using the relationships (7.9,7.10,7.11) we can re-write this as where a= c= f x g x b=
x ,y

f x g x

+
x ,y

f y g y

x ,y

+
x ,y

x ,y

0 2 2 0

a b c d f y g y

x ,y

d=
x ,y

x ,y

And this we write in the generic form + Br r = Ar Without going into the algebra, we present the solution for L4 . The position of the equilibrium point is given by 3 1 y = r1 = r2 = 1 x = 2 2
Note that the full equations of motions have to be used when taking the derivatives. To be completely correct, we should also retain the z equation, and the dependence of the other equations on z . This would serve to increase the complexity without signicantly altering the results, however. The dominant considerations from the viewpoint of system stability occur in the xy plane.
1

136

Chapter 7. Restricted Three-body Problem

We take the indicated partial derivatives and plug in the values for L4 and get B=
3 3 2 3 4 1 2 3 3 2 1 2 9 4

The eigenvalue equation then becomes |2 I A B | =


3 3 2

2 0 0 2 2

2 3 3 2

0 2 2 0
3 4

3 3 2

3 4

1 2

1 2

9 4

1 2

1 2

2 3 2 3 2 9 4

=0

Solving for the determinant produces the quartic polynomial |2 I A B | = 4 + 2 27 ( 1) = 0 4

Treating this as a quadratic in 2 gives that the eigenvalues must satisfy 2 = 1 1 2 1 27(1 ) (7.12)

Now, we require for stability that all of the eigenvalues have non-positive real parts. Since this system is undamped (that is, it has no mechanism by which energy is dissipated), we cannot expect that it will be asymptotically stable. Instead, we will settle for neutral stability. Eigenvalues corresponding to neutral stability have zero real part, and thus when squared are negative real numbers. Since the equation we have derived is in 2 , we look for negative real roots of eqn. (7.12). To get such roots, it is necessary that 0 1 27(1 ) 1 since otherwise the value of 2 will be either imaginary or positive. Since and 1 are both positive, it is clear that the 1 27(1 ) 1 part will be satised. Whether the other requirement is satised depends upon the particular value of . To nd the critical values, we set 1 27(1 ) = 0 = 2 + 1/27 = 0

The roots of the quadratic are approximately = 0.03852 and = 0.96148. The largest value of for any two-body pair in our solar system is that of the Earth-Moon system; the Earth is 81.3 times the mass of the moon, and = 0.0121. Plugging in this value shows that the eigenvalues are purely imaginary, and thus the L4 Lagrange

7.4: Jacobis Integral

137

point is neutrally stable. Since the L5 point is an identical reection of the L4 , it too is stable. A similar analysis could be carried out on the three X -axis Lagrange points. It turns out that for any value of , these points are unstable. In writing the equations for the three-body problem, we ignored all other bodies. In general, this is not a terrible oversight; there are few truly massive bodies in the solar system, and the distances between them are huge. Also, most of the three-body systems would consist of the Sun, a planet, and some small mass. This is not the case for the Earth-Moon system, however, and in this case leaving out the eects of the Sun is a signicant error. It turns out that when the Sun is included, the Earth-Moon Lagrange points are not stable, and a body placed at one of them will not remain. This is why the Lagrange points of the Earth-Moon system are empty.

7.4

Jacobis Integral

In this section, we will derive an analogue of the energy equation of the two-body system, which will be valid for the three-body system. We will then be able to use this constant of the motion for qualitative analysis of the motion of the innitesimal body within the rotating frame of the three-body system.

7.4.1

A Constant of the Motion

Recall that the equations of motion of the negligible mass in the circular restricted three-body problem can be derived as (7.1-7.3) x 2y x = (x 1 + ) (x + )(1 ) 3 3 r1 r2 y (1 )y y + 2x y = 3 3 r1 r2 z (1 )z z = 3 3 r1 r2

We have already discussed the equilibrium solutions to these equations, and noted that the linear Lagrange points are unstable, while (for appropriate values of the mass ratio ) the triangular points are stable. These results are borne out by observation; there are asteroids at the L4 and L5 points in the Sun-Jupiter system. There is nothing at the triangular points of the Earth-moon system, but this is because the eects of the Sun (which are not included in the simplied model we have analyzed) render these points unstable.2 We consider now an additional approach to analyzing the system.
2 Recall that these points are only neutrally stable, so that it is relatively easy to render them unstable in actuality by imposing outside perturbations on bodies there.

138

Chapter 7. Restricted Three-body Problem

In looking at the two-body problem, it was instructive to to nd constants of the motion; that is, functions of the position and velocity of the body in orbit that remained constant at all points on the orbit. The specic energy and specic angular momentum are such constants, and provide much of the machinery for solving problems in two-body systems. It turns out that there is only one such constant for the three-body problem, discovered by Jacobi in 1836, and known as Jacobis Integral. There are several approaches to generating Jacobis integral, and we will take one with some mechanical justication. Recalling that d dt v2 2 = 1d 1 ) = ( +r , ( rr rr r) = rr 2 dt 2

multiply equations (7.1) through (7.3) by x , y , and z , respectively, and sum the results. The rst terms sum to the desired dot product, and performing the algebra and simplifying, we get 1 (x x xx yy zz ) + 3 (xx x yy zz ) 3 r1 r2 (7.13) 2 The terms on the left comprise the derivative of v /2, and the rst two terms on the right are the derivative of (x2 + y 2 )/2. The remaining terms are not so obvious, until one recalls that x x +y y +z z = xx + yy + r1 = (x 1 + )2 + y 2 + z 2
1/2

and

r2 = (x + )2 + y 2 + z 2

1/2

It then becomes obvious that these terms are the time derivatives of /r1 and (1 )/r2 , respectively. Assembling the results, and noting that there will also be a constant of integration, we have Jacobis integral C= v 2 x2 + y 2 1 . 2 2 r1 r2 (7.14)

This is the equivalent of the energy integral for this system. As with the energy for two-body orbits, this value (sometimes called the Jacobi constant) is constant for any particular trajectory in the three-body problem. Consider the terms in (7.14). The rst is clearly a kinetic energy term, while the two right-most are the same type of gravitational potential terms as were derived for the two-body problem (one for each of the two massive bodies). The additional term is the square of the distance from the axis about which the entire system is rotating. This term is grouped with the velocity term. Kinetic energy depends on the velocity relative to an inertial frame; the velocity term is relative to the rotating frame. Recalling that the frame rotates at one radian per second, it becomes clear that the second term is the correction to the velocity to convert it to inertial. Therefore, the Jacobi integral is precisely an energy integral, expressed in the rotating frame.

7.4: Jacobis Integral

139

7.4.2

Surfaces of Zero Velocity

Consider the Jacobi integral as it might apply to a vehicle in motion in the Earthmoon system. The mass of the moon is about 1/81.4 that of the earth, so + 81.4 = 1 = 0.01214

for this system.3 If the vehicle is very close to the surface of the earth, r2 is very small and r1 approaches one. Further, y must be very small (it can be no more than r2 ), and x is very nearly equal to (since the earth is distance to the left of the origin). To generate exact numbers, suppose the vehicle is at 7000 km from the center of the earth, and is exactly on the x axis between the earth and the moon. Then (recall the average radius of the orbit of the moon about the earth is 384,400 kilometers) r2 = 7000/384400 = 0.01821 r1 = 1 r2 = 0.98179 x = r2 = 0.00607 The last three terms in (7.14) can then be evaluated, and we have that C = v 2 /2 54.26. The question now becomes, given some value of the velocity, what is the value of C , and what does it mean? Converting velocity into the normalized system is straightforward: the distance unit is 384, 400 km, and the time unit is such that 2 time units equals the period of rotation of the system, which is (from tables) 27.322 days.4 The one DU/T U = 384400/375704 km/s = 1.0231 km/s. Suppose the velocity is a bit more than the circular orbit velocity, say 8 km/sec. Then we have C = -23.7. What does this mean? Mostly, it means that the vehicle will not be able to get very far from the earth. Just as the value of the energy in the two-body problem limits the maximum possible distance that a vehicle can travel, so too does the value of the Jacobi constant in the restricted three-body problem. To nd the limits of motion for any specic value of the Jacobi constant, we need only nd the surfaces for which the velocity equals zero. These surfaces will
The numbers from various sources disagree beyond the second signicant digit; these are from Gurzadyan, Theory of Interplanetary Flights, Gordon and Breach, 1996. 4 Recalling = 398601 km3 /s2 , we could compute this from TU = a3 /(GM ) = 3844003 . 398601(1 + 1/81.4)
3

Using this, the period actually comes out to 27.285 days. But, close enough.

140

Chapter 7. Restricted Three-body Problem

partition space into accessible regions, in which v 2 is positive, and the inaccessible regions in which velocity would have to be imaginary. To characterize these regions, we will investigate the situation in which z = 0. However, we should keep in mind that nothing in our analysis has restricted us to the x-y plane. Consider now the curves that are generated by setting v = 0 for the value of C that we have computed. Because the magnitude is very large, there are three ways that the value can be met: 1. r2 small. Note that if r2 is very small, then y must also be small (since it can be no larger than r2 ). Also, x must be small, since its largest possible magnitude is only + r2 . Since x and y are squared, we can ignore that term. r1 is nearly unity, because it is the distance from the vehicle to the other primary, which is by denition unit distance from the rst primary. Therefore, all of the terms other than (1 )/r1 are small. In fact, we can write C = 23.7 + (1 )/r2 This means that there will be a nearly circular curve of small radius about the earth, on which v = 0 for the given value of C . Inside this circle the velocity is positive, and thus the circle encloses an accessible region. 2. r1 small. The analysis follows much as in the last case, except that here, x 1 as well. Still, the term (x2 + y 2 )/2 will be about one-half, and since both (1 ) and r1 will be nearly unity, their ratio will be about one. Thus, C = 23.7 1/2 + 1 + /r1 . Since is less than 1/80 times (1 ), it follows that there is a circle of much smaller radius about the moon, inside of which is also an accessible region. 3. Finally, it could be that the sum x2 + y 2 is large. In this case, both r1 and r2 will also be large, and since they are divided into terms of less than one, those terms have small magnitude. Thus, there is a curve of large radius about the center of the system, outside of which is a third accessible region. For the Earth-moon system, using the values we have chosen, the rst two accessible regions are those for which r2 < 0.0417 DU about the Earth, or for which r1 < 0.00055 DU about the moon. The third is for distances greater than about 6 DU from the center (note that none of these curves are actually circles, so these radii are only rough approximations). The important point is that since the vehicle started out inside the rst accessible region, it will stay within that region. A vehicle cannot cross a surface of zero velocity without applying thrust, so all points of any unpowered trajectory trajectory must lie within the same accessible region.

7.4: Jacobis Integral

141

1.2

2.0 1.7

3.0

0.8 1.7 2.0 0.4 1.51 3.0 5.0 0.0

1.495 1.5

1.51 1.594

1.7

1.594 1.51 0.4 1.5 1.495 0.8 1.7 2.0 1.2 0.8 0.4 0.0 0.4 0.8 1.2 1.6 3.0 2.0 1.7 2.0

1.2

Figure 7.4: Curves of zero velocity for the Earth-moon system.

142

Chapter 7. Restricted Three-body Problem

Figure 7.4 shows several curves of zero velocity for the Earth-moon system. These curves are all in the x-y plane. The lowest value represented is 5; the highest is -1.495. The earth is the shaded ball near the origin, and is slightly larger than true scale. The moon is too small to see, at about x = 0.99 on the x axis. The ve black dots, three on the x axis and two at about x = 0.49, y = 0.87 are the Lagrange points. As a point of interest, the positions of the x-axis Lagrange points are about x1 = 1.156, x2 = 0.837, x3 = 1.005.

It is easy to see the way the curves move as the value of C increases towards zero. For C = 5 (a large magnitude), the outer curve mentioned as case 3 above is too large to t in the sketch. The curve for case 2, about the moon, is too small to discern. The curve for case 1, about the Earth, is of substantial size. The accessible regions are inside those curves, and outside the large, o-the-chart curve. As the value gets larger, the radius of the outer curve shrinks, so that at C = 3 it just cuts inside the corners of the graph on the right. The curve about the earth is larger, as is the one about the moon, though that one is still too small to see clearly. The trend continues as C increases to 2 and 1.7. The curves for C = 1.59411 are shown dash-dot to set them o from the rest. Note that the curves about the earth and moon for this value meet at the L2 point. This is the lowest value for which the two curves meet, and therefore it is the lowest value for which a craft can go from the earth to the moon; above this value, the curves merge and the earth and moon are in a single accessible region. Note that the outer curve for this value has moved in quite close, but still does not meet the inner curves; this means that a vehicle could not escape the earth-moon system. The next lowest value shown is C = 1.51, and this curve is dashed. At this value, the outermost curve has blended with the inner. Note that the inaccessible region now includes both triangular points, and curves around the earth on the side away from the moon. The path to deep space is open, but it leads past the moon. As C increases still more, this region splits into two, each of which collapses towards one of the triangular Lagrange points. The value of C at these points is -1.494. This is the highest possible value for this system; to have a higher value of the Jacobi integral, a vehicle must have a non-zero velocity. The analysis above is done purely in the plane, but the vehicle is not restricted to the plane (recall that z is included in the formulation). When out-of-plane motion is included, the curves become surfaces of zero velocity; the analysis is the same, but we discuss spheres rather than circles. For example, the accessible regions for large negative values of C become the inside volumes of spheres about the earth and moon, and the volume of space outside a large sphere surrounding the entire system.

Chapter 8

Gravitational Presence of the Earth

To this point, we have considered all of the planets to be perfect, radially homogenous spheres. As such, we have considered them to be point masses. This is an approximation, more or less correct for dierent planets. In this chapter, we will explore a more accurate description of the Earth. As a result, we will also consider some corrections to the earlier results for motion about the Earth, due to terms we ignored in that earlier work. The rst section of this chapter briey reviews the theory of conservative elds and potential functions, with emphasis on gravitational potentials. This can be skipped if desired; while it is useful for a deeper understanding of gravitational elds, it is not necessary for the implementation that follows. The second section will introduce the use of potentials for the derivation of the equations of motion. Again, this is not completely necessary for consideration of the eects of the disturbing potentials that are described in following sections.

8.1

Gravitational Potentials

The energy integral has played a large part in the analyses discussed in this book. The constant energy is the sum of two terms, and in Chapter 2 one of these terms was described as the potential energy. In this section, we will expand upon this idea. We will start by dening more rigorously the ideas of conservative elds and potentials. Then, we will discuss the gravitational potential of real, nite bodies.

8.1.1

Conservative Force Fields

The idea of a force eld is simple enough. Given some position r in three-dimensional space, suppose that f (r) is the force that would be experienced by a particle with a

143

144

Chapter 8. Gravitational Presence of the Earth

unit amount of some characteristic quantity at that point (this can be referred to as the specic force in this application). If in fact the force is always known so long as the position is known, then this is a force eld. The coordinate system does not have to be inertial, and in general is not. Such elds are obviously not uncommon. Gravity is an obvious example. In this case, the characteristic quantity is mass. In an electric eld, the characteristic quantity is the charge on the particle. In a magnetic eld, the characteristic quantity would be related to the material of which the particle is made. An important point to note is that the force at some point must be independent of the path leading to that point. Also, the force must depend only on the position itself, not on the velocity of the particle. That is, the force depends on where it is, not how it got there or how fast its going. In general, the total force on a body is the sum of many individual forces. Some of these may be due to force elds, some of them not. As an example, the forces on a ball falling through the atmosphere include weight, which is due to gravity and therefore a eld force, and air resistance, which is dependent on the velocity, and is not a eld force. A force eld is conservative if the amount of work done by the eld in moving from one point to another is independent of the path the particle takes. To make this clear, consider a couple of examples.
Example 8.1. Consider the usual at-earth approximation of motion near the surface of the planet. For motion that is small relative to the diameter of the planet, we model the surface as a plane, and model gravity as a constant, acting normal to the plane. The force on a particle of mass m is then f (r) = mg k Where g is the gravitational acceleration. Now consider the work done by the eld on a particle as it moves from point a to b. This is a simple integral,
b b

(8.1)

Wab =

f (r) dr = mg

dz = mg (ak bk ).

(8.2)

Clearly, the result is dependent only on the dierence heights of a and b, and is therefore independent of the path taken. Example 8.2. Consider a two-dimensional example. Let the force be dened by the function f (r) = ky (8.3) where k is some non-zero constant. Consider now two possible paths from the origin to the point (1, 1). In the rst, we travel along the x axis from (0, 0) to (1, 0) and then in the direction from (1, 0) to

8.1: Gravitational Potentials


(1, 1). The work done by the eld is
(1,0) (1,1)

145

W1 = k

(0,0)

0 k

(1,0)

y = 0.

(8.4)

Note that the rst term is zero because y = 0 along that segment, so that the magnitude of the force is zero. The second path will run from the origin along the y axis to (0, 1), and then in the direction to (1, 1). The work done by the eld along this path is
(1,0) (1,1)

W2 = k

(0,0)

y k

(1,0)

(1) = k.

(8.5)

The two paths yield dierent values of work, so the eld is not conservative.

There are a few ways to mathematically express characteristics of a eld that guarantee that it is conservative. One is that the work along any closed path is zero; this is a simple consequence of path independence, and dicult to prove directly. But a very simple way is to use the integral of the work along any path from some arbitrary point r1 to another, r2 . Let the force be dened as f (r) = fi + fj + fk k. , we have Letting dr be written as dx + dy + dz k f (r) dr = fi dx + fj dy + fk dz. Now suppose that we can nd a scalar function (r) such that fi = , x fj = , y fk = . z (8.8) (8.7) (8.6)

Then the work integral becomes


r2 r2

W =
r1

f (r) dr =

r1

dx + dy + dz x y z

= (r2 ) (r1 ).

(8.9)

We see therefore that if such a function exists, then the eld must be conservative, since the value of the integral will always be dened by the values of the function at the endpoints, regardless of the path traversed. Recalling section 2.3.1, we see that this is very similar to the potential energy dened there. In fact, we have
r2 R R

(r2 ) (r1 ) =

r1

f (r) dr =

r1

f (r) dr

r2

f (r) dr = U (r1 ) U (r2 ). (8.10)

146

Chapter 8. Gravitational Presence of the Earth

This implies that the change in over any path is the negative of the change in the potential energy. Now, we note that adding a constant value to will not change its derivatives. We can therefore choose (r) to be the negative of the potential energy function U (r). In summary, we can state from this analysis that a conservative force eld is one for which a potential energy function U (r) can be dened, and further, the specic force on a particle due to the eld is given by the negative gradient of the potential energy, as U U U fi = , fj = , fk = . (8.11) x y z The function (r) = U (r) is often known as the potential function. In many cases, the distinction between the two is ignored, and U is referred to as the potential function. We further make the declaration that the existence of a potential function (equivalently, a potential energy function) is sucient to prove that a eld is conservative. It has been easy to show that the existence of a potential function is enough to prove that the eld is conservative. It is also true that if such a function does not exist, then the eld is not conservative.
Example 8.3. Recall the eld of example 8.2, f (x, y ) = ky . Show that it is not conservative. Recall that for any function (x, y ) that is twice dierentiable, it must be true that 2 2 = . xy yx Therefore, if there is a potential function for this eld, then it must be (from the denition) true that and fj = , fi = x y and therefore it follows that fj fi = . y x In this case, this requires 0 (ky ) = y x = k = 0

which is clearly nonsense. Therefore no potential function exists, and the eld is thus not conservative.

Since the force is the gradient of the potential function, the eld is fully dened by the potential function.

8.1: Gravitational Potentials

147

8.1.2

Gravitational elds

To begin, realize that Newtons law of gravitation applies only to particles. There is no consideration of the size of the body, or of the distribution of mass within the body. To extend the law of gravitation to a body of nite size and possibly non-uniform mass distribution, it is necessary to integrate over the body. In this section, we will assume that the bodies have uniform mass distribution. First, however, consider the potential of a particle. We have from earlier that (r) = r (8.12)

Letting the elements of r be x, y , and z , and noting that r= we have x2 + y 2 + z 2 ,

just as we require. Here, we have replaced the specic force, in this case the force per unit mass, with the acceleration. Now suppose we have two massive particles providing gravitational potentials. The total potential is just the sum of the two: a= P hi 1 2 = (1 + 2 ) = + . r r r r

x /x r 1 y /y = 3 a = = = 3 r r z /z

(8.13)

It follows that we can integrate over an innite number of potentials, which provides the justication for what follows in this section.

b C r a A

Figure 8.1: Integration over annulus. As an example, consider the gravitational potential that would be created by a uniform hollow spherical shell, at a point A outside of the shell. Let the shell have

148

Chapter 8. Gravitational Presence of the Earth

uniform density , radius b, thickness db and be centered at the point C . At any point on the annulus sketched in Figure 8.1, the dierential mass is dm = (bd)(b sin d)db. The distance from the point on the annulus to A is r2 = a2 + b2 2ab cos , where a is the distance from C to A. The potential of the annulus at A is then the integral over of the potential of the dierential mass: Gb2 sin ddb A = 2 (a + b2 2ab cos )1/2
2

d =
0

2Gb2 sin ddb . r

The potential of the shell is the integral of the potential of the annuluis over from 0 to . Noting that the mass of the shell is Ms = 4b2 db, this is S = 2Gb2 db
0

(a2

b2

This is precisely the potential of a particle of mass Ms located at C . It is clear that to nd the potential of a uniform sphere, we would integrate the result in (8.14) over the radius, and the result would be the same. Thus, from any point outside of a sphere, the gravitational potential is the same as that of a point containing all of the mass of the sphere, located at the center of the sphere. This result holds as well for a sphere in which the density varies as a function of distance from the center, so long as it is homogenous within any shell. Note that this is a formal justication of the approximation made in chapter 2, where we treated all bodies as points. To the extent that the Earth and other planets can be treated as radially symmetric spheres, the approximation is valid, even for points very close to the surface. This is a very good approximation for most purposes, though the errors introduced are signicant in many applications.

sin d GMS = . 1 / 2 2ab cos ) a

(8.14)

8.2

Equations of Motion using Potentials

In this section, the equation of motion of a mass in the presence of more than one additional mass will he derived, from the point of view of gravitational potentials. This will be similar to the sphere of inuence (section 6.2), from a dierent and more general approach.

8.2: Equations of Motion using Potentials

149 Figure 8.2 shows a body m in space with three others.1 Following the results of the last section, the potential at m due to the other bodies is () = Gm0 Gm1 Gm2 + + r0 r1 r2 (8.15)

m2 r1 r1,0 m0 r0 r2 m

m1

Since the force per unit mass on m is the gradient of the potential, and the acceleration is (from the second law of motion) also the force per unit mass, we have dv = . dt (8.16)

Figure 8.2: The n-body problem.

For simplicity, consider the motion in the direction. Recalling that

2 r0 = ( 0 )2 + ( 0 )2 + ( 0 )2

0 1 = 3 r0 r0 with similar results for r1 and r2 . Taking the partial derivative of the potential function then gives = = Gm0 ( 0 ) Gm1 ( 1 ) Gm2 ( 2 ) 3 3 3 r0 r1 r2 (8.17)

we have

are very similar. The expressions for and Consider the motion of mass m0 . The potential at m0 does not include a contribution due to m0 itself, but does include a contribution due to m. Thus 0 () = Gm Gm1 Gm2 + + r0 r1,0 r2,0 (8.18)

where we use ri,j to mean the vector from mi to mj . Clearly, 0 = 0 = Gm0 (0 ) Gm1 (0 1 ) Gm2 (0 2 ) . 3 3 3 0 r0 r1 r2 ,0 ,0 (8.19)

To examine the motion of m relative to m0 , dene a new coordinate system centered at m0 , with axes parallel to those of the intertial frame. Dene x = 0 , y = 0 , z = 0 .

1 Three is a sucient approximation to many in this case; the results we will derive will easily extend.

150 0 , and Then we have x = x =

Chapter 8. Gravitational Presence of the Earth

Gm0 ( 0 ) Gm1 ( 1 ) Gm2 ( 2 ) 3 3 3 r0 r1 r2 Gm(0 ) Gm1 (0 1 ) Gm2 (0 2 ) + + + 3 3 3 r0 r1 r2 ,0 ,0

(8.20)

Re-arranging the terms provides G(m0 + m)( 0 ) 1 1 0 2 2 0 Gm1 + Gm2 + 3 3 3 3 3 r0 r1 r1,0 r2 r2 ,0 (8.21) We will henceforth make the usual identication = G(m0 + m). Now, we wish to replace the variables , 0 , et cetera with variables dened in the new coordinate frame. To do so, dene x = x1 = 1 0 , y1 = 1 0 , z1 = 1 0 , x2 = 2 0 , y2 = 2 0 , z2 = 2 0 . x = Noting that r1 = we have that (x x1 )2 + (y y1 )2 + (z z1 )2 x x1 = 3 r1 x x1 = 3 r1,0 x x + 3 r x xx1 1 3 r1 r1,0 xx1 3 r1 ,0 and r1,0 = 1 r1 .
2 2 x2 1 + y1 + z1 ,

x1 x2 x x x1 x x2 Gm + Gm + 1 2 3 3 3 3 r3 r1 r1,0 r2 r2 ,0

(8.22)

(8.23)

and

(8.24)

Combining these with similar results for terms involving x2 leads to x = + x xx2 1 3 r2 r2,0 (8.25)

The corresponding relations for y and z are y = and z = y + 3 r y 1 yy1 3 r1 r1,0 zz1 1 3 r1 r1,0 + y z 1 yy2 3 r2 r2,0 zz2 1 3 r2 r2,0 . (8.26)

z + 3 r z

(8.27)

8.3: Gravitational Potential of the Earth Combining equations (8.25), (8.26), and (8.26) provides r= where R1 = r R1 R2 + + 3 r r r 1 xx2 + yy2 + zz2 . r2 r2,0

151

(8.28)

xx1 + yy1 + zz1 1 ; r1 r1,0

R2 =

It should be clear that if there are any further planets, each adds a term to (8.28) of the form Rk 1 xx1 + yy1 + zz1 . (8.29) = r r r1 r1,0 The functions Rk in the equations above are known as disturbing potentials. Without them, the equation of motion is simply r= , r r (8.30)

and the term on the right hand side is recognized as the potential due to a spherical planet. In general, the equation of motion when the system does not meet the perfect two-body assumptions can be written as r= r R + 3 r r (8.31)

where R is the sum of all disturbing potentials. In the early chapters of this text, planets were considered to be in two-body motion about the Sun. This produced an equation of motion such as (8.30). A complete equation of motion would be of the form (8.31), where the other planets make themselves known through R.

8.3

Gravitational Potential of the Earth

At this point, we have shown that any force eld caused by gravity can be expressed as the gradient of a potential function. In the last section, the potential function due to the presence of several bodies was derived, holding the assumption that the bodies can be modelled as discrete points. This resulted in (8.31), with a disturbing potential due to the existence of additional planets. Here, we consider a disturbing potential arising from a dierent source. In deriving the equations of motion about a planet, we have made the assumption that the planet was spherical and radially symmetric. This is not perfectly true for

152

Chapter 8. Gravitational Presence of the Earth

any planet, though it is close.2 The planet about which we know most, of course, and about which we are primarily concerned, is the earth. In this section, we examine the gravitational eld of the earth. We have said that the approximation of the earth as a sphere is a good one, and this is true. A better approximation is that it is an oblate spheroid, which is to say that it is a solid body with an axis of radial symmetry; any plane normal to the axis that cuts through the body results in a circular section, and any cutting plane that includes the axis results in an elliptic cross section. A further renement has the spheroid slightly larger below the equator than above. And the renements continue. The actual shape and mass distribution of the planet is important for many purposes, but from the point of view of spaceight only its appearance as a source of gravitational attraction is signicant.3 The gravitational eld is generally described by its associated potential function. In this section, we discuss the usual forms of that this description takes. Note: In many commercial satellite simulation computer programs, the gravitational eld is actually expressed as a table based on latitude and longitude. This is specic to the purpose, however, and not germane to the current discussion.

8.3.1

Legendre Polynomials

The usual expression of the potential function of the Earth involves Legendre Polynomials. These polynomials are one of several sets of functions

8.3.2

Expression of the Gravitational Field

As mentioned, the gravitational eld of the earth is expressed through the potential function. Given enough terms, the potential of any body, of whatever shape, can be expressed to an arbitrarily high level of precision using Legendre polynomials.

In 1995, a body larger than Pluto was found to be orbiting the Sun at a distance of approximately 35AU (the body was eventually given the name Xena). In the years since, several more were found. In response, the International Astronomical Society adopted a formal denition of a planet. Part of this denition is that the body must have sucient mass that its own gravity forces it to be spherical. Thus, all planets will be close to perfectly spherical. 3 With the obvious caveat that the body in ight have sucient altitude to avoid the surface.

Chapter 9

Satellite Clusters and Relative Motion

The idea of relative motion in orbit has been of interest ever since the space age truly began in the 1950s. From the beginning, there was interest in creating space stations, and in missions to the Moon. In each of these cases, there would be a need for small craft to maneuver to, and dock with, larger craft (for lunar missions, a command module was placed in orbit around the moon, and the lander had to rendezvous with it after lifting o from the lunar surface to begin the return trip). More recently, in the last decade of the 20th century, interest has grown in satellite clusters. Ever since the rst satellites were put in earth orbit, they have been being made larger and more complex. At the same time, the communication and navigation systems for the satellites have been made more and more capable. At some point, it became obvious that in some cases, it would make more sense to divide the duties among several smaller, less complex, less expensive satellites, which would remain near each other as they orbited the earth. There are also missions which are possible only with such clusters, and which require very precise knowledge of the relative positions of the satellites within the cluster. We will not explore the nature of these missions, which largely involve highly accurate imaging of the earth, or of distant stars. Instead, taking it as given that clusters are desireable, we will concentrate on the nature of the relative motion of satellite in clusters. To begin, however, we will consider the most basic use of relative motion, the docking of a craft with an orbiting space station.

9.1

Rendezvous in Circular Orbit

In this section, we explore relative motion by considering the motion of a powered body attempting rendezvous with a second, unpowered body. The next section will describe the overall approach, which will then be specialized to the case in which the

153

154

Chapter 9. Satellite Clusters and Relative Motion

unpowered body is in circular orbit. We will think of these as a spacecraft and an orbiting space station. All of the following analysis will assume that the planet is spherical. The additional accelerations due to planetary oblateness, or the presence of other bodies (such as the moon) would be nearly identical for both the spacecraft and the space station, and so very nearly cancel out. While some work has been done that includes some of these eects, they are not needed for an introduction to the problem.

9.1.1

Equations of Relative Motion


I 2

In general, the equation of motion of a spacecraft in orbit is r d ( r ) = +f (9.1) dt2 r3 where f is the acceleration due to thrust.1 This equation assumes that the planet is spherical, and that there are no other forces or accelerations present. The prexed I in (9.1) notes that the derivative is taken with respect to a xed, or at least non-rotating, reference frame.2 When the problem being considered is how the spacecraft moves when near the space staSpacecraft tion, it makes sense to think of the position of the craft relative to the station (or the astro naut relative to the shuttle, or whatever). To r Station that end, dene the position of the craft as
rs

r = rs +

(9.2)

where rs is the position vector of the space station, r is the position of the spacecraft, and is the relative position vector. The relative motion equation is then
I 2 I 2 d d d ( ) = ( r ) (rs ) . (9.3) 2 2 dt dt dt2 Now, using (9.1), and assuming that the station is not thrusting, so that its orbit is Keplerian, we have I 2

Figure 9.1: Spacecraft in ight near space station.

I 2

( rs ) rs d () = +f 3 . 2 3 dt rs rs

(9.4)

f should not be taken to be the force from the spacecraft motors. It might be better to use a which more obviously means acceleration, but the letter a appears too often as semimajor axis to give it another meaning. Instead, the reader should simply take f as the specic force, or force per unit mass, which appears as acceleration in application. 2 Recall that the equation of motion for the two-body problem has the same form in both the inertial frame and in a non-rotating frame attached to one of the bodies.

9.1: Rendezvous in Circular Orbit

155

Now, because the size of the vector is much less than that of rs , we expand the motion of the spacecraft about that of the station. There are several ways to approach the expansion. We will use Taylor expansion, which is covered briey in Appendix C. Renaming the terms in (9.4) and writing it as d () = F(r) + f F(rs ), (9.5) dt2 we expand the expression for unthrusted motion of the spacecraft as d (9.6) F(r) = F(rs ) + F(rs ) + dr The terms beyond those listed involve higher powers of , and so will be considered negligible. Since the rst term after the equal sign is identical to the nal term in (9.5), the two cancel. The result is a rst-order approximation to the equations of relative motion I 2 d d () = F(rs ) + f . (9.7) 2 dt dr This is also known as a linear approximation to the equations of motion, and it is said that the system has been linearized. To write out the equations, we let the elements of be {, , } so that x xs y ys r= = + . (9.8) z zs Recalling that F(r) can be written as F(r) = (x2 + y2 + z 2 )3/2 x y z
I 2

the partial derivatives of the rst of these becomes

3x2 3x2 F1 = + + = x r3 r5 (x2 + y 2 + z 2 )3/2 (x2 + y 2 + z 2 )5/2 F1 3xy 3xy = = 5 5 / 2 2 2 2 y r (x + y + z ) 3xz 3xz F1 = = 5 . 5 / 2 z r (x2 + y 2 + z 2 ) Continuing the derivation produces the system d (9.9) F(rs ) = 5 3rrT I3 r2 dr r (Here, I3 means the 3 3 identity matrix). At this point, the only simplications have been to adopt the two-body assumptions.

156

Chapter 9. Satellite Clusters and Relative Motion

9.1.2

Specialization to Circular Orbits

In this section we assume that the space station is in a circular orbit about the k earth. We also dene a reference frame in which to dene the vectors rs and . As shown in Figure 9.2, we let the frame r be centered at the space station, and take the direction to be radially out from the rs unit vector is parallel to earth. The k the angular momentum vector of the stations orbit, and is chosen to complete the right-handed coordinate system. Reference Orbit The dened coordinate system rotates at the angular rate at which the Figure 9.2: Spacecraft in ight near space station is moving. Because the frame is rotating, we have the acceleration in the station. moving frame to be
I 2

d () 2 dt2

(9.10)

(the usual subtraction of the acceleration of the origin of the local frame is not necessary, as this was done in deriving eqn. 9.5). Invoking the assumption of a circular orbit for the space station, the angular velocity vector becomes 0 0 = (9.11) n where n is the mean motion. This is constant, so the term involving its time derivative in (9.10) disappears. Further, we have rs 0 rs = . (9.12) 0 = 3n2 + 2n + fi + fj = 2n = n2 + fz .

Evaluating (9.9) at r = rs , and using the result along with (9.11) and (9.7) in (9.10), provides the result (9.13) (9.14) (9.15)

3 has been used to simplify some of the terms. These are known where n = /rs variously as Hills equations, the Clohessy-Wiltshire equations, and occasionally as

9.1: Rendezvous in Circular Orbit

157

the Clohessy-Wiltshire-Hill equations. They are very similar to equations derived by Hill in 1878, in his studies of the motion of the moon.3 Clohessy and Wiltshire rederived them in the context of orbital rendezvous in 1960.4 We will refer to them as the CW equations, due to the context in which we are using them. These equations describe the motion of a body relative to another, where the second is in a circular reference orbit. Since the presence of the space station does not actually enter the dynamics, it need not actually exist. All that is required is that the position of the origin of hte local frame be dened as a point moving along a Keplerian reference orbit. The term reference orbit will often be used to describe the motion of the local origin.

9.1.3

Natural Motion

The CW equations cannot be solved in general unless the thrust f (t) is known. However, if impulsive maneuvers are assumed, the thrust is zero during motion. With this assumption, the equations can be solved in closed form. Motion without thrust is also known as the natural motion of the satellite. The easiest equation to solve is (9.15), as neither nor appear. It is thus decoupled from the other two equations. With f = 0, we have = n2 = (t) = c1 sin nt + c2 cos nt (9.16)

where the constants c1 and c2 depend on the initial conditions. Solving in terms of these, we have 0 /n sin nt + 0 cos nt. (t) = (9.17) The other equations are only slightly more dicult. Integrating (9.14) gives = 2n + k where k is a constant of integration. This is substituted into (9.13) to get = n2 + 2nk which is solved directly to get (t) = c3 sin nt + c4 cos nt + 2k/n. We substitute (9.20) into (9.18) and integrate to get (t) = 2c3 cos nt 2c4 sin nt 3kt + c5 .
3 4

(9.18)

(9.19)

(9.20)

(9.21)

G. W. Hill, Researches in the Lunar Theory, American Journal of mathematics, vol. 1, 1878. W. H. Clohessy and R. S. Wiltshire, Terminal Guidance Systems for Satellite Rendezvous, Journal of the Aerospace Sciences, Sept. 1960

158

Chapter 9. Satellite Clusters and Relative Motion

Solving for the constants in terms of the initial conditions produces nally 0 2 2 0 sin nt + 30 cos nt + ( 0 + 2n0 ) (9.22) n n n 0 0 2 4 0 2 cos nt + + 60 sin nt 3 ( + 0 (9.23) 0 + 2n0 ) t (t) = n n n (t) = It is interesting to consider certain special cases of the motion. From (9.23), for instance, it can be seen that if all initial conditions other than 0 are zero, then (t) 0 and (t) 0 . This means that it is possible to simply oset the spacecraft by some distance ahead of or behind the space station, and it will remain in that relative position. It is not possible, however, to do the same with a non-zero value of 0 . Any oset in the radial direction will result in motion relative to the station. Another point of interest is the term multiplying time in (9.23). A term in the solution of a physical system that grows with time is called a secular term. In this case, the coecient is 3( 0 + 2n0 ). If this term is non-zero, then the spacecraft will drift away from the station in the along-track direction, pulling ahead or falling behind depending on the sign of the coecient. Note also that if this coecient is zero, the constant term in (9.22) is also zero. Considering (9.22) more closely, it is seen to be a simple oscillation, with a constant oset from zero. If the secular term is zero, then this oset is also zero. How does this relate to what we already know about orbits? If the spacecraft continually pulls ahead of the station, then in the time the station makes one orbit, the craft will already have completed an orbit. This means that the orbital period of the craft is dierent, which in turn means that the orbital energy and the semimajor axis are dierent. It is this dierence in the semimajor axis that causes the oset in the radial direction. We refer to the requirement that 0 + 2n0 = 0 (9.24)

for no secular growth to occur as the energy matching condition. If the energy matching condition is satised, the equations simplify. Using (9.24), the coecients of cos nt in (9.22) and of sin nt in (9.23) become 0 and 20 , respectively. Equations (9.22) and (9.23) then simplify to 0 /n) sin nt + 0 cos nt (t) = ( 0 /n) cos nt 20 sin nt 2( 0 /n) + 0 . (t) = 2( (9.25) (9.26)

The constant 0 has no eect on the form of the motion, and can be safely ignored for now. Recall that the form of the equation in (9.25) can be re-written as k1 sin nt k2 cos nt = A cos(nt + ).

9.1: Rendezvous in Circular Orbit where a is the amplitude and a constant phase angle. Also, recalling that cos = sin( /2); the equations become (t) = A cos(nt + ) 0 /n) + 0 . (t) = 2A cos(nt + /2) 2( sin = cos( /2)

159

(9.27) (9.28)

It becomes clear that the relations for (t) and (t) are sinusoids that are ninety degrees out of phase, with the amplitude of twice that of . In other words, the motion in the -eta plane is an ellipse, twice as long as it is deep. The period of this motion is the same as that of the orbit of the space station. The out-of-plane motion is also a sinusoid, with the same period. Because it is independent of the - motion, its magnitude and the phase dierence are arbitrary, 0 . depending only on the initial conditions 0 and
Example 9.1. The space station is in orbit 600 km above the earth. It is desired to put a satellite in orbit near the station, to keep watch for signs of damage. At its apogee, the satellite is at 610 km altitude, and the station is between it and the earth. Take this point to be the initial condition. 1. What is the value of 0 if the energy matching condition is met (recall that it must be met, or the satellite will drift away from the station over several orbits). 2. The out of plane motion is to have the same magnitude as the radial motion. 0 ? What is the value of Since the station is between the satellite and the earth at the initial condition, we have 0 = 10 km; The mean motion of the station orbit is n= From (9.24), we have km . s The out of plane motion is given by (9.17). Since 0 = 0, in this case we have 0 = 2n0 = 0.02166 0 /n) sin nt (t) = ( 0 /n when sin nt = 1. Therefore, which will be maximized at = 0 = 10n = 0 = 0.01083 km . s /a3 = 398601 = 0.001083 s1 (6378 + 600)3 0 = 0 = 0.

160

Chapter 9. Satellite Clusters and Relative Motion


The relative motion of the satellite about the space station is shown in Figure 9.3. The radial vs. along track motion is precisely the two-by-one ellipse that is expected. The projection of the out-of-pane motion vs. along track motion appears as a line. This is an edge view of the actual elliptic motion.
10 out of plane 5 10 5 0 5 10 20

radial

0 5 10 20

15

10

10

15

20

15

10

10

15

20

along track

along track

Figure 9.3: Relative motion of satellite (example 9.1).

9.1.4

The Transition Matrix

The natural motion of the craft relative to the space station is fully dened by the six initial conditions. Re-writing (9.17), (9.22), and (9.23) to separate the eect of each initial value, and writing the result in matrix-vector form, provides 4 3 cos nt 0 0 (t) 0 (t) 0 0 = 6 sin nt 6nt 1 (t) 0 0 cos nt 0 0 sin nt 2 2 cos nt 0 1 0 0 . (9.29) + 2 cos nt 2 4 sin nt 3nt n 0 0 sin nt 0

These two equations are now combined into one. (t) (t) =

The derivatives of the motion are written easily enough by taking the derivative of (9.29) with respect to time. Assembling the results as above gives (t) 3n sin nt 0 0 0 0 (t) 0 = 6n (cos nt 1) 0 0 0 n sin nt 0 (t) 0 cos nt 2 sin nt 0 0 0 + 2 sin nt 4 cos nt 3 . (9.30) 0 0 cos nt 0 11 (t) 12 (t) 21 (t) 22 (t) 0 0 (9.31)

9.1: Rendezvous in Circular Orbit

161

The matrix in (9.31) is called a transition matrix. Note that any arbitrary time t1 can be taken to be the initial time. Then the state at any time t2 is written (t2 ) (2 ) = 11 (t2 t1 ) 12 (t2 t1 ) 21 (t2 t1 ) 22 (t2 t1 ) (t1 ) (t1 ) (9.32)

A set of canonical units for this problem can be derived based on the nominal orbit radius rs . Since = 1 in canonical units, this results in n = 1 and the period Ps of the reference orbit as 2 time units. Such a unit set allows the time to be specied easily in terms of fractions of an orbit.

9.1.5

Impulsive Rendezvous

The transition matrix of the last section can be used to solve for impulsive rendezvous in circular orbit.
Example 9.2. An astronaut is ten kilometers above and ten kilometers behind his spacecraft, which is on a circular orbit of radius 10000 kilometers about the earth. If his instantaneous velocity relative to the craft is 10 meters per second in the local x direction, what are the thrusts required to return to the craft, arriving when nt = /2? The problem can be solved in either canonical or standard units. Choosing standard units, we have 3 = 6.3135 1004 s1 . n = /rs Note that the time appears in (9.29) and (9.30) only multiplied by n, so the actual value of the nal time is not needed. The initial conditions are 0 10 km 0 = 10

and

These conditions will not result in the desired rendezvous, so the problem is to compute the relative velocity after the impulse so that rendezvous will happen. Recalling that an impulse does not change the position, only the velocity, we use (9.29) to solve as
1 + 0 = 12 (/2) [(/2) 11 (/2)0 ]

10 km 0 = 0 . s 0

(9.33)

where as usual the superscript + denotes the velocity immediately after the impulse. Now, the desired nal position is (/2) = 0. Inserting this, along with the initial position, into (9.33) and solving produces 3.841 1003 km 3.841 m 03 + 1.920 = . 0 = 1.920 10 s s 0 0

162

Chapter 9. Satellite Clusters and Relative Motion


The impulse needed is this value minus that before the impulse. Thus, + 0 = 13.84 v1 = + 1.92 m/s. 0 We now use (9.30) to compute the velocity of the astronaut when he arrives at the spacecraft. Since he will want a velocity of zero relative to the craft, he will use an impulse to negate the arrival velocity. Therefore, + v2 = 21 0 + 22 0 . Solving this produces m/s. v2 = 3.841 1.920 + 6.314k The relative motion is shown in Figure 9.4.
0.5 0 8 0.5 10

(9.34)

out of plane

radial

1 1.5 2

2 2.5 3 10 0 10

along track

along track

Figure 9.4: Motion of astronaut relative to spacecraft (example 9.2).

Equations (9.33) and (9.34) are valid in general for impulsive rendezvous problems in the Clohessy-Wiltshire formulation. However, they work only if the submatrix 12 is invertible. It almost always is, but there are certain values of nt for which it is not. In the example, the astronaut was heading to the origin of the local system. The only eect this has on the problem is to set the desired nal value of to zero. As a nal note of caution, it should be remembered that the Clohessy-Wiltshire equations are approximations. They are very close to reality for motion that is quite close to the reference circular orbit. As the motion gets larger, the accuracy decays. A shortcoming of this approach is that it does not provide a way to estimate the size of the error.

9.2: Natural Motion and Orbital Elements

163

9.2

Natural Motion and Orbital Elements

In the last section, the dierential equations for the motion of a spacecraft relative to another body were derived. Those equations are valid in general, but could only be solved if the thrust is known. It was a fairly straightforward process, however, to solve for the natural relative motion, when the reference orbit was circular. The equations for relative motion can be derived as in section 9.1 for non-circular reference orbits as well, but the results are not so easily solved. As interest in satellite clusters increased, many investigators addressed these problems, and transition matrices for elliptic orbits were derived. In this section, we take a dierent approach, expanding the orbital elements directly. This will allow an examination of the qualitative motion of the spacecraft relative to the station.

9.2.1

Expansions of the Orbital Elements

In general, the idea of series expansion is that a quantity is expanded about a nominal value of that quantity, in terms of some (usually small) parameter. In the last section, the equations of motion were expanded by taking a Taylor expansion of the equations of motion. The nominal value of the motion was the circular reference orbit. The expansion parameter was the vector joining the body of interest to the origin of the local frame; this was small relative to the radius of the reference orbit. In this section, the motion of a body in orbit is expanded about the mean anomaly, and the expansion parameter will be the orbital eccentricity. The techique used is the Lagrange method, whic is presented in Appendix C.2. It is not the only technique available, but it works well in expanding orbital elements, because it is well-suited to relations of the form of the eccentric anomaly. The eccentric anomaly is expanded in the appendix, as an example of the method. Using the same technique to expand cos E provides cos E = cos M e 3e2 (1 cos 2M ) + (cos 3M cos M ) + 2 8 (9.35)

This can then be substituted into equation (3.11) r = 1 e cos E a to get e2 3e3 r = 1 e cos M + (1 cos 2M ) (cos 3M cos M ) + a 2 8 (9.36)

Clearly, if e is small, the terms in the expansion become small very quickly. The series is valid for fairly large values of eccentricity, as well, though it fails to converge for some values of M when e is greater than about e = 0.663.

164

Chapter 9. Satellite Clusters and Relative Motion

Finding an expansion for the true anomaly is somewhat more dicult, and the exercise does not provide increased understanding of the result,5 and so will be neglected here. In the end, the result is the expansion = M + 2e sin M + e3 5e2 sin 2M + (13 sin 3M 3 sin M ) + 4 12 (9.37)

Note that these expansions are for the motion of an individual spacecraft; nothing has been said about a second vehicle. However, as was noted in section 9.1, the only way for two vehicles in natural motion to remain near each other is if their orbital energies match. If the energies match, it follows as well that the semimajor axes and the mean motions match. In such a case, the mean anomaly of one vehicle minus that of the other will be a constant.

9.2.2

An Order Analysis

Consider the motion of one satellite relative to another, when the second satellite, which we take as the reference, is in an orbit that is signicantly elliptic. If the second satellite is to be close to the reference at all points on the orbit, they must have very similar arguments of periapsis. Since the mean anomaly is measured from periapsis, it follows that M = M0 + M, (9.38) where M is the mean anomaly of the satellite in which we are interested, M0 is the mean anomaly of the reference satelite, the dierence M is small. Applying (9.37) to each satellite and taking the dierence, we have 0 = M + 2(e0 + e) sin(M0 + M ) + 5 sin M0 + 2e0 sin M0 e2 4 0 5 (e0 + e)2 sin (2M0 + 2M ) 4 (9.39)

Invoking some well-known trigonometric identities provides 0 = M +2e0 (sin M0 cos M + cos M0 sin M ) 2e0 sin M0 +2e (sin M0 cos M + cos M0 sin M ) 5 + e2 (sin 2M0 cos 2M + cos 2M0 sin 2M ) 4 0 5 (9.40) + e0 e (sin 2M0 cos 2M + cos 2M0 sin 2M ) 2 5 5 + e2 (sin 2M0 cos 2M + cos 2M0 sin 2M ) e2 sin 2M0 + 4 4 0
5

Though it is enjoyable as a mathematical exercise.

9.2: Natural Motion and Orbital Elements

165

To perform an order analysis means to identify terms that have the same order in small parameters. In the last section, we noted that the magnitude of was very small relative to rs . Any term that included higher powers of were therefore considered negligible. In this case, things are not quite as simple, as there are more variables to consider. Consider the along-track relative motion. The distance that the satellites are apart will be very nearly r0 ( 0 ). If this is considered small relative to r0 , it is clear that 0 is the small term. In (9.39), M appears by itself, so if 0 is small, so must be M . Now think about the radial distance. Since the orbits must be similar, the satellites must have similar periapsis radii. Recall that rp = a(1 e) so that the dierence in periapsis radii is rp rp,0 = ae. Since a is not small, we have that e must be small. We now have two small parameters. Note: It is possible one of e or M is zero, or at least very small relative to the other. However, the important thing for our analysis is that neither is larger than small. Consider the second term on the right hand side of (9.40). Because M is a small parameter, it is necessary to expand the trigonometric terms containing it. Recalling that 3 5 + + 6 5! 2 4 cos = 1 + + 2 4! sin = shows that sine terms will contribute a rst-order term, while cosine terms contribute a zeroth- and a second-order term. Both also contribute higher-order terms, but for now it is sucient to consider only orders zero through two. Expanding the second term through second order in M (there are no terms containing e) provides. 2e0 (sin M0 cos M + cos M0 sin M ) = 2e0 sin M0 1 M 2 /2 + 2e0 cos M0 M (9.41) The term 2e0 sin M0 on the right hand side will be subtracted out by the third term in (9.40), eliminating the zeroth-order term. The only remaining term of rst-order is 2e0 cos M0 M .

Chapter 10

Rocket Performance

The only feasible method so far devised for achieving the velocities necessary to orbit satellites, or to send objects (and people) beyond the gravity well of the Earth, is the chemical rocket. Many other schemes have been proposed, most impossible, but none have so far been implemented. Some of these other schemes have to their credit great imagination, and some the possibility of great achievements. Several authors over the last few centuries have proposed ying to the moon in balloons, or in vehicles harnessed to birds. Jules Vernes idea of using a mammoth cannon to re a shell to the moon is also arresting, and equally impossible (at least with any chance of survival for passengers). To get goods from the moon to the Earth, the lunar inhabitants in Robert Heinleins The Moon is a Harsh Mistress use a catapult. This is more feasible, since the gravity of the moon is much less than that of the Earth. And once in space, there are proposals for solar sails, which are basically huge mirrors, that use the very small momentum of light to accelerate arbitrarily large bodies to interstellar speeds. All of these ideas and more are beyond the scope of this text. In this chapter, we will restrict ourselves to a basic discussion of the chemical rocket. We will rst derive the most basic equations for describing rocket performance outside of the atmosphere and gravity of a planet, and then discuss the forces due to these factors. We will look at the sounding rocket. Finally, we will discuss single- and multi-stage rocket performance in terms of mass fractions, the non-dimensional factors often used to describe vehicles.

10.1

Thrust of a Rocket

For our purposes, a rocket will be any device that creates thrust by expelling only stored mass. This denition excludes jet engines, which primarily use their stored fuel to increase the energy of the air stream that ows through them. The only such rockets that achieve sucient thrust to enable launch from the surface of the

167

168

Chapter 10. Rocket Performance

u-v

Figure 10.1: Sketch of rocket and exhaust

Earth use chemical fuel, though electrical thrusters have been developed for use in low-thrust applications once in orbit. A chemical rocket works by burning stored fuel (typically referred to as propellant) and allowing the exhaust gases to escape through a nozzle at the back. These gases are owing at a velocity u relative to the rocket, which itself is traveling at a velocity v . To compute the resulting thrust, consider the rocket at time t, and at some later time t + t. At time t, the rocket has mass m, and is moving at some velocity v . After the interval t, the mass has decreased because some m of propellant has been expelled, and the velocity has increased by some increment v . Thus the momentum at times t and t + t are Mt = mv Mt+t = (m mp )(v + v ) + m(v u) Since momentum is conserved, these two can be set equal to each other to get mv = (m mp )(v + v ) = mv = ump v mp Dividing through by t and taking the limit as t 0 gives mv = um p (10.4) (10.1) (10.2)

(10.3)

The term on the left is the instantaneous change in the momentum of the rocket, and thus is equal to the thrust imposed upon it by the expulsion of propellant. We thus have that T = um p (10.5) This equation tells us that the thrust of a rocket is dependent both upon how much mass ows through the nozzle, and on the speed it attains relative to the rocket as it leaves. The two obvious ways to improve the performance of a rocket are then to increase the mass ow rate m p , or the exhaust velocity u. The rst of these can be easily enough accomplished by building a bigger rocket motor, but this requires more

10.2: The Sounding Rocket

169

structural mass and may defeat the purpose. The second requires using very highenergy fuels, or changing the basic technology of the rocket. Propellant chemists work at increasing the energy derived from chemical propellants. The goal of technologies such as nuclear and electrical rockets is to achieve exhaust velocities well above those of chemical rockets.

10.1.1

The Rocket Equation

The mass ow rate in eqn. (10.4) is positive because it refers to the mass of propellant. The mass of the rocket, which includes all unburnt propellant, changes as well, and the rate of change has the opposite sign. We can thus re-write (10.4) as mv = um We can solve this equation: m dm dv = u dt dt = mdv = udm = dv = u dm m

We make the assumption that u is a constant (which is in fact quite a good assumption), which allows us to integrate:
v v0

dv = u

dm m0 m v v0 = u ln(m0 /m)

Replacing v v0 with the commonly used v gives us v = u ln(m0 /m) (10.6)

This is one form of the rocket equation, which describes the ideal performance of a rocket. In various forms, this equation will be the main component of the rest of this chapter.

10.2

The Sounding Rocket

The simplest case of a rocket trajectory under gravity is the sounding rocket1 . These rockets are launched vertically, usually carrying an instrument package that will then be allowed to come back to earth on a parachute. In the ideal case, a sounding rocket will travel a straight line directly upwards. In reality, of course, winds, thrust misalignment, and asymmetries in the rocket and
1 The term is an analogue of the naval term. Sounding refers to lowering a weight on a rope over the side of a sea-going vessel to determine the depth of the water.

170

Chapter 10. Rocket Performance

payload will cause a small variation from this, but we will ignore these eects in this analysis. The sounding trajectory will consist of two parts, the boost phase and the coast phase. During the rst of these, the rocket motor is burning and the vehicle is accelerating against gravity. During the second, the motor has burned out, and the rocket will coast until the kinetic energy it had at the end of the boost is gone and the velocity has gone to zero.

10.2.1

Powered Boost

The boost phase of the trajectory will typically last several seconds. During this phase, the vehicle will gain velocity and altitude. The altitude gained will be suciently small that we may ignore the variation of gravitational attraction with altitude, and we will take gravity to be constant. We will also ignore any eects due to the atmosphere. The equations of motion are therefore simple. The vertical acceleration is the dierence between the eects of thrust and of gravity, and the velocity is the rate of change of altitude. Maintaining our assumption of constant thrust and exhaust velocity, we also have a constant mass ow rate from the motor, and the equations are um v = g (10.7) m = v h (10.8) m = k (10.9) Here, the constant k is the mass ow rate of the burned fuel from the motor2 . We will begin by considering the nal equation, (10.9). This equation can be written dm = kdt m(t) = m0 kt This will be useful in evaluating the other two expressions. Consider eqn. (10.7). We can re-write this as mv = um mg The rst term on the right is simply the equation we derived at the beginning of the chapter, and thus gives us the rocket equation when we integrate. The second term is evaluated as v = g = v = gt
2 Liquid-fueled motors, or solid motors with varying burn rates, will not have a constant ow rate. Most sounding rockets are made with simple, inexpensive solid fuel motors, however.

which allows us to write the mass as a function of time as

(10.10)

10.2: The Sounding Rocket (ignoring any initial velocity). Thus the equation as a whole gives v = v (0) + u ln(m0 /m) gt

171

(10.11)

This is correct, but the use of both t and m as independent variables on the right hand side is needlessly confusing. Instead, we use eqn. (10.10) to write time as t= m0 m k

Now we can write the velocity as a function of either mass or time as v (t) = v (0) + u ln m0 m0 kt gt m0 m k (10.12) (10.13)

v (m) = v (0) + u ln(m0 /m) g

Note that, until we wrote mass as a function of time, we had made no assumptions of constant ow rate in deriving the velocity. Thus eqn. (10.11) is valid so long as the exhaust velocity is constant, while the last two are valid only if the mass ow rate is also constant. Solving for the altitude is somewhat more complicated. We can again choose to solve for a function of mass or time, and we will choose to solve for mass. Thus we re-write eqn. (10.8) using (10.13) as dh = v (0)dt + u ln(m0 /m)dt g Then using t= we get dh = v (0)dm u + ln k k m m0 dm g m m0 dm k k (10.14) m0 m k = dt = dm k m0 m dt k

We integrate each term on the right hand side of (10.14) separately. Up to this point, we have not worried about the notation of integrating a function of mass using limits of integration that are themselves functions of mass. To make it clear that the values used as limits of integration are specic values, rather than functions, we will use m as the limit of integration, and replace it with m at whatever point is convenient when we are nished. The rst term integrates simply to
m m0

v (0)dm (m0 m )v (0) = k k

172 The second we integrate3 as u k The third becomes


m

Chapter 10. Rocket Performance

ln
m0

m m0

dm =

u m ln k

m m0

+ m0 m

m g (m m0 )2 g (m m0 )dm = k 2 m0 2k 2 The altitude as a function of mass is then

(m0 m)v (0) u h= m ln + k k

m m0

g (m m0 )2 + m0 m 2k 2

(10.15)

In the usual case, the initial velocity will be zero. We can then nd the velocity when the motor burns out from hbo = u mbo ln k mbo m0 + m0 mbo g (mbo m0 )2 2k 2 (10.16)

10.2.2

Coast Phase

After the motor burns out, the rocket will continue to coast upwards until all of the kinetic energy it attained during the boost phase is converted to potential energy. In this phase, the rocket will cover sucient altitude that the variation in gravity has a signicant eect. Therefore, we replace our constant gravity with a function of altitude: R2 g (h) = g0 (10.17) (R + h)2 In this equation, R is the radius of the Earth (or the moon, or whatever body the rocket is being launched from), h is the altitude above the surface, and g0 is the gravitational acceleration at the surface. Since the gravitational force eld is conservative, we can equate the kinetic energy of the rocket at burnout to the work done by gravity during the coast. We then have
hbo +hc 2 R2 dh mbo vbo = mbo g0 2 (R + h)2 hbo 2 vbo g0 R 2 g0 R 2 = = 2 R + hbo R + hbo + hc

which, after some simplication, gives the coasting altitude to be hc =


2 vbo (R + hbo )2 2 (R + hbo ) 2g0 R2 vbo

(10.18)

Adding this altitude to the altitude at burnout will give the nal altitude of the sounding rocket.
3

We nd in tables that

ln x dx = x ln x x.

10.2: The Sounding Rocket

173

Example 10.1. Given a sounding rocket with a payload of 12 pounds,4 8 pounds of structure, and 12 pounds of propellant with an exhaust velocity of 10000 feet per second, what is the nal altitude achieved if the burn time is 12 seconds? How does it change if the burn time is only 6 seconds? Solution: The equations we just derived give us v (tbo ) = u ln(m0 /mbo ) gt =

vbo = 10000 ln[(12 + 8 + 12)/(12 + 8)] 12 32.174 = 4700 386.1 = 4314 ft/s

Note that we went ahead and used pounds as a unit of mass here. We can get away with this because we are using it in a ratio, where the conversion factor will cancel out. This is in general dangerous the notion of 12 pounds of propellant is nonsensical anywhere but on the surface of the Earth. The equation for burnout altitude also has mass only used in ratios, since the masses inside the brackets in eqn. (10.16) are divided by the mass ow rate outside. Thus, so long as we are careful to use consistent units, we can set k = 12lb-m/12 seconds and compute hbo = 32.174 10000 [20 ln(20/32) + 32 20] (20 32)2 = 2.370 104 ft 1 2

The additional altitude due to the coast phase is given by eqn. (10.18) to be (using R = 2.0925673 107 feet) hc = 43142 209493552 = 2.939 105 ft 2 32.174R2 43142 (20949355)

The total altitude of the rocket thus comes out to be 317,600 feet. If the burn time is shortened to 6 seconds, k becomes 2 lb-m per second, and the numbers become vbo = 4507 ft/s hbo = 1.286 104 ft for a total altitude of 333,800 feet. hc = 3.209 105 ft

The results of this example help to explain the optimal use of propellant in sounding rockets. In is well-known to rocket designers that the most ecient use of propellant when thrusting against gravity is to burn as quickly as possible. It is clear
When the term pound is used as a unit of mass, it is understood that the unit is the lb-mass, dened as the mass in slugs times gc (giving us the amount of mass that weighs n pounds under normal gravity). While this is not a generally accepted unit of mass, it is still often used.
4

174

Chapter 10. Rocket Performance


Atmospheric Pressure

Exhaust Pressure

Figure 10.2: Atmospheric pressure on a rocket

from eqn. (10.12) that this results in the highest burnout velocity. It is not as clear that the reduced altitude at burnout is more than recovered during the coast. Obviously, the limiting case of the impulsive burn is not possible, as it results in innite acceleration. The structural limits of the rocket and payload often limit the acceleration, and in this way dene the burn time. Even when this is not a problem, however, the faster-is-better rule may not hold when the eects of atmospheric drag are included in the equations.

10.3

Aerodynamic Eects on Rocket Performance

While a key use of rockets, and the use that makes them of interest to us, is their ability to function in space, the majority of rockets actually spend most or all of their working lives in the atmosphere. Missiles, for example, are commonly powered by solid rocket motors. The sounding rocket also generally does all of its thrusting while still well in the dense lower atmosphere.

10.3.1

Pressure Thrust

The two primary eects of the atmosphere are drag and an additional term in the thrust equation. We will look at the second of these rst. Consider a rocket in the atmosphere. The arrows in Figure 10.2 represent the atmospheric pressure, which is evenly distributed around the body and acts normal to the surface everywhere, with the exception of the exit plane. The exit plane is the area of the rocket nozzle at the end, and the pressure acting on the rocket at this point is the exhaust pressure Pe . Summing the pressure forces about the rocket body, we see that all of the forces other than those involving the exhaust pressure cancel themselves out, and we are left with a term in the direction of rocket travel that is due to the dierence in exhaust

10.3: Aerodynamic Eects on Rocket Performance pressure and atmospheric pressure: Tex = (Pe Patm )Ae

175

(10.19)

That is, thrust due to the atmosphere is equal to the dierence in exhaust and ambient pressure, times the area of the exit plane. The pressure in the atmosphere varies with altitude. To rst order, a good approximation is the exponential atmosphere, in which the pressure is given by Patm = P0 eh/P (10.20)

Here, P0 is the pressure at sea level, and P is a proportionality constant. If h is measured in feet, the value of P is usually taken to be 23200. Because of this reduction in pressure with increasing altitude, the amount of thrust due to exhaust pressure increases as the rocket gains altitude. The exhaust pressure is determined by two primary factors. One is the pressure in the combustion chamber of the rocket. Most chemical propellants burn faster at higher pressures. Also, the density of the exhaust gas is higher at higher pressure, so that more mass can ow through a smaller nozzle. For these reasons, small highperformance rocket motors often will have high combustion chamber pressures. The exhaust pressure is given by the chamber pressure times the second major factor, which is determined by the expansion ratio of the nozzle. The expansion ratio is simply the exit area of the nozzle divided by the cross-sectional area at the smallest part, or throat, of the nozzle. While it might seem that the ideal rocket would have a very high pressure thrust term, this is not the case. The exhaust pressure decreases as the exhaust gases ow through the expanding nozzle, so that a high pressure also tends to mean small area. More importantly, the gases attain higher velocity as they expand (an explanation of this is far beyond the scope of this book), and it is this velocity that gives the majority of the thrust. It turns out that the ideal case, yielding maximum thrust, is when the exhaust pressure exactly matches the ambient pressure. Due to design constraints and the change of pressure with altitude, matching the pressures is seldom possible. For air-to-air missiles, which must t under the wing of an aircraft, the nozzle size is limited and the performance must be very high, so the exhaust pressure can be in excess of 100 psi. For the space shuttle boosters, however, the chamber pressure is much lower, and the nozzles can be much larger. For this class of vehicle, the pressure term may actually be negative at launch, and become positive only as the atmospheric pressure gets low.

10.3.2

Drag

Aerodynamic drag is the force impeding the motion of a body through the atmosphere. The eect is proportional to the square of the velocity, so that while it is experienced

176

Chapter 10. Rocket Performance

by any body moving through the air, it is a far greater eect for high-speed vehicles such as aircraft and rockets. There are several components of drag, from the friction of the air rubbing over the body to the energy required to push the air out of the path of the body. It is beyond our needs to investigate these thoroughly. Instead, we will simply note that drag on a rocket is usually characterized by the equation 1 D = v 2 Sref CD 2 (10.21)

In eqn. (10.21), is the density of the atmosphere, and v is the velocity of the rocket relative to the atmosphere. The term Sref (often simply S ) is the reference area, which for a rocket is usually dened to be the cross-sectional area of the main rocket body, looking down the longitudinal axis. Since a rocket typically has a circular cross-section, this is simply d2 /4, where d is the diameter of the rocket. The term CD is called the Drag coecient. It is dened for a particular body through eqn. (10.21). That is, the drag is measured, and divided by v 2 Sref /2 to get the coecient. The value of CD varies with velocity (more precisely, with Mach number), but is nearly constant for most rockets above three times the speed of sound, where the vehicle spends most of its operating lifetime. Thus, for initial design work, CD is usually taken to be a constant. For typical rockets, the value of CD at high Mach numbers will be in the range 0.12 to 0.16. The lower value will be for a very clean, sleek design. If we include drag in the dierential equation of motion, we have before burnout v = 1 m 1 um + v 2 Sref CD 2

(recall that m is negative). After burnout, the thrust term of course goes to zero. This is a non-linear equation, and no closed-form solution exists for it. Instead, it is solved numerically on a case-by-case basis.
Example 10.2. Consider the sounding rocket of example 10.1. This rocket might typically have a diameter of four inches, and a drag coecient of 0.14. We will assume the exponential atmosphere, which denes the density as a function of altitude to be (h) = 0 eh/ where the proportionality constant = 23800 feet. While this situation does not lead to analytical solutions, we can set up the equations of motion and integrate them numerically. For this example, we will ignore the thrust due to exhaust pressure, but include the gravity term. The equations of motion are 0 t tbo : t > tbo : m = k v = (uk D)/m g =v h m =0 v = D/m g =v h

10.4: Specic Impulse

177

Using 12 seconds as the burn time, we have a maximum drag of 79.4 pounds at burnout, which is the point of maximum velocity. The thrust is T = uk = 10000(1/gc ) = 311 pounds, so the drag is fully one-fourth of the thrust. The burnout velocity is reduced by the drag to 3686 ft/s, and the altitude to 21650 ft. The nal altitude is drastically reduced, with the vehicle attaining a maximum of only hf = 1.673 105 feet.

Note that in this example, the mass and mass ow rate cannot be in poundsmass, because the term D/m would not come out correctly. In this case, a consistent set of units must be used. The problem of attaining the maximum altitude with a sounding rocket is known as the Goddard problem, after Robert Goddard, an American pioneer of rocketry. Too high an acceleration creates high velocities while still in the lower atmosphere, where the relatively great atmospheric density leads to large drag forces. Too low an acceleration, however, allows gravity too much time to act against the rocket. The mathematics of the tradeo are very complicated. The solutions are unique to each rocket, and generally require a variable-thrust motor to implement.

10.4

Specic Impulse

Rocket fuels are often compared based on a number called the Specic Impulse. The specic impulse of a particular fuel is dened as the thrust obtained by burning a pound of fuel, multiplied by the amount of time required to burn the fuel. The higher this number, the better the fuel. Under ideal circumstances, the thrust is given as seen in section 10.1 by the equation T = um p Thus we can relate the specic impulse to the exhaust velocity as
t t

Isp =
0

T dt =
0

dmp dt dt

Switching the variable of integration from time to mass, we note that one pound of fuel is 1/gc slugs, so we get
t

u
0

dmp dt = u dt

1/gc

dm = u/gc
0

We therefore have the result that u = gc Isp (10.22)

Plugging this into eqn. (10.6), we get a second version of the rocket equation, v = gc Isp ln(m0 /m) (10.23)

178

Chapter 10. Rocket Performance

The unit of Isp is seconds, and thus eqn. (10.23) works in all unit systems in which the time is measured in seconds. In the metric system, for instance, gc = 9.80665 m/s2 and the exhaust velocity will be in meters per second. As was seen in section 10.3, the actual thrust seen by a rocket will depend on several factors beyond the chemical content of the fuel itself. In order to make eqn. (10.23) meaningful, these factors must in some way be accounted for. This is typically done by using a value for specic impulse that is a bit lower than the ideal. The ideal specic impulse is that which multiplies the burn time by the highest possible thrust. This thrust is that attained if the motor is burned in a vacuum (so that Patm is zero) and the exhaust nozzle allows the exhaust gas to fully expand. This value is often known as the vacuum specic impulse, and is thus denoted Ispv . A value often used in preliminary design is the delivered specic impulse. Since the delivered thrust depends on the circumstances under which the fuel is burned, this value can change quite a bit. The rocket designer can deduce a good value by starting with the vacuum specic impulse and accounting for the other eects. She then has a value of Ispd which can be used in further preliminary design and analysis work. As seen is section 10.1, the thrust developed by a rocket motor is determined in the main by the velocity of the exhaust gases. For a high velocity, the amount of energy must be high, implying a high combustion temperature, and the molecular weight of the gases must be low. One of the highest possible values of specic impulse is attained by burning a combination of liquid hydrogen and liquid oxygen, which results in Isp = 457 s. The space shuttle main engine achieves a delivered Isp of 455 s, for an eciency of over 99.5%.

10.5

Single Stage Performance and Mass Fractions

We saw in our earlier derivation of the rocket equation that the velocity capacity of a rocket is determined by the fraction m0 /mbo . This value is known as the mass ratio. This ratio, along with the specic impulse of the motor, denes the performance of the vehicle, at least when the eects of atmosphere and gravity are ignored. Since many of these eects can be taken into account by judiciously altering the value of Isp assumed delivered, this is actually a good approximation, and gives an excellent rst approximation of the requirements and performance of the rocket engines needed for a mission. The total mass of a single stage rocket vehicle is divided into three categories: the payload mass P , the propellant mass mp , and the structural mass ms . Of these, only the payload is protable mass; the structural mass is dead weight that must be carried along to house the propellant and provide it a place in which to burn and a nozzle through which to exit. The overall mass fraction of the vehicle is determined in part by the technology of the rocket motor, and in part by the payload. The motor technology can be described

10.5: Single Stage Performance and Mass Fractions

179

separately from the payload by the structural factor ,5 dened as the structural mass over the mass of structure plus the mass of propellant: = ms ms + mp (10.24)

We add to the motor the mass of the payload, giving us the total mass of the vehicle, and the part of that mass which is useful payload. The fraction of initial mass that consists of the payload is the payload mass fraction (or payload fraction or payload ratio), P P = = (10.25) m0 P + ms + mp These are the two mass fractions we will use to investigate vehicle performance. Others can be dened; in particular, the propellant mass fraction is often used. It is usually dened as the mass of propellant divided by the initial mass of the vehicle. The equation used to estimate the performance is the rocket equation, which we recall is v = gc Isp ln(m0 /mbo ) We write this in terms of the structural factor and the payload fraction: ms + P ms + P + (mp mp ) mbo = = m0 ms + mp + P ms + mp + P mp ms + mp mp mp =1 =1 =1 ms + mp + P m0 m0 ms + mp Now, ms + mp m0 P = =1 m0 m0 mp + ms ms mp = =1 ms + mp ms + mp mbo = 1 (1 )(1 ) = + (1 ) m0

and

so that Finally, we have

v = gc Isp ln(m0 /mbo ) = gc Isp ln[ + (1 ) ]

(10.26)

Example 10.3. Suppose we have a structural factor of = 0.08 and a delivered specic impulse of Isp = 230. What is the maximum velocity (assuming a start from zero) that can be attained?
5 Various authors use dierent symbols for the mass fractions. There does not seem to be an agreed-upon standard.

180
Solution: Using eqn. (10.26), we have

Chapter 10. Rocket Performance

v = gc Isp ln[ + (1 ) ] which implies that the velocity is maximized when = 0. Then the velocity is simply v = gc Isp ln[ ] 230gc ln[0.08] = 581gc which comes out to 18690 feet per second or 5697 meters per second.

Note that this example implies that such a rocket could never be used to attain orbit from the Earths surface, since a low-Earth orbit requires about 7.7 km/s of orbital speed (and about another 1.5 km/s that gets lost in overcoming drag and gravity). This leads to the multi-stage rocket, the subject of the next section.

10.6

Multi-Stage Rockets

The diculty with using single-stage rockets to achieve space is the amount of structural mass that must be carried along. To alleviate this problem, it is advisable to discard excess mass when possible. This is accomplished through staging, in which several rocket motors are used in turn, each being discarded when its fuel is exhausted. In this way, the structural mass of the expended motor is also done away with, and the necessity to further accelerate it eliminated. In this section, we will consider only the most basic form of staging, in which only one motor is red at a time, and is discarded before the next is ignited. More general forms of staging are often used. The space shuttle, for example, res both the main engine and the strap-on boosters during launch. The boosters are dropped when their fuel is expended, but the main engine remains. The external fuel tank is later dropped, as well. Thus the shuttle has in a sense three stages (since it has dierent structural mass in each phase), but one engine is active in all three.

10.6.1

Two-Stage Rockets

Consider a rocket with only two stages. The stages are stacked one atop the other, and numbered in the order in which they are red, so that stage one is the lower stage. In this simplest case, the multi-stage rocket can be considered two single-stage vehicles. The second stage is a usual rocket, with structural mass ms,2 , propellant mp,2 , and payload P . The rst stage has structural and propellant masses ms,1 and mp,1 , and its payload is everything it carries. Thus the payload of the rst stage is the total initial mass of the second stage. The payload mass fractions of the two stages are then m0,2 P 1 = ; 2 = m0,1 m0,2

10.6: Multi-Stage Rockets

181

The overall payload fraction of the vehicle is the payload over the initial mass of the vehicle. The initial mass of the vehicle is the initial mass of the rst stage (since its initial mass includes its payload, which in turn includes all other stages), so = P m0,2 P = = 1 2 m0,1 m0,1 m0,2 (10.27)

We can use the same equations to nd the performances of the two stages that we had for the single-stage rocket. Given the payload fractions and structural factors of the two stages, we use eqn. (10.26) for each stage to get v1 = gc Isp1 ln[1 + (1 1 )1 ] v2 = gc Isp2 ln[2 + (1 2 )2 ] and the total velocity achieved is simply v = v1 + v2 In general, there is no reason to expect that the mass fractions of the two stages will be the same. Quite often, the lowest stage of a vehicle will be a solid rocket, with the topmost stage having a liquid motor.
Example 10.4. We wish to accelerate a payload P to a velocity of 9000 m/s. We will use a booster technology which has a structural factor = 0.08, and fuel with specic impulse Isp = 230 seconds. Using a two-stage vehicle, how much propellant will be needed (in terms of P )? Solution: Since the specic impulse and structural factors are the same, the optimal solution will have the payload fractions and v s the same for the two stages. Therefore v2 = 4500 = 9.80665 230 ln[0.08 0.92 ] = 0.08 + 0.92 = e1.9951 = 0.136 For the upper stage, we have m0,2 = P/ Now, we know that ms = so we have that P + mp + 0.0870mp = 16.43P = 1.0870mp = 15.43P = mp = 14.2P (10.28) mp 1 = ms = 0.0870mp = m0,2 = 16.43P

= 0.0609

182

Chapter 10. Rocket Performance


For the lower stage, the analysis is identical, except that the payload is the entire upper stage. Thus we substitute P/ for P in eqn. (10.28) to get mp,1 = 14.2(P/ ) = 233.2P Finally, we sum the propellant in the two stages to get a total of mp = 14.2P + 233.2P = 247.4P

It may seem that almost 250 kg of propellant for every kg of payload is quite a lot. We saw in example 10.3, however, that the job is not possible at all for a single-stage vehicle.

10.6.2

The n-Stage Rocket

A rocket with any given number n of stages is treated by extending the results for a two-stage rocket. The total velocity of the rocket is the sum of the stage v s:
n

V = gc

i=1

Ispi ln[i + (1 i )i ]

(10.29)

and the overall mass fraction is the product of the stage mass fractions:
n

=
i=1

(10.30)

Again, there is no reason that the stages of the rocket should be identical, particularly given the diering technologies that may be used for them. In the very special case in which all of the stages use the same technology, so that the structural factors and specic impulses are the same, it turns out that the most ecient way to construct the rocket is to have the same payload fraction for each stage, and thus the same v for each stage. By most ecient, we mean that the rocket will achieve the greatest velocity for a given nal payload and specied amount of fuel.
Example 10.5. For a specied payload P , compute the amount of fuel required to achieve a velocity of 8 km/sec for 1. a two-stage rocket with Isp = 210 for each stage 2. a three-stage rocket with Isp = 210 for each stage Assume that and v are the same for each stage, and use = 0.08 for all stages. : Solution Since v is the same for all stages, we can write for (a) v1 = v2 = 4000 = Isp gc ln[ + (1 ) ] = e1.9423 = 0.08 + 0.92 =

= 0.06888

10.6: Multi-Stage Rockets


Now, we know =

183

ms ms + mp

ms =

Since = k = Pk /m0,k for the k -th stage, we can use this to get m0 = ms + mp + P = P/ = which in this case gives mp,k = 12.436Pk For the rst stage, P1 = m0,2 = P/ = 14.518P , so mp = mp,1 + mp,2 = 12.436P1 + 12.436P2 = 12.436(1/ + 1)P = 193P For part (b), everything is the same, except that we have three stages. Then v = 8000/3 = 2667 And (1/ 1)P = (1 + /(1 ))mp = mp,k = 3.444Pk Since we have three stages, we have three terms in the total mass: mp = 3.444(1/ 2 + 1/ + 1)P = 97.3P Note that the amount of propellant required per unit mass of payload has been greatly reduced in the 3-stage vehicle. km s = = 0.2108 (1 + /(1 ))mp,k = Pk /k

mp 1

From example 10.5, we can see that increasing the number of stages greatly increases the eciency of a rocket. Adding a fourth stage would reduce the propellant requirement still further. The limiting case would be a rocket made up of an innite number of innitely small boosters. While this case is clearly infeasible, it is interesting to consider. The dening characteristic of such a rocket is that the excess mass is discarded eectively continuously. Thus, we can think of the structure as being burned along with the fuel.6 As such, we can examine the performance by adding the mass of the structure to that of the expelled fuel, without increasing the thrust. Recalling that ms = mp 1 we can dene the mass of propellant and structure, calling it m p , as m p = ms + mp = 1 mp 1

6 Such designs have in fact been proposed; so far as the author is aware, none have been developed beyond preliminary examination.

184

Chapter 10. Rocket Performance

We also have that T = um p , and from the denition of specic impulse that u = Isp gc . Therefore T = gc Isp m p Since the thrust will be unchanged even though the mass ow rate is increased to sp : include the structure, we can adjust the specic impulse to get an eective value I p sp m sp m p = gc I T = gc Isp m p = gc I 1 = sp = (1 )Isp I

We can then use the rocket equation, with = 0, to derive a maximum possible value for v . Carrying out the arithmetic, we get sp ln[ ] v = gc I If we also took the payload ratio to zero, we would get innite velocity; this is not only ludicrous, but useless.

10.7

Problems

1. At t = 0 the engine of a single-stage rocket ignites. Due to a design aw, the thrust is only half the initial weight of the rocket. In terms of Isp , when will the rocket actually lift o? (Assume constant mass ow rate.) 2. The total v required for transfer from low earth orbit with 28 inclination to geosynchronous equatorial orbit is 4.49 km/s. A space tug with a restartable engine has Isp = 453 s, 16,000 kg of fuel, and structural mass of 1300 kg. (a) How much payload can the tug deliver to geosynchronous orbit? (b) Can the tug make the round trip without payload? If so, how much payload can it carry to geosynchronous orbit and still make it back (after leaving the payload in orbit) to low earth orbit? 3. Consider the case of a vehicle with two strap-on boosters, such as the Space Shuttle. The solid fuel gives the boosters a specic impulse of Isp = 287 s, and the booster technology is such that the structural factor = 0.148. The main vehicle has a liquid fuel engine with Isp = 455 s that generates 1.41 million pounds of thrust. (a) If the burn time for the boosters is tb = 120 s and each produces 2.65 million pounds of thrust, what is the weight of the boosters, both initially (before ignition), and after the fuel is expended?

10.7: Problems

185

(b) When the vehicle is used, the boosters re along with the main engine. When the boosters burn out, their empty shells are dropped while the main engine continues to re. If the main engines re for a total of eight minutes, and the vehicle weighs 375,000 lb. when they shut o, what is the total v capability of the vehicle? 4. A two-stage sounding rocket in which both stages have Isp = 282 s carries 1167 kg of propellant in the rst stage and 415 kg in the second. The structural masses of the two stages are 113 kg and 41 kg, and the vehicle carries a payload of 250 kg. Ignoring the eects of burn time (see note below), (a) Calculate the approximate burnout speed, and the nal altitude, of the vehicle. (b) Instead of the usual strategy, assume instead that we allow the second stage to coast after the rst stage burns out, until it reaches its maximum altitude, and then we ignite the second stage. What is the resulting nal altitude? Note: To ignore the eects of burn time, simply ignore the gt term in the sounding rocket boost equation. This is equivalent to assuming that the altitude gained during burn is negligible compared to the altitude gained during coast.

Chapter 11

Solar Sailing

So far, this is just a ller.

187

Bibliography

[1] Bate, R.R., Mueller, D.D., and White, J. E., Fundamentals of Astrodynamics, Dover Publications, New York, 1971. [2] Battin, Richard H., An Introduction to the Mathematics and Methods of Astrodynamics, Revised edition, AIAA Education Series (J. S. Przemieniecki, Series Ed.), AIAA, Inc, Reston, Virginia, 1999. [3] Berry, Arthur, A Short History of Astronomy: From Earliest Times through the Nineteenth Century, Dover Publications, New York, 1961 (republication of work orginally published by John Murray, 1898). [4] Brouwer, Dirk, and Gerald M. Clemence, Methods of Celestial Mechanics, Academic Press, New York, NY, 1961. [5] Curtis, Howard D., Orbital Mechanics for Engineering Students, Elsevier Butterworth Heinemann, Jordan Hill, Oxford, 2005. [6] Danby, J.M.A., Fundamentals of Celestial Mechanics, 2nd ed., Willman-Bell, Inc, Richmond, Virginia, 1992. [7] Gurzadyan, G.A., Theory of Interplanetary Flights, Gordon and Breach Publishers, The Netherlands, 1996. [8] Kaplan, Marshall H., Modern Spacecraft Dynamics and Control, John Wiley and Sons, New York, 1976. [9] Kellogg, Oliver Dimon, Foundations of Potential Theory, Dover Publications Inc., 1954. [10] Meeus, Jean, Astronomical Algorithms, 2nd edition, Willman-Bell, Richmond, Virginia, 1998. [11] Moulton, F.R., An Introduction to Celestial Mechanics, 2nd revised ed., Dover, New York, 1970 (republication of work originally published by the MacMillan Company, 1914).

189

190

Bibliography

[12] Prussing, J.E., and Conway, B. A., Orbital Mechanics, Oxford, New York, 1993. [13] Roy, A.E., Orbital Motion, 3rd ed., Adam Hilger, Bristol, 1988. [14] Sutton, George P., Rocket Propulsion Elements: An Introduction to the Engineering of Rockets, 5th edition, John Wiley and Sons, New York, 1986. [15] Thomson, William Tyrrell, Introduction to Space Dynamics, Dover Publications, Inc., New York, 1986. [16] Vallado, David A., Fundamentals of Astrodynamics and Applications, 2nd edition, Space Technology Library Volume 12, Microcosm Press, El Segindo, California, 2001. [17] Wiesel, William E., Spaceight Dynamics, McGraw-Hill, Inc., New York, 1989.

Appendix A

Sidereal Time and Various Years

There is nothing in orbital mechanics, at least not to the extent that it is covered in this book, that makes the passage of time confusing. However, the telling of time is not always straightforward when it comes to dening days and years.

A.1

Solar vs. Sidereal Days

The rst diculty in dening time comes from dening exactly what is a day. Classically, this was not a problem, as the Earth was thought to be the center of the universe and the Sun revolved about it. A day was then the length of time between noons; that is, from when the Sun was highest in the sky until it was next highest in the sky. This was simple enough and more than sucient until it became known that the Earth in fact moved around the Sun.1 This leads to the use of the xed stars as the frame of reference, as in Chapter 4. In this frame, the day is more reasonable dened as the time between the highest points of some xed star or other celestial object. In fact, the sidereal day is dened as the time between successive passages of the rst point of Aries over the observers longitude. This is a slightly shorter length of time than the solar day discussed above. The term sidereal means with respect to the stars. The fact is that the Earth does not rotate about 365 and one-fourth times per year. Instead, it rotates once more, when viewed with respect to the xed stars. This is because during each day the Earth also moves about the Sun by about one degree, and so the Earth must rotate that much further to bring a point on its surface once again between the center
It is of course equally true to say that the Sun revolves around the Earth, as it is now known that there can be no true center dened to the universe. However, the other planets do not revolve around the Earth, and it is certainly more convenient to view the Sun as the center.
1

191

192

Appendix A: Sidereal Time and Various Years

of the Earth and the Sun. As the Earth does not move detectably with respect to the distant stars, there is no corresponding increase in the rotation needed to bring one of the stars back to an overhead position. Therefore the Earth rotates 360 degrees for each sidereal day, and approximately 360 + 360/365.25 degrees in the average solar day.

A.1.1

The Mean Solar Day

Because the Earth moves in an elliptic orbit, it does not move through the same angle every day. The solar day thus has a slightly dierent length at dierent times of the year. While the variation is small, it is measurable. This leads to the denition of the mean solar day, which is the average length of a day during one sidereal year.2 This is more important than it may at rst seem, because the second is dened as 1/86400 of a mean solar day.3 This is the usual second. There is also a sidereal second, which is analogous to the mean solar second; it is 1/86400 of a sidereal day.

A.2

Tropical and Sidereal Years

The year as usually understood is the sidereal year, which is to say when the Sun has regained its position with respect to the xed stars (as seen from the Earth). Recall that the heliocentric frame is dened in terms of the vernal equinox direction (sec 4.1), and that this direction is not quite constant. Therefore, the sidereal year is not quite the same as the tropical year, which is the time between successive passages of the Sun across the vernal equinox. Still another year is the anomalistic year, which is the time from perihelion to perihelion of the Earths orbit; due to various third-body eects, this is not quite constant.

This variation is on the order of seconds. The Earth moves just under 1.02 during a mean solar day at perihelion, versus a bit over 0.953 at aphelion, and it takes about 16 seconds for the Earth to rotate through the additional angle. 3 Actually, it is now dened as some number of wavelengths of some radiation of something, which can be precisely measured in an atomic clock. But the nember was chosen to come out to 1/86400 of a mean solar day, as usually understood.

Appendix B

Review of Vectors and Kinematics

In this section we present a basic review of vector arithmetic and some results from kinematics. This is not intended to be a complete treatment of the subject. For a more complete review, see Thomson [15], Introduction to Space Dynamics, chapters 1 through 3.

B.1

Vector Arithmetic

For our purposes, a vector will usually denote a quantity in 3 dimensions, such as a position or velocity in Cartesian coordinates. Thus the review given here will be conned to nite-dimensional vectors in general, and in some cases specically to 3-dimensional vectors. Except when specically noted, the same denitions and operations apply to all nite-dimensional vectors.

B.1.1

Notation

We will in general use the bold face lower-case Roman character to denote a vector, and the subscripted character to denote a member of the vector. The plain character without subscript will be used to denote the magnitude of the vector (the denition of magnitude will be given later). The zero vector is the vector consisting of all zeroes. It is usually written 0, but it is not uncommon to see notation such as a = 0, as well.

We will consider column vectors, and our examples will be 3-dimensional, such as a1 b1 a2 b2 a= b= a3 b3

193

194

Appendix B: Review of Vectors and Kinematics

B.1.2

Addition and Scalar Multiplication

Note that each of these operations is commutative; that is, a + b = b + a and k a = ak .

The multiplication of a vector by a scalar is also dened entry-by-entry as follows: ka1 ka2 ka = ka3

The addition of vectors is dened as the addition of the entries in the vectors. Thus we have a1 + b 1 c 1 a2 + b 2 c2 c=a+b= = a3 + b 3 c3

B.1.3

Magnitude

The magnitude (or norm) of a vector means, in simple terms, its length. For our purposes, the magnitude is the Euclidean norm, dened for a vector of arbitrary nite dimension as the square root of the sum of the squares of the entries in the vector. Using a to denote the magnitude of the vector a, we thus have
n

a=
i=1

a2 i

where n is the number of entries in the vector. For a vector with three entries, this is often seen written as 1 2 2 2 a = a2 1 + a2 + a3 A common notation for the norm of a vector is also a . It should be remembered that the denition we will use is not the only correct denition of norm. In a more general mathematical sense, a norm need only satisfy some given requirements. This is both beyond the scope of this review and of little immediate use.

B.1.4

Expression through Unit Vectors

A unit vector is simply a vector with length 1. A unit vector can be derived from any vector a (except the zero vector), simply by dividing the vector by its magnitude: = a/a a

B.1: Vector Arithmetic

195

The expression to the right of the second equals sign expresses a as the sum of the three component values a1 , a2 , and a3 , each multiplied by a unit vector that causes the component to take its proper place in a. The set of unit vectors shown is usually . denoted , , and k When the vector is expressed in a rectangular coordinate frame, these three unit vectors point along the three coordinate axes.

The result is a vector of unit length, pointing in the same direction as the original . (The use of the circumex (hat) to vector. It is immediately obvious that a = aa denote a unit vector is fairly standard.) From the last section, it is clear that we can write any three-dimensional vector a as a1 a1 0 0 0 0 1 a2 0 0 1 a2 0 0 a= = + + a3 + + a2 = a1 a3 1 0 0 0 0 a3

B.1.5

Dot and Cross Products

The dot product (sometimes referred to as the scalar product) of two vectors is dened as a b = ab cos where is the angle between the two vectors. It is immediate that a a = aa cos 0 = a2 are aligned with the coordinate axes, the angles Because the unit vectors , , and k between them are 90 , and we have that = =0 = k k From previous sections, we can write b1 a b = a1 + a2 + a3 k + b2 + b3 k which multiplies out to give + a2 b 1 a b = a1 b1 + b2 + b3 k + b2 + b3 k k +a3 b1 k + b2 k + b3 k More generally, for vectors of length n, we write
n

= a1 b 1 + a2 b 2 + a3 b 3

ab=

ai b i
i=1

196

Appendix B: Review of Vectors and Kinematics

From this, it is immediately obvious that the dot product is commutative, and distributive over addition. That is, ab=ba as a b = (ab sin ) c is a unit vector perpendicular to both a and b and positive by the rightwhere c hand rule. The right-hand rule is dened according to the order of the vectors: the is obtained by rst aligning the ngers of the right hand with a, holding direction of c is then in the direction the hand so that the ngers can curl into the direction of b. c of the thumb. in opposite directions for a b and b a, we see Since the cross product gives c that the cross product is not commutative, and in fact that a b = b a. The cross product is, however, distributive over addition. It is obvious from the denition that a a = 0. and (a + b) c = a c + b c

The cross (or vector) product is dened only for vectors of length 3. It is dened

B.2

Coordinate Transformations

In this section, we will consider only three-dimensional coordinate frames, and will assume that the origin of each frame is at the same point. We are interested in the ways that a single vector may be expressed in each frame, and how the two expressions are related.

B.2.1
Y
B

Single-axis Rotations
YA r

XB XA

Consider the situation of Figure B.1. The sketch shows two coordinate frames, with a common origin, both with the Z -axis out of the page. The two are related by the rotation angle ; the frame superscripted B can be obtained by rotating the frame superscripted A through about the Z -axis. Suppose that the vector r is expressed in the original frame as
A A A A A A k . r = r1 + r2 + r3

Figure B.1: Rotation about the Z -axis.

The same vector will be expressed in the rotated frame as


B B B B B B k r = r1 + r2 + r3

B.2: Coordinate Transformations

197

where of course the new set of unit vectors are aligned with the rotated axes. Now, B by denition r1 = r B . We can thus write
A A A A A A B A A A A A A k B k B = r1 B + r2 B + r3 r1 = r1 + r2 + r3

(B.1)

and recalling the denition of the dot product, we can express this as
B A A A r1 = r1 cos + r2 cos(/2 ) + r3 cos(/2) A A A = r1 cos + r2 sin + r3

Similarly, we have
B r2 = A A A A A A k B r1 + r2 + r3

A A A A A A = r1 B + r2 B + r3 k B A A A = r1 cos(/2 + ) + r2 cos + r3 A A A = r1 sin + r2 cos + r3 B A And obviously r3 = r3 . Putting these into matrix form, we have

Using the notation rA to mean the vector r expressed in relation to the unit vectors of frame A, we can write rB = LBA rA A to those where the rotation matrix LBA transforms the coecients of A , A , and k B . Note that despite the dierent way of expressing it, the matrix multiplying B , B , k clearly must be the same in both (B.2) and (B.3). Now, a vector described in the rotated frame B can be related to the unit vectors of frame A in an identical fashion. To use the form of (B.2), we have A B A B A r1 A B A B r = A 2 A A A r3 B k B k B B k A r1 B B k A r2 B B k A r3 k

B A B r1 B A r = B 2 B B r3 A k cos sin = 0

A A B r1 A B k A A B A B r k 2 A A B A B r3 k k k A sin 0 r1 A cos 0 r2 A r3 0 1

(B.2)

(B.3)

(B.4)

It is obvious that the matrix in (B.4) is the transpose of that in (B.2). We could come to the same conclusion by noting that the tranformation from B to A would require

198

Appendix B: Review of Vectors and Kinematics for in (B.3) produces B 0 r1 B r 0 2 B r3 1

a rotation through about Z. Substituting A cos sin r1 A cos r = sin 2 A 0 0 r3

which again is the transpose of the earlier matrix.

B.2.2

Euler Rotations

Consider any two rectilinear coordinate systems with a common origin. Euler showed that the axes of one of them can be rotated to align with those of the other by not more than three rotations, each about a unique axis. This result is usually implemented by dening the three axis in advance. The values of the three rotation angles then fully describe the orientation of one coordinate system with respect to the other. The angles are often referred to as the Euler angles.

B.3

Kinematics

We now review some basic facts and dentions from kinematics, the study of objects in motion. We will generally consider motion in three dimensions, but will also often restrict the motion to plane. Planar motion can be considered a special case of threedimensional motion, and all general statements about three-dimensional motion apply to the simpler case.

B.3.1

Position and Velocity


Z

Y X

For our purposes, we will consider motion in a rectangular coordinate frame, and we will usually denote the axes as X, Y, and Z . The position of some point in the frame is described by a vector of length three, as r1 x r2 y r= = = x + y + zk r3 z where the three entries are the displacement of the point from the origin along the X , Y , and Z axes, respectively.1

Figure B.2: Position vector.

1 The notation for position and velocity is variable. It is common to see rx to mean the xcomponent of r, and ri for the same quantity (the i from the unit vector ) is equally common.

B.3: Kinematics

199

The velocity of the point in the frame is by denition the time rate of change of the position: dr =r v= dt Recalling the usual denition of a derivative, we can write this as v = lim r + r t0 t

where t is a scalar. Since division of a vector by a scalar is done element-by-element, it follows that derivatives are taken element-by-element, as well. , it is not true that v = r Note that while v = r . The acceleration is dened as the time rate of change of velocity.

B.3.2

Motion in a Rotating Frame


ZA

YD XD

YA

XA

ZD

Z Z

F Y Y A X XA
1 1

In our kinematics review, we occasionally expressed the same vector in dierent coordinate systems. This is what a coordinate transformation does. We will explore the topic in general now. Suppose we have two coordinate systems, say A and D, and we will assume that they are coincident - the origins are at the same point. It can be shown that the axes of D can be rotated into alignment with A and vice versa, through a series of 3 angular rotations. There are many possible rotations we could use. Also, even using the same rotations, the order is important; dierent orders produce dierent results. Suppose that we rst rotate about the vertical axis through an angle . Alternatively, consider the frame XB YB Z B , in which ZB is collinear with Z. Let r be some vector, which we write as A r = r1 i A + r2 jA + r3 k

Figure B.3: PLEASE FIND A NAME How would r be expressed in the he X 1 Y 1 Z 1 system? We know that we can write FOR ME 1 )k 1 r = (r i1 ) i1 + (r j 1 ) j 1 + (r k

200

Appendix B: Review of Vectors and Kinematics

Consider the rst of these. Expanding we have A r = (r1 iA i1 ) i1 + (r2 jA i1 ) i1 + (r3 k i1 ) i1 + . . . A is normal to Now, iA i1 = | iA | | i1 | cos . Similarly, jA i1 = + sin . And, since k 1 1 i , we get kA i = 0. 1 terms gives Expanding for j 1 and k 1 + r2 sin r = r1 cos I i1 1 r2 sin j 1 + r2 cos j 1 + r3 k and this is, of course, the expression of r in the B coordinate system. We can write this in vector-matrix form as cos sin 0 r1A r1B r2B Br = = sin cos 0 r2A = LBA rA r3B 0 0 0 r3A

(The subscript on L means A to B.) LAB is the transformation matrix from A to B. Suppose that instead of rotating about Z, we had rotated through an angle about Y. Our sketch might look like and through manipulation similar to ZB those we just performed, we nd that cos 0 sin ZC r rB = rC = LCB rB 1 0 rc = 0 sin 0 cos A rotation about an X axis, say through an angle , would give the transformation YB YC 1 0 0 rD = 0 cos sin rC = LDC rC 0 sin cos

Now XB consider two successive rotations, as with a radar dish or an aircraft. First, we XC Z through an angle . Then we rotate about the moved Y axis through rotate about the angle . (In actuality of course, we would do the rotations simultaneously to Figure a B.4: I AM NOTHING WITH follow target. But we can do them OUT separately for the purpose of initial aiming.) A NAME Now we have a target at some point rC in the radar frame. Looking at the process of moving from one frame (the inertial) to another, we can dene an intermediate frame,B, after the rst rotation. It is in this intermediate frame that the second rotation is performed.

B.3: Kinematics

201

Since we have an intermediate frame, we can express r in that frame. Thus we can write rC = LCB rB But we also have rB = LBI rI so rc = LCB LBI rI 1 = rI = (LCB L BI rC This continues through a third rotation, if necessary. Thus, we can dene the complete transformation matrix from A to D through rD = LCD LCB LBA rA = LDA = LDC LCB LBA It must be stressed that the rotations need not be about and X,Y, and Z axis. We could just as easily use, say about ZA , about YB , and about ZC . The only rule is that we cannot use the same axis twice, since a rotation about an axis does not move that axis.

B.3.3

Direction Cosines
B )k B rB = (r iB ) iB + (r jB ) jB + (r k

We noted before that any vector can be written as

Examining LBA and LAB , and recalling that a b = b a, we see by inspection that LBA = (LT AB

Each entry in this matrix is the cosine of some angle, and these are thus called direction cosines. Note that the transformation from B to A is accomplished by a very similar matrix: B iA iB iA jB iA k B = LAB rB rA = jA iB jA jB jA k A A A k B k iB k jB k

and, in particular, that this can be expanded into the form A iB iA iB jA iB k A rB = jB iA jB jA jB k B B B k A jA k k iA k

202

Appendix B: Review of Vectors and Kinematics

B.3.4

Inversion
rA = LAB rB = rB = (LAB )1 rA = LBA rA

Now, we know that

assuming that LAB is invertible, which it always is. However, we just showed that LAB = LT AB , so we have that (LAB )1 = (LA B )T a result that makes life much easier in many cases.

Appendix C

Series Expansions

This chapter presents a brief review of two expansion techniques that will be of use in the text. The rst, often known as Taylors Expansion, is likely to be well-known to the reader. The second technique, attributed to Lagrange, is not is not as well known in general.

C.1

Taylor Expansions

Taylor Expansions is the term often used to refer to applications of Taylors theorem to create a polynomial representation of a function. Begin by reviewing the denition and the simplest properties of the expansion for functions of one variable. Denition C.1: Consider the function f (z ). Let z = w + , where w is xed. If f (z ) has an nth derivative at w its Taylor expansion of degree n about w is the polynomial in written as P () = f (w) + f (w) + 1 1 f (w)2 + . . . + f (n) (w)n . 2! n! (C.1)

The expansion is valid for any so long as all the needed derivatives exist on the interval between w and w + . More formally, Taylors theorem can be stated as Theorem C.1: If f has a continuous nth derivative in a neighborhood of w, then in that neighborhood f (z ) = f (w) + where Rn = 1 1 f (w)(z w) + . . . + f (n) (w)(z w)n + Rn , 1! n!
z w

(C.2)

1 (n 1)!

(z t)n1 f (n) (t) f (n) (w) dt.

(C.3)

203

204

Appendix C: Series Expansions

This is often known as the Integral remainder form of Taylors theorem, and other similar names. The importance of Taylors theorem lies in the remainder term, which can be shown to go to zero as goes to zero, and to do so faster than any of the terms of the polynomial. The proof of the theorem is not dicult, but is not necessary for its application, so we omit it.

C.2

Lagrange Expansions

The Lagrange expansion is similar to the Taylor expansion in that it uses the derivatives of the function in the expression. Consider the function F (z ) where z = w + (z ). Then F can be expressed as a function of w and as F (z ) = F (w) + (w)F (w) + + + Here, F (z ) = dF dz = F (w) = dF dz .
z =w

(C.4)

n+1 n {(w)}n+1 F (w) + n (n + 1)! w

2 {(w)}2 F (w) 2 w (C.5)

This allows the function to be expressed as an expansion in about its value at w. It can of course be shown that the Lagrange expansion produces the same expression as does the Taylor expansion, since there can only be one polynomial expansion for any function. It can naturally also be shown that the expansion converges for suciently small. The Lagrange expansion is particularly useful in astrodynamics because the eccentric anomaly is conveniently expressed as E = M + e sin E (C.6)

so that M takes the place of w, e is the small parameter ( in eqn. C.5) and (z ) = sin E . Then F (w) = M F () = 1 and (w) = sin M. The expansion for eccentric anomaly about the mean anomaly then becomes E = M + e sin M + e2 e3 2 sin3 M + sin2 M + 2 M 6 M 2 (C.7)

C.3: Some Trigonometric Identities Taking the derivatives, and appealing to some trigonometric identities, produces sin2 M = 2 sin M cos M = sin 2M M 2 3 sin3 M = 6 sin M cos2 M 3 sin3 M = (3 sin 3M sin M ) 2 M 4 The rst few terms of the expansion are then E = M + e sin M +

205

(C.8) (C.9)

e2 e3 sin 2M + (3 sin 3M sin M ) + (C.10) 2 8 The amount of work (largely manipulation of trigonometric identities) increases rapidly with each term in the expansion. However, for small eccentricities the series converges rapidly, and only a few terms are needed.

C.3

Some Trigonometric Identities

Here, we list some trigonometric identities that are used elsewhere in the text. We also give the expansions of some common functions.

C.3.1

Identities

The following are often useful in simplifying expressions involving many trigonometric terms. sin2 + cos2 = 1 sin( + ) sin( ) cos( + ) cos( ) = = = = sin cos + cos sin sin cos cos sin cos cos sin sin cos cos + sin sin

sin2 = (1 cos 2)/2 cos2 = (1 + cos 2)/2

C.3.2

Expansions
ex = 1 + x + x2 /2 + x3 /3! + + xk /k ! + sin x = x x3 /3! + x5 /5! + + (1)k+1 x2k+1 /(2k + 1)! cos x = 1 x2 /2 + x4 /4! + + (1)k x2k /(2k )!

In each of the following series, k = 0, 1, 2, . . ..

Appendix D

A Brief Review of System Stability

This chapter provides a brief review of some of the most basic ideas of the stability of dynamical systems. It will in no way serve as a complete introduction to the topic, but should serve to refresh memories and to make comprehensible the ideas that will be presented concerning the stability of the Lagrange points. We will consider systems of two variables, x and y . These variables will be known as the state variables, as they describe the state of the system. Two variables are enough to show the basic ideas of stability, which are applicable to systems of any nite size, while allowing the variables to be dealt with explicitly.

D.1

Problem Formulation and Equilibrium Points

Consider the dynamical system described by the two rst-order dierential equations x = f (x, y ) y = g (x, y ) (D.1) (D.2)

We will be concerned solely with the stability of autonomous systems. The term autonomous means that the independent variable t does not appear explicitly in the equations of motion. The usual spring-mass-damper system is an example of an autonomous system; a rocket, in which the mass changes explicitly with time, is an example of a non-autonomous system. We will assume that there exists a point ( x, y ) such that the functions f and g above are zero; that is, f ( x, y ) = g ( x, y ) = 0

207

208

Appendix D: A Brief Review of System Stability

This clearly means that the derivatives of the state variables x and y are zero. This is what is meant by the term equilibrium point, a point at which the states of the system are constant.

D.2

Motion Near an Equilibrium Point

Supposing we can nd a point ( x, y ) as described above, we would like to know how the system behaves in the neighborhood of this point. This information is generally necessary because the mathematical equations we use to describe physical systems are seldom exact, so that we only know the equilibrium points to a good approximation, or because we lack the ability to place the system precisely at the equilibrium point, or any combination of these and a host of other reasons.

D.2.1

Linearizing the System

To begin our investigation of the system near the equilibrium point, we linearize the system. This is a process by which the behavior of the nonlinear system is described in the vicinity of the equilibrium point through a linear approximation of the system. Given the point ( x, y ), we make the denitions x = x + y = y + Taking the derivatives of these equations gives = f (x, y ) = f ( x = x + , y + ) y = = g (x, y ) = g ( x + , y + ) and y do not exist because the values x where the terms x and y are constants, and thus do not have derivatives. We now use Taylors Theorem to write the functions f and g as expansions about the equilibrium point (we ignore questions of dierentiability of the functions at the equilibrium point, tacitly making whatever assumptions are necessary to complete the analysis). We have for f , for example, f (x, y ) = f ( x + , y + ) f f = f ( x, y ) + + x x y ,y + 1 2 2f x2 2 +
x ,y

x ,y

2f y 2

2 +
x ,y

2f xy

x ,y

D.2: Motion Near an Equilibrium Point

209

Now, we are interested in the behavior of the system only near the equilibrium point, so we can safely assume that and are small. Since multiplying two small things together gives an even smaller result, we say that all higher-order terms, meaning terms in which or are squared, or multiplied by each other, are so small as to be negligible. Thus, we ignore all terms beyond the rst derivative in the expansion we have just performed. A second simplication can also be done. Recall that the denition of the point ( x, y ) is that f ( x, y ) = 0. Thus, another term drops out of the expansion, and when we have we plug it into our expression for = f x Similarly, the equation for is = g x +
x ,y

+
x ,y

f y

x ,y

g y

x ,y

The important thing about this formulation is that the derivatives are evaluated at the known point ( x, y ). This means that they are known constants, so we can write the dierential equations for and in matrix form as where a= f x f y = a b c d g x g y

b=
x ,y

c=
x ,y

d=
x ,y

x ,y

and we write this in the generic form = Ar r (D.3)

D.2.2

Solution of the Linearized System

Linearizing the system about the equilibrium point has produced what is called a constant-coecient linear system of equations. This is the vector version of the wellknown scalar equation x = ax (D.4) The solution to (D.4) is x(t) = x eat , where x is the initial condition. Knowing this, we decide to try a solution to (D.3) of the form r(t) = ret (D.5)

210

Appendix D: A Brief Review of System Stability

where is a constant to be determined. Plugging (D.5) into (D.3) gives d ( ret ) = A ret dt ret = A ret (A I ) ret = 0

(D.6)

Consider (D.6). We know that et = 0 for all values of and t, because the exponential is non-zero for all nite numbers, real or complex. Thus. if our equation is to be satised, we require (A I ) r=0 (D.7) Further, r = 0 is the trivial case, in which r(t) = 0 for all t. This is a valid solution, but not an interesting one; it basically agrees that if the system is at zero to start with, it will stay there. Now we recall something we learned in linear algebra class. That is, Theorem D.1: Given that M x = 0, where M is a square matrix and x is a vector of compatible dimension, either or both of the following must be true: 1. The vector x = 0, or 2. The matrix M is singular; that is, the determinant of M is zero.

We have already decided that we dont want to bother with r = 0, so that leaves the second possibility; we can satisfy (D.7) if the determinant |A I | is zero. The values of for which |A I | = 0 are known as the eigenvalues of the matrix A. We will not go into great detail about eigenvalues here, though they are of great import in the study of matrices, linear systems, and linear algebraic systems of all types. Instead, we simply note a couple of facts about eigenvalues that apply to linear systems of the type we are considering here. 1. A square matrix with n rows has exactly n eigenvalues. 2. Eigenvalues are in general complex. The rst of these facts means that for a system of two equations such as the one we are considering, there will be two solutions. The second means that the solutions will generally be described by the exponentials of complex numbers. For a given eigenvalue, we can return to eqn. (D.7) and nd a vector r that satises the equation. Such a vector is known as the eigenvector corresponding to the given eigenvalue. Because the matrix A I is singular, r is not quite unique; the values of the entries in the vector relative to each other are xed, but the magnitude

D.2: Motion Near an Equilibrium Point

211

of the vector itself is arbitrary. To see this, consider some eigenvector multiplied by some non-zero scalar , and plug this into eqn. (D.7): (A I )( r) = (A I ) r Since we already know that (A I ) r = 0, this is satised for any value of . Since there are n eigenvalues, it follows that there are also n corresponding eigenvectors, and that each may have some arbitrary magnitude. Consider some general combination of the eigenvalue/eigenvector pairs, each with its scalar multiplier: r(t) = 1 r1 e1 t + 2 r2 e2 t + + n rn en t (D.8)

If we plug this back into the original equation (D.3), we have (after taking derivatives and re-arranging) 1 (1 r1 A r1 )e1 t + 2 (2 r2 A r2 )e2 t + + n (n rn A rn )en t = 0 (D.9)

We already know that each of the terms in parentheses is equal to zero, because each r is a solution to the original dierential equation. Thus eqn. (D.9) is satised, and the linear combination (D.8) is a solution to the dierential equation (D.3).

D.2.3

Motion Near the Equilibrium Point

Consider now what the solution of our linear system means. We wrote the system as = Ar, where r r= What this means is that r is the vector from the equilibrium point ( x, y ) to the nearby point ( x + , y + ). We would like to know what happens as time passes; that is, how does r(t) change as t ? Each possible solution, we said, is of the form r(t) = ret , and any linear combination of these solutions is also a solution. It can also be shown that for a system of this type, all possible solutions are contained in the set of such solutions. Because this system is second-order, there are two solutions, and any possible solution can be written in the form r(t) = 1 r1 e1 t + 2 r2 e2 t (D.10) There are two terms on the right side of eqn. (D.10). Each term describes a particular mode of the system. The discussion of the system modes is beyond our needs here. It is sucient to say that the general motion is a combination of the separate modes. The contribution of each mode to the motion changes with time, as shown my the exponential term. The values of the constants 1 and 2 are dened by the initial conditions. Since in general we cannot specify the initial conditions, we need to be concerned with all modes.

212

Appendix D: A Brief Review of System Stability

The analysis of any particular mode is identical to that of any other, so instead of worrying about which eigenvalue we are dealing with, we will use and it will be understood that any of the values of that cause (D.7) to be satised can be used. Since in general may be complex, we write it as = p + qi where p and q are real and i = 1. Then

r(t) = ret = rept eqit Now, eqit is a complex term, and always has magnitude one in the complex plane. r is constant, so it does not contribute to the behavior in any meaningful way. Thus, the term with which we are most concerned is ept .

D.2.4

Stability

By stability, we mean the property that, if we begin with (x, y ) near the equilibrium point ( x, y ), it stays near the equilibrium point. For this simple two-dimensional system, this can be expressed mathematically as follows. Denition D.1: The system x = f (x, y ) y = g (x, y ) is said to be stable at the point ( x, y ) if for every > 0, there exists some > 0 such that x(0) x + y (0) y < for all time t > 0. = x(t) x + y (t) y <

Note that the denition does not require that both x and y go back to x and y . The extension of the denition to larger systems is obvious. The reason for concern with ept is because it is the only term in the description of r(t) that may grow. Suppose p > 0. Then as t , pt . But the magnitude of r(t) is, we have argued, proportional to ept . And if pt gets large, the exponential of pt is not only going to get large, it is going to get large very quickly indeed. But suppose p < 0. Then t causes pt , and therefore ept 0. That means that both and are approaching zero as time increases, which in turn means that the point (x, y ) = ( x + , y + ) ( x, y ). What this means in English is that we started near the equilibrium point, and as time goes on we get closer to it. The nal possibility is that p might be identically zero. Then ept = e0 = 1, and the vector r(t) is an oscillator with constant magnitude. This isnt as good as going

D.2: Motion Near an Equilibrium Point

213

to zero, but at least it means that r doesnt grow, so that the pair (x, y ) stays in the vicinity of ( x, y ). Since the general solution of the problem includes all the eigenvalues and eigenvectors, the only way we can be certain of stability is to have all of the eigenvalues satisfy the requirements. In general, we say that a linear system is asymptotically stable if all of the eigenvalues have negative real parts, neutrally stable if most of them have negative real parts and the others have zero real part, and unstable if any of the eigenvalues have positive real parts. Note that since the system is normalized such that the equilibrium point is at r = 0, asymptotic stability means that eventually the state of the system will converge to zero. Neutral stability does not make this guarantee; since some parts of the solution will oscillate forever, neutral stability only implies that the state of the system will not get arbitrarily large.
Example D.1. Consider the second-order system = k cos c w (k , c, and w constants that describe the physical system). By letting x = , y = ,

this can be re-written as the two rst-order equations x = y y = k cos x cy w Find the equilibrium points of the system. Linearize the system and construct the eigenvalue equation. Then, taking the values k = 2, w = 1, show that the equilibrium points are ( x, y ) = (/4, 0) and (/4, 0). Analyze these points, showing that ( x, y ) = (/4, 0) is unstable for all values of c, and that ( x, y ) = (/4, 0) is stable for all c 0. Setting the velocities to zero gives x =0=y = = y =0

as a requirement for any equilibrium point. Using this in the second equation gives 0 = k cos x w cos x = w/k

Therefore the equilibrium points are any pair ( x, y ) = (cos1 (w/k ), 0). To linearize, write the system in the form x = f (x, y ) y = g (x, y )

214

Appendix D: A Brief Review of System Stability


and express the variables x and y as x=x + and y=y + .

Then we expand about the equilibrium points, keeping only the rst terms in the expansion (the linear terms), to get =
f x g x f y g y


( x,y )

In this case, the functions and their partial derivatives are f (x, y ) = y f /x = 0 f /y = 1 The eigenvalue equation is then |I A| = 0 0 0 1 k sin x c = ( + c) + k sin x =0 g (x, y ) = k cos x cy w g/x = k sin x g/y = c

For the given values of k and w, the equation for y (recalling y must always be zero) becomes y = 2 cos x 1 = 0 = cos x = 2/2 The possible values for x are /4, as requested in the problem statement. Consider the equilibrium point (/4, 0). The eigenvalue equation becomes 2 + c + 2 sin(/4) = 2 + c 1 = 0. Solving gives 1 c c2 + 4 . 2 Considering the term under the radical, we see that it is always positive, and that = c2 + 4 > c2 = c 2 + 4 > |c |.

Since the value under the radical is positive, both of the eigenvalues are real. Due to the magnitude of the term under the radical, one of the eigenvalues is certain to be positive, regardless of the value of c. Therefore, since a positive real part to an eigenvalue means that the system is unstable, this is an unstable equilibrium point. For the equilibrium point ( x, y ) = (/4, 0), we have the eigenvalue equation for this equilibrium point to be 2 + c + 1 = 0 from which the eigenvalues are = 1 c 2 c2 4 .

D.3: Second-Derivative Systems

215

The system is unstable if there is a value of c for which one of the two roots is real and positive. Consider the three possibilities (a) c2 < 4 (b) c2 4 (c) c = 0 In the rst case, the term under the radical is negative, so the sign of the real part depends only on the sign of c. This will be negative if c > 0, so at least for c < 2, the system is stable for positive c. In the second case, the term under the radical is positive (or zero), and has magnitude less than c2 . Therefore, both roots will be real and have the same sign, and will be negative if c > 0. Finally, in the third case, the roots are = i, which is purely imaginary. In this case, the system is neutrally stable. Summarizing these results, the system is stable if c 0.

D.3

Second-Derivative Systems

The system we looked at in the last several pages was described by a pair of rstorder equations; that is, there were only rst derivatives with respect to time involved. For various reasons, we will want to examine systems of second-order equations; in particular, the equations of the three-body problem will be analyzed in their secondorder form. In systems described by second-order equations, we have both velocity and acceleration for each of the state variables. With the two states x and y , this leads to the system x = f (x, x, y, y ) y = g (x, x, y, y ) (D.11) (D.12)

An equilibrium point again requires that the states of the system be constant. Therefore the velocities and the accelerations will be zero. 1 Assume as before that such a point is found. As in the last section, we consider motion near the point by expanding about it, but in this case we allow for small velocities as well as small displacements. Again using and , we have x = x + y = y + = = ; x = y = ; x = y =

1 In many applications, it is enough to nd a point of pseudo-equilibrium, at which only the accelerations are required to be zero. This will not be the case in the applications we consider.

216

Appendix D: A Brief Review of System Stability

Proceeding as for systems of rst-order equations, we take the Taylor expansion about the equilibrium point, drop the higher-order terms, and eliminate the values that we know are zero. The only complication is that we must take partial derivatives with respect to the velocities as well as the positions. This is not dicult, however. When put into matrix form, the resulting equations are =
f x g x f y g y

x ,y

f x g x

f y g y

x ,y

(D.13)

We are now looking at systems of the form + Br r = Ar (D.14)

Again, we will assume a solution of the form r(t) = ret Plugging this into (D.14), we have ret = A ret + B ret 2 (2 I A B ) r=0 Again, we are interested in the values of for which the determinant of the matrix is zero; |2 I A B | = 0 (D.15) In the case we looked at previously, we said that for a system of two equations, there would always be two eigenvalues. In this case, the system may still have only two equations, but each equation is second order. That means that each equation contributes two eigenvalues, and there are a total of four values of for which (D.15) is satised. Since the solutions are of the same form as those for the rst-derivative system, the stability analysis is identical. Again, we arrive at the requirement that the eigenvalues all have non-positive real parts for stability.

D.3.1

Motion near the Equilibrium Point

As with the system of rst-order equations, the response of the second derivative system can be described as a linear combination, as in (D.8). Recall that each of the constant vectors describes a particular mode of motion. In this case, the modes will include both the states and and their derivatives. We include all of these in an augmented state vector, which we will denote R to avoid confusion. We have r = R= r

D.3: Second-Derivative Systems

217

Using this denition in (D.14), we have a system of four rst-derivative equations written as = 0 I R (D.16) R B A The matrix in this equation is 4 by 4, and thus has four eigenvalues. It can be shown that the eigenvalues are the same as those computed from (D.15). Further, the eignevectors associated with this system are those for the second-derivative system in its original form. Note that in general there will be four such eigenvectors for this , and system. This reects the fact that there are four variables ( , , ) for which initial values may separately be specied.

D.3.2

Stability of Non-dissipative Systems

One nal comment has to be made, this one concerning systems in which there are no friction or damping forces. Such forces are known as dissipative, because they dissipate energy from the system. To achieve asymptotic stability for a linear system, some such dissipative mechanism must exist. Remember that for asymptotic stability, the state of the system goes to zero. As an example, this might relate to a pendulum ceasing its oscillations and hanging still. In this case, both the potential energy and the kinetic have gone to zero. The primary dissipative mechanism is air resistance. If there is no dissipation, the total energy of the system can never decrease. What this means in practice is that the system either has kinetic energy, so that it is moving, or it has potential energy, and thus is about to move, or both. Orbital systems, in the idealized analysis, in general have no dissipative mechanism. In the restricted three-body problem, for instance, there is no dissipative term in the equations for motion near the equilibrium point. Therefore, there is no way to get asymptotic stability. The mathematical result of this is that the eigenvalues will not have negative real parts, which would imply such stability. Instead, the best that can be hoped for is neutral stability, with all eigenvalues having zero real part.

Appendix E

Tables

This appendix contains lists of physical constants and planetary information. The tables in various textbooks and sources disagree, often in the fourth or fth decimal place, sometimes in the third. The values given here are representative, and are the ones used in the examples in the text. Some of the information in planetary tables may seem inconsistent. For example, the period of a planets orbit calculated from the tabulated data may not agree precisely with the tabulated period. This will be due to both the rounding of the values listed in the table, and the fact that the two-body equations derived in the text do not allow for many very small perturbations that occur in fact. These are too small to be included in the calculations (and are unnecessary for our purposes), but are large enough to appear in the precisely measured values in the tables. The tables included here do not have the period listed; when needed, we will assume that the two-body calculation is suciently close. The sources of this information include several of the listed references. Also, some is taken from on-line sources such as: 1) Wikipedia A Free Encyclopedia at http://www.wikipedia.org

219

220

Appendix E: Tables

E.1

Basic Solar System Units


AU Gravitational parameter of the Sun 1.32715 1011 km3 /s2 Astronomical unit 1.495979 108 km

Note: The astronomical unit is no longer dened as the semimajor axis of the Earths orbit. It is now dened as the radius of the orbit of a particle of negligible mass about the Sun, with period one Gaussian year (365.2568983 mean solar days; this was the value calculated by Gauss as the length of the sidereal year). The dierence appears in the sixth signicant digit.

E.2

Earth Physical Data


r Gravitational parameter Equatorial radius Inclination of equator Sidereal year Mean solar day Sidereal day 3.98601 105 km3 /s2 6378.145 km 23.45 365.256 mean solar days 86400 mean solar seconds 23h 56m 04s mean solar time

Notes: 1. The polar radius of the Earth is slightly (about 20 km) less than the equatorial. 2. In addition to the mean solar second (dened as 1/86400 mean solar days), there is also a sidereal second, which is 1/86400 sidereal days. One mean solar second = 1.0027379093 sidereal seconds.

E.3
E.3.1

Planetary Data
Orbital Characteristics

The following data is taken from Danby [6], with a couple of corrections from Vallado [16]. With the exception of the data for Pluto, It is referenced to the epoch January 1, 2000. Pluto data is taken from the Astronomical Almanac for 1995.

E.4: The Moon Planet semi-major eccentricity inclination axis (AU) (degrees) 0.3871 0.7233 1.0000 1.5237 5.2026 9.5381 19.1833 30.0551 39.5376 0.2056 0.0068 0.0167 0.0934 0.0485 0.0555 0.0463 0.0090 0.2509 7.005 3.395 0.000 1.850 1.303 2.489 0.773 1.770 17.13

221


E.3.2

Mercury Venus Earth Mars Jupiter Saturn Uranus Neptune Pluto

Physical Data
Planet Equatorial Mass radius (Earth = 1) (km) 2439 6051 6378 3393 71400 60000 25400 25295 1500 0.056 0.817 1.000 0.108 318.0 95.2 14.6 17.3 0.0022

Mercury Venus Earth Mars Jupiter Saturn Uranus Neptune Pluto

E.4

The Moon
r a Radius 1738 km Semimajor axis 384400 km Mass 0.0123 Earth mass

Index

angular momentum, 8, 24 apoapsis, 14 Aries, rst point of, 56 ascending node, 58 Astronomical symbols, 3 atmospheric pressure, 173 Barkers equation, 41 bi-elliptic transfer, 77 bisection, 42 Brahe, Tycho, 1 conservative eld, 141 Copernicus, Nicolaus, 1 disturbing potential, 149 drag, 173 eccentric anomaly, 39 eccentricity, 13, 49, 58 energy, 7, 16 ephemeris, 119 epoch, 59 equilibrium point, 202 equinox autumnal, 56 vernal, 56 ight path angle, 15 force eld, 141 gravitational eld, 144 Gregorian calendar, 120 Heinlein, Robert, 165 Hohmann transfer, 74

hyperbolic cosine, 46 hyperbolic sine, 45, 46 hyperbolic tangent, 46 impulsive maneuver, 69 inclination, 58 Jacobi, 135 Jacobis Integral, 135 Julian day, 120 Kepler, Johannes, 1 kinetic energy, 7, 11 Lagrange points, 132 Laplace vector, 9 latitude (argument of), 60 latus rectum, 13 Libra, rst point of, 56 Libration points, see Lagrange points longitude, true, 60 Mars, 26, 45 mass fraction, 176 payload ratio, 177 structural factor, 176 mean motion, 33, 38 Newton Newton, Isaac, 2 Second law, 2 Newton-Raphson, 43 orbit equation, 10 orbital elements classical, 57

222

Index Delauney elements, 66 equinoctial, 65 mean planetary, 122 special cases, 60 orbital plane, 9 order analysis, 163 periapsis, 14 argument of, 58 Time of p. passage, 59 Phobos, 26 potential energy, 11, 144 potential function, 144 Ptolemy, 1 rocket multi-stage rocket, 178 single stage, 176 sounding rocket, 167 rocket equation, 167 Runge-Lenz vector, see Laplace vector semi-latus rectum, 13, 58 semimajor axis, 14, 47, 58 semiminor axis, 14 sensitivity analysis, 84 solstice, 56 specic energy, see energy specic impulse, 175, 182 Sphere of Activity, 93 Sphere of Inuence, 92 Taylor expansion, 43, 197 thrust, 166, 182 pressure thrust, 172 Tisserand, 93 true anomaly, 11 Ulysses, 113 Universal gravitation, law of, 2 Voyager, 127 zodiac, 56

223

You might also like