750 likes | 978 Views
Basic Concepts in Control. 393R: Autonomous Robots Peter Stone. Slides Courtesy of Benjamin Kuipers. Good Afternoon Colleagues. Are there any questions?. Logistics. Reading responses Next week’s readings - due Monday night Braitenberg vehicles Forward/inverse kinematics
E N D
Basic Concepts in Control 393R: Autonomous Robots Peter Stone Slides Courtesy of Benjamin Kuipers
Good Afternoon Colleagues • Are there any questions?
Logistics • Reading responses • Next week’s readings - due Monday night • Braitenberg vehicles • Forward/inverse kinematics • Aibo joint modeling • Next class: lab intro (start here)
Controlling a Simple System • Consider a simple system: • Scalar variables x and u, not vectors x and u. • Assume x is observable: y = G(x) = x • Assume effect of motor command u: • The setpoint xset is the desired value. • The controller responds to error: e = x xset • The goal is to set u to reach e = 0.
The intuition behind control • Use action u to push back toward error e = 0 • error e depends on state x (via sensors y) • What does pushing back do? • Depends on the structure of the system • Velocity versus acceleration control • How much should we push back? • What does the magnitude of u depend on? Car on a slope example
Velocity or acceleration control? • If error reflects x, does u affect x or x ? • Velocity control: u x (valve fills tank) • let x = (x) • Acceleration control: u x (rocket) • let x = (x v)T
The Bang-Bang Controller • Push back, against the direction of the error • with constant action u • Error is e = x - xset • To prevent chatter around e = 0, • Household thermostat. Not very subtle.
Bang-Bang Control in Action • Optimal for reaching the setpoint • Not very good for staying near it
Hysteresis • Does a thermostat work exactly that way? • Car demonstration • Why not? • How can you prevent such frequent motor action? • Aibo turning to ball example
Proportional Control • Push back, proportional to the error. • set ub so that • For a linear system, we get exponential convergence. • The controller gain k determines how quickly the system responds to error.
Velocity Control • You want to drive your car at velocity vset. • You issue the motor command u = posaccel • You observe velocity vobs. • Define a first-order controller: • k is the controller gain.
Proportional Control in Action • Increasing gain approaches setpoint faster • Can leads to overshoot, and even instability • Steady-state offset
Steady-State Offset • Suppose we have continuing disturbances: • The P-controller cannot stabilize at e = 0. • Why not?
Steady-State Offset • Suppose we have continuing disturbances: • The P-controller cannot stabilize at e = 0. • if ub is defined so F(xset,ub) = 0 • then F(xset,ub) + d 0, so the system changes • Must adapt ub to different disturbances d.
Adaptive Control • Sometimes one controller isn’t enough. • We need controllers at different time scales. • This can eliminate steady-state offset. • Why?
Adaptive Control • Sometimes one controller isn’t enough. • We need controllers at different time scales. • This can eliminate steady-state offset. • Because the slower controller adapts ub.
Integral Control • The adaptive controller means • Therefore • The Proportional-Integral (PI) Controller.
Nonlinear P-control • Generalize proportional control to • Nonlinear control laws have advantages • f has vertical asymptote: bounded error e • f has horizontal asymptote: bounded effort u • Possible to converge in finite time. • Nonlinearity allows more kinds of composition.
Stopping Controller • Desired stopping point: x=0. • Current position: x • Distance to obstacle: • Simple P-controller: • Finite stopping time for
Derivative Control • Damping friction is a force opposing motion, proportional to velocity. • Try to prevent overshoot by damping controller response. • Estimating a derivative from measurements is fragile, and amplifies noise.
Derivative Control in Action • Damping fights oscillation and overshoot • But it’s vulnerable to noise
Effect of Derivative Control • Different amounts of damping (without noise)
Derivatives Amplify Noise • This is a problem if control output (CO) depends on slope (with a high gain).
The PID Controller • A weighted combination of Proportional, Integral, and Derivative terms. • The PID controller is the workhorse of the control industry. Tuning is non-trivial. • Next lecture includes some tuning methods.
PID Control in Action • But, good behavior depends on good tuning! • Aibo joints use PID control
Habituation • Integral control adapts the bias term ub. • Habituation adapts the setpoint xset. • It prevents situations where too much control action would be dangerous. • Both adaptations reduce steady-state error.
Types of Controllers • Open-loop control • No sensing • Feedback control (closed-loop) • Sense error, determine control response. • Feedforward control (closed-loop) • Sense disturbance, predict resulting error, respond to predicted error before it happens. • Model-predictive control (closed-loop) • Plan trajectory to reach goal. • Take first step. • Repeat. Design open and closed-loop controllers for me to get out of the room.
Dynamical Systems • A dynamical system changes continuously (almost always) according to • A controller is defined to change the coupled robot and environment into a desired dynamical system.
Time plot (t,x) Phase portrait (x,v) Two views of dynamic behavior
Phase Portrait: (x,v) space • Shows the trajectory (x(t),v(t)) of the system • Stable attractor here
In One Dimension • Simple linear system • Fixed point • Solution • Stable if k < 0 • Unstable if k > 0
In Two Dimensions • Often, we have position and velocity: • If we model actions as forces, which cause acceleration, then we get:
The Damped Spring • The spring is defined by Hooke’s Law: • Include damping friction • Rearrange and redefine constants
The Wall Follower (x,y)
The Wall Follower • Our robot model: u = (v)Ty=(y )T 0. • We set the control law u = (v)T = Hi(y)
The Wall Follower • Assume constant forward velocity v = v0 • approximately parallel to the wall: 0. • Desired distance from wall defines error: • We set the control law u = (v)T = Hi(y) • We want e to act like a “damped spring”
The Wall Follower • We want a damped spring: • For small values of • Substitute, and assume v=v0 is constant. • Solve for
The Wall Follower • To get the damped spring • We get the constraint • Solve for . Plug into u. • This makes the wall-follower a PD controller. • Because:
Tuning the Wall Follower • The system is • Critical damping requires • Slightly underdamped performs better. • Set k2 by experience. • Set k1 a bit less than
An Observer for Distance to Wall • Short sonar returns are reliable. • They are likely to be perpendicular reflections.
Alternatives • The wall follower is a PD control law. • A target seeker should probably be a PI control law, to adapt to motion. • Can try different tuning values for parameters. • This is a simple model. • Unmodeled effects might be significant.
Ziegler-Nichols Tuning • Open-loop response to a unit step increase. • d is deadtime. T is the process time constant. • K is the process gain. K d T
Tuning the PID Controller • We have described it as: • Another standard form is: • Ziegler-Nichols says:
Ziegler-Nichols Closed Loop • Disable D and I action (pure P control). • Make a step change to the setpoint. • Repeat, adjusting controller gain until achieving a stable oscillation. • This gain is the “ultimate gain” Ku. • The period is the “ultimate period” Pu.