Chapter 15 — Robot Learning

Inverted helicopter hovering

An example of simulation-based optimization using a learned forward model. This brief video shows a successful application of reinforcement learning to the design of a controller for sustained inverted flight of an autonomous helicopter. The authors began by learning a stochastic, nonlinear forward model of the helicopter’s dynamics. Then, a reinforcement learning algorithm was applied to automatically learn a controller for autonomous inverted hovering. The video illustrates Section 15.2.5 -- Applications of Model Learning, Springer Handbook of Robotics, 2nd ed (2016); Reference: A.Y. Ng, A. Coates, M. Diel, V. Ganapathi, J. Schulte, B. Tse, E. Berger, E. Liang: Autonomous inverted helicopter flight via reinforcement learning, IX Int. Symp. Exp. Robot. 2004, Springer Tract. Adv. Robot. 21, 363-372 (2006)
Pieter Abbeel
352
Andrew Ng and Pieter Abbeel