An example of simulation-based optimization using a learned forward model.
This brief video shows a successful application of reinforcement learning to the design of a controller for sustained inverted flight of an autonomous helicopter. The authors began by learning a stochastic, nonlinear forward model of the helicopter’s dynamics. Then, a reinforcement learning algorithm was applied to automatically learn a controller for autonomous inverted hovering.
The video illustrates Section 15.2.5 -- Applications of Model Learning, Springer Handbook of Robotics, 2nd ed (2016);
Reference: A.Y. Ng, A. Coates, M. Diel, V. Ganapathi, J. Schulte, B. Tse, E. Berger, E. Liang: Autonomous inverted helicopter flight via reinforcement learning, IX Int. Symp. Exp. Robot. 2004, Springer Tract. Adv. Robot. 21, 363-372 (2006)