MPI-INF Logo
Campus Event Calendar

Event Entry

What and Who

Information-Geometric Policy Search for Learning Versatile, Reusable Skills

Gerhard Neumann
University of Lincoln
Talk
AG 1, AG 2, AG 3, AG 4, AG 5, RG1, SWS, MMCI  
Public Audience
English

Date, Time and Location

Tuesday, 12 June 2018
10:00
60 Minutes
E1 5
029
Saarbrücken

Abstract

In the future, autonomous robots will be used for various applications such as autonomous farming, handling dangerous materials as for example decommissioning nuclear waste, health care or autonomous transportation. For such complex scenarios, it is inevitable that autonomous robots are equipped with sophisticated learning capabilities which enable it to learn from human teachers as well as from self-improvement.


In this talk, I will present our work on information-geometric policy search methods for learning complex motor skills. Our algorithms use information-geometric insights to exploit curvature and path information in order to perform efficient local search at the level of single elemental motions, also called movement primitives. Simultaneously to local search, the algorithms search on a global level by selecting between distinct solutions, allowing us to represent a versatile solution space with high quality solutions. Our algorithms can be used to efficiently learn motor skills, generalize these motions to different situations, learn reactive skills that can react to perturbations and select and learn when to switch between these motions. I will also briefly show how to extend our algorithms to learn from preference-based feedback instead of a numeric reward signal, enabling a human expert to guide the learning agent without the need for manual reward tuning. While I will use dynamic motor games, such as table tennis, as motivation throughout my talk, I will also shortly present how to apply similar methods for robot grasping and manipulation tasks.

Contact

Connie Balzert
--email hidden
passcode not visible
logged in users only

Connie Balzert, 06/08/2018 12:20 -- Created document.