Max-Planck-Institut für Informatik
max planck institut
informatik
mpii logo Minerva of the Max Planck Society
 

MPI-INF or MPI-SWS or Local Campus Event Calendar

<< Previous Entry Next Entry >> New Event Entry Edit this Entry Login to DB (to update, delete)
What and Who
Title:Information-Geometric Policy Search for Learning Versatile, Reusable Skills
Speaker:Gerhard Neumann
coming from:University of Lincoln
Speakers Bio:
Event Type:Talk
Visibility:D1, D2, D3, D4, D5, RG1, SWS, MMCI
We use this to send out email in the morning.
Level:Public Audience
Language:English
Date, Time and Location
Date:Tuesday, 12 June 2018
Time:10:00
Duration:60 Minutes
Location:Saarbr├╝cken
Building:E1 5
Room:029
Abstract
In the future, autonomous robots will be used for various applications such as autonomous farming, handling dangerous materials as for example decommissioning nuclear waste, health care or autonomous transportation. For such complex scenarios, it is inevitable that autonomous robots are equipped with sophisticated learning capabilities which enable it to learn from human teachers as well as from self-improvement.

In this talk, I will present our work on information-geometric policy search methods for learning complex motor skills. Our algorithms use information-geometric insights to exploit curvature and path information in order to perform efficient local search at the level of single elemental motions, also called movement primitives. Simultaneously to local search, the algorithms search on a global level by selecting between distinct solutions, allowing us to represent a versatile solution space with high quality solutions. Our algorithms can be used to efficiently learn motor skills, generalize these motions to different situations, learn reactive skills that can react to perturbations and select and learn when to switch between these motions. I will also briefly show how to extend our algorithms to learn from preference-based feedback instead of a numeric reward signal, enabling a human expert to guide the learning agent without the need for manual reward tuning. While I will use dynamic motor games, such as table tennis, as motivation throughout my talk, I will also shortly present how to apply similar methods for robot grasping and manipulation tasks.

Contact
Name(s):Connie Balzert
Video Broadcast
Video Broadcast:NoTo Location:
Tags, Category, Keywords and additional notes
Note:
Attachments, File(s):

Created:
Connie Balzert/MPI-INF, 06/08/2018 12:20 PM
Last modified:
Uwe Brahm/MPII/DE, 06/12/2018 07:01 AM
  • Connie Balzert, 06/08/2018 12:20 PM -- Created document.