Probabilistic model-based imitation learning. (October 2013)
- Record Type:
- Journal Article
- Title:
- Probabilistic model-based imitation learning. (October 2013)
- Main Title:
- Probabilistic model-based imitation learning
- Authors:
- Englert, Peter
Paraschos, Alexandros
Deisenroth, Marc Peter
Peters, Jan - Abstract:
- Efficient skill acquisition is crucial for creating versatile robots. One intuitive way to teach a robot new tricks is to demonstrate a task and enable the robot to imitate the demonstrated behavior. This approach is known as imitation learning. Classical methods of imitation learning, such as inverse reinforcement learning or behavioral cloning, suffer substantially from the correspondence problem when the actions (i.e. motor commands, torques or forces) of the teacher are not observed or the body of the teacher differs substantially, e.g., in the actuation. To address these drawbacks we propose to learn a robot-specific controller that directly matches robot trajectories with observed ones. We present a novel and robust probabilistic model-based approach for solving a probabilistic trajectory matching problem via policy search. For this purpose, we propose to learn a probabilistic model of the system, which we exploit for mental rehearsal of the current controller by making predictions about future trajectories. These internal simulations allow for learning a controller without permanently interacting with the real system, which results in a reduced overall interaction time. Using long-term predictions from this learned model, we train robot-specific controllers that reproduce the expert's distribution of demonstrations without the need to observe motor commands during the demonstration. The strength of our approach is that it addresses the correspondence problem in aEfficient skill acquisition is crucial for creating versatile robots. One intuitive way to teach a robot new tricks is to demonstrate a task and enable the robot to imitate the demonstrated behavior. This approach is known as imitation learning. Classical methods of imitation learning, such as inverse reinforcement learning or behavioral cloning, suffer substantially from the correspondence problem when the actions (i.e. motor commands, torques or forces) of the teacher are not observed or the body of the teacher differs substantially, e.g., in the actuation. To address these drawbacks we propose to learn a robot-specific controller that directly matches robot trajectories with observed ones. We present a novel and robust probabilistic model-based approach for solving a probabilistic trajectory matching problem via policy search. For this purpose, we propose to learn a probabilistic model of the system, which we exploit for mental rehearsal of the current controller by making predictions about future trajectories. These internal simulations allow for learning a controller without permanently interacting with the real system, which results in a reduced overall interaction time. Using long-term predictions from this learned model, we train robot-specific controllers that reproduce the expert's distribution of demonstrations without the need to observe motor commands during the demonstration. The strength of our approach is that it addresses the correspondence problem in a principled way. Our method achieves a higher learning speed than both model-based imitation learning based on dynamics motor primitives and trial-and-error-based learning systems with hand-crafted cost functions. We successfully applied our approach to imitating human behavior using a tendon-driven compliant robotic arm. Moreover, we demonstrate the generalization ability of our approach in a multi-task learning setup. … (more)
- Is Part Of:
- Adaptive behavior. Volume 21:Number 5(2013)
- Journal:
- Adaptive behavior
- Issue:
- Volume 21:Number 5(2013)
- Issue Display:
- Volume 21, Issue 5 (2013)
- Year:
- 2013
- Volume:
- 21
- Issue:
- 5
- Issue Sort Value:
- 2013-0021-0005-0000
- Page Start:
- 388
- Page End:
- 403
- Publication Date:
- 2013-10
- Subjects:
- Animal behavior -- Periodicals
Animals -- Adaptation -- Periodicals
Adaptability (Psychology) -- Periodicals
Adaptation, Psychological -- Periodicals
Artificial intelligence -- Periodicals
591.5 - Journal URLs:
- http://adb.sagepub.com ↗
http://www.uk.sagepub.com/home.nav ↗ - DOI:
- 10.1177/1059712313491614 ↗
- Languages:
- English
- ISSNs:
- 1741-2633
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23239.xml