Reinforcement learning of motor skills using Policy Search and human corrective advice. (December 2019)

Record Type:: Journal Article
Title:: Reinforcement learning of motor skills using Policy Search and human corrective advice. (December 2019)
Main Title:: Reinforcement learning of motor skills using Policy Search and human corrective advice
Authors:: Celemin, Carlos
Maeda, Guilherme
Ruiz-del-Solar, Javier
Peters, Jan
Kober, Jens
Abstract:: Robot learning problems are limited by physical constraints, which make learning successful policies for complex motor skills on real systems unfeasible. Some reinforcement learning methods, like Policy Search, offer stable convergence toward locally optimal solutions, whereas interactive machine learning or learning-from-demonstration methods allow fast transfer of human knowledge to the agents. However, most methods require expert demonstrations. In this work, we propose the use of human corrective advice in the actions domain for learning motor trajectories. Additionally, we combine this human feedback with reward functions in a Policy Search learning scheme. The use of both sources of information speeds up the learning process, since the intuitive knowledge of the human teacher can be easily transferred to the agent, while the Policy Search method with the cost/reward function take over for supervising the process and reducing the influence of occasional wrong human corrections. This interactive approach has been validated for learning movement primitives with simulated arms with several degrees of freedom in reaching via-point movements, and also using real robots in such tasks as "writing characters" and the ball-in-a-cup game. Compared with standard reinforcement learning without human advice, the results show that the proposed method not only converges to higher rewards when learning movement primitives, but also that the learning is sped up by a factor of 4–40 … (more)
Is Part Of:: International journal of robotics research. Volume 38:Number 14(2019)
Journal:: International journal of robotics research
Issue:: Volume 38:Number 14(2019)
Issue Display:: Volume 38, Issue 14 (2019)
Year:: 2019
Volume:: 38
Issue:: 14
Issue Sort Value:: 2019-0038-0014-0000
Page Start:: 1560
Page End:: 1580
Publication Date:: 2019-12
Subjects:: Reinforcement learning -- policy search -- learning from demonstrations -- interactive machine learning -- movement primitives -- motor skills
Robots -- Periodicals
Robots, Industrial -- Periodicals
629.89205
Journal URLs:: http://ijr.sagepub.com/ ↗
http://www.uk.sagepub.com/home.nav ↗
DOI:: 10.1177/0278364919871998 ↗
Languages:: English
ISSNs:: 0278-3649
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 11935.xml