Learning from Demonstrations and Human Evaluative Feedbacks: Handling Sparsity and Imperfection Using Inverse Reinforcement Learning Approach. (13th January 2020)

Record Type:: Journal Article
Title:: Learning from Demonstrations and Human Evaluative Feedbacks: Handling Sparsity and Imperfection Using Inverse Reinforcement Learning Approach. (13th January 2020)
Main Title:: Learning from Demonstrations and Human Evaluative Feedbacks: Handling Sparsity and Imperfection Using Inverse Reinforcement Learning Approach
Authors:: Mourad, Nafee
Ezzeddine, Ali
Nadjar Araabi, Babak
Nili Ahmadabadi, Majid
Other Names:: Li Yangmin Academic Editor.
Abstract:: Abstract : Programming by demonstrations is one of the most efficient methods for knowledge transfer to develop advanced learning systems, provided that teachers deliver abundant and correct demonstrations, and learners correctly perceive them. Nevertheless, demonstrations are sparse and inaccurate in almost all real-world problems. Complementary information is needed to compensate these shortcomings of demonstrations. In this paper, we target programming by a combination of nonoptimal and sparse demonstrations and a limited number of binary evaluative feedbacks, where the learner uses its own evaluated experiences as new demonstrations in an extended inverse reinforcement learning method. This provides the learner with a broader generalization and less regret as well as robustness in face of sparsity and nonoptimality in demonstrations and feedbacks. Our method alleviates the unrealistic burden on teachers to provide optimal and abundant demonstrations. Employing an evaluative feedback, which is easy for teachers to deliver, provides the opportunity to correct the learner's behavior in an interactive social setting without requiring teachers to know and use their own accurate reward function. Here, we enhance the inverse reinforcement learning (IRL ) to estimate the reward function using a mixture of nonoptimal and sparse demonstrations and evaluative feedbacks. Our method, called IRL from demonstration and human's critique (IRLDC ), has two phases. The teacher first … (more)
Is Part Of:: Journal of robotics. Volume 2020(2020)
Journal:: Journal of robotics
Issue:: Volume 2020(2020)
Issue Display:: Volume 2020, Issue 2020 (2020)
Year:: 2020
Volume:: 2020
Issue:: 2020
Issue Sort Value:: 2020-2020-2020-0000
Page Start:
Page End:
Publication Date:: 2020-01-13
Subjects:: Robotics -- Periodicals
Robotics
Periodicals
629.892
Journal URLs:: https://www.hindawi.com/journals/jr/ ↗
DOI:: 10.1155/2020/3849309 ↗
Languages:: English
ISSNs:: 1687-9600
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library HMNTS - ELD Digital store
Ingest File:: 12905.xml