Least-squares policy iteration algorithms for robotics: Online, continuous, and automatic. (August 2019)

Record Type:: Journal Article
Title:: Least-squares policy iteration algorithms for robotics: Online, continuous, and automatic. (August 2019)
Main Title:: Least-squares policy iteration algorithms for robotics: Online, continuous, and automatic
Authors:: Friedrich, Stefan R.
Schreibauer, Michael
Buss, Martin
Abstract:: Abstract: Reinforcement learning (RL) is a general framework to acquire intelligent behavior by trial-and-error and many successful applications and impressive results have been reported in the field of robotics. In robot control problem settings, it is oftentimes characteristic that the algorithms have to learn online through interaction with the system while it is operating, and that both state and action spaces are continuous. Least-squares policy iteration (LSPI) based approaches are therefore particularly hard to employ in practice, and parameter tuning is a tedious and costly enterprise. In order to mitigate this problem, we derive an automatic online LSPI algorithm that operates over continuous action spaces and does not require an a-priori, hand-tuned value function approximation architecture. To this end, we first show how the kernel least-squares policy iteration algorithm can be modified to handle data online by recursive dictionary and learning update rules. Next, borrowing sparsification methods from kernel adaptive filtering, the continuous action-space approximation in the online least-squares policy iteration algorithm can be efficiently automated as well. We then propose a similarity-based information extrapolation for the recursive temporal difference update in order to perform the dictionary expansion step efficiently in both algorithms. The performance of the proposed algorithms is compared with respect to their batch or hand-tuned counterparts in a … (more)
Is Part Of:: Engineering applications of artificial intelligence. Volume 83(2019)
Journal:: Engineering applications of artificial intelligence
Issue:: Volume 83(2019)
Issue Display:: Volume 83, Issue 2019 (2019)
Year:: 2019
Volume:: 83
Issue:: 2019
Issue Sort Value:: 2019-0083-2019-0000
Page Start:: 72
Page End:: 84
Publication Date:: 2019-08
Subjects:: Reinforcement learning -- Policy iteration -- Continuous actions -- Robotics -- Sparsification
Engineering -- Data processing -- Periodicals
Artificial intelligence -- Periodicals
Expert systems (Computer science) -- Periodicals
Ingénierie -- Informatique -- Périodiques
Intelligence artificielle -- Périodiques
Systèmes experts (Informatique) -- Périodiques
Artificial intelligence
Engineering -- Data processing
Expert systems (Computer science)
Periodicals
620.00285
Journal URLs:: http://www.sciencedirect.com/science/journal/09521976 ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.engappai.2019.04.001 ↗
Languages:: English
ISSNs:: 0952-1976
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3755.704500
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 10931.xml