Deep reinforcement learning based finite-horizon optimal tracking control for nonlinear system. Issue 25 (2018)

Record Type:: Journal Article
Title:: Deep reinforcement learning based finite-horizon optimal tracking control for nonlinear system. Issue 25 (2018)
Main Title:: Deep reinforcement learning based finite-horizon optimal tracking control for nonlinear system
Authors:: Kim, Jong Woo
Park, Byung Jun
Yoo, Haeun
Lee, Jay H.
Lee, Jong Min
Abstract:: Abstract: Reinforcement learning (RL) can be used to obtain an approximate numerical solution to the Hamilton-Jacobi-Bellman (HJB) equation. Recent advances in machine learning community enable the use of deep neural networks (DNNs) to approximate high-dimensional nonlinear functions as those that occur in RL, accurately without any domain knowledge. In the standard RL setting, both system and cost structures are unknown, and the amount of data needed to obtain an accurate approximation can be impractically large. Meanwhile, when the structures are known, they can be used to solve the HJB equation efficiently. Herein, the model-based globalized dual heuristic programming (GDHP) is proposed, in which the HJB equation is separated into value, costate, and policy functions. A particular class of interest in this research is finite horizon optimal tracking control (FHOC) problem. Additional issues that arise, such as time-varying functions, terminal constraints, and delta-input formulation, are addressed in the context of FHOC. The DNN structure and training algorithm suitable for FHOC are presented. A benchmark continuous reactor example is provided to illustrate the proposed approach.
Is Part Of:: IFAC-PapersOnLine. Volume 51:Issue 25(2018)
Journal:: IFAC-PapersOnLine
Issue:: Volume 51:Issue 25(2018)
Issue Display:: Volume 51, Issue 25 (2018)
Year:: 2018
Volume:: 51
Issue:: 25
Issue Sort Value:: 2018-0051-0025-0000
Page Start:: 257
Page End:: 262
Publication Date:: 2018
Subjects:: Reinforcement learning -- Approximate dynamic programming -- Deep learning -- Globalized dual heuristic programming -- Optimal control -- Optimal tracking
Automatic control -- Periodicals
629.805
Journal URLs:: https://www.journals.elsevier.com/ifac-papersonline/ ↗
http://www.sciencedirect.com/ ↗
DOI:: 10.1016/j.ifacol.2018.11.115 ↗
Languages:: English
ISSNs:: 2405-8963
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 8751.xml