A model-based deep reinforcement learning method applied to finite-horizon optimal control of nonlinear control-affine system. (March 2020)

Record Type:: Journal Article
Title:: A model-based deep reinforcement learning method applied to finite-horizon optimal control of nonlinear control-affine system. (March 2020)
Main Title:: A model-based deep reinforcement learning method applied to finite-horizon optimal control of nonlinear control-affine system
Authors:: Kim, Jong Woo
Park, Byung Jun
Yoo, Haeun
Oh, Tae Hoon
Lee, Jay H.
Lee, Jong Min
Abstract:: Highlights: A model-based deep reinforcement learning (DRL) algorithm, which solves the Hamilton–Jacobi–Bellman equation for finite-horizon optimal control of nonlinear control-affine system is developed. Deep neural networks (DNNs) are implemented to approximate the value function, its first-order derivative (i.e., costate function), and the policy function. State-of-the-art DRL methods are incorporated to efficiently train the DNNs. The use of DNNs has allow for the application of the algorithm to the high-dimensional problem, and is shown to improve the performance of a learned policy in the presence of uncertainty. Examples involving the batch chemical reactor and a diffusion-convection-reaction system are used to demonstrate the statements. Abstract: The Hamilton–Jacobi–Bellman (HJB) equation can be solved to obtain optimal closed-loop control policies for general nonlinear systems. As it is seldom possible to solve the HJB equation exactly for nonlinear systems, either analytically or numerically, methods to build approximate solutions through simulation based learning have been studied in various names like neurodynamic programming (NDP) and approximate dynamic programming (ADP). The aspect of learning connects these methods to reinforcement learning (RL), which also tries to learn optimal decision policies through trial-and-error based learning. This study develops a model-based RL method, which iteratively learns the solution to the HJB and its associated equations. … (more)
Is Part Of:: Journal of process control. Volume 87(2020)
Journal:: Journal of process control
Issue:: Volume 87(2020)
Issue Display:: Volume 87, Issue 2020 (2020)
Year:: 2020
Volume:: 87
Issue:: 2020
Issue Sort Value:: 2020-0087-2020-0000
Page Start:: 166
Page End:: 178
Publication Date:: 2020-03
Subjects:: Reinforcement learning -- Approximate dynamic programming -- Deep neural networks -- Globalized dual heuristic programming -- Finite horizon optimal control problem -- Hamilton–Jacobi–Bellman equation
Process control -- Periodicals
Fabrication -- Contrôle -- Périodiques
Process control
Periodicals
Electronic journals
660.281
Journal URLs:: http://www.sciencedirect.com/science/journal/09591524 ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.jprocont.2020.02.003 ↗
Languages:: English
ISSNs:: 0959-1524
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 5042.645000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 18025.xml