Reinforcement learning based optimal control of batch processes using Monte-Carlo deep deterministic policy gradient with phase segmentation. (4th January 2021)

Record Type:: Journal Article
Title:: Reinforcement learning based optimal control of batch processes using Monte-Carlo deep deterministic policy gradient with phase segmentation. (4th January 2021)
Main Title:: Reinforcement learning based optimal control of batch processes using Monte-Carlo deep deterministic policy gradient with phase segmentation
Authors:: Yoo, Haeun
Kim, Boeun
Kim, Jong Woo
Lee, Jay H.
Abstract:: Highlights: Design of reward function is suggested for the general economic process control. Phase segmentation approach is proposed to address distinct characteristics of various phases of a batch run. DDPG algorithm is modified with Monte-Carlo learning for stable agent training. Suggested algorithm is applied to a batch polymerization process control problem. Abstract: Batch process control represents a challenge given its dynamic operation over a large operating envelope. Nonlinear model predictive control (NMPC) is the current standard for optimal control of batch processes. The performance of conventional NMPC can be unsatisfactory in the presence of uncertainties. Reinforcement learning (RL) which can utilize simulation or real operation data is a viable alternative for such problems. To apply RL to batch process control effectively, however, choices such as the reward function design and value update method must be made carefully. This study proposes a phase segmentation approach for the reward function design and value/policy function representation. In addition, the deep deterministic policy gradient algorithm (DDPG) is modified with Monte-Carlo learning to ensure more stable and efficient learning behavior. A case study of a batch polymerization process producing polyols is used to demonstrate the improvement brought by the proposed approach and to highlight further issues.
Is Part Of:: Computers & chemical engineering. Volume 144(2021)
Journal:: Computers & chemical engineering
Issue:: Volume 144(2021)
Issue Display:: Volume 144, Issue 2021 (2021)
Year:: 2021
Volume:: 144
Issue:: 2021
Issue Sort Value:: 2021-0144-2021-0000
Page Start:
Page End:
Publication Date:: 2021-01-04
Subjects:: Batch process -- Reinforcement learning -- Optimal control -- Actor-Critic
Chemical engineering -- Data processing -- Periodicals
660.0285
Journal URLs:: http://www.sciencedirect.com/science/journal/00981354 ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.compchemeng.2020.107133 ↗
Languages:: English
ISSNs:: 0098-1354
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3394.664000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 14934.xml