Asynchronous learning for actor–critic neural networks and synchronous triggering for multiplayer system. (October 2022)

Record Type:: Journal Article
Title:: Asynchronous learning for actor–critic neural networks and synchronous triggering for multiplayer system. (October 2022)
Main Title:: Asynchronous learning for actor–critic neural networks and synchronous triggering for multiplayer system
Authors:: Wang, Ke
Mu, Chaoxu
Abstract:: Abstract: In this paper, based on actor–critic neural network structure and reinforcement learning scheme, a novel asynchronous learning algorithm with event communication is developed, so as to solve Nash equilibrium of multiplayer nonzero-sum differential game in an adaptive fashion. From the point of optimal control view, each player or local controller wants to minimize the individual infinite-time cost function by finding an optimal policy. In this novel learning framework, each player consists of one critic and one actor, and implements distributed asynchronous policy iteration to optimize decision-making process. In addition, communication burden between the system and players is effectively reduced by setting up a central event generator. Critic network executes fast updates by gradient-descent adaption while actor network gives event-induced updates using the gradient projection. The closed-loop asymptotic stability is ensured along with uniform ultimate convergence. Then, the effectiveness of the proposed algorithm is substantiated on a four-player nonlinear system, revealing that it can significantly reduce sampling numbers without impairing learning accuracy. Finally, by leveraging nonzero-sum game idea, the proposed learning scheme is also applied to solve the lateral-directional stability of a linear aircraft system, and is further extended to a nonlinear vehicle system for achieving adaptive cruise control. Highlights: A novel asynchronous learning algorithm … (more)
Is Part Of:: ISA transactions. Volume 129(2022)Part B
Journal:: ISA transactions
Issue:: Volume 129(2022)Part B
Issue Display:: Volume 129, Issue 2022 (2022)
Year:: 2022
Volume:: 129
Issue:: 2022
Issue Sort Value:: 2022-0129-2022-0000
Page Start:: 295
Page End:: 308
Publication Date:: 2022-10
Subjects:: Nonzero-sum differential game -- Neural network -- Actor–critic -- Asynchronous learning -- Synchronous triggering -- Event-triggered communication
Engineering instruments -- Periodicals
Engineering instruments
Periodicals
Electronic journals
629.805
Journal URLs:: http://www.sciencedirect.com/science/journal/00190578 ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.isatra.2022.02.007 ↗
Languages:: English
ISSNs:: 0019-0578
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 4582.700000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 24095.xml