Solving the scalarization issues of Advantage-based Reinforcement Learning algorithms. (June 2021)

Record Type:: Journal Article
Title:: Solving the scalarization issues of Advantage-based Reinforcement Learning algorithms. (June 2021)
Main Title:: Solving the scalarization issues of Advantage-based Reinforcement Learning algorithms
Authors:: Galatolo, Federico A.
Cimino, Mario G.C.A.
Vaglini, Gigliola
Abstract:: Abstract: In this research, some of the issues that arise from the scalarization of the multi-objective optimization problem in the Advantage Actor–Critic (A2C) reinforcement learning algorithm are investigated. The paper shows how a naive scalarization can lead to gradients overlapping. Furthermore, the possibility that the entropy regularization term can be a source of uncontrolled noise is discussed. With respect to the above issues, a technique to avoid gradient overlapping is proposed, while keeping the same loss formulation. Moreover, a method to avoid the uncontrolled noise, by sampling the actions from distributions with a desired minimum entropy, is investigated. Pilot experiments have been carried out to show how the proposed method speeds up the training. The proposed approach can be applied to any Advantage-based Reinforcement Learning algorithm. Graphical abstract: Highlights: Issues of the Advantage Actor–Critic (A2C) gradient-based optimization are addressed. The proposed variant of the A2C avoids gradient overlapping and controls the entropy. The speedup with respect to A2C is better for increasing environment complexity.
Is Part Of:: Computers & electrical engineering. Volume 92(2021)
Journal:: Computers & electrical engineering
Issue:: Volume 92(2021)
Issue Display:: Volume 92, Issue 2021 (2021)
Year:: 2021
Volume:: 92
Issue:: 2021
Issue Sort Value:: 2021-0092-2021-0000
Page Start:
Page End:
Publication Date:: 2021-06
Subjects:: Reinforcement learning -- Actor–critic -- Deep learning -- Gradient-based optimization
Computer engineering -- Periodicals
Electrical engineering -- Periodicals
Electrical engineering -- Data processing -- Periodicals
Ordinateurs -- Conception et construction -- Périodiques
Électrotechnique -- Périodiques
Électrotechnique -- Informatique -- Périodiques
Computer engineering
Electrical engineering
Electrical engineering -- Data processing
Periodicals
Electronic journals
621.302854
Journal URLs:: http://www.sciencedirect.com/science/journal/00457906/ ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.compeleceng.2021.107117 ↗
Languages:: English
ISSNs:: 0045-7906
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3394.680000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 17229.xml