Solving the scalarization issues of Advantage-based Reinforcement Learning algorithms. (June 2021)
- Record Type:
- Journal Article
- Title:
- Solving the scalarization issues of Advantage-based Reinforcement Learning algorithms. (June 2021)
- Main Title:
- Solving the scalarization issues of Advantage-based Reinforcement Learning algorithms
- Authors:
- Galatolo, Federico A.
Cimino, Mario G.C.A.
Vaglini, Gigliola - Abstract:
- Abstract: In this research, some of the issues that arise from the scalarization of the multi-objective optimization problem in the Advantage Actor–Critic (A2C) reinforcement learning algorithm are investigated. The paper shows how a naive scalarization can lead to gradients overlapping. Furthermore, the possibility that the entropy regularization term can be a source of uncontrolled noise is discussed. With respect to the above issues, a technique to avoid gradient overlapping is proposed, while keeping the same loss formulation. Moreover, a method to avoid the uncontrolled noise, by sampling the actions from distributions with a desired minimum entropy, is investigated. Pilot experiments have been carried out to show how the proposed method speeds up the training. The proposed approach can be applied to any Advantage-based Reinforcement Learning algorithm. Graphical abstract: Highlights: Issues of the Advantage Actor–Critic (A2C) gradient-based optimization are addressed. The proposed variant of the A2C avoids gradient overlapping and controls the entropy. The speedup with respect to A2C is better for increasing environment complexity.
- Is Part Of:
- Computers & electrical engineering. Volume 92(2021)
- Journal:
- Computers & electrical engineering
- Issue:
- Volume 92(2021)
- Issue Display:
- Volume 92, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 92
- Issue:
- 2021
- Issue Sort Value:
- 2021-0092-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-06
- Subjects:
- Reinforcement learning -- Actor–critic -- Deep learning -- Gradient-based optimization
Computer engineering -- Periodicals
Electrical engineering -- Periodicals
Electrical engineering -- Data processing -- Periodicals
Ordinateurs -- Conception et construction -- Périodiques
Électrotechnique -- Périodiques
Électrotechnique -- Informatique -- Périodiques
Computer engineering
Electrical engineering
Electrical engineering -- Data processing
Periodicals
Electronic journals
621.302854 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00457906/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compeleceng.2021.107117 ↗
- Languages:
- English
- ISSNs:
- 0045-7906
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.680000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 17229.xml