An indirect reinforcement learning based real-time energy management strategy via high-order Markov Chain model for a hybrid electric vehicle. (1st December 2021)
- Record Type:
- Journal Article
- Title:
- An indirect reinforcement learning based real-time energy management strategy via high-order Markov Chain model for a hybrid electric vehicle. (1st December 2021)
- Main Title:
- An indirect reinforcement learning based real-time energy management strategy via high-order Markov Chain model for a hybrid electric vehicle
- Authors:
- Yang, Ningkang
Han, Lijin
Xiang, Changle
Liu, Hui
Li, Xunmin - Abstract:
- Abstract: This paper proposes a real-time indirect reinforcement learning based strategy to reduce the fuel consumption. In order to improve the real-time performance and achieve learning online, the simulated experience from environment model is adopted for the learning process, which is called indirect reinforcement learning. To establish an accurate environment model, a high-order Markov Chain is introduced and detailed, which is more precise than a widely used first-order Markov Chain. Corresponding with the model, how the reinforcement learning algorithm learns from the simulated experience is illustrated. Furthermore, an online recursive form of the transition probability matrix is derived, through which the statistical characteristics from the practical driving conditions can be collected. The induced matrix norm is chosen as a criterion to quantify the differences between the transition probability matrices and to determine the time for updating the environment model and triggering the recalculation of the reinforcement learning algorithm. Simulation results demonstrate that, compared with the direct RL, the proposed strategy can effectively reduce the learning time while maintains satisfied fuel economy. Furthermore, a hardware-in-the-loop experiment verifies its real-time capability and actual applicability. Highlights: A parsimonious high order Markov Chain model is introduced and utilized. An indirect reinforcement learning algorithm is developed. The powerAbstract: This paper proposes a real-time indirect reinforcement learning based strategy to reduce the fuel consumption. In order to improve the real-time performance and achieve learning online, the simulated experience from environment model is adopted for the learning process, which is called indirect reinforcement learning. To establish an accurate environment model, a high-order Markov Chain is introduced and detailed, which is more precise than a widely used first-order Markov Chain. Corresponding with the model, how the reinforcement learning algorithm learns from the simulated experience is illustrated. Furthermore, an online recursive form of the transition probability matrix is derived, through which the statistical characteristics from the practical driving conditions can be collected. The induced matrix norm is chosen as a criterion to quantify the differences between the transition probability matrices and to determine the time for updating the environment model and triggering the recalculation of the reinforcement learning algorithm. Simulation results demonstrate that, compared with the direct RL, the proposed strategy can effectively reduce the learning time while maintains satisfied fuel economy. Furthermore, a hardware-in-the-loop experiment verifies its real-time capability and actual applicability. Highlights: A parsimonious high order Markov Chain model is introduced and utilized. An indirect reinforcement learning algorithm is developed. The power transition probability of the Markov Chain is updated online and recursively. The introduced matrix norm is used to determine the update of the control policy. The proposed strategy improves fuel economy and operates in real time. … (more)
- Is Part Of:
- Energy. Volume 236(2021)
- Journal:
- Energy
- Issue:
- Volume 236(2021)
- Issue Display:
- Volume 236, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 236
- Issue:
- 2021
- Issue Sort Value:
- 2021-0236-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-12-01
- Subjects:
- Hybrid electric vehicle -- Real-time energy management -- Indirect reinforcement learning -- High-order Markov chain
Power resources -- Periodicals
Power (Mechanics) -- Periodicals
Energy consumption -- Periodicals
333.7905 - Journal URLs:
- http://www.elsevier.com/journals ↗
- DOI:
- 10.1016/j.energy.2021.121337 ↗
- Languages:
- English
- ISSNs:
- 0360-5442
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3747.445000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 19355.xml