A novel double-mGBDT-based Q-learning. (23rd March 2022)
- Record Type:
- Journal Article
- Title:
- A novel double-mGBDT-based Q-learning. (23rd March 2022)
- Main Title:
- A novel double-mGBDT-based Q-learning
- Authors:
- Fu, Qiming
Ma, Shuai
Tian, Dawei
Chen, JianPing
Gao, Zhen
Zhong, Shan - Abstract:
- This paper proposes a novel double-mGBDT-based Q-learning algorithm. Compared with traditional deep reinforcement learning, the proposed algorithm uses the mGBDT to replace the DNN, where the mGBDT is introduced as the function approximator. In the learning process, based on the state, we use the Bellman equation to construct the target value, which is used to train the mGBDT in an online manner. Like DQN, we also adopt two mGBDT frameworks, which are used to address the problem of easy divergence. To verify performance, we apply the proposed algorithm DQN and mGBDT to the traditional benchmark problems in CartPole and MountainCar. The results show that the proposed algorithm can converge to the optimal policy, and compared with DQN, the proposed algorithm's stability is much better after convergence.
- Is Part Of:
- International journal of modelling, identification and control. Volume 37:Number 3/4(2021)
- Journal:
- International journal of modelling, identification and control
- Issue:
- Volume 37:Number 3/4(2021)
- Issue Display:
- Volume 37, Issue 3/4 (2021)
- Year:
- 2021
- Volume:
- 37
- Issue:
- 3/4
- Issue Sort Value:
- 2021-0037-NaN-0000
- Page Start:
- 232
- Page End:
- 239
- Publication Date:
- 2022-03-23
- Subjects:
- deep learning -- reinforcement learning -- mGBDT
Engineering -- Methodology -- Periodicals
Science -- Methodology -- Periodicals
001.42 - Journal URLs:
- http://www.inderscience.com/browse/index.php?journalID=176 ↗
http://www.inderscience.com/ ↗ - Languages:
- English
- ISSNs:
- 1746-6172
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 20806.xml