Deep reinforcement learning for shared control of mobile robots. Issue 4 (1st December 2021)
- Record Type:
- Journal Article
- Title:
- Deep reinforcement learning for shared control of mobile robots. Issue 4 (1st December 2021)
- Main Title:
- Deep reinforcement learning for shared control of mobile robots
- Authors:
- Tian, Chong
Shaik, Shahil
Wang, Yue - Other Names:
- Zhang Yu guestEditor.
Gao Fei guestEditor.
Sun Yuxiang guestEditor.
Hovakimyan Naira guestEditor.
Fang Zheng guestEditor. - Abstract:
- Abstract: Shared control of mobile robots integrates manual input with auxiliary autonomous controllers to improve the overall system performance. However, prior work that seeks to find the optimal shared control ratio needs an accurate human model, which is usually challenging to obtain. In this study, the authors develop an extended Twin Delayed Deep Deterministic Policy Gradient (DDPG) (TD3X)‐based shared control framework that learns to assist a human operator in teleoperating mobile robots optimally. The robot's states, shared control ratio in the previous time step, and human's control input is used as inputs to the reinforcement learning (RL) agent, which then outputs the optimal shared control ratio between human input and autonomous controllers without knowing the human model. Noisy softmax policies are developed to make the TD3X algorithm feasible under the constraint of a shared control ratio. Furthermore, to accelerate the training process and protect the robot, a navigation demonstration policy and a safety guard are developed. A neural network (NN) structure is developed to maintain the correlation of sensor readings among heterogeneous input data and improve the learning speed. In addition, an extended DAGGER (DAGGERX) human agent is developed for training the RL agent to reduce human workload. Robot simulations and experiments with humans in the loop are conducted. The results show that the DAGGERX human agent can simulate real human inputs in the worst‐caseAbstract: Shared control of mobile robots integrates manual input with auxiliary autonomous controllers to improve the overall system performance. However, prior work that seeks to find the optimal shared control ratio needs an accurate human model, which is usually challenging to obtain. In this study, the authors develop an extended Twin Delayed Deep Deterministic Policy Gradient (DDPG) (TD3X)‐based shared control framework that learns to assist a human operator in teleoperating mobile robots optimally. The robot's states, shared control ratio in the previous time step, and human's control input is used as inputs to the reinforcement learning (RL) agent, which then outputs the optimal shared control ratio between human input and autonomous controllers without knowing the human model. Noisy softmax policies are developed to make the TD3X algorithm feasible under the constraint of a shared control ratio. Furthermore, to accelerate the training process and protect the robot, a navigation demonstration policy and a safety guard are developed. A neural network (NN) structure is developed to maintain the correlation of sensor readings among heterogeneous input data and improve the learning speed. In addition, an extended DAGGER (DAGGERX) human agent is developed for training the RL agent to reduce human workload. Robot simulations and experiments with humans in the loop are conducted. The results show that the DAGGERX human agent can simulate real human inputs in the worst‐case scenarios with a mean square error of 0.0039. Compared to the original TD3 agent, the TD3X‐based shared control system decreased the average collision number from 387.3 to 44.4 in a simplistic environment and 394.2 to 171.2 in a more complex environment. The maximum average return increased from 1043 to 1187 with a faster converge speed in the simplistic environment, while the performance is equally good in the complex environment because of the use of an advanced human agent. In the human subject tests, participants' average perceived workload was significantly lower in shared control than that in exclusively manual control (26.90 vs. 40.07, p = 0.013). … (more)
- Is Part Of:
- IET cyber-systems and robotics. Volume 3:Issue 4(2021)
- Journal:
- IET cyber-systems and robotics
- Issue:
- Volume 3:Issue 4(2021)
- Issue Display:
- Volume 3, Issue 4 (2021)
- Year:
- 2021
- Volume:
- 3
- Issue:
- 4
- Issue Sort Value:
- 2021-0003-0004-0000
- Page Start:
- 315
- Page End:
- 330
- Publication Date:
- 2021-12-01
- Subjects:
- control -- deep learning -- human robot interaction -- mobile robots -- reinforcement learning
Robotics -- Periodicals
Cybernetics -- Periodicals
Cybernetics
Robotics
Periodicals
629 - Journal URLs:
- https://ietresearch.onlinelibrary.wiley.com/journal/26316315 ↗
https://digital-library.theiet.org/content/journals/iet-csr ↗
http://resolver.macewan.ca/macewan?url_ver=Z39.88-2004&ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&rfr_id=info:sid/sfxit.com:opac_856&url_ctx_fmt=info:ofi/fmt:kev:mtx:ctx&sfx.ignore_date_threshold=1&rft.object_id=4100000008486984&svc_val_fmt=info:ofi/fmt:kev:mtx:sch_svc& ↗
http://resolver.library.ualberta.ca/resolver?ctx_enc=info:ofi/enc:UTF-8&ctx_ver=Z39.88-2004&rfr_id=info:sid/ualberta.ca:opac&rft.genre=journal&rft.object_id=4100000008486984&rft.issn=&rft.eissn=&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&url_ctx_fmt=info:ofi/fmt:kev:mtx:ctx&url_ver=Z39.88-2004 ↗
https://resolver.ebscohost.com/Redirect/PRL?EPPackageLocationID=570.20128740.48720848&epcustomerid=s3011414 ↗
https://ieeexplore.ieee.org/servlet/opac?punumber=8566027 ↗
http://search.ebscohost.com/login.aspx?direct=true&site=edspub-live&scope=site&type=44&db=edspub&authtype=ip, guest&custid=ns011247&groupid=main&profile=eds&bquery=AN%2020128740 ↗
http://ieeexplore.ieee.org/Xplore/home.jsp ↗
https://digital-library.theiet.org/content/journals/iet-csr ↗
http://imp-primo.hosted.exlibrisgroup.com/openurl/44IMP/44IMP_services_page?u.ignore_date_coverage=true&rft.mms_id=991000469600701591 ↗ - DOI:
- 10.1049/csy2.12036 ↗
- Languages:
- English
- ISSNs:
- 2631-6315
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 26181.xml