A pretrained proximal policy optimization algorithm with reward shaping for aircraft guidance to a moving destination in three-dimensional continuous space. (4th February 2021)

Record Type:: Journal Article
Title:: A pretrained proximal policy optimization algorithm with reward shaping for aircraft guidance to a moving destination in three-dimensional continuous space. (4th February 2021)
Main Title:: A pretrained proximal policy optimization algorithm with reward shaping for aircraft guidance to a moving destination in three-dimensional continuous space
Authors:: Wang, Zhuang
Li, Hui
Wu, Zhaoxin
Wu, Haolin
Abstract:: To enhance the performance of guiding an aircraft to a moving destination in a certain direction in three-dimensional continuous space, it is essential to develop an efficient intelligent algorithm. In this article, a pretrained proximal policy optimization (PPO) with reward shaping algorithm, which does not require an accurate model, is proposed to solve the guidance problem of manned aircraft and unmanned aerial vehicles. Continuous action reward function and position reward function are presented, by which the training speed is increased and the performance of the generated trajectory is improved. Using pretrained PPO, a new agent can be trained efficiently for a new task. A reinforcement learning framework is built, in which an agent can be trained to generate a reference trajectory or a series of guidance instructions. General simulation results show that the proposed method can significantly improve the training efficiency and trajectory performance. The carrier-based aircraft approach simulation is carried out to prove the application value of the proposed approach.
Is Part Of:: International journal of advanced robotic systems. Volume 18:Number 1(2021)
Journal:: International journal of advanced robotic systems
Issue:: Volume 18:Number 1(2021)
Issue Display:: Volume 18, Issue 1 (2021)
Year:: 2021
Volume:: 18
Issue:: 1
Issue Sort Value:: 2021-0018-0001-0000
Page Start:
Page End:
Publication Date:: 2021-02-04
Subjects:: Aircraft guidance -- deep reinforcement learning -- PPO -- reward shaping
Robotics -- Periodicals
Robotics
Periodicals
629.892
Journal URLs:: http://arx.sagepub.com/ ↗
http://search.epnet.com/direct.asp?db=bch&jid=13CR&scope=site ↗
http://www.intechweb.org/journal.php?id=3 ↗
http://www.uk.sagepub.com/home.nav ↗
DOI:: 10.1177/1729881421989546 ↗
Languages:: English
ISSNs:: 1729-8806
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 15110.xml