Cite
HARVARD Citation
Wang, Z. et al. (2021). A pretrained proximal policy optimization algorithm with reward shaping for aircraft guidance to a moving destination in three-dimensional continuous space. International journal of advanced robotic systems. p. . [Online].