Aerial combat maneuvering policy learning based on confrontation demonstrations and dynamic quality replay. (May 2022)