A novel policy gradient algorithm with PSO-based parameter exploration for continuous control. (April 2020)