Reducing Entropy Overestimation in Soft Actor Critic Using Dual Policy Network. (10th June 2021)