Reinforcement Learning Guided by Double Replay Memory. (29th April 2021)