Reinforcement learning and dynamic programming using function approximators. (2017)