Reinforcement learning and dynamic programming using function approximators. ([2010])