A model-based deep reinforcement learning method applied to finite-horizon optimal control of nonlinear control-affine system. (March 2020)