A Finite Horizon Markov Decision Process Based Reinforcement Learning Control of a Rapid Thermal Processing system. (August 2018)