A reinforcement learning framework for optimal operation and maintenance of power grids. (1st May 2019)