A novel reinforcement learning method for improving occupant comfort via window opening and closing. (October 2020)