A competitive Markov decision process model and a recursive reinforcement-learning algorithm for fairness scheduling of agile satellites. (July 2022)