Reinforcement-learning-based optimal trading in a simulated futures market with heterogeneous agents. (April 2022)