Aligning individual and collective welfare in complex socio-technical systems by combining metaheuristics and reinforcement learning. (March 2019)