This is an interim version of our Electronic Legal Deposit Catalogue-eJournals and eBooks while we continue to recover from a cyber-attack.
No-Regret Learning for Coalitional Model Predictive Control⁎This work is partially supported by the Spanish Training program for Academic Staff (FPU17/02653), the H2020 ADG-ERC project OCONTSOLAR (ID 789051), the MINECO-Spain project DPI2017-86918-R, and the US National Science Foundation (NSF) under grant ECCS-1847056 and CNS-1544782. Issue 2 (2020)
Record Type:
Journal Article
Title:
No-Regret Learning for Coalitional Model Predictive Control⁎This work is partially supported by the Spanish Training program for Academic Staff (FPU17/02653), the H2020 ADG-ERC project OCONTSOLAR (ID 789051), the MINECO-Spain project DPI2017-86918-R, and the US National Science Foundation (NSF) under grant ECCS-1847056 and CNS-1544782. Issue 2 (2020)
Main Title:
No-Regret Learning for Coalitional Model Predictive Control⁎This work is partially supported by the Spanish Training program for Academic Staff (FPU17/02653), the H2020 ADG-ERC project OCONTSOLAR (ID 789051), the MINECO-Spain project DPI2017-86918-R, and the US National Science Foundation (NSF) under grant ECCS-1847056 and CNS-1544782.
Abstract: In this paper, we introduce a learning approach for the controller structure in coalitional model predictive control (MPC) schemes. In this context, the local control entities can dynamically perform in a decentralized manner or assemble into groups of controllers that coordinate their control actions, i.e., coalitions. Such control strategy aims at maximizing system performance while reducing the coordination and computation burden. In this paper, we pose a multi-armed bandit problem where the arms are a set of possible controller structures and the player performs as a supervisory layer that can periodically change the composition of the coalitions. The goal is to use real-time observations to progressively learn the controller structure that best suits the needs of the system. A heuristic learning algorithm and illustrative results are provided.