Using Linear Stochastic Bandits to extend traditional offline Designed Experiments to online settings. (January 2018)
- Record Type:
- Journal Article
- Title:
- Using Linear Stochastic Bandits to extend traditional offline Designed Experiments to online settings. (January 2018)
- Main Title:
- Using Linear Stochastic Bandits to extend traditional offline Designed Experiments to online settings
- Authors:
- Sudarsanam, Nandan
Ravindran, Balaraman - Abstract:
- Highlights: Linear Bandits are used to extend traditional Factorial designs to online settings. A probabilistic meta-model built on real data is used as a simulation test-bed. A theoretical derivation of performance for Factorial designs is provided. A statistics inspired Bandit algorithm is proposed and outperforms the baselines. Abstract: A designed experiment is typically followed by a statistical analysis of the results, using which the preferred settings of the inputs are selected for operation. In this paper, we motivate real-world scenarios, where it could be advantageous to succeed the experiment with continued exploration upon deployment in the online context. We propose the use of Linear Bandits to conduct sequential experiments in the online setting. The linear bandit algorithms, which utilize results from the designed experiment as an initial seed, are then used to select a treatment combination in each step or trial. Specifically, the study analyzes two linear bandit algorithms and compares them to three different baselines. The two linear bandit algorithms are OFUL, which is shown in literature to have one of the best theoretical performances, and LGUCBand, a novel contribution of this research, which uses the statistical concept of upper confidence bands for linear models. The baselines are different designs and data analyses without any form of online experimentation. The results are compared using simulations of a model built on meta-data from publishedHighlights: Linear Bandits are used to extend traditional Factorial designs to online settings. A probabilistic meta-model built on real data is used as a simulation test-bed. A theoretical derivation of performance for Factorial designs is provided. A statistics inspired Bandit algorithm is proposed and outperforms the baselines. Abstract: A designed experiment is typically followed by a statistical analysis of the results, using which the preferred settings of the inputs are selected for operation. In this paper, we motivate real-world scenarios, where it could be advantageous to succeed the experiment with continued exploration upon deployment in the online context. We propose the use of Linear Bandits to conduct sequential experiments in the online setting. The linear bandit algorithms, which utilize results from the designed experiment as an initial seed, are then used to select a treatment combination in each step or trial. Specifically, the study analyzes two linear bandit algorithms and compares them to three different baselines. The two linear bandit algorithms are OFUL, which is shown in literature to have one of the best theoretical performances, and LGUCBand, a novel contribution of this research, which uses the statistical concept of upper confidence bands for linear models. The baselines are different designs and data analyses without any form of online experimentation. The results are compared using simulations of a model built on meta-data from published experiments on real engineering applications. An analytical derivation of the default baselines is also an important contribution of this research and is intended to provide theoretical validation of the simulation results. The results indicate that, across different environments, substantial long-term improvements can be made by following designed experiments with linear bandits, with minimal short term costs. … (more)
- Is Part Of:
- Computers & industrial engineering. Volume 115(2018)
- Journal:
- Computers & industrial engineering
- Issue:
- Volume 115(2018)
- Issue Display:
- Volume 115, Issue 2018 (2018)
- Year:
- 2018
- Volume:
- 115
- Issue:
- 2018
- Issue Sort Value:
- 2018-0115-2018-0000
- Page Start:
- 471
- Page End:
- 485
- Publication Date:
- 2018-01
- Subjects:
- Design of Experiments -- Linear Stochastic Bandits -- Linear Bandits -- Online experimentation
Engineering -- Data processing -- Periodicals
Industrial engineering -- Periodicals
620.00285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/03608352 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.cie.2017.11.030 ↗
- Languages:
- English
- ISSNs:
- 0360-8352
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.713000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 7025.xml