Statistical and machine learning models for optimizing energy in parallel applications. (November 2019)
- Record Type:
- Journal Article
- Title:
- Statistical and machine learning models for optimizing energy in parallel applications. (November 2019)
- Main Title:
- Statistical and machine learning models for optimizing energy in parallel applications
- Authors:
- Endrei, Mark
Jin, Chao
Dinh, Minh Ngoc
Abramson, David
Poxon, Heidi
DeRose, Luiz
de Supinski, Bronis R - Other Names:
- Dongarra Jack guest-editor.
Tourancheau Bernard guest-editor. - Abstract:
- Rising power costs and constraints are driving a growing focus on the energy efficiency of high performance computing systems. The unique characteristics of a particular system and workload and their effect on performance and energy efficiency are typically difficult for application users to assess and to control. Settings for optimum performance and energy efficiency can also diverge, so we need to identify trade-off options that guide a suitable balance between energy use and performance. We present statistical and machine learning models that only require a small number of runs to make accurate Pareto-optimal trade-off predictions using parameters that users can control. We study model training and validation using several parallel kernels and more complex workloads, including Algebraic Multigrid (AMG), Large-scale Atomic Molecular Massively Parallel Simulator, and Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics. We demonstrate that we can train the models using as few as 12 runs, with prediction error of less than 10%. Our AMG results identify trade-off options that provide up to 45% improvement in energy efficiency for around 10% performance loss. We reduce the sample measurement time required for AMG by 90%, from 13 h to 74 min.
- Is Part Of:
- International journal of high performance computing applications. Volume 33:Number 6(2019)
- Journal:
- International journal of high performance computing applications
- Issue:
- Volume 33:Number 6(2019)
- Issue Display:
- Volume 33, Issue 6 (2019)
- Year:
- 2019
- Volume:
- 33
- Issue:
- 6
- Issue Sort Value:
- 2019-0033-0006-0000
- Page Start:
- 1079
- Page End:
- 1097
- Publication Date:
- 2019-11
- Subjects:
- Energy efficiency -- performance -- regression modeling -- machine learning -- high performance computing
High performance computing -- Periodicals
Supercomputers -- Periodicals
004.1105 - Journal URLs:
- http://hpc.sagepub.com ↗
http://www.uk.sagepub.com/home.nav ↗
http://firstsearch.oclc.org ↗ - DOI:
- 10.1177/1094342019842915 ↗
- Languages:
- English
- ISSNs:
- 1094-3420
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 11258.xml