Chip‐level and multi‐node analysis of energy‐optimized lattice Boltzmann CFD simulations. (18th May 2015)
- Record Type:
- Journal Article
- Title:
- Chip‐level and multi‐node analysis of energy‐optimized lattice Boltzmann CFD simulations. (18th May 2015)
- Main Title:
- Chip‐level and multi‐node analysis of energy‐optimized lattice Boltzmann CFD simulations
- Authors:
- Wittmann, Markus
Hager, Georg
Zeiser, Thomas
Treibig, Jan
Wellein, Gerhard - Other Names:
- Olabarriaga Silvia Delgado guestEditor.
Wilkins‐Diehr Nancy guestEditor.
Smari Waleed W. guestEditor.
Bakhouya Mohamed guestEditor.
Fiore Sandro guestEditor.
Aloisio Giovanni guestEditor. - Abstract:
- Summary: Memory‐bound algorithms show complex performance and energy consumption behavior on multicore processors. We choose the lattice Boltzmann method on an Intel Sandy Bridge cluster as a prototype scenario to investigate if and how single‐chip performance and power characteristics can be generalized to the highly parallel case. First, we perform an analysis of a sparse‐lattice lattice Boltzmann method implementation for complex geometries. Using a single‐core performance model, we predict the intra‐chip saturation characteristics and the optimal operating point in terms of energy‐to‐solution as a function of implementation details, clock frequency, vectorization, and number of active cores per chip. We show that high single‐core performance and a correct choice of the number of active cores per chip are the essential optimizations for the lowest energy‐to‐solution at minimal performance degradation. Then we extrapolate to the Message Passing Interface (MPI)‐parallel level and quantify the energy‐saving potential of various optimizations and execution modes, where we find these guidelines to be even more important, especially when communication overhead is non‐negligible. In our setup, we could achieve energy savings of 35% in this case, compared with a naive approach. We also demonstrate that a simple non‐reflective reduction of the clock speed leaves most of the energy‐saving potential unused. Copyright © 2015 John Wiley & Sons, Ltd.
- Is Part Of:
- Concurrency and computation. Volume 28:Number 7(2016)
- Journal:
- Concurrency and computation
- Issue:
- Volume 28:Number 7(2016)
- Issue Display:
- Volume 28, Issue 7 (2016)
- Year:
- 2016
- Volume:
- 28
- Issue:
- 7
- Issue Sort Value:
- 2016-0028-0007-0000
- Page Start:
- 2295
- Page End:
- 2315
- Publication Date:
- 2015-05-18
- Subjects:
- energy optimization -- ECM performance model -- lattice Boltzmann method
Parallel processing (Electronic computers) -- Periodicals
Parallel computers -- Periodicals
004.35 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/cpe.3489 ↗
- Languages:
- English
- ISSNs:
- 1532-0626
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3405.622000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 2186.xml