Scalable critical-path analysis and optimization guidance for hybrid MPI-CUDA applications. (November 2017)
- Record Type:
- Journal Article
- Title:
- Scalable critical-path analysis and optimization guidance for hybrid MPI-CUDA applications. (November 2017)
- Main Title:
- Scalable critical-path analysis and optimization guidance for hybrid MPI-CUDA applications
- Authors:
- Schmitt, Felix
Dietrich, Robert
Juckeland, Guido - Abstract:
- The use of accelerators in heterogeneous systems is an established approach in designing petascale applications. Today, Compute Unified Device Architecture (CUDA) offers a rich programming interface for GPU accelerators but requires developers to incorporate several layers of parallelism on both the CPU and the GPU. From this increasing program complexity emerges the need for sophisticated performance tools. This work contributes by analyzing hybrid MPI-CUDA programs for properties based on wait states, such as the critical path, a metric proven to identify application bottlenecks effectively. We developed a tool to construct a dependency graph based on an execution trace and the inherent dependencies of the programming models CUDA and Message Passing Interface (MPI). Thereafter, it detects wait states and attributes blame to responsible activities. Together with the property of being on the critical path, we can identify activities that are most viable for optimization. To evaluate the global impact of optimizations to critical activities, we predict the program execution using a graph-based performance projection. The developed approach has been demonstrated with suitable examples to be both scalable and correct. Furthermore, we establish a new categorization of CUDA inefficiency patterns ensuing from the dependencies between CUDA activities.
- Is Part Of:
- International journal of high performance computing applications. Volume 31:Number 6(2017)
- Journal:
- International journal of high performance computing applications
- Issue:
- Volume 31:Number 6(2017)
- Issue Display:
- Volume 31, Issue 6 (2017)
- Year:
- 2017
- Volume:
- 31
- Issue:
- 6
- Issue Sort Value:
- 2017-0031-0006-0000
- Page Start:
- 485
- Page End:
- 498
- Publication Date:
- 2017-11
- Subjects:
- GPGPU -- CUDA -- MPI -- wait states -- critical-path analysis -- performance analysis -- performance optimization
High performance computing -- Periodicals
Supercomputers -- Periodicals
004.1105 - Journal URLs:
- http://hpc.sagepub.com ↗
http://www.uk.sagepub.com/home.nav ↗
http://firstsearch.oclc.org ↗ - DOI:
- 10.1177/1094342016661865 ↗
- Languages:
- English
- ISSNs:
- 1094-3420
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 8027.xml