Work stealing for GPU‐accelerated parallel programs in a global address space framework. (6th January 2016)
- Record Type:
- Journal Article
- Title:
- Work stealing for GPU‐accelerated parallel programs in a global address space framework. (6th January 2016)
- Main Title:
- Work stealing for GPU‐accelerated parallel programs in a global address space framework
- Authors:
- Arafat, Humayun
Dinan, James
Krishnamoorthy, Sriram
Balaji, Pavan
Sadayappan, P. - Abstract:
- Summary: Task parallelism is an attractive approach to automatically load balance the computation in a parallel system and adapt to dynamism exhibited by parallel systems. Exploiting task parallelism through work stealing has been extensively studied in shared and distributed‐memory contexts. In this paper, we study the design of a system that uses work stealing for dynamic load balancing of task‐parallel programs executed on hybrid distributed‐memory CPU‐graphics processing unit (GPU) systems in a global‐address space framework. We take into account the unique nature of the accelerator model employed by GPUs, the significant performance difference between GPU and CPU execution as a function of problem size, and the distinct CPU and GPU memory domains. We consider various alternatives in designing a distributed work stealing algorithm for CPU‐GPU systems, while taking into account the impact of task distribution and data movement overheads. These strategies are evaluated using microbenchmarks that capture various execution configurations as well as the state‐of‐the‐art CCSD(T) application module from the computational chemistry domain. Copyright © 2016 John Wiley & Sons, Ltd.
- Is Part Of:
- Concurrency and computation. Volume 28:Number 13(2016)
- Journal:
- Concurrency and computation
- Issue:
- Volume 28:Number 13(2016)
- Issue Display:
- Volume 28, Issue 13 (2016)
- Year:
- 2016
- Volume:
- 28
- Issue:
- 13
- Issue Sort Value:
- 2016-0028-0013-0000
- Page Start:
- 3637
- Page End:
- 3654
- Publication Date:
- 2016-01-06
- Subjects:
- GPU -- partitioned global address space -- task parallelism
Parallel processing (Electronic computers) -- Periodicals
Parallel computers -- Periodicals
004.35 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/cpe.3747 ↗
- Languages:
- English
- ISSNs:
- 1532-0626
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3405.622000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 910.xml