Accelerating high‐order stencils on GPUs. (22nd August 2021)
- Record Type:
- Journal Article
- Title:
- Accelerating high‐order stencils on GPUs. (22nd August 2021)
- Main Title:
- Accelerating high‐order stencils on GPUs
- Authors:
- Sai, Ryuichi
Mellor‐Crummey, John
Meng, Xiaozhu
Zhou, Keren
Araya‐Polo, Mauricio
Meng, Jie - Other Names:
- Wright Steven A. guestEditor.
Solak Serdar guestEditor.
Kilimci Zeynep Hilal guestEditor.
Eken Süleyman guestEditor.
Fernandes Steven guestEditor.
Zhang Yu‐Dong guestEditor.
Tavares João Manuel R.S. guestEditor. - Abstract:
- Summary: Finite‐difference methods based on high‐order stencils are commonly used for modeling of seismic wave propagation, weather forecasting, computational fluid dynamics, convolutional neural networks, and others. Nowadays, the community commonly employs graphics processing units (GPUs) to accelerate such stencil computations. As a result, knowing how to write efficient stencil computations for GPUs is of significant interest. While high‐performance, low‐order stencils on GPUs have been studied extensively in the literature, not all proposed approaches work well for high‐order stencils. Furthermore, coping with boundary conditions used with stencils for seismic modeling makes it challenging to efficiently exploit thread‐level parallelism on GPUs. In this article, we describe several implementations of a 25‐point stencil. We evaluate our stencil code shapes, memory hierarchy usage, data access patterns, and other performance attributes on several modern GPUs and compare them with machine rooflines. On average, our top‐performing kernels achieve six times the performance of a 25‐point stencil code developed in C and mapped to GPUs using OpenACC. Several of our implementations have excellent performance portability across multiple generations of both NVIDIA and AMD GPUs.
- Is Part Of:
- Concurrency and computation. Volume 34:Number 20(2022)
- Journal:
- Concurrency and computation
- Issue:
- Volume 34:Number 20(2022)
- Issue Display:
- Volume 34, Issue 20 (2022)
- Year:
- 2022
- Volume:
- 34
- Issue:
- 20
- Issue Sort Value:
- 2022-0034-0020-0000
- Page Start:
- n/a
- Page End:
- n/a
- Publication Date:
- 2021-08-22
- Subjects:
- boundary condition -- GPU -- high‐order -- HPC -- stencil computation
Parallel processing (Electronic computers) -- Periodicals
Parallel computers -- Periodicals
004.35 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/cpe.6467 ↗
- Languages:
- English
- ISSNs:
- 1532-0626
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3405.622000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 23523.xml