Adaptation of fluid model EULAG to graphics processing unit architecture. (14th October 2014)
- Record Type:
- Journal Article
- Title:
- Adaptation of fluid model EULAG to graphics processing unit architecture. (14th October 2014)
- Main Title:
- Adaptation of fluid model EULAG to graphics processing unit architecture
- Authors:
- Rojek, Krzysztof Andrzej
Ciznicki, Milosz
Rosa, Bogdan
Kopta, Piotr
Kulczewski, Michal
Kurowski, Krzysztof
Piotrowski, Zbigniew Pawel
Szustak, Lukasz
Wojcik, Damian Karol
Wyrzykowski, Roman
Hussain, Farookh Khadeer
Wyrzykowski, Roman
Tudruj, Marek - Abstract:
- <abstract abstract-type="main" id="cpe3417-abs-0001"> <title>Summary</title> <p id="cpe3417-para-0001">The goal of this study is to adapt the multiscale fluid solver EULerian or LAGrangian framewrok (EULAG) to future graphics processing units (GPU) platforms. The EULAG model has the proven record of successful applications, and excellent efficiency and scalability on conventional supercomputer architectures. Currently, the model is being implemented as the new dynamical core of the COSMO weather prediction framework. Within this study, two main modules of EULAG, namely the multidimensional positive definite advection transport algorithm (MPDATA) and the variational generalized conjugate residual, elliptic pressure solver Generalized Conjugate Residual (GCR) are analyzed and optimized. In this paper, a method is proposed, which ensures a comprehensive analysis of the resource consumption including registers, shared, and global memories. This method allows us to identify bottlenecks of the algorithm, including data transfers between host and global memory, global and shared memories, as well as GPU occupancy. We put the emphasis on providing a fixed memory access pattern, padding as well as organizing computation in the MPDATA algorithm. The testing and validation of the new GPU implementation have been carried out based on modeling decaying turbulence of a homogeneous incompressible fluid in a triply‐periodic cube. Simulations performed using the standard version of EULAG and<abstract abstract-type="main" id="cpe3417-abs-0001"> <title>Summary</title> <p id="cpe3417-para-0001">The goal of this study is to adapt the multiscale fluid solver EULerian or LAGrangian framewrok (EULAG) to future graphics processing units (GPU) platforms. The EULAG model has the proven record of successful applications, and excellent efficiency and scalability on conventional supercomputer architectures. Currently, the model is being implemented as the new dynamical core of the COSMO weather prediction framework. Within this study, two main modules of EULAG, namely the multidimensional positive definite advection transport algorithm (MPDATA) and the variational generalized conjugate residual, elliptic pressure solver Generalized Conjugate Residual (GCR) are analyzed and optimized. In this paper, a method is proposed, which ensures a comprehensive analysis of the resource consumption including registers, shared, and global memories. This method allows us to identify bottlenecks of the algorithm, including data transfers between host and global memory, global and shared memories, as well as GPU occupancy. We put the emphasis on providing a fixed memory access pattern, padding as well as organizing computation in the MPDATA algorithm. The testing and validation of the new GPU implementation have been carried out based on modeling decaying turbulence of a homogeneous incompressible fluid in a triply‐periodic cube. Simulations performed using the standard version of EULAG and its new GPU implementation give similar solutions. Preliminary results show a promising increase in terms of computational efficiency. Copyright © 2014 John Wiley &amp; Sons, Ltd.</p> </abstract> … (more)
- Is Part Of:
- Concurrency and computation. Volume 27:Number 4(2015:Mar.)
- Journal:
- Concurrency and computation
- Issue:
- Volume 27:Number 4(2015:Mar.)
- Issue Display:
- Volume 27, Issue 4 (2015)
- Year:
- 2015
- Volume:
- 27
- Issue:
- 4
- Issue Sort Value:
- 2015-0027-0004-0000
- Page Start:
- 937
- Page End:
- 957
- Publication Date:
- 2014-10-14
- Subjects:
- Parallel processing (Electronic computers) -- Periodicals
Parallel computers -- Periodicals
004.35 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/cpe.3417 ↗
- Languages:
- English
- ISSNs:
- 1532-0626
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3405.622000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 4349.xml