Sparse matrix partitioning for optimizing SpMV on CPU-GPU heterogeneous platforms. (January 2020)
- Record Type:
- Journal Article
- Title:
- Sparse matrix partitioning for optimizing SpMV on CPU-GPU heterogeneous platforms. (January 2020)
- Main Title:
- Sparse matrix partitioning for optimizing SpMV on CPU-GPU heterogeneous platforms
- Authors:
- Benatia, Akrem
Ji, Weixing
Wang, Yizhuo
Shi, Feng - Other Names:
- Cecilia José M guest-editor.
- Abstract:
- Sparse matrix–vector multiplication (SpMV) kernel dominates the computing cost in numerous applications. Most of the existing studies dedicated to improving this kernel have been targeting just one type of processing units, mainly multicore CPUs or graphics processing units (GPUs), and have not explored the potential of the recent, rapidly emerging, CPU-GPU heterogeneous platforms. To take full advantage of these heterogeneous systems, the input sparse matrix has to be partitioned on different available processing units. The partitioning problem is more challenging with the existence of many sparse formats whose performances depend both on the sparsity of the input matrix and the used hardware. Thus, the best performance does not only depend on how to partition the input sparse matrix but also on which sparse format to use for each partition. To address this challenge, we propose in this article a new CPU-GPU heterogeneous method for computing the SpMV kernel that combines between different sparse formats to achieve better performance and better utilization of CPU-GPU heterogeneous platforms. The proposed solution horizontally partitions the input matrix into multiple block-rows and predicts their best sparse formats using machine learning-based performance models. A mapping algorithm is then used to assign the block-rows to the CPU and GPU(s) available in the system. Our experimental results using real-world large unstructured sparse matrices on two different machines showSparse matrix–vector multiplication (SpMV) kernel dominates the computing cost in numerous applications. Most of the existing studies dedicated to improving this kernel have been targeting just one type of processing units, mainly multicore CPUs or graphics processing units (GPUs), and have not explored the potential of the recent, rapidly emerging, CPU-GPU heterogeneous platforms. To take full advantage of these heterogeneous systems, the input sparse matrix has to be partitioned on different available processing units. The partitioning problem is more challenging with the existence of many sparse formats whose performances depend both on the sparsity of the input matrix and the used hardware. Thus, the best performance does not only depend on how to partition the input sparse matrix but also on which sparse format to use for each partition. To address this challenge, we propose in this article a new CPU-GPU heterogeneous method for computing the SpMV kernel that combines between different sparse formats to achieve better performance and better utilization of CPU-GPU heterogeneous platforms. The proposed solution horizontally partitions the input matrix into multiple block-rows and predicts their best sparse formats using machine learning-based performance models. A mapping algorithm is then used to assign the block-rows to the CPU and GPU(s) available in the system. Our experimental results using real-world large unstructured sparse matrices on two different machines show a noticeable performance improvement. … (more)
- Is Part Of:
- International journal of high performance computing applications. Volume 34:Number 1(2020)
- Journal:
- International journal of high performance computing applications
- Issue:
- Volume 34:Number 1(2020)
- Issue Display:
- Volume 34, Issue 1 (2020)
- Year:
- 2020
- Volume:
- 34
- Issue:
- 1
- Issue Sort Value:
- 2020-0034-0001-0000
- Page Start:
- 66
- Page End:
- 80
- Publication Date:
- 2020-01
- Subjects:
- Sparse matrix–vector multiplication (SpMV) -- CPU-GPU heterogeneous platforms -- sparse matrix partitioning
High performance computing -- Periodicals
Supercomputers -- Periodicals
004.1105 - Journal URLs:
- http://hpc.sagepub.com ↗
http://www.uk.sagepub.com/home.nav ↗
http://firstsearch.oclc.org ↗ - DOI:
- 10.1177/1094342019886628 ↗
- Languages:
- English
- ISSNs:
- 1094-3420
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 12035.xml