Two sparsities are better than one: unlocking the performance benefits of sparse–sparse networks. Issue 3 (1st September 2022)
- Record Type:
- Journal Article
- Title:
- Two sparsities are better than one: unlocking the performance benefits of sparse–sparse networks. Issue 3 (1st September 2022)
- Main Title:
- Two sparsities are better than one: unlocking the performance benefits of sparse–sparse networks
- Authors:
- Hunter, Kevin
Spracklen, Lawrence
Ahmad, Subutai - Abstract:
- Abstract: In principle, sparse neural networks should be significantly more efficient than traditional dense networks. Neurons in the brain exhibit two types of sparsity; they are sparsely interconnected and sparsely active. These two types of sparsity, called weight sparsity and activation sparsity, when combined, offer the potential to reduce the computational cost of neural networks by two orders of magnitude. Despite this potential, today's neural networks deliver only modest performance benefits using just weight sparsity, because traditional computing hardware cannot efficiently process sparse networks. In this article we introduce Complementary Sparsity, a novel technique that significantly improves the performance of dual sparse networks on existing hardware. We demonstrate that we can achieve high performance running weight-sparse networks, and we can multiply those speedups by incorporating activation sparsity. Using Complementary Sparsity, we show up to 100× improvement in throughput and energy efficiency performing inference on FPGAs. We analyze scalability and resource tradeoffs for a variety of kernels typical of commercial convolutional networks such as ResNet-50 and MobileNetV2. Our results with Complementary Sparsity suggest that weight plus activation sparsity can be a potent combination for efficiently scaling future AI models.
- Is Part Of:
- Neuromorphic computing and engineering. Volume 2:Issue 3(2022)
- Journal:
- Neuromorphic computing and engineering
- Issue:
- Volume 2:Issue 3(2022)
- Issue Display:
- Volume 2, Issue 3 (2022)
- Year:
- 2022
- Volume:
- 2
- Issue:
- 3
- Issue Sort Value:
- 2022-0002-0003-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-09-01
- Subjects:
- sparsity -- FPGA -- deep learning -- convolutional networks -- neocortex -- DNNs -- ResNet
Neural networks (Computer science) -- Periodicals
Neural computers -- Periodicals
Neuromorphics -- Periodicals
006.3 - Journal URLs:
- http://www.iop.org/ ↗
https://iopscience.iop.org/journal/2634-4386 ↗ - DOI:
- 10.1088/2634-4386/ac7c8a ↗
- Languages:
- English
- ISSNs:
- 2634-4386
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 22548.xml