Evaluating and analyzing the energy efficiency of CNN inference on high‐performance GPU. (21st October 2020)
- Record Type:
- Journal Article
- Title:
- Evaluating and analyzing the energy efficiency of CNN inference on high‐performance GPU. (21st October 2020)
- Main Title:
- Evaluating and analyzing the energy efficiency of CNN inference on high‐performance GPU
- Authors:
- Yao, Chunrong
Liu, Wantao
Tang, Weiqing
Guo, Jinrong
Hu, Songlin
Lu, Yijun
Jiang, Wei - Abstract:
- Summary: Convolutional neural network (CNN) inference usually runs on high‐performance graphic processing units (GPUs). Since GPU is a high power consumption unit, that makes the energy consumption increases sharply due to the deep learning tasks. The energy efficiency of CNN inference is not only related to the software and hardware configurations, but also closely related to the application requirements of inference tasks. However, it is not clear on GPUs at present. In this paper, we conduct a comprehensive study on the model‐level and layer‐level energy efficiency of popular CNN models. The results point out several opportunities for further optimization. We also analyze the parameter settings (i.e., batch size, dynamic voltage and frequency scaling) and propose a revenue model to allow an optimal trade‐off between energy efficiency and latency. Compared with the default settings, the optimal settings can improve revenue by up to 15.31×. We obtain the following main findings: (i) GPUs do not exploit the parallelism from the model depth and small convolution kernels, resulting in low energy efficiency. (ii) Convolutional layers are the most energy‐consuming CNN layers. However, due to the cache, the power consumption of all layers is relatively balanced. (iii) The energy efficiency of TensorRT is 1.53× than that of TensorFlow.
- Is Part Of:
- Concurrency and computation. Volume 33:Number 6(2021)
- Journal:
- Concurrency and computation
- Issue:
- Volume 33:Number 6(2021)
- Issue Display:
- Volume 33, Issue 6 (2021)
- Year:
- 2021
- Volume:
- 33
- Issue:
- 6
- Issue Sort Value:
- 2021-0033-0006-0000
- Page Start:
- n/a
- Page End:
- n/a
- Publication Date:
- 2020-10-21
- Subjects:
- CNNs -- energy efficiency -- high‐performance GPU -- inference
Parallel processing (Electronic computers) -- Periodicals
Parallel computers -- Periodicals
004.35 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/cpe.6064 ↗
- Languages:
- English
- ISSNs:
- 1532-0626
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3405.622000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 15758.xml