A hardware-efficient computing engine for FPGA-based deep convolutional neural network accelerator. (October 2022)
- Record Type:
- Journal Article
- Title:
- A hardware-efficient computing engine for FPGA-based deep convolutional neural network accelerator. (October 2022)
- Main Title:
- A hardware-efficient computing engine for FPGA-based deep convolutional neural network accelerator
- Authors:
- Li, Xueming
Huang, Hongmin
Chen, Taosheng
Gao, Huaien
Hu, Xianghong
Xiong, Xiaoming - Abstract:
- Abstract: Deep convolutional neural networks (DCNNs) have recently emerged as a promising approach for computer vision tasks with many new DCNN architectures proposed to further improve their performance. However, the significant computation workload limits the deployment of such networks on embedded devices. Research on accelerating DCNN inference usually employs field-programmable gate arrays (FPGAs) due to their programmability. However, hardware efficiency and reconfigurability do not often receive sufficient attention. This paper proposes an efficient accelerator to support multiple DCNNs and improve the hardware utilization from three perspectives. First, a bandwidth-based tiling algorithm is used to improve the data transfer efficiency for direct memory access (DMA). Second, three parallel strategies are proposed to improve the utilization of the computing units (CUs). Third, a configurable CU is designed to improve the digital signal processor (DSP) utilization. The proposed accelerator is implemented on the Xilinx ZYNQ-7 ZC706 Evaluation Board at 200 MHz. The accelerator reaches 163 Giga Operations Per Second (GOPS) and 0.36 GOPS/DSP on the VGG-16 while consuming only 448 DSPs. A 0.24 GOPS/DSP is achieved with ResNet50 and 0.27 GOPS/DSP with YOLOv2-tiny. The experimental results demonstrate that this design achieves a better trade-off between hardware resource consumption, performance, and reconfigurability over previous works.
- Is Part Of:
- Microelectronics journal. Volume 128(2022)
- Journal:
- Microelectronics journal
- Issue:
- Volume 128(2022)
- Issue Display:
- Volume 128, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 128
- Issue:
- 2022
- Issue Sort Value:
- 2022-0128-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-10
- Subjects:
- DCNN accelerator -- Hardware efficiency -- Parallel computing -- FPGA -- Reconfigurability
Microelectronics -- Periodicals
Microélectronique -- Périodiques
Microelectronics
Electronic journals
Journals - contents and abstracts
Periodicals
621.3805 - Journal URLs:
- http://catalog.hathitrust.org/api/volumes/oclc/5877621.html ↗
http://www.sciencedirect.com/science/journal/00262692 ↗
http://www.intute.ac.uk/sciences/cgi-bin/fullrecord.pl?handle=lesa.1012319367 ↗
http://www.elsevier.com/journals ↗
http://www.elsevier.com/homepage/elecserv.htt ↗ - DOI:
- 10.1016/j.mejo.2022.105547 ↗
- Languages:
- English
- ISSNs:
- 0959-8324
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5758.973000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23865.xml