Simulating quantized inference on convolutional neural networks. (October 2021)
- Record Type:
- Journal Article
- Title:
- Simulating quantized inference on convolutional neural networks. (October 2021)
- Main Title:
- Simulating quantized inference on convolutional neural networks
- Authors:
- Finotti, Vitor
Albertini, Bruno - Abstract:
- Abstract: Mobile and embedded applications of convolutional neural networks (CNNs) use quantization to reduce model size and increase computational efficiency. However, working with quantized networks often implies using non-standard training and execution methods, as modern frameworks offer limited support to fixed-point operations. We propose a quantization approach simulating the effects of quantization in CNN inference without needing to be re-implemented using fixed-point arithmetic, reducing overhead and complexity in evaluating existing networks' responses to quantization. The proposed method provides a fast way of performing post-training quantization with different bit widths in activations and weights. Our experimental results on ImageNet CNNs show a model size reduction of more than 50%, while maintaining classification accuracy without a need for retraining. We also measured the relationship between classification complexity and tolerance to quantization, finding an inverse correlation between quantization level and dataset complexity. Highlights: Simulation of fixed-point quantization inference in convolutional neural networks. Quantization of convolutional neural network inference in PyTorch. Model size reduction on ImageNet architectures by more than 50% without accuracy loss. CNN architectures classifying simple datasets are more tolerant to quantization.
- Is Part Of:
- Computers & electrical engineering. Volume 95(2021)
- Journal:
- Computers & electrical engineering
- Issue:
- Volume 95(2021)
- Issue Display:
- Volume 95, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 95
- Issue:
- 2021
- Issue Sort Value:
- 2021-0095-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-10
- Subjects:
- Convolutional neural networks -- Post-training quantization -- Fixed-point arithmetic
Computer engineering -- Periodicals
Electrical engineering -- Periodicals
Electrical engineering -- Data processing -- Periodicals
Ordinateurs -- Conception et construction -- Périodiques
Électrotechnique -- Périodiques
Électrotechnique -- Informatique -- Périodiques
Computer engineering
Electrical engineering
Electrical engineering -- Data processing
Periodicals
Electronic journals
621.302854 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00457906/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.compeleceng.2021.107446 ↗
- Languages:
- English
- ISSNs:
- 0045-7906
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.680000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 19347.xml