Training high-performance and large-scale deep neural networks with full 8-bit integers. (May 2020)

Record Type:: Journal Article
Title:: Training high-performance and large-scale deep neural networks with full 8-bit integers. (May 2020)
Main Title:: Training high-performance and large-scale deep neural networks with full 8-bit integers
Authors:: Yang, Yukuan
Deng, Lei
Wu, Shuang
Yan, Tianyi
Xie, Yuan
Li, Guoqi
Abstract:: Abstract: Deep neural network (DNN) quantization converting floating-point (FP) data in the network to integers (INT) is an effective way to shrink the model size for memory saving and simplify the operations for compute acceleration. Recently, researches on DNN quantization develop from inference to training, laying a foundation for the online training on accelerators. However, existing schemes leaving batch normalization (BN) untouched during training are mostly incomplete quantization that still adopts high precision FP in some parts of the data paths. Currently, there is no solution that can use only low bit-width INT data during the whole training process of large-scale DNNs with acceptable accuracy. In this work, through decomposing all the computation steps in DNNs and fusing three special quantization functions to satisfy the different precision requirements, we propose a unified complete quantization framework termed as "WAGEUBN" to quantize DNNs involving all data paths including W (Weights), A (Activation), G (Gradient), E (Error), U (Update), and BN. Moreover, the Momentum optimizer is also quantized to realize a completely quantized framework. Experiments on ResNet18/34/50 models demonstrate that WAGEUBN can achieve competitive accuracy on the ImageNet dataset. For the first time, the study of quantization in large-scale DNNs is advanced to the full 8-bit INT level. In this way, all the operations in the training and inference can be bit-wise operations, pushing … (more)
Is Part Of:: Neural networks. Volume 125(2020)
Journal:: Neural networks
Issue:: Volume 125(2020)
Issue Display:: Volume 125, Issue 2020 (2020)
Year:: 2020
Volume:: 125
Issue:: 2020
Issue Sort Value:: 2020-0125-2020-0000
Page Start:: 70
Page End:: 82
Publication Date:: 2020-05
Subjects:: Neural network quantization -- 8-bit training -- Full quantization -- Online learning device
Neural computers -- Periodicals
Neural networks (Computer science) -- Periodicals
Neural networks (Neurobiology) -- Periodicals
Nervous System -- Periodicals
Ordinateurs neuronaux -- Périodiques
Réseaux neuronaux (Informatique) -- Périodiques
Réseaux neuronaux (Neurobiologie) -- Périodiques
Neural computers
Neural networks (Computer science)
Neural networks (Neurobiology)
Periodicals
006.32
Journal URLs:: http://www.sciencedirect.com/science/journal/08936080 ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.neunet.2019.12.027 ↗
Languages:: English
ISSNs:: 0893-6080
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 6081.280800
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 13422.xml