Training Compact DNNs with ℓ1/2 Regularization. (April 2023)
- Record Type:
- Journal Article
- Title:
- Training Compact DNNs with ℓ1/2 Regularization. (April 2023)
- Main Title:
- Training Compact DNNs with ℓ1/2 Regularization
- Authors:
- Tang, Anda
Niu, Lingfeng
Miao, Jianyu
Zhang, Peng - Abstract:
- Highlights: We propose a network compression model based on ℓ 1 / 2 regularization. To the best of our knowledge, it is the first work utilizing non-Lipschitz continuous regularization to compress DNNs. We strictly prove the correspondence between ℓ p ( 0 < p < 1 ) and Hyper-Laplacian prior. Based on this prior, we suggest utilizing ℓ 1 / 2, as the single regularizer, to sparsify the connections and neurons of the network simultaneously. We give a closed-form, threshold solution to the proximal operator of ℓ 1 / 2, and consequently design a stochastic proximal gradient algorithm to train the resulting model. We conduct experiments to validate the performance of the proposed method. The results demonstrate that our method outperforms benchmark methods in terms of accuracy, computation and memory costs. Abstract: Deep neural network(DNN) has achieved unprecedented success in many fields. However, its large model parameters which bring a great burden on storage and calculation hinder the development and application of DNNs. It is worthy of compressing the model to reduce the complexity of the DNN. Sparsity-inducing regularizer is one of the most common tools for compression. In this paper, we propose utilizing the ℓ 1 / 2 quasi-norm to zero out weights of neural networks and compressing the networks automatically during the learning process. To our knowledge, it is the first work applying the non-Lipschitz continuous regularizer for the compression of DNNs. The resulting sparseHighlights: We propose a network compression model based on ℓ 1 / 2 regularization. To the best of our knowledge, it is the first work utilizing non-Lipschitz continuous regularization to compress DNNs. We strictly prove the correspondence between ℓ p ( 0 < p < 1 ) and Hyper-Laplacian prior. Based on this prior, we suggest utilizing ℓ 1 / 2, as the single regularizer, to sparsify the connections and neurons of the network simultaneously. We give a closed-form, threshold solution to the proximal operator of ℓ 1 / 2, and consequently design a stochastic proximal gradient algorithm to train the resulting model. We conduct experiments to validate the performance of the proposed method. The results demonstrate that our method outperforms benchmark methods in terms of accuracy, computation and memory costs. Abstract: Deep neural network(DNN) has achieved unprecedented success in many fields. However, its large model parameters which bring a great burden on storage and calculation hinder the development and application of DNNs. It is worthy of compressing the model to reduce the complexity of the DNN. Sparsity-inducing regularizer is one of the most common tools for compression. In this paper, we propose utilizing the ℓ 1 / 2 quasi-norm to zero out weights of neural networks and compressing the networks automatically during the learning process. To our knowledge, it is the first work applying the non-Lipschitz continuous regularizer for the compression of DNNs. The resulting sparse optimization problem is solved by stochastic proximal gradient algorithm. For further convenience of calculation, an approximation of the threshold-form solution to the proximal operator with ℓ 1 / 2 is given at the same time. Extensive experiments with various datasets and baselines demonstrate the advantages of our new method. … (more)
- Is Part Of:
- Pattern recognition. Volume 136(2023)
- Journal:
- Pattern recognition
- Issue:
- Volume 136(2023)
- Issue Display:
- Volume 136, Issue 2023 (2023)
- Year:
- 2023
- Volume:
- 136
- Issue:
- 2023
- Issue Sort Value:
- 2023-0136-2023-0000
- Page Start:
- Page End:
- Publication Date:
- 2023-04
- Subjects:
- Deep neural networks -- Model compression -- ℓ1/2 Quasi-norm -- Non-Lipschitz regularization -- Sparse optimization
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2022.109206 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 25681.xml