Dynamic hard pruning of Neural Networks at the edge of the internet. (April 2022)
- Record Type:
- Journal Article
- Title:
- Dynamic hard pruning of Neural Networks at the edge of the internet. (April 2022)
- Main Title:
- Dynamic hard pruning of Neural Networks at the edge of the internet
- Authors:
- Valerio, Lorenzo
Nardini, Franco Maria
Passarella, Andrea
Perego, Raffaele - Abstract:
- Abstract: Neural Networks (NN), although successfully applied to several Artificial Intelligence tasks, are often unnecessarily over-parametrized. In edge/fog computing, this might make their training prohibitive on resource-constrained devices, contrasting with the current trend of decentralizing intelligence from remote data centres to local constrained devices. Therefore, we investigate the problem of training effective NN models on constrained devices having a fixed, potentially small, memory budget. We target techniques that are both resource-efficient and performance effective while enabling significant network compression. Our Dynamic Hard Pruning (DynHP) technique incrementally prunes the network during training, identifying neurons that marginally contribute to the model accuracy. DynHP enables a tunable size reduction of the final neural network and reduces the NN memory occupancy during training. Freed memory is reused by a dynamic batch sizing approach to counterbalance the accuracy degradation caused by the hard pruning strategy, improving its convergence and effectiveness. We assess the performance of DynHP through reproducible experiments on three public datasets, comparing them against reference competitors. Results show that DynHP compresses a NN up to 10 times without significant performance drops (up to 3.5% additional error w.r.t. the competitors), reducing up to 80% the training memory occupancy.
- Is Part Of:
- Journal of network and computer applications. Volume 200(2022)
- Journal:
- Journal of network and computer applications
- Issue:
- Volume 200(2022)
- Issue Display:
- Volume 200, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 200
- Issue:
- 2022
- Issue Sort Value:
- 2022-0200-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-04
- Subjects:
- Artificial neural networks -- Pruning -- Compression -- Resource-constrained devices
Microcomputers -- Periodicals
Computer networks -- Periodicals
Application software -- Periodicals
Micro-ordinateurs -- Périodiques
Réseaux d'ordinateurs -- Périodiques
Logiciels d'application -- Périodiques
Application software
Computer networks
Microcomputers
Periodicals
004.05
004 - Journal URLs:
- http://www.sciencedirect.com/science/journal/10848045 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.jnca.2021.103330 ↗
- Languages:
- English
- ISSNs:
- 1084-8045
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5021.410600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21074.xml