Stochastic batch size for adaptive regularization in deep network optimization. (September 2022)
- Record Type:
- Journal Article
- Title:
- Stochastic batch size for adaptive regularization in deep network optimization. (September 2022)
- Main Title:
- Stochastic batch size for adaptive regularization in deep network optimization
- Authors:
- Nakamura, Kensuke
Soatto, Stefano
Hong, Byung-Woo - Abstract:
- Highlights: Adaptive regularization for deep network optimization via parameter-wise batch size. The stochastic batch size reflects local and global properties of each parameter. Beneficial for practical studies where the number of training examples is small. Abstract: We propose a first-order stochastic optimization algorithm incorporating adaptive regularization for pattern recognition problems in deep learning framework. The adaptive regularization is imposed by stochastic process in determining batch size for each model parameter at each optimization iteration. The stochastic batch size is determined by the update probability of each parameter following a distribution of gradient norms in consideration of their local and global properties in the neural network architecture where the range of gradient norms may vary within and across layers. We empirically demonstrate the effectiveness of our algorithm using an image classification task based on conventional network models applied to commonly used benchmark datasets. The quantitative evaluation indicates that our algorithm outperforms the state-of-the-art optimization algorithms in generalization while providing less sensitivity to the selection of batch size which often plays a critical role in optimization, thus achieving more robustness to the selection of regularity.
- Is Part Of:
- Pattern recognition. Volume 129(2022)
- Journal:
- Pattern recognition
- Issue:
- Volume 129(2022)
- Issue Display:
- Volume 129, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 129
- Issue:
- 2022
- Issue Sort Value:
- 2022-0129-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-09
- Subjects:
- Deep network optimization -- Adaptive regularization -- Stochastic gradient descent -- Adaptive mini-batch size
00-01 -- 99-00
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2022.108776 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 22275.xml