Stochastic distributed learning with gradient quantization and double-variance reduction. (2nd January 2023)
- Record Type:
- Journal Article
- Title:
- Stochastic distributed learning with gradient quantization and double-variance reduction. (2nd January 2023)
- Main Title:
- Stochastic distributed learning with gradient quantization and double-variance reduction
- Authors:
- Horváth, Samuel
Kovalev, Dmitry
Mishchenko, Konstantin
Richtárik, Peter
Stich, Sebastian - Abstract:
- ABSTRACT: We consider distributed optimization over several devices, each sending incremental model updates to a central server. This setting is considered, for instance, in federated learning. Various schemes have been designed to compress the model updates in order to reduce the overall communication cost. However, existing methods suffer from a significant slowdown due to additional variance ω > 0 coming from the compression operator and as a result, only converge sublinearly. What is needed is a variance reduction technique for taming the variance introduced by compression. We propose the first methods that achieve linear convergence for arbitrary compression operators. For strongly convex functions with condition number κ, distributed among n machines with a finite-sum structure, each worker having less than m components, we also (i) give analysis for the weakly convex and the non-convex cases and (ii) verify in experiments that our novel variance reduced schemes are more efficient than the baselines. Moreover, we show theoretically that as the number of devices increases, higher compression levels are possible without this affecting the overall number of communications in comparison with methods that do not perform any compression. This leads to a significant reduction in communication cost. Our general analysis allows to pick the most suitable compression for each problem, finding the right balance between additional variance and communication savings. Finally, weABSTRACT: We consider distributed optimization over several devices, each sending incremental model updates to a central server. This setting is considered, for instance, in federated learning. Various schemes have been designed to compress the model updates in order to reduce the overall communication cost. However, existing methods suffer from a significant slowdown due to additional variance ω > 0 coming from the compression operator and as a result, only converge sublinearly. What is needed is a variance reduction technique for taming the variance introduced by compression. We propose the first methods that achieve linear convergence for arbitrary compression operators. For strongly convex functions with condition number κ, distributed among n machines with a finite-sum structure, each worker having less than m components, we also (i) give analysis for the weakly convex and the non-convex cases and (ii) verify in experiments that our novel variance reduced schemes are more efficient than the baselines. Moreover, we show theoretically that as the number of devices increases, higher compression levels are possible without this affecting the overall number of communications in comparison with methods that do not perform any compression. This leads to a significant reduction in communication cost. Our general analysis allows to pick the most suitable compression for each problem, finding the right balance between additional variance and communication savings. Finally, we also (iii) give analysis for arbitrary quantized updates. … (more)
- Is Part Of:
- Optimization methods and software. Volume 38:Number 1(2023)
- Journal:
- Optimization methods and software
- Issue:
- Volume 38:Number 1(2023)
- Issue Display:
- Volume 38, Issue 1 (2023)
- Year:
- 2023
- Volume:
- 38
- Issue:
- 1
- Issue Sort Value:
- 2023-0038-0001-0000
- Page Start:
- 91
- Page End:
- 106
- Publication Date:
- 2023-01-02
- Subjects:
- Distributed optimization -- federated learning -- stochastic optimization -- communication compression -- variance reduction -- gradient methods
90C06 -- 90C15
Mathematical optimization -- Periodicals
Algorithms -- Periodicals
519.7 - Journal URLs:
- http://www.tandfonline.com/toc/goms20/current ↗
http://www.tandfonline.com/ ↗ - DOI:
- 10.1080/10556788.2022.2117355 ↗
- Languages:
- English
- ISSNs:
- 1055-6788
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 6275.120000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 26062.xml