Mixed precision s‐step Lanczos and conjugate gradient algorithms. Issue 3 (23rd November 2021)
- Record Type:
- Journal Article
- Title:
- Mixed precision s‐step Lanczos and conjugate gradient algorithms. Issue 3 (23rd November 2021)
- Main Title:
- Mixed precision s‐step Lanczos and conjugate gradient algorithms
- Authors:
- Carson, Erin
Gergelits, Tomáš
Yamazaki, Ichitaro - Abstract:
- Abstract: Compared to the classical Lanczos algorithm, the s ‐step Lanczos variant has the potential to improve performance by asymptotically decreasing the synchronization cost per iteration. However, this comes at a price; despite being mathematically equivalent, the s ‐step variant may behave quite differently in finite precision, potentially exhibiting greater loss of accuracy and slower convergence relative to the classical algorithm. It has previously been shown that the errors in the s ‐step version follow the same structure as the errors in the classical algorithm, but are amplified by a factor depending on the square of the condition number of the O ( s ) ‐dimensional Krylov bases computed in each outer loop. As the condition number of these s ‐step bases grows (in some cases very quickly) with s, this limits the s values that can be chosen and thus can limit the attainable performance. In this work, we show that if a select few computations in s ‐step Lanczos are performed in double the working precision, the error terms then depend only linearly on the conditioning of the s ‐step bases. This has the potential for drastically improving the numerical behavior of the algorithm with little impact on per‐iteration performance. Our numerical experiments demonstrate the improved numerical behavior possible with the mixed precision approach, and also show that this improved behavior extends to mixed precision s ‐step CG. We present preliminary performance results onAbstract: Compared to the classical Lanczos algorithm, the s ‐step Lanczos variant has the potential to improve performance by asymptotically decreasing the synchronization cost per iteration. However, this comes at a price; despite being mathematically equivalent, the s ‐step variant may behave quite differently in finite precision, potentially exhibiting greater loss of accuracy and slower convergence relative to the classical algorithm. It has previously been shown that the errors in the s ‐step version follow the same structure as the errors in the classical algorithm, but are amplified by a factor depending on the square of the condition number of the O ( s ) ‐dimensional Krylov bases computed in each outer loop. As the condition number of these s ‐step bases grows (in some cases very quickly) with s, this limits the s values that can be chosen and thus can limit the attainable performance. In this work, we show that if a select few computations in s ‐step Lanczos are performed in double the working precision, the error terms then depend only linearly on the conditioning of the s ‐step bases. This has the potential for drastically improving the numerical behavior of the algorithm with little impact on per‐iteration performance. Our numerical experiments demonstrate the improved numerical behavior possible with the mixed precision approach, and also show that this improved behavior extends to mixed precision s ‐step CG. We present preliminary performance results on NVIDIA V100 GPUs that show that the overhead of extra precision is minimal if one uses precisions implemented in hardware. … (more)
- Is Part Of:
- Numerical linear algebra with applications. Volume 29:Issue 3(2022)
- Journal:
- Numerical linear algebra with applications
- Issue:
- Volume 29:Issue 3(2022)
- Issue Display:
- Volume 29, Issue 3 (2022)
- Year:
- 2022
- Volume:
- 29
- Issue:
- 3
- Issue Sort Value:
- 2022-0029-0003-0000
- Page Start:
- n/a
- Page End:
- n/a
- Publication Date:
- 2021-11-23
- Subjects:
- avoiding communication -- error analysis -- finite precision -- Krylov subspace methods -- Lanczos -- mixed precision
Algebras, Linear -- Periodicals
512.5 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/nla.2425 ↗
- Languages:
- English
- ISSNs:
- 1070-5325
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 6184.692750
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21218.xml