A comparison of the Cray XMT and XMT‐2. (9th August 2012)
- Record Type:
- Journal Article
- Title:
- A comparison of the Cray XMT and XMT‐2. (9th August 2012)
- Main Title:
- A comparison of the Cray XMT and XMT‐2
- Authors:
- Bokhari, Shahid H.
Bokhari, Saniyah S. - Abstract:
- <abstract abstract-type="main" id="cpe2909-abs-0001"> <title>SUMMARY</title> <p id="cpe2909-para-0001">We explore the comparative performance of the Cray XMT and XMT‐2 massively multithreaded supercomputers. We use benchmarks to evaluate memory accesses for various types of loops. We also compare the performance of these machines on matrix multiply and on three previously implemented dynamic programming algorithms. It is shown that the relative performance of these machines is dependent on the size (number of processors) of the configuration, as well as the size of the problem being evaluated. In particular, small configurations of the original XMT can sometimes show slightly better performance than larger configurations of the XMT‐2, for the same problem size. We note that, under heavy memory load, performance of loops can saturate well before the maximum number of processors available. This suggests that it may not always be useful to use the maximum number of processors for a specific run. We also show that manual restructuring of nested loops, including <italic>decreasing</italic> the parallelism, can result in major improvements in performance. The results in this paper indicate that careful exploration of the space of problem sizes, number of processors used, and choices of loop parallelization can yield substantial improvements in performance. These improvements can be very significant for production codes that run for extended periods of time. Copyright © 2012 John<abstract abstract-type="main" id="cpe2909-abs-0001"> <title>SUMMARY</title> <p id="cpe2909-para-0001">We explore the comparative performance of the Cray XMT and XMT‐2 massively multithreaded supercomputers. We use benchmarks to evaluate memory accesses for various types of loops. We also compare the performance of these machines on matrix multiply and on three previously implemented dynamic programming algorithms. It is shown that the relative performance of these machines is dependent on the size (number of processors) of the configuration, as well as the size of the problem being evaluated. In particular, small configurations of the original XMT can sometimes show slightly better performance than larger configurations of the XMT‐2, for the same problem size. We note that, under heavy memory load, performance of loops can saturate well before the maximum number of processors available. This suggests that it may not always be useful to use the maximum number of processors for a specific run. We also show that manual restructuring of nested loops, including <italic>decreasing</italic> the parallelism, can result in major improvements in performance. The results in this paper indicate that careful exploration of the space of problem sizes, number of processors used, and choices of loop parallelization can yield substantial improvements in performance. These improvements can be very significant for production codes that run for extended periods of time. Copyright © 2012 John Wiley &amp; Sons, Ltd.</p> </abstract> … (more)
- Is Part Of:
- Concurrency and computation. Volume 25:Number 15(2013:Oct.)
- Journal:
- Concurrency and computation
- Issue:
- Volume 25:Number 15(2013:Oct.)
- Issue Display:
- Volume 25, Issue 15 (2013)
- Year:
- 2013
- Volume:
- 25
- Issue:
- 15
- Issue Sort Value:
- 2013-0025-0015-0000
- Page Start:
- 2123
- Page End:
- 2139
- Publication Date:
- 2012-08-09
- Subjects:
- Parallel processing (Electronic computers) -- Periodicals
Parallel computers -- Periodicals
004.35 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/cpe.2909 ↗
- Languages:
- English
- ISSNs:
- 1532-0626
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3405.622000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 4109.xml