Dynamic group communication for large-scale parallel data mining. (September 2013)
- Record Type:
- Journal Article
- Title:
- Dynamic group communication for large-scale parallel data mining. (September 2013)
- Main Title:
- Dynamic group communication for large-scale parallel data mining
- Authors:
- Katti, Amogh
Di Fatta, Giuseppe - Other Names:
- Fortino Giancarlo guest-editor.
- Abstract:
- Exascale systems are the next frontier in high-performance computing and are expected to deliver a performance of the order of 10 18 operations per second using massive multicore processors. Very large- and extreme-scale parallel systems pose critical algorithmic challenges, especially related to concurrency, locality and the need to avoid global communication patterns. This work investigates a novel protocol for dynamic group communication that can be used to remove the global communication requirement and to reduce the communication cost in parallel formulations of iterative data mining algorithms. The protocol is used to provide a communication-efficient parallel formulation of the k-means algorithm for cluster analysis. The approach is based on a collective communication operation for dynamic groups of processes and exploits non-uniform data distributions. Non-uniform data distributions can be either found in real-world distributed applications or induced by means of multidimensional binary search trees. The analysis of the proposed dynamic group communication protocol has shown that it does not introduce significant communication overhead. The parallel clustering algorithm has also been extended to accommodate an approximation error, which allows a further reduction of the communication costs. The effectiveness of the exact and approximate methods has been tested in a parallel computing system with 64 processors and in simulations with 1024 processing elements.
- Is Part Of:
- Concurrent engineering, research and applications. Volume 21:Number 3(2013)
- Journal:
- Concurrent engineering, research and applications
- Issue:
- Volume 21:Number 3(2013)
- Issue Display:
- Volume 21, Issue 3 (2013)
- Year:
- 2013
- Volume:
- 21
- Issue:
- 3
- Issue Sort Value:
- 2013-0021-0003-0000
- Page Start:
- 227
- Page End:
- 234
- Publication Date:
- 2013-09
- Subjects:
- Extreme-scale computing -- dynamic group communication -- parallel data mining -- clustering -- k-means
Production engineering -- Periodicals
Concurrent engineering -- Periodicals
621.39 - Journal URLs:
- http://cer.sagepub.com/ ↗
http://www.uk.sagepub.com/home.nav ↗
http://firstsearch.oclc.org ↗
http://firstsearch.oclc.org/journal=1063-293x;screen=info;ECOIP ↗ - DOI:
- 10.1177/1063293X13495551 ↗
- Languages:
- English
- ISSNs:
- 1063-293X
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 24585.xml