BSP cost and scalability analysis for MapReduce operations. (7th October 2015)
- Record Type:
- Journal Article
- Title:
- BSP cost and scalability analysis for MapReduce operations. (7th October 2015)
- Main Title:
- BSP cost and scalability analysis for MapReduce operations
- Authors:
- Senger, Hermes
Gil‐Costa, Veronica
Arantes, Luciana
Marcondes, Cesar A. C.
Marín, Mauricio
Sato, Liria M.
da Silva, Fabrício A.B. - Other Names:
- Silla Federico guestEditor.
Fröning Holger guestEditor.
Senger Hermes guestEditor.
Geyer Claudio guestEditor. - Abstract:
- Summary: Data abundance poses the need for powerful and easy‐to‐use tools that support processing large amounts of data. MapReduce has been increasingly adopted for over a decade by many companies, and more recently, it has attracted the attention of an increasing number of researchers in several areas. One main advantage is that the complex details of parallel processing, such as complex network programming, task scheduling, data placement, and fault tolerance, are hidden in a conceptually simple framework. MapReduce is supported by mature software technologies for deployment in data centers such as Hadoop. As MapReduce becomes popular for high‐performance applications, many questions arise concerning its performance and efficiency. In this paper, we demonstrated formally lower bounds on the isoefficiency function for MapReduce applications, when these applications can be modeled as BSP jobs. We also demonstrate how communication and synchronization costs can be dominant for MapReduce computations and discuss the conditions under which such scalability limits are valid. To our knowledge, this is the first study that demonstrates scalability bounds for MapReduce applications. We also discuss how some MapReduce implementations such as Hadoop can mitigate such costs to approach linear, or near‐to‐linear speedups. Copyright © 2015 John Wiley & Sons, Ltd.
- Is Part Of:
- Concurrency and computation. Volume 28:Number 8(2016)
- Journal:
- Concurrency and computation
- Issue:
- Volume 28:Number 8(2016)
- Issue Display:
- Volume 28, Issue 8 (2016)
- Year:
- 2016
- Volume:
- 28
- Issue:
- 8
- Issue Sort Value:
- 2016-0028-0008-0000
- Page Start:
- 2503
- Page End:
- 2527
- Publication Date:
- 2015-10-07
- Subjects:
- Mapreduce -- Hadoop -- scalability -- BSP
Parallel processing (Electronic computers) -- Periodicals
Parallel computers -- Periodicals
004.35 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/cpe.3628 ↗
- Languages:
- English
- ISSNs:
- 1532-0626
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3405.622000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 954.xml