Using adaptive resource allocation to implement an elastic MapReduce framework. (1st April 2016)
- Record Type:
- Journal Article
- Title:
- Using adaptive resource allocation to implement an elastic MapReduce framework. (1st April 2016)
- Main Title:
- Using adaptive resource allocation to implement an elastic MapReduce framework
- Authors:
- Zhao, Jiaqi
Xue, Changlong
Tao, Xinlin
Zhang, Shugong
Tao, Jie - Other Names:
- Ranjan Rajiv guestEditor.
Wang Lizhe guestEditor.
Jayaraman Prem Prakash guestEditor.
Mitra Karan guestEditor.
Georgakopoulos Dimitrios guestEditor. - Abstract:
- Summary: Today, we are observing a transition of science paradigms from the computational science to data‐intensive science. With the exponential increase of input and intermediate data, more applications are developed using the MapReduce programming model, which is regarded as an appropriate programming model for analysing large data sets. A MapReduce framework runs its applications on a cluster, where the computing capacity allocated to the applications is limited and may not fill their runtime resource demand. In this case, the Map/Reduce tasks have to wait in a queues, and the applications suffer from a poor performance. This work develops an autonomic resource manager within the Hadoop MapReduce framework. The manager is capable of getting aware of the overloading or under‐loading situations with the resources allocated to its user community. For the former, it takes an action of requesting more resources from, for example, the batch system of a High Performance Computing (HPC) cluster or Computing Clouds and integrates the additional resources, in case of acquisition, into the Hadoop MapReduce runtime. For the latter, the manager gives the free resources back to its source. We extended the existing Hadoop MapReduce resource manager to implement the proposed strategy and validated the concept on an HPC cluster with standard benchmark applications. Experimental results show a significant performance gain, for example, an up to 45% improvement in execution time forSummary: Today, we are observing a transition of science paradigms from the computational science to data‐intensive science. With the exponential increase of input and intermediate data, more applications are developed using the MapReduce programming model, which is regarded as an appropriate programming model for analysing large data sets. A MapReduce framework runs its applications on a cluster, where the computing capacity allocated to the applications is limited and may not fill their runtime resource demand. In this case, the Map/Reduce tasks have to wait in a queues, and the applications suffer from a poor performance. This work develops an autonomic resource manager within the Hadoop MapReduce framework. The manager is capable of getting aware of the overloading or under‐loading situations with the resources allocated to its user community. For the former, it takes an action of requesting more resources from, for example, the batch system of a High Performance Computing (HPC) cluster or Computing Clouds and integrates the additional resources, in case of acquisition, into the Hadoop MapReduce runtime. For the latter, the manager gives the free resources back to its source. We extended the existing Hadoop MapReduce resource manager to implement the proposed strategy and validated the concept on an HPC cluster with standard benchmark applications. Experimental results show a significant performance gain, for example, an up to 45% improvement in execution time for running multiple applications. Copyright © 2016 John Wiley & Sons, Ltd. … (more)
- Is Part Of:
- Software, practice & experience. Volume 47:Number 3(2017)
- Journal:
- Software, practice & experience
- Issue:
- Volume 47:Number 3(2017)
- Issue Display:
- Volume 47, Issue 3 (2017)
- Year:
- 2017
- Volume:
- 47
- Issue:
- 3
- Issue Sort Value:
- 2017-0047-0003-0000
- Page Start:
- 349
- Page End:
- 360
- Publication Date:
- 2016-04-01
- Subjects:
- adaptive resource management -- elastic computing -- cluster computing -- big data -- Hadoop MapReduce framework
Computer software -- Periodicals
Computer programming -- Periodicals
Computer programs -- Periodicals
005.3 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/spe.2398 ↗
- Languages:
- English
- ISSNs:
- 0038-0644
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8321.453000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 1666.xml