Thermal‐aware task assignments in high performance computing clusters. (2nd August 2017)
- Record Type:
- Journal Article
- Title:
- Thermal‐aware task assignments in high performance computing clusters. (2nd August 2017)
- Main Title:
- Thermal‐aware task assignments in high performance computing clusters
- Authors:
- Taneja, Shubbhi
Kulkarni, Sanjay
Zhou, Yi
Qin, Xiao - Abstract:
- Summary: Cluster‐level thermal management has gained much attention over the past decade due to rising cooling costs associated with data centers. In this research, we propose and implement a static scheduler called SSched and a dynamic one named DSched. These 2 algorithms schedule jobs based on CPU and disk temperatures of a Hadoop cluster's nodes. Our schedulers rely on a monitoring mechanism to keep track of CPU and disk utilization, maintaining CPU and disk temperatures below a threshold through thermal‐aware scheduling decisions. To facilitate the design of SSched and DSched, we classify jobs into the CPU‐intensive and disk‐intensive categories. When a job arrives, SSched retrieves the utilization stats from a profiled log, estimates the thermal behavior, and places the job on NodeManager to minimize thermal impacts. Unlike SSched, DSched improves thermal efficiency of Hadoop clusters through dynamic load balancing. DSched keeps track of the coolest and hottest nodes in the cluster; tasks are migrated from hot nodes into cool ones if any hot spot is detected. To evaluate the effectiveness of our schedulers, we keep track of average CPU and disk temperatures in a node, managing an optimal outlet temperature across a cluster. We demonstrate that compared with the traditional Hadoop scheduler, SSched and DSched achieve approximately 15% savings in terms of cooling cost with little performance overhead.
- Is Part Of:
- Concurrency and computation. Volume 29:Number 18(2017)
- Journal:
- Concurrency and computation
- Issue:
- Volume 29:Number 18(2017)
- Issue Display:
- Volume 29, Issue 18 (2017)
- Year:
- 2017
- Volume:
- 29
- Issue:
- 18
- Issue Sort Value:
- 2017-0029-0018-0000
- Page Start:
- n/a
- Page End:
- n/a
- Publication Date:
- 2017-08-02
- Subjects:
- benchmarking -- CPU‐intensive -- data centers -- disk‐intensive job -- Hadoop -- MapReduce -- task assignment -- thermal‐aware job scheduler -- thermal profiling
Parallel processing (Electronic computers) -- Periodicals
Parallel computers -- Periodicals
004.35 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/cpe.4206 ↗
- Languages:
- English
- ISSNs:
- 1532-0626
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3405.622000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 8272.xml