Cost‐efficient and network‐aware dynamic repartitioning‐based algorithms for scheduling large‐scale graphs in cloud computing environments. (26th July 2018)
- Record Type:
- Journal Article
- Title:
- Cost‐efficient and network‐aware dynamic repartitioning‐based algorithms for scheduling large‐scale graphs in cloud computing environments. (26th July 2018)
- Main Title:
- Cost‐efficient and network‐aware dynamic repartitioning‐based algorithms for scheduling large‐scale graphs in cloud computing environments
- Authors:
- Heidari, Safiollah
Buyya, Rajkumar - Abstract:
- Summary: Large amount of data that is generated by Internet and enterprize applications are stored in the form of graphs. Graph processing systems are broadly used in enterprizes to process such data. With the rapid growth in mobile and social applications and complicated connections of Internet websites, massive concurrent operations need to be handled. On the other hand, the intrinsic structure and the size of real‐world graphs make distributed processing of graphs more challenging. Low balanced communication and computation, low preprocessing overhead, low memory footprint, and scalability should be offered by distributed graph analytics frameworks. Moreover, the effects of network factors such as bandwidth and traffic as well as monetary cost of processing such large‐scale graphs and the mutual impact of these elements have been less studied. To address these issues, we proposed two dynamic repartitioning algorithms that consider network factors, affecting public cloud environments to decrease the monetary cost of processing. A new classification of graph algorithms and processing is also introduced, which will be used to choose the best strategy for processing at any operation. We plugged these algorithms to our extended graph processing system (iGiraph) and compared them with those supported in other graph processing systems such as Giraph and Surfer on Australian National Cloud Infrastructure. We observed that up to 30% faster execution time, up to 50% network trafficSummary: Large amount of data that is generated by Internet and enterprize applications are stored in the form of graphs. Graph processing systems are broadly used in enterprizes to process such data. With the rapid growth in mobile and social applications and complicated connections of Internet websites, massive concurrent operations need to be handled. On the other hand, the intrinsic structure and the size of real‐world graphs make distributed processing of graphs more challenging. Low balanced communication and computation, low preprocessing overhead, low memory footprint, and scalability should be offered by distributed graph analytics frameworks. Moreover, the effects of network factors such as bandwidth and traffic as well as monetary cost of processing such large‐scale graphs and the mutual impact of these elements have been less studied. To address these issues, we proposed two dynamic repartitioning algorithms that consider network factors, affecting public cloud environments to decrease the monetary cost of processing. A new classification of graph algorithms and processing is also introduced, which will be used to choose the best strategy for processing at any operation. We plugged these algorithms to our extended graph processing system (iGiraph) and compared them with those supported in other graph processing systems such as Giraph and Surfer on Australian National Cloud Infrastructure. We observed that up to 30% faster execution time, up to 50% network traffic decline, and more than 50% cost reduction are achieved by our algorithms compared to a framework such as the popular Giraph. … (more)
- Is Part Of:
- Software, practice & experience. Volume 48:Number 12(2018)
- Journal:
- Software, practice & experience
- Issue:
- Volume 48:Number 12(2018)
- Issue Display:
- Volume 48, Issue 12 (2018)
- Year:
- 2018
- Volume:
- 48
- Issue:
- 12
- Issue Sort Value:
- 2018-0048-0012-0000
- Page Start:
- 2174
- Page End:
- 2192
- Publication Date:
- 2018-07-26
- Subjects:
- cloud computing -- cost saving -- graph processing -- network‐aware processing
Computer software -- Periodicals
Computer programming -- Periodicals
Computer programs -- Periodicals
005.3 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/spe.2623 ↗
- Languages:
- English
- ISSNs:
- 0038-0644
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8321.453000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 8482.xml