Adaptive Remus: adaptive checkpointing for Xen-based virtual machine replication. Issue 4 (4th July 2017)
- Record Type:
- Journal Article
- Title:
- Adaptive Remus: adaptive checkpointing for Xen-based virtual machine replication. Issue 4 (4th July 2017)
- Main Title:
- Adaptive Remus: adaptive checkpointing for Xen-based virtual machine replication
- Authors:
- da Silva, Marcelo Pereira
Obelheiro, Rafael Rodrigues
Koslovski, Guilherme Piegas - Abstract:
- Abstract : With the ever increasing dependence on computers and networks, many systems are required to be continuously available in order to fulfil their mission. Virtualization technology enables high availability to be offered in a convenient, cost-effective manner: with the encapsulation provided by virtual machines (VMs), entire systems can be replicated transparently in software, obviating the need for expensive fault-tolerant hardware. Remus is a VM replication mechanism for the Xen hypervisor that provides high availability despite crash failures. Replication is performed by checkpointing the VM at fixed intervals. However, there is an antagonism between processing and communication regarding the optimal checkpoint interval: while longer intervals benefit processor-intensive applications, shorter intervals favour network-intensive applications. Thus, any chosen interval may not always be suitable for the hosted applications, limiting Remus usage in many scenarios. This work introduces Adaptive Remus, a proposal for adaptive checkpointing in Remus that dynamically adjusts the replication frequency according to the characteristics of running applications. Experimental results indicate that our proposal improves performance for applications that require both processing and communication, without harming applications that use only one type of resource. Graphical Abstract: Adaptive Remus quantifies VM metrics to infer the current hosted application load. With thisAbstract : With the ever increasing dependence on computers and networks, many systems are required to be continuously available in order to fulfil their mission. Virtualization technology enables high availability to be offered in a convenient, cost-effective manner: with the encapsulation provided by virtual machines (VMs), entire systems can be replicated transparently in software, obviating the need for expensive fault-tolerant hardware. Remus is a VM replication mechanism for the Xen hypervisor that provides high availability despite crash failures. Replication is performed by checkpointing the VM at fixed intervals. However, there is an antagonism between processing and communication regarding the optimal checkpoint interval: while longer intervals benefit processor-intensive applications, shorter intervals favour network-intensive applications. Thus, any chosen interval may not always be suitable for the hosted applications, limiting Remus usage in many scenarios. This work introduces Adaptive Remus, a proposal for adaptive checkpointing in Remus that dynamically adjusts the replication frequency according to the characteristics of running applications. Experimental results indicate that our proposal improves performance for applications that require both processing and communication, without harming applications that use only one type of resource. Graphical Abstract: Adaptive Remus quantifies VM metrics to infer the current hosted application load. With this information, the mechanism adjusts the checkpointing frequency between two modes. (I) networking mode: increases the checkpointing frequency whenever output traffic is detected on the VM interface; and (II) processing mode: when there is no output traffic in the VM interface, the mechanism reduces the checkpointing frequency, increasing the VM execution time. This approach improves application performance by dynamically adapting the checkpoint interval. … (more)
- Is Part Of:
- International journal of parallel, emergent and distributed systems. Volume 32:Issue 4(2017)
- Journal:
- International journal of parallel, emergent and distributed systems
- Issue:
- Volume 32:Issue 4(2017)
- Issue Display:
- Volume 32, Issue 4 (2017)
- Year:
- 2017
- Volume:
- 32
- Issue:
- 4
- Issue Sort Value:
- 2017-0032-0004-0000
- Page Start:
- 348
- Page End:
- 367
- Publication Date:
- 2017-07-04
- Subjects:
- Replication -- Xen -- virtual machines -- primary-backup -- checkpointing -- Remus
Parallel computers -- Periodicals
Electronic data processing -- Distributed processing -- Periodicals
Computer algorithms -- Periodicals
004.35 - Journal URLs:
- http://www.tandfonline.com/toc/gpaa20/current ↗
http://www.tandfonline.com/ ↗ - DOI:
- 10.1080/17445760.2016.1162302 ↗
- Languages:
- English
- ISSNs:
- 1744-5760
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4542.441300
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 1344.xml