Error recovery mechanism for grid-based workflow within SLA context. (15th November 2007)
- Record Type:
- Journal Article
- Title:
- Error recovery mechanism for grid-based workflow within SLA context. (15th November 2007)
- Main Title:
- Error recovery mechanism for grid-based workflow within SLA context
- Authors:
- Quan, Dang Minh
- Abstract:
- Service Level Agreements (SLAs) serve as a foundation for a reliable and predictable job execution at remote grid sites. In this paper, we describe an error recovery mechanism for workflow within the SLA context, coping with catastrophic failure when one or several High Performance Computing Centers (HPCCs) are detached from the grid system. We propose an algorithm to detect all affected sub-jobs when the error happens and an algorithm to remap those sub-jobs to the remaining healthy HPCCs with makespan optimise. The experiment result shows that our mechanism discovers a higher quality solution in a shorter time period than other existing methods.
- Is Part Of:
- International journal of high performance computing and networking. Volume 5:Number 1/2(2007)
- Journal:
- International journal of high performance computing and networking
- Issue:
- Volume 5:Number 1/2(2007)
- Issue Display:
- Volume 5, Issue 1/2 (2007)
- Year:
- 2007
- Volume:
- 5
- Issue:
- 1/2
- Issue Sort Value:
- 2007-0005-NaN-0000
- Page Start:
- 110
- Page End:
- 121
- Publication Date:
- 2007-11-15
- Subjects:
- grid computing -- service level agreements -- SLA -- error recovery -- workflow -- mapping -- high performance computing
High performance computing -- Periodicals
Computer networks -- Periodicals
High performance computing
Periodicals
004.05 - Journal URLs:
- http://www.inderscience.com/jhome.php?jcode=ijhpcn ↗
http://www.metapress.com/openurl.asp?genre=journal&issn=1740-0562 ↗
http://www.inderscience.com/ ↗ - Languages:
- English
- ISSNs:
- 1740-0562
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 8667.xml