Computational reproducibility of scientific workflows at extreme scales. (September 2019)
- Record Type:
- Journal Article
- Title:
- Computational reproducibility of scientific workflows at extreme scales. (September 2019)
- Main Title:
- Computational reproducibility of scientific workflows at extreme scales
- Authors:
- Pouchard, Line
Baldwin, Sterling
Elsethagen, Todd
Jha, Shantenu
Raju, Bibi
Stephan, Eric
Tang, Li
Van Dam, Kerstin Kleese - Other Names:
- Mascagni Michael guest-editor.
- Abstract:
- We propose an approach for improved reproducibility that includes capturing and relating provenance characteristics and performance metrics. We discuss two use cases: scientific reproducibility of results in the Energy Exascale Earth System Model (E3SM—previously ACME) and performance reproducibility in molecular dynamics workflows on HPC platforms. To capture and persist the provenance and performance data of these workflows, we have designed and developed the Chimbuko and ProvEn frameworks. Chimbuko captures provenance and enables detailed single workflow performance analysis. ProvEn is a hybrid, queryable system for storing and analyzing the provenance and performance metrics of multiple runs in workflow performance analysis campaigns. Workflow provenance and performance data output from Chimbuko can be visualized in a dynamic, multilevel visualization providing overview and zoom-in capabilities for areas of interest. Provenance and related performance data ingested into ProvEn is queryable and can be used to reproduce runs. Our provenance-based approach highlights challenges in extracting information and gaps in the information collected. It is agnostic to the type of provenance data it captures so that both the reproducibility of scientific results and that of performance can be explored with our tools.
- Is Part Of:
- International journal of high performance computing applications. Volume 33:Number 5(2019)
- Journal:
- International journal of high performance computing applications
- Issue:
- Volume 33:Number 5(2019)
- Issue Display:
- Volume 33, Issue 5 (2019)
- Year:
- 2019
- Volume:
- 33
- Issue:
- 5
- Issue Sort Value:
- 2019-0033-0005-0000
- Page Start:
- 763
- Page End:
- 776
- Publication Date:
- 2019-09
- Subjects:
- Computational reproducibility -- scientific workflows -- provenance -- performance analysis -- ProvEn -- Chimbuko
High performance computing -- Periodicals
Supercomputers -- Periodicals
004.1105 - Journal URLs:
- http://hpc.sagepub.com ↗
http://www.uk.sagepub.com/home.nav ↗
http://firstsearch.oclc.org ↗ - DOI:
- 10.1177/1094342019839124 ↗
- Languages:
- English
- ISSNs:
- 1094-3420
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 11071.xml