EDISON‐DATA: A flexible and extensible platform for processing and analysis of computational science data. (7th August 2019)
- Record Type:
- Journal Article
- Title:
- EDISON‐DATA: A flexible and extensible platform for processing and analysis of computational science data. (7th August 2019)
- Main Title:
- EDISON‐DATA: A flexible and extensible platform for processing and analysis of computational science data
- Authors:
- Ahn, Sunil
Lee, Jeongcheol
Kim, Jaesung
Lee, JongSuk R. - Abstract:
- Summary: With the recent emergence of new paradigm, ie, open science and big data, the need for data sharing and collaboration is becoming important in the computational science field as well. The EDISON‐DATA platform aims to provide services that computational simulation data can easily published, preserved, shared, reused, discovered, and analyzed. First, this paper analyzed computational science platform‐related issues, obtained during the development of the EDISON‐DATA platform, regarding the sharing and reusing of the computational science data. These issues include data complexity, diversity, reliability, heterogeneity, etc. To solve the above issues and support data analysis in an efficient and integrated manner, this study proposes various ideas used in the EDISON‐DATA platform. First, we suggested an automated preprocessing framework to handle the complexity of computational science data. Second, to solve the diversity issue, we presented ways to develop preprocessing logic and data presentation logic customized for each data type. Third, to improve the reliability of computational science data, some quality control and provenance management techniques were presented. Fourth, we proposed a way to manage related data in groups. Fifth, to solve data heterogeneity problem and to analyze data in an integrated way, we let the preprocessing framework to use controlled vocabularies to express descriptive metadata. Lastly, we demonstrated feasibility and usability of theSummary: With the recent emergence of new paradigm, ie, open science and big data, the need for data sharing and collaboration is becoming important in the computational science field as well. The EDISON‐DATA platform aims to provide services that computational simulation data can easily published, preserved, shared, reused, discovered, and analyzed. First, this paper analyzed computational science platform‐related issues, obtained during the development of the EDISON‐DATA platform, regarding the sharing and reusing of the computational science data. These issues include data complexity, diversity, reliability, heterogeneity, etc. To solve the above issues and support data analysis in an efficient and integrated manner, this study proposes various ideas used in the EDISON‐DATA platform. First, we suggested an automated preprocessing framework to handle the complexity of computational science data. Second, to solve the diversity issue, we presented ways to develop preprocessing logic and data presentation logic customized for each data type. Third, to improve the reliability of computational science data, some quality control and provenance management techniques were presented. Fourth, we proposed a way to manage related data in groups. Fifth, to solve data heterogeneity problem and to analyze data in an integrated way, we let the preprocessing framework to use controlled vocabularies to express descriptive metadata. Lastly, we demonstrated feasibility and usability of the proposed ideas in this paper by presenting a case study of building a research portal service in the materials field based on the EDISON‐DATA platform. … (more)
- Is Part Of:
- Software, practice & experience. Volume 49:Number 10(2019)
- Journal:
- Software, practice & experience
- Issue:
- Volume 49:Number 10(2019)
- Issue Display:
- Volume 49, Issue 10 (2019)
- Year:
- 2019
- Volume:
- 49
- Issue:
- 10
- Issue Sort Value:
- 2019-0049-0010-0000
- Page Start:
- 1509
- Page End:
- 1530
- Publication Date:
- 2019-08-07
- Subjects:
- analysis -- computational science data -- EDISON -- flexible -- platform
Computer software -- Periodicals
Computer programming -- Periodicals
Computer programs -- Periodicals
005.3 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/spe.2732 ↗
- Languages:
- English
- ISSNs:
- 0038-0644
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8321.453000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 11527.xml