Porting bioinformatics applications from grid to cloud: A macromolecular surface analysis application case study. (May 2017)
- Record Type:
- Journal Article
- Title:
- Porting bioinformatics applications from grid to cloud: A macromolecular surface analysis application case study. (May 2017)
- Main Title:
- Porting bioinformatics applications from grid to cloud: A macromolecular surface analysis application case study
- Authors:
- Merelli, Ivan
Cozzi, Paolo
Ronchieri, Elisabetta
Cesini, Daniele
D'Agostino, Daniele - Other Names:
- Aldinucci Marco guest-editor.
Brorsson Mats guest-editor.
D'Agostino Daniele guest-editor.
Daneshtalab Masoud guest-editor.
Kilpatrick Peter guest-editor.
Leppänen Ville guest-editor. - Abstract:
- In this paper we describe our experience in exploiting different cloud-based environments for an actual use case taken from the bioinformatics domain – the molecular surfaces analysis - that identifies similarities and possible complementarities in the protein surfaces. The analysis of macromolecular surfaces is important since protein surface conformations drive many biological reactions. We developed a workflow that performs the macromolecular surfaces analysis and provides interesting results from a scientific point of view. An important issue is represented by the fact that it is highly compute-intensive, therefore it cannot be run on a single CPU system for meaningful use cases and a parallel infrastructure is required to obtain reasonable execution time. For a decade grid infrastructures have represented suitable solutions to achieve cost effective computational power for Bioinformatics applications. However, these solutions do not offer an adequate customisation of the computational environment (e.g. installing databases and configuring virtual network) due to the rigid organisation of the storage and computational sites. Running applications on customised machines obtained by user-defined images simplifies the computing model, decreases the failure rates and therefore reduces waiting times for production analysis with respect to the canonical grid computations. For these reasons a cloud-based approach is more suitable than a pure grid paradigm. We experimented usingIn this paper we describe our experience in exploiting different cloud-based environments for an actual use case taken from the bioinformatics domain – the molecular surfaces analysis - that identifies similarities and possible complementarities in the protein surfaces. The analysis of macromolecular surfaces is important since protein surface conformations drive many biological reactions. We developed a workflow that performs the macromolecular surfaces analysis and provides interesting results from a scientific point of view. An important issue is represented by the fact that it is highly compute-intensive, therefore it cannot be run on a single CPU system for meaningful use cases and a parallel infrastructure is required to obtain reasonable execution time. For a decade grid infrastructures have represented suitable solutions to achieve cost effective computational power for Bioinformatics applications. However, these solutions do not offer an adequate customisation of the computational environment (e.g. installing databases and configuring virtual network) due to the rigid organisation of the storage and computational sites. Running applications on customised machines obtained by user-defined images simplifies the computing model, decreases the failure rates and therefore reduces waiting times for production analysis with respect to the canonical grid computations. For these reasons a cloud-based approach is more suitable than a pure grid paradigm. We experimented using two cloud-based approaches, based on the Worker Node On Demand Service and on OpenStack, to run the molecular surfaces analysis use case and we compared the results in terms of performance, efficiency and efforts to build the computing model with respect to grid computing. … (more)
- Is Part Of:
- International journal of high performance computing applications. Volume 31:Number 3(2017)
- Journal:
- International journal of high performance computing applications
- Issue:
- Volume 31:Number 3(2017)
- Issue Display:
- Volume 31, Issue 3 (2017)
- Year:
- 2017
- Volume:
- 31
- Issue:
- 3
- Issue Sort Value:
- 2017-0031-0003-0000
- Page Start:
- 182
- Page End:
- 195
- Publication Date:
- 2017-05
- Subjects:
- Grid computing -- cloud-like environment -- virtualisation -- macromolecular surface analysis -- bioinformatics -- structural biology
High performance computing -- Periodicals
Supercomputers -- Periodicals
004.1105 - Journal URLs:
- http://hpc.sagepub.com ↗
http://www.uk.sagepub.com/home.nav ↗
http://firstsearch.oclc.org ↗ - DOI:
- 10.1177/1094342015588565 ↗
- Languages:
- English
- ISSNs:
- 1094-3420
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 7509.xml