Cloud‐based parallel solution for estimating statistical significance of megabyte‐scale DNA sequences‡. (12th November 2012)
- Record Type:
- Journal Article
- Title:
- Cloud‐based parallel solution for estimating statistical significance of megabyte‐scale DNA sequences‡. (12th November 2012)
- Main Title:
- Cloud‐based parallel solution for estimating statistical significance of megabyte‐scale DNA sequences‡
- Authors:
- Hosny, Ahmad M.
Shedeed, Howida A.
Hussein, Ashraf S.
Tolba, Mohamed F. - Abstract:
- <abstract abstract-type="main"> <title>Abstract</title> <p>Confidence in a pairwise local sequence alignment is a fundamental problem in bioinformatics. For huge DNA sequences, this problem is highly compute‐intensive because it involves evaluating hundreds of local alignments to construct an empirical score distribution. Recent parallel solutions support only kilobyte‐scale sequence sizes and/or are based on sophisticated infrastructures that are not available for most of the research labs. This paper presents an efficient parallel solution for evaluating the statistical significance for a pair of huge DNA sequences using cloud infrastructures. This solution can receive requests from various researchers via web‐portal and allocate resources according to their demand. In this way, the benefits of cloud‐based services can be achieved. The fundamental innovation of this research work is proposing an efficient solution that utilizes both shared and distributed memory architectures via cloud technology to enhance the performance of evaluating the statistical significance for pair of DNA sequences. Therefore, the restriction on the sequence sizes is released to be in megabyte‐scale, which was not supported before for the statistical significance problem. The performance evaluation of the proposed solution was carried out on Microsoft's cloud and compared with the existing parallel solutions. The results show that the processing speed outperforms the recent cluster solutions that<abstract abstract-type="main"> <title>Abstract</title> <p>Confidence in a pairwise local sequence alignment is a fundamental problem in bioinformatics. For huge DNA sequences, this problem is highly compute‐intensive because it involves evaluating hundreds of local alignments to construct an empirical score distribution. Recent parallel solutions support only kilobyte‐scale sequence sizes and/or are based on sophisticated infrastructures that are not available for most of the research labs. This paper presents an efficient parallel solution for evaluating the statistical significance for a pair of huge DNA sequences using cloud infrastructures. This solution can receive requests from various researchers via web‐portal and allocate resources according to their demand. In this way, the benefits of cloud‐based services can be achieved. The fundamental innovation of this research work is proposing an efficient solution that utilizes both shared and distributed memory architectures via cloud technology to enhance the performance of evaluating the statistical significance for pair of DNA sequences. Therefore, the restriction on the sequence sizes is released to be in megabyte‐scale, which was not supported before for the statistical significance problem. The performance evaluation of the proposed solution was carried out on Microsoft's cloud and compared with the existing parallel solutions. The results show that the processing speed outperforms the recent cluster solutions that target the same problem. In addition, the performance metrics exhibit linear behavior for the addressed number of instances. Copyright © 2012 John Wiley &amp; Sons, Ltd.</p> </abstract> … (more)
- Is Part Of:
- Concurrency and computation. Volume 26:Number 1(2014:Jan.)
- Journal:
- Concurrency and computation
- Issue:
- Volume 26:Number 1(2014:Jan.)
- Issue Display:
- Volume 26, Issue 1 (2014)
- Year:
- 2014
- Volume:
- 26
- Issue:
- 1
- Issue Sort Value:
- 2014-0026-0001-0000
- Page Start:
- 118
- Page End:
- 133
- Publication Date:
- 2012-11-12
- Subjects:
- Parallel processing (Electronic computers) -- Periodicals
Parallel computers -- Periodicals
004.35 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/cpe.2953 ↗
- Languages:
- English
- ISSNs:
- 1532-0626
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3405.622000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 3898.xml