A comparative study of k-spectrum-based error correction methods for next-generation sequencing data analysis. (July 2016)

Record Type:: Journal Article
Title:: A comparative study of k-spectrum-based error correction methods for next-generation sequencing data analysis. (July 2016)
Main Title:: A comparative study of k-spectrum-based error correction methods for next-generation sequencing data analysis
Authors:: Akogwu, Isaac
Wang, Nan
Zhang, Chaoyang
Gong, Ping
Abstract:: Abstract Background Innumerable opportunities for new genomic research have been stimulated by advancement in high-throughput next-generation sequencing (NGS). However, the pitfall of NGS data abundance is the complication of distinction between true biological variants and sequence error alterations during downstream analysis. Many error correction methods have been developed to correct erroneous NGS reads before further analysis, but independent evaluation of the impact of such dataset features as read length, genome size, and coverage depth on their performance is lacking. This comparative study aims to investigate the strength and weakness as well as limitations of some newestk -spectrum-based methods and to provide recommendations for users in selecting suitable methods with respect to specific NGS datasets. Methods Sixk -spectrum-based methods, i.e., Reptile, Musket, Bless, Bloocoo, Lighter, and Trowel, were compared using six simulated sets of paired-end Illumina sequencing data. These NGS datasets varied in coverage depth (10× to 120×), read length (36 to 100 bp), and genome size (4.6 to 143 MB). Error Correction Evaluation Toolkit (ECET) was employed to derive a suite of metrics (i.e., true positives, false positive, false negative, recall, precision, gain, and F-score) for assessing the correction quality of each method. Results Results from computational experiments indicate that Musket had the best overall performance across the spectra of examined variants … (more)
Is Part Of:: Human genomics. Volume 10(2016)Supplement 2
Journal:: Human genomics
Issue:: Volume 10(2016)Supplement 2
Issue Display:: Volume 10, Issue 2 (2016)
Year:: 2016
Volume:: 10
Issue:: 2
Issue Sort Value:: 2016-0010-0002-0000
Page Start:: 49
Page End:: 59
Publication Date:: 2016-07
Subjects:: Next-generation sequencing (NGS) -- k-mer -- k-spectrum -- Error correction -- Sequence analysis -- Bloom filter
Genomics -- Periodicals
Human genome -- Periodicals
Genetic Research -- Periodicals
Pharmacogenetics -- Periodicals
611.01816
Journal URLs:: http://www.henrystewart.com/human_genomics/ ↗
http://www.humgenomics.com/ ↗
http://link.springer.com/ ↗
DOI:: 10.1186/s40246-016-0068-0 ↗
Languages:: English
ISSNs:: 1479-7364
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 10512.xml