From Wet‐Lab to Variations: Concordance and Speed of Bioinformatics Pipelines for Whole Genome and Whole Exome Sequencing. Issue 12 (26th September 2016)
- Record Type:
- Journal Article
- Title:
- From Wet‐Lab to Variations: Concordance and Speed of Bioinformatics Pipelines for Whole Genome and Whole Exome Sequencing. Issue 12 (26th September 2016)
- Main Title:
- From Wet‐Lab to Variations: Concordance and Speed of Bioinformatics Pipelines for Whole Genome and Whole Exome Sequencing
- Authors:
- Laurie, Steve
Fernandez‐Callejo, Marcos
Marco‐Sola, Santiago
Trotta, Jean‐Remi
Camps, Jordi
Chacón, Alejandro
Espinosa, Antonio
Gut, Marta
Gut, Ivo
Heath, Simon
Beltran, Sergi - Abstract:
- Abstract : As Whole Genome sequencing becomes cheaper and faster it will progressively substitute targeted sequencing. However, computing cost‐performance ratio is not advancing at an equivalent rate. We have benchmarked 6 combinations of state‐of‐the‐art read aligners and variant callers on WGS and WES data. We report differences in speed of up to 20 times in some processes and have observed that SNV, and to a lesser extent InDel, detection is highly consistent only in 70% of the genome. ABSTRACT: As whole genome sequencing becomes cheaper and faster, it will progressively substitute targeted next‐generation sequencing as standard practice in research and diagnostics. However, computing cost–performance ratio is not advancing at an equivalent rate. Therefore, it is essential to evaluate the robustness of the variant detection process taking into account the computing resources required. We have benchmarked six combinations of state‐of‐the‐art read aligners (BWA‐MEM and GEM3) and variant callers (FreeBayes, GATK HaplotypeCaller, SAMtools) on whole genome and whole exome sequencing data from the NA12878 human sample. Results have been compared between them and against the NIST Genome in a Bottle (GIAB) variants reference dataset. We report differences in speed of up to 20 times in some steps of the process and have observed that SNV, and to a lesser extent InDel, detection is highly consistent in 70% of the genome. SNV, and especially InDel, detection is less reliable in 20%Abstract : As Whole Genome sequencing becomes cheaper and faster it will progressively substitute targeted sequencing. However, computing cost‐performance ratio is not advancing at an equivalent rate. We have benchmarked 6 combinations of state‐of‐the‐art read aligners and variant callers on WGS and WES data. We report differences in speed of up to 20 times in some processes and have observed that SNV, and to a lesser extent InDel, detection is highly consistent only in 70% of the genome. ABSTRACT: As whole genome sequencing becomes cheaper and faster, it will progressively substitute targeted next‐generation sequencing as standard practice in research and diagnostics. However, computing cost–performance ratio is not advancing at an equivalent rate. Therefore, it is essential to evaluate the robustness of the variant detection process taking into account the computing resources required. We have benchmarked six combinations of state‐of‐the‐art read aligners (BWA‐MEM and GEM3) and variant callers (FreeBayes, GATK HaplotypeCaller, SAMtools) on whole genome and whole exome sequencing data from the NA12878 human sample. Results have been compared between them and against the NIST Genome in a Bottle (GIAB) variants reference dataset. We report differences in speed of up to 20 times in some steps of the process and have observed that SNV, and to a lesser extent InDel, detection is highly consistent in 70% of the genome. SNV, and especially InDel, detection is less reliable in 20% of the genome, and almost unfeasible in the remaining 10%. These findings will aid in choosing the appropriate tools bearing in mind objectives, workload, and computing infrastructure available. … (more)
- Is Part Of:
- Human mutation. Volume 37:Issue 12(2016)
- Journal:
- Human mutation
- Issue:
- Volume 37:Issue 12(2016)
- Issue Display:
- Volume 37, Issue 12 (2016)
- Year:
- 2016
- Volume:
- 37
- Issue:
- 12
- Issue Sort Value:
- 2016-0037-0012-0000
- Page Start:
- 1263
- Page End:
- 1271
- Publication Date:
- 2016-09-26
- Subjects:
- whole genome sequencing -- whole exome sequencing -- NGS -- NA12878 -- alignment -- variant calling -- bioinformatics -- computing speed -- benchmark
Human chromosome abnormalities -- Periodicals
Mutation (Biology) -- Periodicals
616.04205 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)1098-1004 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/humu.23114 ↗
- Languages:
- English
- ISSNs:
- 1059-7794
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4336.217000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 1782.xml