Visual programming for next-generation sequencing data analytics. Issue 1 (December 2016)
- Record Type:
- Journal Article
- Title:
- Visual programming for next-generation sequencing data analytics. Issue 1 (December 2016)
- Main Title:
- Visual programming for next-generation sequencing data analytics
- Authors:
- Milicchio, Franco
Rose, Rebecca
Bian, Jiang
Min, Jae
Prosperi, Mattia - Abstract:
- Abstract Background High-throughput or next-generation sequencing (NGS) technologies have become an established and affordable experimental framework in biological and medical sciences for all basic and translational research. Processing and analyzing NGS data is challenging. NGS data are big, heterogeneous, sparse, and error prone. Although a plethora of tools for NGS data analysis has emerged in the past decade, (i) software development is still lagging behind data generation capabilities, and (ii) there is a 'cultural' gap between the end user and the developer. Text Generic software template libraries specifically developed for NGS can help in dealing with the former problem, whilst coupling template libraries with visual programming may help with the latter. Here we scrutinize the state-of-the-art low-level software libraries implemented specifically for NGS and graphical tools for NGS analytics. An ideal developing environment for NGS should be modular (with a native library interface), scalable in computational methods (i.e. serial, multithread, distributed), transparent (platform-independent), interoperable (with external software interface), and usable (via an intuitive graphical user interface). These characteristics should facilitate both the run of standardized NGS pipelines and the development of new workflows based on technological advancements or users' needs. We discuss in detail the potential of a computational framework blending generic template programmingAbstract Background High-throughput or next-generation sequencing (NGS) technologies have become an established and affordable experimental framework in biological and medical sciences for all basic and translational research. Processing and analyzing NGS data is challenging. NGS data are big, heterogeneous, sparse, and error prone. Although a plethora of tools for NGS data analysis has emerged in the past decade, (i) software development is still lagging behind data generation capabilities, and (ii) there is a 'cultural' gap between the end user and the developer. Text Generic software template libraries specifically developed for NGS can help in dealing with the former problem, whilst coupling template libraries with visual programming may help with the latter. Here we scrutinize the state-of-the-art low-level software libraries implemented specifically for NGS and graphical tools for NGS analytics. An ideal developing environment for NGS should be modular (with a native library interface), scalable in computational methods (i.e. serial, multithread, distributed), transparent (platform-independent), interoperable (with external software interface), and usable (via an intuitive graphical user interface). These characteristics should facilitate both the run of standardized NGS pipelines and the development of new workflows based on technological advancements or users' needs. We discuss in detail the potential of a computational framework blending generic template programming and visual programming that addresses all of the current limitations. Conclusion In the long term, a proper, well-developed (although not necessarily unique) software framework will bridge the current gap between data generation and hypothesis testing. This will eventually facilitate the development of novel diagnostic tools embedded in routine healthcare. … (more)
- Is Part Of:
- Biodata mining. Volume 9:Issue 1(2016)
- Journal:
- Biodata mining
- Issue:
- Volume 9:Issue 1(2016)
- Issue Display:
- Volume 9, Issue 1 (2016)
- Year:
- 2016
- Volume:
- 9
- Issue:
- 1
- Issue Sort Value:
- 2016-0009-0001-0000
- Page Start:
- 1
- Page End:
- 17
- Publication Date:
- 2016-12
- Subjects:
- Next-generation sequencing -- High-throughput sequencing -- Big data -- Template library -- Generic programming -- Visual programming -- Graphical user interface -- Software suite
Bioinformatics -- Periodicals
Computational biology -- Periodicals
Data mining -- Periodicals
570.285 - Journal URLs:
- http://www.biodatamining.org/ ↗
http://link.springer.com/ ↗ - DOI:
- 10.1186/s13040-016-0095-3 ↗
- Languages:
- English
- ISSNs:
- 1756-0381
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 9879.xml