Discovery of large genomic inversions using long range information. (December 2017)
- Record Type:
- Journal Article
- Title:
- Discovery of large genomic inversions using long range information. (December 2017)
- Main Title:
- Discovery of large genomic inversions using long range information
- Authors:
- Eslami Rasekh, Marzieh
Chiatante, Giorgia
Miroballo, Mattia
Tang, Joyce
Ventura, Mario
Amemiya, Chris
Eichler, Evan
Antonacci, Francesca
Alkan, Can - Abstract:
- Abstract Background Although many algorithms are now available that aim to characterize different classes of structural variation, discovery of balanced rearrangements such as inversions remains an open problem. This is mainly due to the fact that breakpoints of such events typically lie within segmental duplications or common repeats, which reduces the mappability of short reads. The algorithms developed within the 1000 Genomes Project to identify inversions are limited to relatively short inversions, and there are currently no available algorithms to discover large inversions using high throughput sequencing technologies. Results Here we propose a novel algorithm, Valor, to discover large inversions using new sequencing methods that provide long range information such as 10X Genomics linked-read sequencing, pooled clone sequencing, or other similar technologies that we commonly refer to aslong range sequencing . We demonstrate the utility ofValor using both pooled clone sequencing and 10X Genomics linked-read sequencing generated from the genome of an individual from the HapMap project (NA12878). We also provide a comprehensive comparison ofValor against several state-of-the-art structural variation discovery algorithms that use whole genome shotgun sequencing data. Conclusions In this paper, we show thatValor is able to accurately discover all previously identified and experimentally validated large inversions in the same genome with a low false discovery rate.Abstract Background Although many algorithms are now available that aim to characterize different classes of structural variation, discovery of balanced rearrangements such as inversions remains an open problem. This is mainly due to the fact that breakpoints of such events typically lie within segmental duplications or common repeats, which reduces the mappability of short reads. The algorithms developed within the 1000 Genomes Project to identify inversions are limited to relatively short inversions, and there are currently no available algorithms to discover large inversions using high throughput sequencing technologies. Results Here we propose a novel algorithm, Valor, to discover large inversions using new sequencing methods that provide long range information such as 10X Genomics linked-read sequencing, pooled clone sequencing, or other similar technologies that we commonly refer to aslong range sequencing . We demonstrate the utility ofValor using both pooled clone sequencing and 10X Genomics linked-read sequencing generated from the genome of an individual from the HapMap project (NA12878). We also provide a comprehensive comparison ofValor against several state-of-the-art structural variation discovery algorithms that use whole genome shotgun sequencing data. Conclusions In this paper, we show thatValor is able to accurately discover all previously identified and experimentally validated large inversions in the same genome with a low false discovery rate. UsingValor, we also predicted a novel inversion, which we validated using fluorescent in situ hybridization. Valor is available athttps://github.com/BilkentCompGen/Valor … (more)
- Is Part Of:
- BMC genomics. Volume 18:Number 1(2017)
- Journal:
- BMC genomics
- Issue:
- Volume 18:Number 1(2017)
- Issue Display:
- Volume 18, Issue 1 (2017)
- Year:
- 2017
- Volume:
- 18
- Issue:
- 1
- Issue Sort Value:
- 2017-0018-0001-0000
- Page Start:
- 1
- Page End:
- 12
- Publication Date:
- 2017-12
- Subjects:
- Structural variation -- Long range sequencing -- Linked-reads -- Inversion -- Read clouds
Genomes -- Periodicals
Gene mapping -- Periodicals
Genomics -- Periodicals
Base Sequence -- Periodicals
Chromosome Mapping -- Periodicals
Genetic Techniques -- Periodicals
Sequence Analysis, DNA -- Periodicals
572.8605 - Journal URLs:
- http://www.biomedcentral.com/bmcgenomics/ ↗
http://www.pubmedcentral.nih.gov/tocrender.fcgi?journal=32 ↗
http://link.springer.com/ ↗ - DOI:
- 10.1186/s12864-016-3444-1 ↗
- Languages:
- English
- ISSNs:
- 1471-2164
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 9991.xml