A 3-way hybrid approach to generate a new high-quality chimpanzee reference genome (Pan_tro_3.0). Issue 11 (30th October 2017)
- Record Type:
- Journal Article
- Title:
- A 3-way hybrid approach to generate a new high-quality chimpanzee reference genome (Pan_tro_3.0). Issue 11 (30th October 2017)
- Main Title:
- A 3-way hybrid approach to generate a new high-quality chimpanzee reference genome (Pan_tro_3.0)
- Authors:
- Kuderna, Lukas F K
Tomlinson, Chad
Hillier, LaDeana W
Tran, Annabel
Fiddes, Ian T
Armstrong, Joel
Laayouni, Hafid
Gordon, David
Huddleston, John
Garcia Perez, Raquel
Povolotskaya, Inna
Serres Armero, Aitor
Gómez Garrido, Jèssica
Ho, Daniel
Ribeca, Paolo
Alioto, Tyler
Green, Richard E
Paten, Benedict
Navarro, Arcadi
Betranpetit, Jaume
Herrero, Javier
Eichler, Evan E
Sharp, Andrew J
Feuk, Lars
Warren, Wesley C
Marques-Bonet, Tomas - Abstract:
- Abstract: The chimpanzee is arguably the most important species for the study of human origins. A key resource for these studies is a high-quality reference genome assembly; however, as with most mammalian genomes, the current iteration of the chimpanzee reference genome assembly is highly fragmented. In the current iteration of the chimpanzee reference genome assembly (Pan_tro_2.1.4), the sequence is scattered across more then 183 000 contigs, incorporating more than 159 000 gaps, with a genome-wide contig N50 of 51 Kbp. In this work, we produce an extensive and diverse array of sequencing datasets to rapidly assemble a new chimpanzee reference that surpasses previous iterations in bases represented and organized in large scaffolds. To this end, we show substantial improvements over the current release of the chimpanzee genome (Pan_tro_2.1.4) by several metrics, such as increased contiguity by >750% and 300% on contigs and scaffolds, respectively, and closure of 77% of gaps in the Pan_tro_2.1.4 assembly gaps spanning >850 Kbp of the novel coding sequence based on RNASeq data. We further report more than 2700 genes that had putatively erroneous frame-shift predictions to human in Pan_tro_2.1.4 and show a substantial increase in the annotation of repetitive elements. We apply a simple 3-way hybrid approach to considerably improve the reference genome assembly for the chimpanzee, providing a valuable resource for the study of human origins. Furthermore, we produce extensiveAbstract: The chimpanzee is arguably the most important species for the study of human origins. A key resource for these studies is a high-quality reference genome assembly; however, as with most mammalian genomes, the current iteration of the chimpanzee reference genome assembly is highly fragmented. In the current iteration of the chimpanzee reference genome assembly (Pan_tro_2.1.4), the sequence is scattered across more then 183 000 contigs, incorporating more than 159 000 gaps, with a genome-wide contig N50 of 51 Kbp. In this work, we produce an extensive and diverse array of sequencing datasets to rapidly assemble a new chimpanzee reference that surpasses previous iterations in bases represented and organized in large scaffolds. To this end, we show substantial improvements over the current release of the chimpanzee genome (Pan_tro_2.1.4) by several metrics, such as increased contiguity by >750% and 300% on contigs and scaffolds, respectively, and closure of 77% of gaps in the Pan_tro_2.1.4 assembly gaps spanning >850 Kbp of the novel coding sequence based on RNASeq data. We further report more than 2700 genes that had putatively erroneous frame-shift predictions to human in Pan_tro_2.1.4 and show a substantial increase in the annotation of repetitive elements. We apply a simple 3-way hybrid approach to considerably improve the reference genome assembly for the chimpanzee, providing a valuable resource for the study of human origins. Furthermore, we produce extensive sequencing datasets that are all derived from the same cell line, generating a broad non-human benchmark dataset. … (more)
- Is Part Of:
- GigaScience. Volume 6:Issue 11(2017)
- Journal:
- GigaScience
- Issue:
- Volume 6:Issue 11(2017)
- Issue Display:
- Volume 6, Issue 11 (2017)
- Year:
- 2017
- Volume:
- 6
- Issue:
- 11
- Issue Sort Value:
- 2017-0006-0011-0000
- Page Start:
- Page End:
- Publication Date:
- 2017-10-30
- Subjects:
- chimpanzee reference genome -- assembly, genomics
Information storage and retrieval systems -- Research -- Periodicals
Biology -- Research -- Periodicals
Medical sciences -- Research -- Periodicals
Database management -- Periodicals
570.285 - Journal URLs:
- http://www.gigasciencejournal.com/ ↗
http://www.oxfordjournals.org/ ↗ - DOI:
- 10.1093/gigascience/gix098 ↗
- Languages:
- English
- ISSNs:
- 2047-217X
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 25137.xml