Comparing de novo and reference-based transcriptome assembly strategies by applying them to the blood-sucking bug Rhodnius prolixus. (February 2016)
- Record Type:
- Journal Article
- Title:
- Comparing de novo and reference-based transcriptome assembly strategies by applying them to the blood-sucking bug Rhodnius prolixus. (February 2016)
- Main Title:
- Comparing de novo and reference-based transcriptome assembly strategies by applying them to the blood-sucking bug Rhodnius prolixus
- Authors:
- Marchant, A.
Mougel, F.
Mendonça, V.
Quartier, M.
Jacquin-Joly, E.
da Rosa, J.A.
Petit, E.
Harry, M. - Abstract:
- Abstract: High Throughput Sequencing capabilities have made the process of assembling a transcriptome easier, whether or not there is a reference genome. But the quality of a transcriptome assembly must be good enough to capture the most comprehensive catalog of transcripts and their variations, and to carry out further experiments on transcriptomics. There is currently no consensus on which of the many sequencing technologies and assembly tools are the most effective. Many non-model organisms lack a reference genome to guide the transcriptome assembly. One question, therefore, is whether or not a reference-based genome assembly gives better results than de novo assembly. The blood-sucking insect Rhodnius prolixus —a vector for Chagas disease—has a reference genome. It is therefore a good model on which to compare reference-based and de novo transcriptome assemblies. In this study, we compared de novo and reference-based genome assembly strategies using three datasets (454, Illumina, 454 combined with Illumina) and various assembly software. We developed criteria to compare the resulting assemblies: the size distribution and number of transcripts, the proportion of potentially chimeric transcripts, how complete the assembly was (completeness evaluated both through CEGMA software and R. prolixus proteome fraction retrieved). Moreover, we looked for the presence of two chemosensory gene families (Odorant-Binding Proteins and Chemosensory Proteins) to validate the assemblyAbstract: High Throughput Sequencing capabilities have made the process of assembling a transcriptome easier, whether or not there is a reference genome. But the quality of a transcriptome assembly must be good enough to capture the most comprehensive catalog of transcripts and their variations, and to carry out further experiments on transcriptomics. There is currently no consensus on which of the many sequencing technologies and assembly tools are the most effective. Many non-model organisms lack a reference genome to guide the transcriptome assembly. One question, therefore, is whether or not a reference-based genome assembly gives better results than de novo assembly. The blood-sucking insect Rhodnius prolixus —a vector for Chagas disease—has a reference genome. It is therefore a good model on which to compare reference-based and de novo transcriptome assemblies. In this study, we compared de novo and reference-based genome assembly strategies using three datasets (454, Illumina, 454 combined with Illumina) and various assembly software. We developed criteria to compare the resulting assemblies: the size distribution and number of transcripts, the proportion of potentially chimeric transcripts, how complete the assembly was (completeness evaluated both through CEGMA software and R. prolixus proteome fraction retrieved). Moreover, we looked for the presence of two chemosensory gene families (Odorant-Binding Proteins and Chemosensory Proteins) to validate the assembly quality. The reference-based assemblies after genome annotation were clearly better than those generated using de novo strategies alone. Reference-based strategies revealed new transcripts, including new isoforms unpredicted by automatic genome annotation. However, a combination of both de novo and reference-based strategies gave the best result, and allowed us to assemble fragmented transcripts. Graphical abstract: Highlights: We developed criteria to compare de novo vs referenced-based transcriptome assemblies. We validate the assembly quality using the search of two chemosensory gene families. We validate that reference-based assembly strategy is clearly better than de novo one. But the combined strategies gave the best result to assemble fragmented transcripts. Reference-based strategies revealed new transcripts, including new isoforms. … (more)
- Is Part Of:
- Insect biochemistry and molecular biology. Volume 69(2016:Feb.)
- Journal:
- Insect biochemistry and molecular biology
- Issue:
- Volume 69(2016:Feb.)
- Issue Display:
- Volume 69 (2016)
- Year:
- 2016
- Volume:
- 69
- Issue Sort Value:
- 2016-0069-0000-0000
- Page Start:
- 25
- Page End:
- 33
- Publication Date:
- 2016-02
- Subjects:
- Rhodnius prolixus -- Chagas disease vectors -- Reference transcriptome -- de novo assembly -- Referenced-based assembly -- OBP -- CSP
Insect biochemistry -- Periodicals
Insects -- Physiology -- Periodicals
Insects -- Molecular aspects -- Periodicals
Biochemistry -- Periodicals
Insectes -- Biochimie -- Périodiques
Insectes -- Composition -- Périodiques
Insectes -- Physiologie -- Périodiques
Insectes -- Aspect moléculaire -- Périodiques
Biochimie -- Périodiques
Biochemistry
Insect biochemistry
Insects -- Molecular aspects
Insects -- Physiology
Periodicals
572.8157 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09651748 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.ibmb.2015.05.009 ↗
- Languages:
- English
- ISSNs:
- 0965-1748
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4516.852000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 882.xml