Lessons for livestock genomics from genome and transcriptome sequencing in cattle and other mammals. Issue 1 (December 2016)
- Record Type:
- Journal Article
- Title:
- Lessons for livestock genomics from genome and transcriptome sequencing in cattle and other mammals. Issue 1 (December 2016)
- Main Title:
- Lessons for livestock genomics from genome and transcriptome sequencing in cattle and other mammals
- Authors:
- Taylor, Jeremy
Whitacre, Lynsey
Hoff, Jesse
Tizioto, Polyana
Kim, JaeWoo
Decker, Jared
Schnabel, Robert - Abstract:
- Abstract Background Decreasing sequencing costs and development of new protocols for characterizing global methylation, gene expression patterns and regulatory regions have stimulated the generation of large livestock datasets. Here, we discuss experiences in the analysis of whole-genome and transcriptome sequence data. Methods We analyzed whole-genome sequence (WGS) data from 132 individuals from five canid species (Canis familiaris, C. latrans, C. dingo, C. aureus andC. lupus ) and 61 breeds, three bison (Bison bison ), 64 water buffalo (Bubalus bubalis ) and 297 bovines from 17 breeds. By individual, data vary in extent of reference genome depth of coverage from 4.9X to 64.0X. We have also analyzed RNA-seq data for 580 samples representing 159Bos taurus andRattus norvegicus animals and 98 tissues. By aligning reads to a reference assembly and calling variants, we assessed effects of average depth of coverage on the actual coverage and on the number of called variants. We examined the identity of unmapped reads by assembling them and querying produced contigs against the non-redundant nucleic acids database. By imputing high-density single nucleotide polymorphism data on 4010 US registered Angus animals to WGS using Run4 of the 1000 Bull Genomes Project and assessing the accuracy of imputation, we identified misassembled reference sequence regions. Results We estimate that a 24X depth of coverage is required to achieve 99.5 % coverage of the reference assembly and identifyAbstract Background Decreasing sequencing costs and development of new protocols for characterizing global methylation, gene expression patterns and regulatory regions have stimulated the generation of large livestock datasets. Here, we discuss experiences in the analysis of whole-genome and transcriptome sequence data. Methods We analyzed whole-genome sequence (WGS) data from 132 individuals from five canid species (Canis familiaris, C. latrans, C. dingo, C. aureus andC. lupus ) and 61 breeds, three bison (Bison bison ), 64 water buffalo (Bubalus bubalis ) and 297 bovines from 17 breeds. By individual, data vary in extent of reference genome depth of coverage from 4.9X to 64.0X. We have also analyzed RNA-seq data for 580 samples representing 159Bos taurus andRattus norvegicus animals and 98 tissues. By aligning reads to a reference assembly and calling variants, we assessed effects of average depth of coverage on the actual coverage and on the number of called variants. We examined the identity of unmapped reads by assembling them and querying produced contigs against the non-redundant nucleic acids database. By imputing high-density single nucleotide polymorphism data on 4010 US registered Angus animals to WGS using Run4 of the 1000 Bull Genomes Project and assessing the accuracy of imputation, we identified misassembled reference sequence regions. Results We estimate that a 24X depth of coverage is required to achieve 99.5 % coverage of the reference assembly and identify 95 % of the variants within an individual's genome. Genomes sequenced to low average coverage (e.g., <10X) may fail to cover 10 % of the reference genome and identify <75 % of variants. About 10 % of genomic DNA or transcriptome sequence reads fail to align to the reference assembly. These reads include loci missing from the reference assembly and misassembled genes and interesting symbionts, commensal and pathogenic organisms. Conclusions Assembly errors and a lack of annotation of functional elements significantly limit the utility of the current draft livestock reference assemblies. The Functional Annotation of Animal Genomes initiative seeks to annotate functional elements, while a 70X Pac-Bio assembly for cow is underway and may result in a significantly improved reference assembly. … (more)
- Is Part Of:
- Genetics, selection, evolution. Volume 48:Issue 1(2016)
- Journal:
- Genetics, selection, evolution
- Issue:
- Volume 48:Issue 1(2016)
- Issue Display:
- Volume 48, Issue 1 (2016)
- Year:
- 2016
- Volume:
- 48
- Issue:
- 1
- Issue Sort Value:
- 2016-0048-0001-0000
- Page Start:
- 1
- Page End:
- 18
- Publication Date:
- 2016-12
- Subjects:
- Livestock -- Breeding -- Periodicals
Animal genetics -- Periodicals
Livestock -- Genetics -- Periodicals
Evolution -- Periodicals
576.505 - Journal URLs:
- http://www.edpsciences.com/docinfos/INRA-GENETICS/ ↗
http://www.gsejournal.org/ ↗
http://www.pubmedcentral.nih.gov/tocrender.fcgi?action=archive&journal=847 ↗
http://link.springer.com/ ↗ - DOI:
- 10.1186/s12711-016-0237-6 ↗
- Languages:
- English
- ISSNs:
- 1297-9686
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 10184.xml