First Draft Assembly and Annotation of the Genome of a California Endemic Oak Quercus lobata Née (Fagaceae). Issue 11 (1st November 2016)
- Record Type:
- Journal Article
- Title:
- First Draft Assembly and Annotation of the Genome of a California Endemic Oak Quercus lobata Née (Fagaceae). Issue 11 (1st November 2016)
- Main Title:
- First Draft Assembly and Annotation of the Genome of a California Endemic Oak Quercus lobata Née (Fagaceae)
- Authors:
- Sork, Victoria L
Fitz-Gibbon, Sorel T
Puiu, Daniela
Crepeau, Marc
Gugger, Paul F
Sherman, Rachel
Stevens, Kristian
Langley, Charles H
Pellegrini, Matteo
Salzberg, Steven L - Abstract:
- Abstract: Oak represents a valuable natural resource across Northern Hemisphere ecosystems, attracting a large research community studying its genetics, ecology, conservation, and management. Here we introduce a draft genome assembly of valley oak ( Quercus lobata ) using Illumina sequencing of adult leaf tissue of a tree found in an accessible, well-studied, natural southern California population. Our assembly includes a nuclear genome and a complete chloroplast genome, along with annotation of encoded genes. The assembly contains 94, 394 scaffolds, totaling 1.17 Gb with 18, 512 scaffolds of length 2 kb or longer, with a total length of 1.15 Gb, and a N50 scaffold size of 278, 077 kb. The k -mer histograms indicate an diploid genome size of ∼720–730 Mb, which is smaller than the total length due to high heterozygosity, estimated at 1.25%. A comparison with a recently published European oak ( Q. robur ) nuclear sequence indicates 93% similarity. The Q. lobata chloroplast genome has 99% identity with another North American oak, Q. rubra . Preliminary annotation yielded an estimate of 61, 773 predicted protein-coding genes, of which 71% had similarity to known protein domains. We searched 956 Benchmarking Universal Single-Copy Orthologs, and found 863 complete orthologs, of which 450 were present in > 1 copy. We also examined an earlier version (v0.5) where duplicate haplotypes were removed to discover variants. These additional sources indicate that the predicted gene countAbstract: Oak represents a valuable natural resource across Northern Hemisphere ecosystems, attracting a large research community studying its genetics, ecology, conservation, and management. Here we introduce a draft genome assembly of valley oak ( Quercus lobata ) using Illumina sequencing of adult leaf tissue of a tree found in an accessible, well-studied, natural southern California population. Our assembly includes a nuclear genome and a complete chloroplast genome, along with annotation of encoded genes. The assembly contains 94, 394 scaffolds, totaling 1.17 Gb with 18, 512 scaffolds of length 2 kb or longer, with a total length of 1.15 Gb, and a N50 scaffold size of 278, 077 kb. The k -mer histograms indicate an diploid genome size of ∼720–730 Mb, which is smaller than the total length due to high heterozygosity, estimated at 1.25%. A comparison with a recently published European oak ( Q. robur ) nuclear sequence indicates 93% similarity. The Q. lobata chloroplast genome has 99% identity with another North American oak, Q. rubra . Preliminary annotation yielded an estimate of 61, 773 predicted protein-coding genes, of which 71% had similarity to known protein domains. We searched 956 Benchmarking Universal Single-Copy Orthologs, and found 863 complete orthologs, of which 450 were present in > 1 copy. We also examined an earlier version (v0.5) where duplicate haplotypes were removed to discover variants. These additional sources indicate that the predicted gene count in Version 1.0 is overestimated by 37–52%. Nonetheless, this first draft valley oak genome assembly represents a high-quality, well-annotated genome that provides a tool for forest restoration and management practices. … (more)
- Is Part Of:
- G3. Volume 6:Issue 11(2016)
- Journal:
- G3
- Issue:
- Volume 6:Issue 11(2016)
- Issue Display:
- Volume 6, Issue 11 (2016)
- Year:
- 2016
- Volume:
- 6
- Issue:
- 11
- Issue Sort Value:
- 2016-0006-0011-0000
- Page Start:
- 3485
- Page End:
- 3495
- Publication Date:
- 2016-11-01
- Subjects:
- adaptation -- annotation -- chloroplast -- nuclear genome assembly -- Quercus -- GenPred -- Shared Data Resources -- Genomic Selection
Genetics -- Research -- Periodicals
Genomics -- Periodicals
Genetics
Genomics
Genes
Genetics -- Research
Genomics
Electronic journals
Periodical
Periodicals
Fulltext
Internet Resources
Periodicals
572.8 - Journal URLs:
- https://academic.oup.com/g3journal ↗
http://bibpurl.oclc.org/web/43467 ↗
http://www.g3journal.org ↗
http://www.oxfordjournals.org/ ↗ - DOI:
- 10.1534/g3.116.030411 ↗
- Languages:
- English
- ISSNs:
- 2160-1836
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23599.xml