A Comparison of One-Rate and Two-Rate Inference Frameworks for Site-Specific dN/dS Estimation. Issue 2 (1st October 2016)
- Record Type:
- Journal Article
- Title:
- A Comparison of One-Rate and Two-Rate Inference Frameworks for Site-Specific dN/dS Estimation. Issue 2 (1st October 2016)
- Main Title:
- A Comparison of One-Rate and Two-Rate Inference Frameworks for Site-Specific dN/dS Estimation
- Authors:
- Spielman, Stephanie J
Wan, Suyang
Wilke, Claus O - Abstract:
- Abstract: Two broad paradigms exist for inferring d N / d S, the ratio of nonsynonymous to synonymous substitution rates, from coding sequences: (i) a one-rate approach, where d N / d S is represented with a single parameter, or (ii) a two-rate approach, where d N and d S are estimated separately. The performances of these two approaches have been well studied in the specific context of proper model specification, i.e., when the inference model matches the simulation model. By contrast, the relative performances of one-rate vs. two-rate parameterizations when applied to data generated according to a different mechanism remain unclear. Here, we compare the relative merits of one-rate and two-rate approaches in the specific context of model misspecification by simulating alignments with mutation–selection models rather than with d N / d S -based models. We find that one-rate frameworks generally infer more accurate d N / d S point estimates, even when d S varies among sites. In other words, modeling d S variation may substantially reduce accuracy of d N / d S point estimates. These results appear to depend on the selective constraint operating at a given site. For sites under strong purifying selection (d N / d S ≲ 0.3 ), one-rate and two-rate models show comparable performances. However, one-rate models significantly outperform two-rate models for sites under moderate-to-weak purifying selection. We attribute this distinction to the fact that, for these more quicklyAbstract: Two broad paradigms exist for inferring d N / d S, the ratio of nonsynonymous to synonymous substitution rates, from coding sequences: (i) a one-rate approach, where d N / d S is represented with a single parameter, or (ii) a two-rate approach, where d N and d S are estimated separately. The performances of these two approaches have been well studied in the specific context of proper model specification, i.e., when the inference model matches the simulation model. By contrast, the relative performances of one-rate vs. two-rate parameterizations when applied to data generated according to a different mechanism remain unclear. Here, we compare the relative merits of one-rate and two-rate approaches in the specific context of model misspecification by simulating alignments with mutation–selection models rather than with d N / d S -based models. We find that one-rate frameworks generally infer more accurate d N / d S point estimates, even when d S varies among sites. In other words, modeling d S variation may substantially reduce accuracy of d N / d S point estimates. These results appear to depend on the selective constraint operating at a given site. For sites under strong purifying selection (d N / d S ≲ 0.3 ), one-rate and two-rate models show comparable performances. However, one-rate models significantly outperform two-rate models for sites under moderate-to-weak purifying selection. We attribute this distinction to the fact that, for these more quickly evolving sites, a given substitution is more likely to be nonsynonymous than synonymous. The data will therefore be relatively enriched for nonsynonymous changes, and modeling d S contributes excessive noise to d N / d S estimates. We additionally find that high levels of divergence among sequences, rather than the number of sequences in the alignment, are more critical for obtaining precise point estimates. … (more)
- Is Part Of:
- Genetics. Volume 204:Issue 2(2016)
- Journal:
- Genetics
- Issue:
- Volume 204:Issue 2(2016)
- Issue Display:
- Volume 204, Issue 2 (2016)
- Year:
- 2016
- Volume:
- 204
- Issue:
- 2
- Issue Sort Value:
- 2016-0204-0002-0000
- Page Start:
- 499
- Page End:
- 511
- Publication Date:
- 2016-10-01
- Subjects:
- dN/dS -- mutation–selection models -- evolutionary rate -- sequence simulation -- molecular evolution
Genetics -- Periodicals
576.5 - Journal URLs:
- http://www.oxfordjournals.org/ ↗
- DOI:
- 10.1534/genetics.115.185264 ↗
- Languages:
- English
- ISSNs:
- 0016-6731
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 25203.xml