Content area
Abstract
One of the most popular measures in the analysis of protein sequence evolution is the ratio of nonsynonymous distance (dN) to synonymous distance (dS). Under the assumption that synonymous substitutions in the coding region are selectively neutral, the dN/dS ratio can be used to statistically detect the adaptive evolution (or purifying selection) if dN/dS > 1 (or dN/dS < 1) significantly. However, due to strong structural constraints and/or variable functional constraints imposed on amino acid sites, most encoding genes in most species have demonstrated dN/dS < 1. Consequently, the statistical power for testing dN/dS = 1 may be insufficient to distinguish between different selection modes. In this paper, we propose a more powerful test, called dN/dS-H, in which a new parameter H, a relative measure of rate variation among sites, was introduced. Given the condition of strong purifying selections at some sites, the dN/dS-H model predicts dN/dS = 1-H for neutral evolution, dN/dS < 1-H for nearly neutral selection, and dN/dS > 1-H for adaptive evolution. The potential of this new method for resolving the neutral-adaptive debates is illustrated by the protein sequence evolution in vertebrates, Drosophila and yeasts, as well as somatic cancer evolution (specialized as the CN/CS-H test).
Details
1 Iowa State University, The Laurence H. Baker Center in Bioinformatics on Biological Statistics, Ames, USA (GRID:grid.34421.30) (ISNI:0000 0004 1936 7312); Iowa State University, Department of Genetics, Development and Cell Biology, Ames, USA (GRID:grid.34421.30) (ISNI:0000 0004 1936 7312); Iowa State University, Program of Ecological and Evolutionary Biology, Ames, USA (GRID:grid.34421.30) (ISNI:0000 0004 1936 7312)





