Back to Search Start Over

Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability.

Authors :
Frank Mannino
Sadie Wisotsky
Sergei L Kosakovsky Pond
Spencer V Muse
Source :
PLoS ONE, Vol 15, Iss 3, p e0229493 (2020)
Publication Year :
2020
Publisher :
Public Library of Science (PLoS), 2020.

Abstract

It is standard practice to model site-to-site variability of substitution rates by discretizing a continuous distribution into a small number, K, of equiprobable rate categories. We demonstrate that the variance of this discretized distribution has an upper bound determined solely by the choice of K and the mean of the distribution. This bound can introduce biases into statistical inference, especially when estimating parameters governing site-to-site variability of substitution rates. Applications to two large collections of sequence alignments demonstrate that this upper bound is often reached in analyses of real data. When parameter estimation is of primary interest, additional rate categories or more flexible modeling methods should be considered.

Subjects

Subjects :
Medicine
Science

Details

Language :
English
ISSN :
19326203
Volume :
15
Issue :
3
Database :
Directory of Open Access Journals
Journal :
PLoS ONE
Publication Type :
Academic Journal
Accession number :
edsdoj.919fc1c881f447fa098d308f7d75ebd
Document Type :
article
Full Text :
https://doi.org/10.1371/journal.pone.0229493