Cargando…

Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability

It is standard practice to model site-to-site variability of substitution rates by discretizing a continuous distribution into a small number, K, of equiprobable rate categories. We demonstrate that the variance of this discretized distribution has an upper bound determined solely by the choice of K...

Descripción completa

Detalles Bibliográficos
Autores principales: Mannino, Frank, Wisotsky, Sadie, Kosakovsky Pond, Sergei L., Muse, Spencer V.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7051046/
https://www.ncbi.nlm.nih.gov/pubmed/32119689
http://dx.doi.org/10.1371/journal.pone.0229493
_version_ 1783502701983694848
author Mannino, Frank
Wisotsky, Sadie
Kosakovsky Pond, Sergei L.
Muse, Spencer V.
author_facet Mannino, Frank
Wisotsky, Sadie
Kosakovsky Pond, Sergei L.
Muse, Spencer V.
author_sort Mannino, Frank
collection PubMed
description It is standard practice to model site-to-site variability of substitution rates by discretizing a continuous distribution into a small number, K, of equiprobable rate categories. We demonstrate that the variance of this discretized distribution has an upper bound determined solely by the choice of K and the mean of the distribution. This bound can introduce biases into statistical inference, especially when estimating parameters governing site-to-site variability of substitution rates. Applications to two large collections of sequence alignments demonstrate that this upper bound is often reached in analyses of real data. When parameter estimation is of primary interest, additional rate categories or more flexible modeling methods should be considered.
format Online
Article
Text
id pubmed-7051046
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-70510462020-03-12 Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability Mannino, Frank Wisotsky, Sadie Kosakovsky Pond, Sergei L. Muse, Spencer V. PLoS One Research Article It is standard practice to model site-to-site variability of substitution rates by discretizing a continuous distribution into a small number, K, of equiprobable rate categories. We demonstrate that the variance of this discretized distribution has an upper bound determined solely by the choice of K and the mean of the distribution. This bound can introduce biases into statistical inference, especially when estimating parameters governing site-to-site variability of substitution rates. Applications to two large collections of sequence alignments demonstrate that this upper bound is often reached in analyses of real data. When parameter estimation is of primary interest, additional rate categories or more flexible modeling methods should be considered. Public Library of Science 2020-03-02 /pmc/articles/PMC7051046/ /pubmed/32119689 http://dx.doi.org/10.1371/journal.pone.0229493 Text en © 2020 Mannino et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Mannino, Frank
Wisotsky, Sadie
Kosakovsky Pond, Sergei L.
Muse, Spencer V.
Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability
title Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability
title_full Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability
title_fullStr Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability
title_full_unstemmed Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability
title_short Equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability
title_sort equiprobable discrete models of site-specific substitution rates underestimate the extent of rate variability
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7051046/
https://www.ncbi.nlm.nih.gov/pubmed/32119689
http://dx.doi.org/10.1371/journal.pone.0229493
work_keys_str_mv AT manninofrank equiprobablediscretemodelsofsitespecificsubstitutionratesunderestimatetheextentofratevariability
AT wisotskysadie equiprobablediscretemodelsofsitespecificsubstitutionratesunderestimatetheextentofratevariability
AT kosakovskypondsergeil equiprobablediscretemodelsofsitespecificsubstitutionratesunderestimatetheextentofratevariability
AT musespencerv equiprobablediscretemodelsofsitespecificsubstitutionratesunderestimatetheextentofratevariability