Cargando…
Maximum Likelihood Phylogenetic Inference is Consistent on Multiple Sequence Alignments, with or without Gaps
We prove that maximum likelihood phylogenetic inference is consistent on gapped multiple sequence alignments (MSAs) as long as substitution rates across each edge are greater than zero, under mild assumptions on the structure of the alignment. Under these assumptions, maximum likelihood will asympto...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4748752/ https://www.ncbi.nlm.nih.gov/pubmed/26615177 http://dx.doi.org/10.1093/sysbio/syv089 |
_version_ | 1782415176352399360 |
---|---|
author | Truszkowski, Jakub Goldman, Nick |
author_facet | Truszkowski, Jakub Goldman, Nick |
author_sort | Truszkowski, Jakub |
collection | PubMed |
description | We prove that maximum likelihood phylogenetic inference is consistent on gapped multiple sequence alignments (MSAs) as long as substitution rates across each edge are greater than zero, under mild assumptions on the structure of the alignment. Under these assumptions, maximum likelihood will asymptotically recover the tree with edge lengths corresponding to the mean number of substitutions per site on each edge. This refutes Warnow's recent suggestion (Warnow 2012) that maximum likelihood phylogenetic inference might be statistically inconsistent when gaps are treated as missing data, even if the MSA is correct. We also derive a simple new proof of maximum likelihood consistency of ungapped alignments. |
format | Online Article Text |
id | pubmed-4748752 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-47487522016-02-11 Maximum Likelihood Phylogenetic Inference is Consistent on Multiple Sequence Alignments, with or without Gaps Truszkowski, Jakub Goldman, Nick Syst Biol Regular Articles We prove that maximum likelihood phylogenetic inference is consistent on gapped multiple sequence alignments (MSAs) as long as substitution rates across each edge are greater than zero, under mild assumptions on the structure of the alignment. Under these assumptions, maximum likelihood will asymptotically recover the tree with edge lengths corresponding to the mean number of substitutions per site on each edge. This refutes Warnow's recent suggestion (Warnow 2012) that maximum likelihood phylogenetic inference might be statistically inconsistent when gaps are treated as missing data, even if the MSA is correct. We also derive a simple new proof of maximum likelihood consistency of ungapped alignments. Oxford University Press 2016-03 2015-11-28 /pmc/articles/PMC4748752/ /pubmed/26615177 http://dx.doi.org/10.1093/sysbio/syv089 Text en © The Author(s) 2015. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Regular Articles Truszkowski, Jakub Goldman, Nick Maximum Likelihood Phylogenetic Inference is Consistent on Multiple Sequence Alignments, with or without Gaps |
title | Maximum Likelihood Phylogenetic Inference is Consistent on Multiple Sequence Alignments, with or without Gaps |
title_full | Maximum Likelihood Phylogenetic Inference is Consistent on Multiple Sequence Alignments, with or without Gaps |
title_fullStr | Maximum Likelihood Phylogenetic Inference is Consistent on Multiple Sequence Alignments, with or without Gaps |
title_full_unstemmed | Maximum Likelihood Phylogenetic Inference is Consistent on Multiple Sequence Alignments, with or without Gaps |
title_short | Maximum Likelihood Phylogenetic Inference is Consistent on Multiple Sequence Alignments, with or without Gaps |
title_sort | maximum likelihood phylogenetic inference is consistent on multiple sequence alignments, with or without gaps |
topic | Regular Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4748752/ https://www.ncbi.nlm.nih.gov/pubmed/26615177 http://dx.doi.org/10.1093/sysbio/syv089 |
work_keys_str_mv | AT truszkowskijakub maximumlikelihoodphylogeneticinferenceisconsistentonmultiplesequencealignmentswithorwithoutgaps AT goldmannick maximumlikelihoodphylogeneticinferenceisconsistentonmultiplesequencealignmentswithorwithoutgaps |