Cargando…

Maximum Likelihood Phylogenetic Inference is Consistent on Multiple Sequence Alignments, with or without Gaps

We prove that maximum likelihood phylogenetic inference is consistent on gapped multiple sequence alignments (MSAs) as long as substitution rates across each edge are greater than zero, under mild assumptions on the structure of the alignment. Under these assumptions, maximum likelihood will asympto...

Descripción completa

Detalles Bibliográficos
Autores principales: Truszkowski, Jakub, Goldman, Nick
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4748752/
https://www.ncbi.nlm.nih.gov/pubmed/26615177
http://dx.doi.org/10.1093/sysbio/syv089
_version_ 1782415176352399360
author Truszkowski, Jakub
Goldman, Nick
author_facet Truszkowski, Jakub
Goldman, Nick
author_sort Truszkowski, Jakub
collection PubMed
description We prove that maximum likelihood phylogenetic inference is consistent on gapped multiple sequence alignments (MSAs) as long as substitution rates across each edge are greater than zero, under mild assumptions on the structure of the alignment. Under these assumptions, maximum likelihood will asymptotically recover the tree with edge lengths corresponding to the mean number of substitutions per site on each edge. This refutes Warnow's recent suggestion (Warnow 2012) that maximum likelihood phylogenetic inference might be statistically inconsistent when gaps are treated as missing data, even if the MSA is correct. We also derive a simple new proof of maximum likelihood consistency of ungapped alignments.
format Online
Article
Text
id pubmed-4748752
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-47487522016-02-11 Maximum Likelihood Phylogenetic Inference is Consistent on Multiple Sequence Alignments, with or without Gaps Truszkowski, Jakub Goldman, Nick Syst Biol Regular Articles We prove that maximum likelihood phylogenetic inference is consistent on gapped multiple sequence alignments (MSAs) as long as substitution rates across each edge are greater than zero, under mild assumptions on the structure of the alignment. Under these assumptions, maximum likelihood will asymptotically recover the tree with edge lengths corresponding to the mean number of substitutions per site on each edge. This refutes Warnow's recent suggestion (Warnow 2012) that maximum likelihood phylogenetic inference might be statistically inconsistent when gaps are treated as missing data, even if the MSA is correct. We also derive a simple new proof of maximum likelihood consistency of ungapped alignments. Oxford University Press 2016-03 2015-11-28 /pmc/articles/PMC4748752/ /pubmed/26615177 http://dx.doi.org/10.1093/sysbio/syv089 Text en © The Author(s) 2015. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Regular Articles
Truszkowski, Jakub
Goldman, Nick
Maximum Likelihood Phylogenetic Inference is Consistent on Multiple Sequence Alignments, with or without Gaps
title Maximum Likelihood Phylogenetic Inference is Consistent on Multiple Sequence Alignments, with or without Gaps
title_full Maximum Likelihood Phylogenetic Inference is Consistent on Multiple Sequence Alignments, with or without Gaps
title_fullStr Maximum Likelihood Phylogenetic Inference is Consistent on Multiple Sequence Alignments, with or without Gaps
title_full_unstemmed Maximum Likelihood Phylogenetic Inference is Consistent on Multiple Sequence Alignments, with or without Gaps
title_short Maximum Likelihood Phylogenetic Inference is Consistent on Multiple Sequence Alignments, with or without Gaps
title_sort maximum likelihood phylogenetic inference is consistent on multiple sequence alignments, with or without gaps
topic Regular Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4748752/
https://www.ncbi.nlm.nih.gov/pubmed/26615177
http://dx.doi.org/10.1093/sysbio/syv089
work_keys_str_mv AT truszkowskijakub maximumlikelihoodphylogeneticinferenceisconsistentonmultiplesequencealignmentswithorwithoutgaps
AT goldmannick maximumlikelihoodphylogeneticinferenceisconsistentonmultiplesequencealignmentswithorwithoutgaps