Cargando…

MASCOT: parameter and state inference under the marginal structured coalescent approximation

MOTIVATION: The structured coalescent is widely applied to study demography within and migration between sub-populations from genetic sequence data. Current methods are either exact but too computationally inefficient to analyse large datasets with many sub-populations, or make strong approximations...

Descripción completa

Detalles Bibliográficos
Autores principales: Müller, Nicola F, Rasmussen, David, Stadler, Tanja
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6223361/
https://www.ncbi.nlm.nih.gov/pubmed/29790921
http://dx.doi.org/10.1093/bioinformatics/bty406
_version_ 1783369388371476480
author Müller, Nicola F
Rasmussen, David
Stadler, Tanja
author_facet Müller, Nicola F
Rasmussen, David
Stadler, Tanja
author_sort Müller, Nicola F
collection PubMed
description MOTIVATION: The structured coalescent is widely applied to study demography within and migration between sub-populations from genetic sequence data. Current methods are either exact but too computationally inefficient to analyse large datasets with many sub-populations, or make strong approximations leading to severe biases in inference. We recently introduced an approximation based on weaker assumptions to the structured coalescent enabling the analysis of larger datasets with many different states. We showed that our approximation provides unbiased migration rate and population size estimates across a wide parameter range. RESULTS: We extend this approach by providing a new algorithm to calculate the probability of the state of internal nodes that includes the information from the full phylogenetic tree. We show that this algorithm is able to increase the probability attributed to the true sub-population of a node. Furthermore we use improved integration techniques, such that our method is now able to analyse larger datasets, including a H3N2 dataset with 433 sequences sampled from five different locations. AVAILABILITY AND IMPLEMENTATION: The presented methods are part of the BEAST2 package MASCOT, the Marginal Approximation of the Structured COalescenT. This package can be downloaded via the BEAUti package manager. The source code is available at https://github.com/nicfel/Mascot.git. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6223361
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-62233612018-11-14 MASCOT: parameter and state inference under the marginal structured coalescent approximation Müller, Nicola F Rasmussen, David Stadler, Tanja Bioinformatics Original Papers MOTIVATION: The structured coalescent is widely applied to study demography within and migration between sub-populations from genetic sequence data. Current methods are either exact but too computationally inefficient to analyse large datasets with many sub-populations, or make strong approximations leading to severe biases in inference. We recently introduced an approximation based on weaker assumptions to the structured coalescent enabling the analysis of larger datasets with many different states. We showed that our approximation provides unbiased migration rate and population size estimates across a wide parameter range. RESULTS: We extend this approach by providing a new algorithm to calculate the probability of the state of internal nodes that includes the information from the full phylogenetic tree. We show that this algorithm is able to increase the probability attributed to the true sub-population of a node. Furthermore we use improved integration techniques, such that our method is now able to analyse larger datasets, including a H3N2 dataset with 433 sequences sampled from five different locations. AVAILABILITY AND IMPLEMENTATION: The presented methods are part of the BEAST2 package MASCOT, the Marginal Approximation of the Structured COalescenT. This package can be downloaded via the BEAUti package manager. The source code is available at https://github.com/nicfel/Mascot.git. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-11-15 2018-05-22 /pmc/articles/PMC6223361/ /pubmed/29790921 http://dx.doi.org/10.1093/bioinformatics/bty406 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Müller, Nicola F
Rasmussen, David
Stadler, Tanja
MASCOT: parameter and state inference under the marginal structured coalescent approximation
title MASCOT: parameter and state inference under the marginal structured coalescent approximation
title_full MASCOT: parameter and state inference under the marginal structured coalescent approximation
title_fullStr MASCOT: parameter and state inference under the marginal structured coalescent approximation
title_full_unstemmed MASCOT: parameter and state inference under the marginal structured coalescent approximation
title_short MASCOT: parameter and state inference under the marginal structured coalescent approximation
title_sort mascot: parameter and state inference under the marginal structured coalescent approximation
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6223361/
https://www.ncbi.nlm.nih.gov/pubmed/29790921
http://dx.doi.org/10.1093/bioinformatics/bty406
work_keys_str_mv AT mullernicolaf mascotparameterandstateinferenceunderthemarginalstructuredcoalescentapproximation
AT rasmussendavid mascotparameterandstateinferenceunderthemarginalstructuredcoalescentapproximation
AT stadlertanja mascotparameterandstateinferenceunderthemarginalstructuredcoalescentapproximation