Cargando…
MASCOT: parameter and state inference under the marginal structured coalescent approximation
MOTIVATION: The structured coalescent is widely applied to study demography within and migration between sub-populations from genetic sequence data. Current methods are either exact but too computationally inefficient to analyse large datasets with many sub-populations, or make strong approximations...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6223361/ https://www.ncbi.nlm.nih.gov/pubmed/29790921 http://dx.doi.org/10.1093/bioinformatics/bty406 |
_version_ | 1783369388371476480 |
---|---|
author | Müller, Nicola F Rasmussen, David Stadler, Tanja |
author_facet | Müller, Nicola F Rasmussen, David Stadler, Tanja |
author_sort | Müller, Nicola F |
collection | PubMed |
description | MOTIVATION: The structured coalescent is widely applied to study demography within and migration between sub-populations from genetic sequence data. Current methods are either exact but too computationally inefficient to analyse large datasets with many sub-populations, or make strong approximations leading to severe biases in inference. We recently introduced an approximation based on weaker assumptions to the structured coalescent enabling the analysis of larger datasets with many different states. We showed that our approximation provides unbiased migration rate and population size estimates across a wide parameter range. RESULTS: We extend this approach by providing a new algorithm to calculate the probability of the state of internal nodes that includes the information from the full phylogenetic tree. We show that this algorithm is able to increase the probability attributed to the true sub-population of a node. Furthermore we use improved integration techniques, such that our method is now able to analyse larger datasets, including a H3N2 dataset with 433 sequences sampled from five different locations. AVAILABILITY AND IMPLEMENTATION: The presented methods are part of the BEAST2 package MASCOT, the Marginal Approximation of the Structured COalescenT. This package can be downloaded via the BEAUti package manager. The source code is available at https://github.com/nicfel/Mascot.git. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-6223361 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-62233612018-11-14 MASCOT: parameter and state inference under the marginal structured coalescent approximation Müller, Nicola F Rasmussen, David Stadler, Tanja Bioinformatics Original Papers MOTIVATION: The structured coalescent is widely applied to study demography within and migration between sub-populations from genetic sequence data. Current methods are either exact but too computationally inefficient to analyse large datasets with many sub-populations, or make strong approximations leading to severe biases in inference. We recently introduced an approximation based on weaker assumptions to the structured coalescent enabling the analysis of larger datasets with many different states. We showed that our approximation provides unbiased migration rate and population size estimates across a wide parameter range. RESULTS: We extend this approach by providing a new algorithm to calculate the probability of the state of internal nodes that includes the information from the full phylogenetic tree. We show that this algorithm is able to increase the probability attributed to the true sub-population of a node. Furthermore we use improved integration techniques, such that our method is now able to analyse larger datasets, including a H3N2 dataset with 433 sequences sampled from five different locations. AVAILABILITY AND IMPLEMENTATION: The presented methods are part of the BEAST2 package MASCOT, the Marginal Approximation of the Structured COalescenT. This package can be downloaded via the BEAUti package manager. The source code is available at https://github.com/nicfel/Mascot.git. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-11-15 2018-05-22 /pmc/articles/PMC6223361/ /pubmed/29790921 http://dx.doi.org/10.1093/bioinformatics/bty406 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Papers Müller, Nicola F Rasmussen, David Stadler, Tanja MASCOT: parameter and state inference under the marginal structured coalescent approximation |
title | MASCOT: parameter and state inference under the marginal structured coalescent approximation |
title_full | MASCOT: parameter and state inference under the marginal structured coalescent approximation |
title_fullStr | MASCOT: parameter and state inference under the marginal structured coalescent approximation |
title_full_unstemmed | MASCOT: parameter and state inference under the marginal structured coalescent approximation |
title_short | MASCOT: parameter and state inference under the marginal structured coalescent approximation |
title_sort | mascot: parameter and state inference under the marginal structured coalescent approximation |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6223361/ https://www.ncbi.nlm.nih.gov/pubmed/29790921 http://dx.doi.org/10.1093/bioinformatics/bty406 |
work_keys_str_mv | AT mullernicolaf mascotparameterandstateinferenceunderthemarginalstructuredcoalescentapproximation AT rasmussendavid mascotparameterandstateinferenceunderthemarginalstructuredcoalescentapproximation AT stadlertanja mascotparameterandstateinferenceunderthemarginalstructuredcoalescentapproximation |