Cargando…
Amount of Information Needed for Model Choice in Approximate Bayesian Computation
Approximate Bayesian Computation (ABC) has become a popular technique in evolutionary genetics for elucidating population structure and history due to its flexibility. The statistical inference framework has benefited from significant progress in recent years. In population genetics, however, its ou...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4069000/ https://www.ncbi.nlm.nih.gov/pubmed/24959900 http://dx.doi.org/10.1371/journal.pone.0099581 |
_version_ | 1782322488363974656 |
---|---|
author | Stocks, Michael Siol, Mathieu Lascoux, Martin De Mita, Stéphane |
author_facet | Stocks, Michael Siol, Mathieu Lascoux, Martin De Mita, Stéphane |
author_sort | Stocks, Michael |
collection | PubMed |
description | Approximate Bayesian Computation (ABC) has become a popular technique in evolutionary genetics for elucidating population structure and history due to its flexibility. The statistical inference framework has benefited from significant progress in recent years. In population genetics, however, its outcome depends heavily on the amount of information in the dataset, whether that be the level of genetic variation or the number of samples and loci. Here we look at the power to reject a simple constant population size coalescent model in favor of a bottleneck model in datasets of varying quality. Not only is this power dependent on the number of samples and loci, but it also depends strongly on the level of nucleotide diversity in the observed dataset. Whilst overall model choice in an ABC setting is fairly powerful and quite conservative with regard to false positives, detecting weaker bottlenecks is problematic in smaller or less genetically diverse datasets and limits the inferences possible in non-model organism where the amount of information regarding the two models is often limited. Our results show it is important to consider these limitations when performing an ABC analysis and that studies should perform simulations based on the size and nature of the dataset in order to fully assess the power of the study. |
format | Online Article Text |
id | pubmed-4069000 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-40690002014-06-27 Amount of Information Needed for Model Choice in Approximate Bayesian Computation Stocks, Michael Siol, Mathieu Lascoux, Martin De Mita, Stéphane PLoS One Research Article Approximate Bayesian Computation (ABC) has become a popular technique in evolutionary genetics for elucidating population structure and history due to its flexibility. The statistical inference framework has benefited from significant progress in recent years. In population genetics, however, its outcome depends heavily on the amount of information in the dataset, whether that be the level of genetic variation or the number of samples and loci. Here we look at the power to reject a simple constant population size coalescent model in favor of a bottleneck model in datasets of varying quality. Not only is this power dependent on the number of samples and loci, but it also depends strongly on the level of nucleotide diversity in the observed dataset. Whilst overall model choice in an ABC setting is fairly powerful and quite conservative with regard to false positives, detecting weaker bottlenecks is problematic in smaller or less genetically diverse datasets and limits the inferences possible in non-model organism where the amount of information regarding the two models is often limited. Our results show it is important to consider these limitations when performing an ABC analysis and that studies should perform simulations based on the size and nature of the dataset in order to fully assess the power of the study. Public Library of Science 2014-06-24 /pmc/articles/PMC4069000/ /pubmed/24959900 http://dx.doi.org/10.1371/journal.pone.0099581 Text en © 2014 Stocks et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Stocks, Michael Siol, Mathieu Lascoux, Martin De Mita, Stéphane Amount of Information Needed for Model Choice in Approximate Bayesian Computation |
title | Amount of Information Needed for Model Choice in Approximate Bayesian Computation |
title_full | Amount of Information Needed for Model Choice in Approximate Bayesian Computation |
title_fullStr | Amount of Information Needed for Model Choice in Approximate Bayesian Computation |
title_full_unstemmed | Amount of Information Needed for Model Choice in Approximate Bayesian Computation |
title_short | Amount of Information Needed for Model Choice in Approximate Bayesian Computation |
title_sort | amount of information needed for model choice in approximate bayesian computation |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4069000/ https://www.ncbi.nlm.nih.gov/pubmed/24959900 http://dx.doi.org/10.1371/journal.pone.0099581 |
work_keys_str_mv | AT stocksmichael amountofinformationneededformodelchoiceinapproximatebayesiancomputation AT siolmathieu amountofinformationneededformodelchoiceinapproximatebayesiancomputation AT lascouxmartin amountofinformationneededformodelchoiceinapproximatebayesiancomputation AT demitastephane amountofinformationneededformodelchoiceinapproximatebayesiancomputation |