Cargando…

A Novel Approach for Choosing Summary Statistics in Approximate Bayesian Computation

The choice of summary statistics is a crucial step in approximate Bayesian computation (ABC). Since statistics are often not sufficient, this choice involves a trade-off between loss of information and reduction of dimensionality. The latter may increase the efficiency of ABC. Here, we propose an ap...

Descripción completa

Detalles Bibliográficos
Autores principales: Aeschbacher, Simon, Beaumont, Mark A., Futschik, Andreas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3522150/
https://www.ncbi.nlm.nih.gov/pubmed/22960215
http://dx.doi.org/10.1534/genetics.112.143164
_version_ 1782253047777329152
author Aeschbacher, Simon
Beaumont, Mark A.
Futschik, Andreas
author_facet Aeschbacher, Simon
Beaumont, Mark A.
Futschik, Andreas
author_sort Aeschbacher, Simon
collection PubMed
description The choice of summary statistics is a crucial step in approximate Bayesian computation (ABC). Since statistics are often not sufficient, this choice involves a trade-off between loss of information and reduction of dimensionality. The latter may increase the efficiency of ABC. Here, we propose an approach for choosing summary statistics based on boosting, a technique from the machine-learning literature. We consider different types of boosting and compare them to partial least-squares regression as an alternative. To mitigate the lack of sufficiency, we also propose an approach for choosing summary statistics locally, in the putative neighborhood of the true parameter value. We study a demographic model motivated by the reintroduction of Alpine ibex (Capra ibex) into the Swiss Alps. The parameters of interest are the mean and standard deviation across microsatellites of the scaled ancestral mutation rate (θ(anc) = 4N(e)u) and the proportion of males obtaining access to matings per breeding season (ω). By simulation, we assess the properties of the posterior distribution obtained with the various methods. According to our criteria, ABC with summary statistics chosen locally via boosting with the L(2)-loss performs best. Applying that method to the ibex data, we estimate [Formula: see text] and find that most of the variation across loci of the ancestral mutation rate u is between 7.7 × 10(−4) and 3.5 × 10(−3) per locus per generation. The proportion of males with access to matings is estimated as [Formula: see text] , which is in good agreement with recent independent estimates.
format Online
Article
Text
id pubmed-3522150
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-35221502013-11-01 A Novel Approach for Choosing Summary Statistics in Approximate Bayesian Computation Aeschbacher, Simon Beaumont, Mark A. Futschik, Andreas Genetics Investigations The choice of summary statistics is a crucial step in approximate Bayesian computation (ABC). Since statistics are often not sufficient, this choice involves a trade-off between loss of information and reduction of dimensionality. The latter may increase the efficiency of ABC. Here, we propose an approach for choosing summary statistics based on boosting, a technique from the machine-learning literature. We consider different types of boosting and compare them to partial least-squares regression as an alternative. To mitigate the lack of sufficiency, we also propose an approach for choosing summary statistics locally, in the putative neighborhood of the true parameter value. We study a demographic model motivated by the reintroduction of Alpine ibex (Capra ibex) into the Swiss Alps. The parameters of interest are the mean and standard deviation across microsatellites of the scaled ancestral mutation rate (θ(anc) = 4N(e)u) and the proportion of males obtaining access to matings per breeding season (ω). By simulation, we assess the properties of the posterior distribution obtained with the various methods. According to our criteria, ABC with summary statistics chosen locally via boosting with the L(2)-loss performs best. Applying that method to the ibex data, we estimate [Formula: see text] and find that most of the variation across loci of the ancestral mutation rate u is between 7.7 × 10(−4) and 3.5 × 10(−3) per locus per generation. The proportion of males with access to matings is estimated as [Formula: see text] , which is in good agreement with recent independent estimates. Genetics Society of America 2012-11 /pmc/articles/PMC3522150/ /pubmed/22960215 http://dx.doi.org/10.1534/genetics.112.143164 Text en Copyright © 2012 by the Genetics Society of America Available freely online through the author-supported open access option.
spellingShingle Investigations
Aeschbacher, Simon
Beaumont, Mark A.
Futschik, Andreas
A Novel Approach for Choosing Summary Statistics in Approximate Bayesian Computation
title A Novel Approach for Choosing Summary Statistics in Approximate Bayesian Computation
title_full A Novel Approach for Choosing Summary Statistics in Approximate Bayesian Computation
title_fullStr A Novel Approach for Choosing Summary Statistics in Approximate Bayesian Computation
title_full_unstemmed A Novel Approach for Choosing Summary Statistics in Approximate Bayesian Computation
title_short A Novel Approach for Choosing Summary Statistics in Approximate Bayesian Computation
title_sort novel approach for choosing summary statistics in approximate bayesian computation
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3522150/
https://www.ncbi.nlm.nih.gov/pubmed/22960215
http://dx.doi.org/10.1534/genetics.112.143164
work_keys_str_mv AT aeschbachersimon anovelapproachforchoosingsummarystatisticsinapproximatebayesiancomputation
AT beaumontmarka anovelapproachforchoosingsummarystatisticsinapproximatebayesiancomputation
AT futschikandreas anovelapproachforchoosingsummarystatisticsinapproximatebayesiancomputation
AT aeschbachersimon novelapproachforchoosingsummarystatisticsinapproximatebayesiancomputation
AT beaumontmarka novelapproachforchoosingsummarystatisticsinapproximatebayesiancomputation
AT futschikandreas novelapproachforchoosingsummarystatisticsinapproximatebayesiancomputation