Cargando…

Sequential Monte Carlo multiple testing

Motivation: In molecular biology, as in many other scientific fields, the scale of analyses is ever increasing. Often, complex Monte Carlo simulation is required, sometimes within a large-scale multiple testing setting. The resulting computational costs may be prohibitively high. Results: We here pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Sandve, Geir Kjetil, Ferkingstad, Egil, Nygård, Ståle
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3223366/
https://www.ncbi.nlm.nih.gov/pubmed/21998154
http://dx.doi.org/10.1093/bioinformatics/btr568
_version_ 1782217286024691712
author Sandve, Geir Kjetil
Ferkingstad, Egil
Nygård, Ståle
author_facet Sandve, Geir Kjetil
Ferkingstad, Egil
Nygård, Ståle
author_sort Sandve, Geir Kjetil
collection PubMed
description Motivation: In molecular biology, as in many other scientific fields, the scale of analyses is ever increasing. Often, complex Monte Carlo simulation is required, sometimes within a large-scale multiple testing setting. The resulting computational costs may be prohibitively high. Results: We here present MCFDR, a simple, novel algorithm for false discovery rate (FDR) modulated sequential Monte Carlo (MC) multiple hypothesis testing. The algorithm iterates between adding MC samples across tests and calculating intermediate FDR values for the collection of tests. MC sampling is stopped either by sequential MC or based on a threshold on FDR. An essential property of the algorithm is that it limits the total number of MC samples whatever the number of true null hypotheses. We show on both real and simulated data that the proposed algorithm provides large gains in computational efficiency. Availability: MCFDR is implemented in the Genomic HyperBrowser (http://hyperbrowser.uio.no/mcfdr), a web-based system for genome analysis. All input data and results are available and can be reproduced through a Galaxy Pages document at: http://hyperbrowser.uio.no/mcfdr/u/sandve/p/mcfdr. Contact: geirksa@ifi.uio.no
format Online
Article
Text
id pubmed-3223366
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-32233662011-11-25 Sequential Monte Carlo multiple testing Sandve, Geir Kjetil Ferkingstad, Egil Nygård, Ståle Bioinformatics Original Papers Motivation: In molecular biology, as in many other scientific fields, the scale of analyses is ever increasing. Often, complex Monte Carlo simulation is required, sometimes within a large-scale multiple testing setting. The resulting computational costs may be prohibitively high. Results: We here present MCFDR, a simple, novel algorithm for false discovery rate (FDR) modulated sequential Monte Carlo (MC) multiple hypothesis testing. The algorithm iterates between adding MC samples across tests and calculating intermediate FDR values for the collection of tests. MC sampling is stopped either by sequential MC or based on a threshold on FDR. An essential property of the algorithm is that it limits the total number of MC samples whatever the number of true null hypotheses. We show on both real and simulated data that the proposed algorithm provides large gains in computational efficiency. Availability: MCFDR is implemented in the Genomic HyperBrowser (http://hyperbrowser.uio.no/mcfdr), a web-based system for genome analysis. All input data and results are available and can be reproduced through a Galaxy Pages document at: http://hyperbrowser.uio.no/mcfdr/u/sandve/p/mcfdr. Contact: geirksa@ifi.uio.no Oxford University Press 2011-12-01 2011-10-13 /pmc/articles/PMC3223366/ /pubmed/21998154 http://dx.doi.org/10.1093/bioinformatics/btr568 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Sandve, Geir Kjetil
Ferkingstad, Egil
Nygård, Ståle
Sequential Monte Carlo multiple testing
title Sequential Monte Carlo multiple testing
title_full Sequential Monte Carlo multiple testing
title_fullStr Sequential Monte Carlo multiple testing
title_full_unstemmed Sequential Monte Carlo multiple testing
title_short Sequential Monte Carlo multiple testing
title_sort sequential monte carlo multiple testing
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3223366/
https://www.ncbi.nlm.nih.gov/pubmed/21998154
http://dx.doi.org/10.1093/bioinformatics/btr568
work_keys_str_mv AT sandvegeirkjetil sequentialmontecarlomultipletesting
AT ferkingstadegil sequentialmontecarlomultipletesting
AT nygardstale sequentialmontecarlomultipletesting