Cargando…
Sequential Monte Carlo multiple testing
Motivation: In molecular biology, as in many other scientific fields, the scale of analyses is ever increasing. Often, complex Monte Carlo simulation is required, sometimes within a large-scale multiple testing setting. The resulting computational costs may be prohibitively high. Results: We here pr...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3223366/ https://www.ncbi.nlm.nih.gov/pubmed/21998154 http://dx.doi.org/10.1093/bioinformatics/btr568 |
_version_ | 1782217286024691712 |
---|---|
author | Sandve, Geir Kjetil Ferkingstad, Egil Nygård, Ståle |
author_facet | Sandve, Geir Kjetil Ferkingstad, Egil Nygård, Ståle |
author_sort | Sandve, Geir Kjetil |
collection | PubMed |
description | Motivation: In molecular biology, as in many other scientific fields, the scale of analyses is ever increasing. Often, complex Monte Carlo simulation is required, sometimes within a large-scale multiple testing setting. The resulting computational costs may be prohibitively high. Results: We here present MCFDR, a simple, novel algorithm for false discovery rate (FDR) modulated sequential Monte Carlo (MC) multiple hypothesis testing. The algorithm iterates between adding MC samples across tests and calculating intermediate FDR values for the collection of tests. MC sampling is stopped either by sequential MC or based on a threshold on FDR. An essential property of the algorithm is that it limits the total number of MC samples whatever the number of true null hypotheses. We show on both real and simulated data that the proposed algorithm provides large gains in computational efficiency. Availability: MCFDR is implemented in the Genomic HyperBrowser (http://hyperbrowser.uio.no/mcfdr), a web-based system for genome analysis. All input data and results are available and can be reproduced through a Galaxy Pages document at: http://hyperbrowser.uio.no/mcfdr/u/sandve/p/mcfdr. Contact: geirksa@ifi.uio.no |
format | Online Article Text |
id | pubmed-3223366 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-32233662011-11-25 Sequential Monte Carlo multiple testing Sandve, Geir Kjetil Ferkingstad, Egil Nygård, Ståle Bioinformatics Original Papers Motivation: In molecular biology, as in many other scientific fields, the scale of analyses is ever increasing. Often, complex Monte Carlo simulation is required, sometimes within a large-scale multiple testing setting. The resulting computational costs may be prohibitively high. Results: We here present MCFDR, a simple, novel algorithm for false discovery rate (FDR) modulated sequential Monte Carlo (MC) multiple hypothesis testing. The algorithm iterates between adding MC samples across tests and calculating intermediate FDR values for the collection of tests. MC sampling is stopped either by sequential MC or based on a threshold on FDR. An essential property of the algorithm is that it limits the total number of MC samples whatever the number of true null hypotheses. We show on both real and simulated data that the proposed algorithm provides large gains in computational efficiency. Availability: MCFDR is implemented in the Genomic HyperBrowser (http://hyperbrowser.uio.no/mcfdr), a web-based system for genome analysis. All input data and results are available and can be reproduced through a Galaxy Pages document at: http://hyperbrowser.uio.no/mcfdr/u/sandve/p/mcfdr. Contact: geirksa@ifi.uio.no Oxford University Press 2011-12-01 2011-10-13 /pmc/articles/PMC3223366/ /pubmed/21998154 http://dx.doi.org/10.1093/bioinformatics/btr568 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Sandve, Geir Kjetil Ferkingstad, Egil Nygård, Ståle Sequential Monte Carlo multiple testing |
title | Sequential Monte Carlo multiple testing |
title_full | Sequential Monte Carlo multiple testing |
title_fullStr | Sequential Monte Carlo multiple testing |
title_full_unstemmed | Sequential Monte Carlo multiple testing |
title_short | Sequential Monte Carlo multiple testing |
title_sort | sequential monte carlo multiple testing |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3223366/ https://www.ncbi.nlm.nih.gov/pubmed/21998154 http://dx.doi.org/10.1093/bioinformatics/btr568 |
work_keys_str_mv | AT sandvegeirkjetil sequentialmontecarlomultipletesting AT ferkingstadegil sequentialmontecarlomultipletesting AT nygardstale sequentialmontecarlomultipletesting |