Cargando…

Optimization of a parallel permutation testing function for the SPRINT R package

The statistical language R and its Bioconductor package are favoured by many biostatisticians for processing microarray data. The amount of data produced by some analyses has reached the limits of many common bioinformatics computing infrastructures. High Performance Computing systems offer a soluti...

Descripción completa

Detalles Bibliográficos
Autores principales: Petrou, Savvas, Sloan, Terence M, Mewissen, Muriel, Forster, Thorsten, Piotrowski, Michal, Dobrzelecki, Bartosz, Ghazal, Peter, Trew, Arthur, Hill, Jon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley & Sons, Ltd 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3546371/
https://www.ncbi.nlm.nih.gov/pubmed/23335858
http://dx.doi.org/10.1002/cpe.1787
_version_ 1782256042740023296
author Petrou, Savvas
Sloan, Terence M
Mewissen, Muriel
Forster, Thorsten
Piotrowski, Michal
Dobrzelecki, Bartosz
Ghazal, Peter
Trew, Arthur
Hill, Jon
author_facet Petrou, Savvas
Sloan, Terence M
Mewissen, Muriel
Forster, Thorsten
Piotrowski, Michal
Dobrzelecki, Bartosz
Ghazal, Peter
Trew, Arthur
Hill, Jon
author_sort Petrou, Savvas
collection PubMed
description The statistical language R and its Bioconductor package are favoured by many biostatisticians for processing microarray data. The amount of data produced by some analyses has reached the limits of many common bioinformatics computing infrastructures. High Performance Computing systems offer a solution to this issue. The Simple Parallel R Interface (SPRINT) is a package that provides biostatisticians with easy access to High Performance Computing systems and allows the addition of parallelized functions to R. Previous work has established that the SPRINT implementation of an R permutation testing function has close to optimal scaling on up to 512 processors on a supercomputer. Access to supercomputers, however, is not always possible, and so the work presented here compares the performance of the SPRINT implementation on a supercomputer with benchmarks on a range of platforms including cloud resources and a common desktop machine with multiprocessing capabilities. Copyright © 2011 John Wiley & Sons, Ltd.
format Online
Article
Text
id pubmed-3546371
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher John Wiley & Sons, Ltd
record_format MEDLINE/PubMed
spelling pubmed-35463712013-01-16 Optimization of a parallel permutation testing function for the SPRINT R package Petrou, Savvas Sloan, Terence M Mewissen, Muriel Forster, Thorsten Piotrowski, Michal Dobrzelecki, Bartosz Ghazal, Peter Trew, Arthur Hill, Jon Concurr Comput Special Issue Papers The statistical language R and its Bioconductor package are favoured by many biostatisticians for processing microarray data. The amount of data produced by some analyses has reached the limits of many common bioinformatics computing infrastructures. High Performance Computing systems offer a solution to this issue. The Simple Parallel R Interface (SPRINT) is a package that provides biostatisticians with easy access to High Performance Computing systems and allows the addition of parallelized functions to R. Previous work has established that the SPRINT implementation of an R permutation testing function has close to optimal scaling on up to 512 processors on a supercomputer. Access to supercomputers, however, is not always possible, and so the work presented here compares the performance of the SPRINT implementation on a supercomputer with benchmarks on a range of platforms including cloud resources and a common desktop machine with multiprocessing capabilities. Copyright © 2011 John Wiley & Sons, Ltd. John Wiley & Sons, Ltd 2011-12-10 2011-06-23 /pmc/articles/PMC3546371/ /pubmed/23335858 http://dx.doi.org/10.1002/cpe.1787 Text en Copyright © 2011 John Wiley & Sons, Ltd. http://creativecommons.org/licenses/by/2.5/ Re-use of this article is permitted in accordance with the Creative Commons Deed, Attribution 2.5, which does not permit commercial exploitation.
spellingShingle Special Issue Papers
Petrou, Savvas
Sloan, Terence M
Mewissen, Muriel
Forster, Thorsten
Piotrowski, Michal
Dobrzelecki, Bartosz
Ghazal, Peter
Trew, Arthur
Hill, Jon
Optimization of a parallel permutation testing function for the SPRINT R package
title Optimization of a parallel permutation testing function for the SPRINT R package
title_full Optimization of a parallel permutation testing function for the SPRINT R package
title_fullStr Optimization of a parallel permutation testing function for the SPRINT R package
title_full_unstemmed Optimization of a parallel permutation testing function for the SPRINT R package
title_short Optimization of a parallel permutation testing function for the SPRINT R package
title_sort optimization of a parallel permutation testing function for the sprint r package
topic Special Issue Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3546371/
https://www.ncbi.nlm.nih.gov/pubmed/23335858
http://dx.doi.org/10.1002/cpe.1787
work_keys_str_mv AT petrousavvas optimizationofaparallelpermutationtestingfunctionforthesprintrpackage
AT sloanterencem optimizationofaparallelpermutationtestingfunctionforthesprintrpackage
AT mewissenmuriel optimizationofaparallelpermutationtestingfunctionforthesprintrpackage
AT forsterthorsten optimizationofaparallelpermutationtestingfunctionforthesprintrpackage
AT piotrowskimichal optimizationofaparallelpermutationtestingfunctionforthesprintrpackage
AT dobrzeleckibartosz optimizationofaparallelpermutationtestingfunctionforthesprintrpackage
AT ghazalpeter optimizationofaparallelpermutationtestingfunctionforthesprintrpackage
AT trewarthur optimizationofaparallelpermutationtestingfunctionforthesprintrpackage
AT hilljon optimizationofaparallelpermutationtestingfunctionforthesprintrpackage