Cargando…

puma: a Bioconductor package for propagating uncertainty in microarray analysis

BACKGROUND: Most analyses of microarray data are based on point estimates of expression levels and ignore the uncertainty of such estimates. By determining uncertainties from Affymetrix GeneChip data and propagating these uncertainties to downstream analyses it has been shown that we can improve res...

Descripción completa

Detalles Bibliográficos
Autores principales: Pearson, Richard D, Liu, Xuejun, Sanguinetti, Guido, Milo, Marta, Lawrence, Neil D, Rattray, Magnus
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2714555/
https://www.ncbi.nlm.nih.gov/pubmed/19589155
http://dx.doi.org/10.1186/1471-2105-10-211
_version_ 1782169687146102784
author Pearson, Richard D
Liu, Xuejun
Sanguinetti, Guido
Milo, Marta
Lawrence, Neil D
Rattray, Magnus
author_facet Pearson, Richard D
Liu, Xuejun
Sanguinetti, Guido
Milo, Marta
Lawrence, Neil D
Rattray, Magnus
author_sort Pearson, Richard D
collection PubMed
description BACKGROUND: Most analyses of microarray data are based on point estimates of expression levels and ignore the uncertainty of such estimates. By determining uncertainties from Affymetrix GeneChip data and propagating these uncertainties to downstream analyses it has been shown that we can improve results of differential expression detection, principal component analysis and clustering. Previously, implementations of these uncertainty propagation methods have only been available as separate packages, written in different languages. Previous implementations have also suffered from being very costly to compute, and in the case of differential expression detection, have been limited in the experimental designs to which they can be applied. RESULTS: puma is a Bioconductor package incorporating a suite of analysis methods for use on Affymetrix GeneChip data. puma extends the differential expression detection methods of previous work from the 2-class case to the multi-factorial case. puma can be used to automatically create design and contrast matrices for typical experimental designs, which can be used both within the package itself but also in other Bioconductor packages. The implementation of differential expression detection methods has been parallelised leading to significant decreases in processing time on a range of computer architectures. puma incorporates the first R implementation of an uncertainty propagation version of principal component analysis, and an implementation of a clustering method based on uncertainty propagation. All of these techniques are brought together in a single, easy-to-use package with clear, task-based documentation. CONCLUSION: For the first time, the puma package makes a suite of uncertainty propagation methods available to a general audience. These methods can be used to improve results from more traditional analyses of microarray data. puma also offers improvements in terms of scope and speed of execution over previously available methods. puma is recommended for anyone working with the Affymetrix GeneChip platform for gene expression analysis and can also be applied more generally.
format Text
id pubmed-2714555
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27145552009-07-24 puma: a Bioconductor package for propagating uncertainty in microarray analysis Pearson, Richard D Liu, Xuejun Sanguinetti, Guido Milo, Marta Lawrence, Neil D Rattray, Magnus BMC Bioinformatics Software BACKGROUND: Most analyses of microarray data are based on point estimates of expression levels and ignore the uncertainty of such estimates. By determining uncertainties from Affymetrix GeneChip data and propagating these uncertainties to downstream analyses it has been shown that we can improve results of differential expression detection, principal component analysis and clustering. Previously, implementations of these uncertainty propagation methods have only been available as separate packages, written in different languages. Previous implementations have also suffered from being very costly to compute, and in the case of differential expression detection, have been limited in the experimental designs to which they can be applied. RESULTS: puma is a Bioconductor package incorporating a suite of analysis methods for use on Affymetrix GeneChip data. puma extends the differential expression detection methods of previous work from the 2-class case to the multi-factorial case. puma can be used to automatically create design and contrast matrices for typical experimental designs, which can be used both within the package itself but also in other Bioconductor packages. The implementation of differential expression detection methods has been parallelised leading to significant decreases in processing time on a range of computer architectures. puma incorporates the first R implementation of an uncertainty propagation version of principal component analysis, and an implementation of a clustering method based on uncertainty propagation. All of these techniques are brought together in a single, easy-to-use package with clear, task-based documentation. CONCLUSION: For the first time, the puma package makes a suite of uncertainty propagation methods available to a general audience. These methods can be used to improve results from more traditional analyses of microarray data. puma also offers improvements in terms of scope and speed of execution over previously available methods. puma is recommended for anyone working with the Affymetrix GeneChip platform for gene expression analysis and can also be applied more generally. BioMed Central 2009-07-09 /pmc/articles/PMC2714555/ /pubmed/19589155 http://dx.doi.org/10.1186/1471-2105-10-211 Text en Copyright © 2009 Pearson et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Pearson, Richard D
Liu, Xuejun
Sanguinetti, Guido
Milo, Marta
Lawrence, Neil D
Rattray, Magnus
puma: a Bioconductor package for propagating uncertainty in microarray analysis
title puma: a Bioconductor package for propagating uncertainty in microarray analysis
title_full puma: a Bioconductor package for propagating uncertainty in microarray analysis
title_fullStr puma: a Bioconductor package for propagating uncertainty in microarray analysis
title_full_unstemmed puma: a Bioconductor package for propagating uncertainty in microarray analysis
title_short puma: a Bioconductor package for propagating uncertainty in microarray analysis
title_sort puma: a bioconductor package for propagating uncertainty in microarray analysis
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2714555/
https://www.ncbi.nlm.nih.gov/pubmed/19589155
http://dx.doi.org/10.1186/1471-2105-10-211
work_keys_str_mv AT pearsonrichardd pumaabioconductorpackageforpropagatinguncertaintyinmicroarrayanalysis
AT liuxuejun pumaabioconductorpackageforpropagatinguncertaintyinmicroarrayanalysis
AT sanguinettiguido pumaabioconductorpackageforpropagatinguncertaintyinmicroarrayanalysis
AT milomarta pumaabioconductorpackageforpropagatinguncertaintyinmicroarrayanalysis
AT lawrenceneild pumaabioconductorpackageforpropagatinguncertaintyinmicroarrayanalysis
AT rattraymagnus pumaabioconductorpackageforpropagatinguncertaintyinmicroarrayanalysis