Cargando…

A Measure of the Signal-to-Noise Ratio of Microarray Samples and Studies Using Gene Correlations

BACKGROUND: The quality of gene expression data can vary dramatically from platform to platform, study to study, and sample to sample. As reliable statistical analysis rests on reliable data, determining such quality is of the utmost importance. Quality measures to spot problematic samples exist, bu...

Descripción completa

Detalles Bibliográficos
Autores principales: Venet, David, Detours, Vincent, Bersini, Hugues
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3520972/
https://www.ncbi.nlm.nih.gov/pubmed/23251415
http://dx.doi.org/10.1371/journal.pone.0051013
_version_ 1782252873315254272
author Venet, David
Detours, Vincent
Bersini, Hugues
author_facet Venet, David
Detours, Vincent
Bersini, Hugues
author_sort Venet, David
collection PubMed
description BACKGROUND: The quality of gene expression data can vary dramatically from platform to platform, study to study, and sample to sample. As reliable statistical analysis rests on reliable data, determining such quality is of the utmost importance. Quality measures to spot problematic samples exist, but they are platform-specific, and cannot be used to compare studies. RESULTS: As a proxy for quality, we propose a signal-to-noise ratio for microarray data, the “Signal-to-Noise Applied to Gene Expression Experiments”, or SNAGEE. SNAGEE is based on the consistency of gene-gene correlations. We applied SNAGEE to a compendium of 80 large datasets on 37 platforms, for a total of 24,380 samples, and assessed the signal-to-noise ratio of studies and samples. This allowed us to discover serious issues with three studies. We show that signal-to-noise ratios of both studies and samples are linked to the statistical significance of the biological results. CONCLUSIONS: We showed that SNAGEE is an effective way to measure data quality for most types of gene expression studies, and that it often outperforms existing techniques. Furthermore, SNAGEE is platform-independent and does not require raw data files. The SNAGEE R package is available in BioConductor.
format Online
Article
Text
id pubmed-3520972
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-35209722012-12-18 A Measure of the Signal-to-Noise Ratio of Microarray Samples and Studies Using Gene Correlations Venet, David Detours, Vincent Bersini, Hugues PLoS One Research Article BACKGROUND: The quality of gene expression data can vary dramatically from platform to platform, study to study, and sample to sample. As reliable statistical analysis rests on reliable data, determining such quality is of the utmost importance. Quality measures to spot problematic samples exist, but they are platform-specific, and cannot be used to compare studies. RESULTS: As a proxy for quality, we propose a signal-to-noise ratio for microarray data, the “Signal-to-Noise Applied to Gene Expression Experiments”, or SNAGEE. SNAGEE is based on the consistency of gene-gene correlations. We applied SNAGEE to a compendium of 80 large datasets on 37 platforms, for a total of 24,380 samples, and assessed the signal-to-noise ratio of studies and samples. This allowed us to discover serious issues with three studies. We show that signal-to-noise ratios of both studies and samples are linked to the statistical significance of the biological results. CONCLUSIONS: We showed that SNAGEE is an effective way to measure data quality for most types of gene expression studies, and that it often outperforms existing techniques. Furthermore, SNAGEE is platform-independent and does not require raw data files. The SNAGEE R package is available in BioConductor. Public Library of Science 2012-12-12 /pmc/articles/PMC3520972/ /pubmed/23251415 http://dx.doi.org/10.1371/journal.pone.0051013 Text en © 2012 Venet et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Venet, David
Detours, Vincent
Bersini, Hugues
A Measure of the Signal-to-Noise Ratio of Microarray Samples and Studies Using Gene Correlations
title A Measure of the Signal-to-Noise Ratio of Microarray Samples and Studies Using Gene Correlations
title_full A Measure of the Signal-to-Noise Ratio of Microarray Samples and Studies Using Gene Correlations
title_fullStr A Measure of the Signal-to-Noise Ratio of Microarray Samples and Studies Using Gene Correlations
title_full_unstemmed A Measure of the Signal-to-Noise Ratio of Microarray Samples and Studies Using Gene Correlations
title_short A Measure of the Signal-to-Noise Ratio of Microarray Samples and Studies Using Gene Correlations
title_sort measure of the signal-to-noise ratio of microarray samples and studies using gene correlations
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3520972/
https://www.ncbi.nlm.nih.gov/pubmed/23251415
http://dx.doi.org/10.1371/journal.pone.0051013
work_keys_str_mv AT venetdavid ameasureofthesignaltonoiseratioofmicroarraysamplesandstudiesusinggenecorrelations
AT detoursvincent ameasureofthesignaltonoiseratioofmicroarraysamplesandstudiesusinggenecorrelations
AT bersinihugues ameasureofthesignaltonoiseratioofmicroarraysamplesandstudiesusinggenecorrelations
AT venetdavid measureofthesignaltonoiseratioofmicroarraysamplesandstudiesusinggenecorrelations
AT detoursvincent measureofthesignaltonoiseratioofmicroarraysamplesandstudiesusinggenecorrelations
AT bersinihugues measureofthesignaltonoiseratioofmicroarraysamplesandstudiesusinggenecorrelations