Cargando…

Microarray test results should not be compensated for multiplicity of gene contents

BACKGROUND: Microarray technology has enabled the measurement of comprehensive transcriptomic information. However, each data entry may reflect trivial individual differences among samples and also contain technical noise. Therefore, the certainty of each observed difference should be confirmed at e...

Descripción completa

Detalles Bibliográficos
Autor principal: Konishi, Tomokazu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287486/
https://www.ncbi.nlm.nih.gov/pubmed/22784577
http://dx.doi.org/10.1186/1752-0509-5-S2-S6
_version_ 1782224674159067136
author Konishi, Tomokazu
author_facet Konishi, Tomokazu
author_sort Konishi, Tomokazu
collection PubMed
description BACKGROUND: Microarray technology has enabled the measurement of comprehensive transcriptomic information. However, each data entry may reflect trivial individual differences among samples and also contain technical noise. Therefore, the certainty of each observed difference should be confirmed at earlier steps of the analyses, and statistical tests are frequently used for this purpose. Since microarrays analyze a huge number of genes simultaneously, concerns of multiplicity, i.e. the family wise error rate (FWER) and false discovery rate (FDR), have been raised in testing the data. To deal with these concerns, several compensation methodologies have been proposed, making the tests very conservative to the extent that arbitrary tuning of the threshold has been introduced to relax the conditions. Unexpectedly, however, the appropriateness of the test methodologies, the concerns of multiplicity, and the compensation methodologies have not been sufficiently confirmed. RESULTS: The appropriateness was checked by means of coincidence between the methodologies' premises and the statistical characteristics of data found in two typical microarray platforms. As expected, normality was observed in within-group data differences, supporting application of t-test and F-test statistics. However, genes displayed their own tendencies in the magnitude of variations, and the distributions of p-values were rather complex. These characteristics are inconsistent with premises underlying the compensation methodologies, which assume that most of the null hypotheses are true. The evidence also raised concerns about multiplicity. In transcriptomic studies, FWER should not be critical, as analyses at higher levels would not be influenced by a few false positives. Additionally, the concerns for FDR are not suitable for the sharp null hypotheses on expression levels. CONCLUSIONS: Therefore, although compensation methods have been recommended to deal with the problem of multiplicity, the compensations are actually inappropriate for transcriptome analyses. Compensations are not only unnecessary, but will increase the occurrence of false negative errors, and arbitrary adjustment of the threshold damages the objectivity of the tests. Rather, the results of parametric tests should be evaluated directly.
format Online
Article
Text
id pubmed-3287486
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32874862012-02-28 Microarray test results should not be compensated for multiplicity of gene contents Konishi, Tomokazu BMC Syst Biol Proceedings BACKGROUND: Microarray technology has enabled the measurement of comprehensive transcriptomic information. However, each data entry may reflect trivial individual differences among samples and also contain technical noise. Therefore, the certainty of each observed difference should be confirmed at earlier steps of the analyses, and statistical tests are frequently used for this purpose. Since microarrays analyze a huge number of genes simultaneously, concerns of multiplicity, i.e. the family wise error rate (FWER) and false discovery rate (FDR), have been raised in testing the data. To deal with these concerns, several compensation methodologies have been proposed, making the tests very conservative to the extent that arbitrary tuning of the threshold has been introduced to relax the conditions. Unexpectedly, however, the appropriateness of the test methodologies, the concerns of multiplicity, and the compensation methodologies have not been sufficiently confirmed. RESULTS: The appropriateness was checked by means of coincidence between the methodologies' premises and the statistical characteristics of data found in two typical microarray platforms. As expected, normality was observed in within-group data differences, supporting application of t-test and F-test statistics. However, genes displayed their own tendencies in the magnitude of variations, and the distributions of p-values were rather complex. These characteristics are inconsistent with premises underlying the compensation methodologies, which assume that most of the null hypotheses are true. The evidence also raised concerns about multiplicity. In transcriptomic studies, FWER should not be critical, as analyses at higher levels would not be influenced by a few false positives. Additionally, the concerns for FDR are not suitable for the sharp null hypotheses on expression levels. CONCLUSIONS: Therefore, although compensation methods have been recommended to deal with the problem of multiplicity, the compensations are actually inappropriate for transcriptome analyses. Compensations are not only unnecessary, but will increase the occurrence of false negative errors, and arbitrary adjustment of the threshold damages the objectivity of the tests. Rather, the results of parametric tests should be evaluated directly. BioMed Central 2011-12-14 /pmc/articles/PMC3287486/ /pubmed/22784577 http://dx.doi.org/10.1186/1752-0509-5-S2-S6 Text en Copyright ©2011 Konishi; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Konishi, Tomokazu
Microarray test results should not be compensated for multiplicity of gene contents
title Microarray test results should not be compensated for multiplicity of gene contents
title_full Microarray test results should not be compensated for multiplicity of gene contents
title_fullStr Microarray test results should not be compensated for multiplicity of gene contents
title_full_unstemmed Microarray test results should not be compensated for multiplicity of gene contents
title_short Microarray test results should not be compensated for multiplicity of gene contents
title_sort microarray test results should not be compensated for multiplicity of gene contents
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287486/
https://www.ncbi.nlm.nih.gov/pubmed/22784577
http://dx.doi.org/10.1186/1752-0509-5-S2-S6
work_keys_str_mv AT konishitomokazu microarraytestresultsshouldnotbecompensatedformultiplicityofgenecontents