Cargando…

Microarray test results should not be compensated for multiplicity of gene contents

BACKGROUND: Microarray technology has enabled the measurement of comprehensive transcriptomic information. However, each data entry may reflect trivial individual differences among samples and also contain technical noise. Therefore, the certainty of each observed difference should be confirmed at e...

Descripción completa

Detalles Bibliográficos
Autor principal:	Konishi, Tomokazu
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2011
Materias:	Proceedings
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287486/ https://www.ncbi.nlm.nih.gov/pubmed/22784577 http://dx.doi.org/10.1186/1752-0509-5-S2-S6

_version_	1782224674159067136
author	Konishi, Tomokazu
author_facet	Konishi, Tomokazu
author_sort	Konishi, Tomokazu
collection	PubMed
description	BACKGROUND: Microarray technology has enabled the measurement of comprehensive transcriptomic information. However, each data entry may reflect trivial individual differences among samples and also contain technical noise. Therefore, the certainty of each observed difference should be confirmed at earlier steps of the analyses, and statistical tests are frequently used for this purpose. Since microarrays analyze a huge number of genes simultaneously, concerns of multiplicity, i.e. the family wise error rate (FWER) and false discovery rate (FDR), have been raised in testing the data. To deal with these concerns, several compensation methodologies have been proposed, making the tests very conservative to the extent that arbitrary tuning of the threshold has been introduced to relax the conditions. Unexpectedly, however, the appropriateness of the test methodologies, the concerns of multiplicity, and the compensation methodologies have not been sufficiently confirmed. RESULTS: The appropriateness was checked by means of coincidence between the methodologies' premises and the statistical characteristics of data found in two typical microarray platforms. As expected, normality was observed in within-group data differences, supporting application of t-test and F-test statistics. However, genes displayed their own tendencies in the magnitude of variations, and the distributions of p-values were rather complex. These characteristics are inconsistent with premises underlying the compensation methodologies, which assume that most of the null hypotheses are true. The evidence also raised concerns about multiplicity. In transcriptomic studies, FWER should not be critical, as analyses at higher levels would not be influenced by a few false positives. Additionally, the concerns for FDR are not suitable for the sharp null hypotheses on expression levels. CONCLUSIONS: Therefore, although compensation methods have been recommended to deal with the problem of multiplicity, the compensations are actually inappropriate for transcriptome analyses. Compensations are not only unnecessary, but will increase the occurrence of false negative errors, and arbitrary adjustment of the threshold damages the objectivity of the tests. Rather, the results of parametric tests should be evaluated directly.
format	Online Article Text
id	pubmed-3287486
institution	National Center for Biotechnology Information
language	English
publishDate	2011
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-32874862012-02-28 Microarray test results should not be compensated for multiplicity of gene contents Konishi, Tomokazu BMC Syst Biol Proceedings BACKGROUND: Microarray technology has enabled the measurement of comprehensive transcriptomic information. However, each data entry may reflect trivial individual differences among samples and also contain technical noise. Therefore, the certainty of each observed difference should be confirmed at earlier steps of the analyses, and statistical tests are frequently used for this purpose. Since microarrays analyze a huge number of genes simultaneously, concerns of multiplicity, i.e. the family wise error rate (FWER) and false discovery rate (FDR), have been raised in testing the data. To deal with these concerns, several compensation methodologies have been proposed, making the tests very conservative to the extent that arbitrary tuning of the threshold has been introduced to relax the conditions. Unexpectedly, however, the appropriateness of the test methodologies, the concerns of multiplicity, and the compensation methodologies have not been sufficiently confirmed. RESULTS: The appropriateness was checked by means of coincidence between the methodologies' premises and the statistical characteristics of data found in two typical microarray platforms. As expected, normality was observed in within-group data differences, supporting application of t-test and F-test statistics. However, genes displayed their own tendencies in the magnitude of variations, and the distributions of p-values were rather complex. These characteristics are inconsistent with premises underlying the compensation methodologies, which assume that most of the null hypotheses are true. The evidence also raised concerns about multiplicity. In transcriptomic studies, FWER should not be critical, as analyses at higher levels would not be influenced by a few false positives. Additionally, the concerns for FDR are not suitable for the sharp null hypotheses on expression levels. CONCLUSIONS: Therefore, although compensation methods have been recommended to deal with the problem of multiplicity, the compensations are actually inappropriate for transcriptome analyses. Compensations are not only unnecessary, but will increase the occurrence of false negative errors, and arbitrary adjustment of the threshold damages the objectivity of the tests. Rather, the results of parametric tests should be evaluated directly. BioMed Central 2011-12-14 /pmc/articles/PMC3287486/ /pubmed/22784577 http://dx.doi.org/10.1186/1752-0509-5-S2-S6 Text en Copyright ©2011 Konishi; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Proceedings Konishi, Tomokazu Microarray test results should not be compensated for multiplicity of gene contents
title	Microarray test results should not be compensated for multiplicity of gene contents
title_full	Microarray test results should not be compensated for multiplicity of gene contents
title_fullStr	Microarray test results should not be compensated for multiplicity of gene contents
title_full_unstemmed	Microarray test results should not be compensated for multiplicity of gene contents
title_short	Microarray test results should not be compensated for multiplicity of gene contents
title_sort	microarray test results should not be compensated for multiplicity of gene contents
topic	Proceedings
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287486/ https://www.ncbi.nlm.nih.gov/pubmed/22784577 http://dx.doi.org/10.1186/1752-0509-5-S2-S6
work_keys_str_mv	AT konishitomokazu microarraytestresultsshouldnotbecompensatedformultiplicityofgenecontents

Microarray test results should not be compensated for multiplicity of gene contents

Ejemplares similares