Cargando…

Permutation – based statistical tests for multiple hypotheses

BACKGROUND: Genomics and proteomics analyses regularly involve the simultaneous test of hundreds of hypotheses, either on numerical or categorical data. To correct for the occurrence of false positives, validation tests based on multiple testing correction, such as Bonferroni and Benjamini and Hochb...

Descripción completa

Detalles Bibliográficos
Autores principales: Camargo, Anyela, Azuaje, Francisco, Wang, Haiying, Zheng, Huiru
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2611984/
https://www.ncbi.nlm.nih.gov/pubmed/18939983
http://dx.doi.org/10.1186/1751-0473-3-15
_version_ 1782163108312121344
author Camargo, Anyela
Azuaje, Francisco
Wang, Haiying
Zheng, Huiru
author_facet Camargo, Anyela
Azuaje, Francisco
Wang, Haiying
Zheng, Huiru
author_sort Camargo, Anyela
collection PubMed
description BACKGROUND: Genomics and proteomics analyses regularly involve the simultaneous test of hundreds of hypotheses, either on numerical or categorical data. To correct for the occurrence of false positives, validation tests based on multiple testing correction, such as Bonferroni and Benjamini and Hochberg, and re-sampling, such as permutation tests, are frequently used. Despite the known power of permutation-based tests, most available tools offer such tests for either t-test or ANOVA only. Less attention has been given to tests for categorical data, such as the Chi-square. This project takes a first step by developing an open-source software tool, Ptest, that addresses the need to offer public software tools incorporating these and other statistical tests with options for correcting for multiple hypotheses. RESULTS: This study developed a public-domain, user-friendly software whose purpose was twofold: first, to estimate test statistics for categorical and numerical data; and second, to validate the significance of the test statistics via Bonferroni, Benjamini and Hochberg, and a permutation test of numerical and categorical data. The tool allows the calculation of Chi-square test for categorical data, and ANOVA test, Bartlett's test and t-test for paired and unpaired data. Once a test statistic is calculated, Bonferroni, Benjamini and Hochberg, and a permutation tests are implemented, independently, to control for Type I errors. An evaluation of the software using different public data sets is reported, which illustrates the power of permutation tests for multiple hypotheses assessment and for controlling the rate of Type I errors. CONCLUSION: The analytical options offered by the software can be applied to support a significant spectrum of hypothesis testing tasks in functional genomics, using both numerical and categorical data.
format Text
id pubmed-2611984
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26119842008-12-30 Permutation – based statistical tests for multiple hypotheses Camargo, Anyela Azuaje, Francisco Wang, Haiying Zheng, Huiru Source Code Biol Med Software Review BACKGROUND: Genomics and proteomics analyses regularly involve the simultaneous test of hundreds of hypotheses, either on numerical or categorical data. To correct for the occurrence of false positives, validation tests based on multiple testing correction, such as Bonferroni and Benjamini and Hochberg, and re-sampling, such as permutation tests, are frequently used. Despite the known power of permutation-based tests, most available tools offer such tests for either t-test or ANOVA only. Less attention has been given to tests for categorical data, such as the Chi-square. This project takes a first step by developing an open-source software tool, Ptest, that addresses the need to offer public software tools incorporating these and other statistical tests with options for correcting for multiple hypotheses. RESULTS: This study developed a public-domain, user-friendly software whose purpose was twofold: first, to estimate test statistics for categorical and numerical data; and second, to validate the significance of the test statistics via Bonferroni, Benjamini and Hochberg, and a permutation test of numerical and categorical data. The tool allows the calculation of Chi-square test for categorical data, and ANOVA test, Bartlett's test and t-test for paired and unpaired data. Once a test statistic is calculated, Bonferroni, Benjamini and Hochberg, and a permutation tests are implemented, independently, to control for Type I errors. An evaluation of the software using different public data sets is reported, which illustrates the power of permutation tests for multiple hypotheses assessment and for controlling the rate of Type I errors. CONCLUSION: The analytical options offered by the software can be applied to support a significant spectrum of hypothesis testing tasks in functional genomics, using both numerical and categorical data. BioMed Central 2008-10-21 /pmc/articles/PMC2611984/ /pubmed/18939983 http://dx.doi.org/10.1186/1751-0473-3-15 Text en Copyright © 2008 Camargo et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software Review
Camargo, Anyela
Azuaje, Francisco
Wang, Haiying
Zheng, Huiru
Permutation – based statistical tests for multiple hypotheses
title Permutation – based statistical tests for multiple hypotheses
title_full Permutation – based statistical tests for multiple hypotheses
title_fullStr Permutation – based statistical tests for multiple hypotheses
title_full_unstemmed Permutation – based statistical tests for multiple hypotheses
title_short Permutation – based statistical tests for multiple hypotheses
title_sort permutation – based statistical tests for multiple hypotheses
topic Software Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2611984/
https://www.ncbi.nlm.nih.gov/pubmed/18939983
http://dx.doi.org/10.1186/1751-0473-3-15
work_keys_str_mv AT camargoanyela permutationbasedstatisticaltestsformultiplehypotheses
AT azuajefrancisco permutationbasedstatisticaltestsformultiplehypotheses
AT wanghaiying permutationbasedstatisticaltestsformultiplehypotheses
AT zhenghuiru permutationbasedstatisticaltestsformultiplehypotheses