Cargando…

A method for computing the overall statistical significance of a treatment effect among a group of genes

BACKGROUND: In studies that use DNA arrays to assess changes in gene expression, our goal is to evaluate the statistical significance of treatments on sets of genes. Genes can be grouped by a molecular function, a biological process, or a cellular component, e.g., gene ontology (GO) terms. The meani...

Descripción completa

Detalles Bibliográficos
Autores principales: Delongchamp, Robert, Lee, Taewon, Velasco, Cruz
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1683577/
https://www.ncbi.nlm.nih.gov/pubmed/17118132
http://dx.doi.org/10.1186/1471-2105-7-S2-S11
_version_ 1782131173485445120
author Delongchamp, Robert
Lee, Taewon
Velasco, Cruz
author_facet Delongchamp, Robert
Lee, Taewon
Velasco, Cruz
author_sort Delongchamp, Robert
collection PubMed
description BACKGROUND: In studies that use DNA arrays to assess changes in gene expression, our goal is to evaluate the statistical significance of treatments on sets of genes. Genes can be grouped by a molecular function, a biological process, or a cellular component, e.g., gene ontology (GO) terms. The meaning of an affected GO group is often clearer than interpretations arising from a list of the statistically significant genes. RESULTS: Computer simulations demonstrated that correlations among genes invalidate many statistical methods that are commonly used to assign significance to GO terms. Ignoring these correlations overstates the statistical significance. Meta-analysis methods for combining p-values were modified to adjust for correlation. One of these methods is elaborated in the context of a comparison between two treatments. The form of the correlation adjustment depends upon the alternative hypothesis. CONCLUSION: Reliable corrections for the effect of correlations among genes on the significance level of a GO term can be constructed for an alternative hypothesis where all transcripts in the GO term increase (decrease) in response to treatment. For general alternatives, which allow some transcripts to increase and others to decrease, the bias of naïve significance calculations can be greatly decreased although not eliminated.
format Text
id pubmed-1683577
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-16835772006-12-05 A method for computing the overall statistical significance of a treatment effect among a group of genes Delongchamp, Robert Lee, Taewon Velasco, Cruz BMC Bioinformatics Proceedings BACKGROUND: In studies that use DNA arrays to assess changes in gene expression, our goal is to evaluate the statistical significance of treatments on sets of genes. Genes can be grouped by a molecular function, a biological process, or a cellular component, e.g., gene ontology (GO) terms. The meaning of an affected GO group is often clearer than interpretations arising from a list of the statistically significant genes. RESULTS: Computer simulations demonstrated that correlations among genes invalidate many statistical methods that are commonly used to assign significance to GO terms. Ignoring these correlations overstates the statistical significance. Meta-analysis methods for combining p-values were modified to adjust for correlation. One of these methods is elaborated in the context of a comparison between two treatments. The form of the correlation adjustment depends upon the alternative hypothesis. CONCLUSION: Reliable corrections for the effect of correlations among genes on the significance level of a GO term can be constructed for an alternative hypothesis where all transcripts in the GO term increase (decrease) in response to treatment. For general alternatives, which allow some transcripts to increase and others to decrease, the bias of naïve significance calculations can be greatly decreased although not eliminated. BioMed Central 2006-09-26 /pmc/articles/PMC1683577/ /pubmed/17118132 http://dx.doi.org/10.1186/1471-2105-7-S2-S11 Text en Copyright © 2006 Delongchamp et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Delongchamp, Robert
Lee, Taewon
Velasco, Cruz
A method for computing the overall statistical significance of a treatment effect among a group of genes
title A method for computing the overall statistical significance of a treatment effect among a group of genes
title_full A method for computing the overall statistical significance of a treatment effect among a group of genes
title_fullStr A method for computing the overall statistical significance of a treatment effect among a group of genes
title_full_unstemmed A method for computing the overall statistical significance of a treatment effect among a group of genes
title_short A method for computing the overall statistical significance of a treatment effect among a group of genes
title_sort method for computing the overall statistical significance of a treatment effect among a group of genes
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1683577/
https://www.ncbi.nlm.nih.gov/pubmed/17118132
http://dx.doi.org/10.1186/1471-2105-7-S2-S11
work_keys_str_mv AT delongchamprobert amethodforcomputingtheoverallstatisticalsignificanceofatreatmenteffectamongagroupofgenes
AT leetaewon amethodforcomputingtheoverallstatisticalsignificanceofatreatmenteffectamongagroupofgenes
AT velascocruz amethodforcomputingtheoverallstatisticalsignificanceofatreatmenteffectamongagroupofgenes
AT delongchamprobert methodforcomputingtheoverallstatisticalsignificanceofatreatmenteffectamongagroupofgenes
AT leetaewon methodforcomputingtheoverallstatisticalsignificanceofatreatmenteffectamongagroupofgenes
AT velascocruz methodforcomputingtheoverallstatisticalsignificanceofatreatmenteffectamongagroupofgenes