Cargando…
An Integrated Statistical Approach to Compare Transcriptomics Data Across Experiments: A Case Study on the Identification of Candidate Target Genes of the Transcription Factor PPARα
An effective strategy to elucidate the signal transduction cascades activated by a transcription factor is to compare the transcriptional profiles of wild type and transcription factor knockout models. Many statistical tests have been proposed for analyzing gene expression data, but most tests are b...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Libertas Academica
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3388742/ https://www.ncbi.nlm.nih.gov/pubmed/22783064 http://dx.doi.org/10.4137/BBI.S9529 |
_version_ | 1782237234439651328 |
---|---|
author | Ullah, Mohammad Ohid Müller, Michael Hooiveld, Guido J.E.J. |
author_facet | Ullah, Mohammad Ohid Müller, Michael Hooiveld, Guido J.E.J. |
author_sort | Ullah, Mohammad Ohid |
collection | PubMed |
description | An effective strategy to elucidate the signal transduction cascades activated by a transcription factor is to compare the transcriptional profiles of wild type and transcription factor knockout models. Many statistical tests have been proposed for analyzing gene expression data, but most tests are based on pair-wise comparisons. Since the analysis of microarrays involves the testing of multiple hypotheses within one study, it is generally accepted that one should control for false positives by the false discovery rate (FDR). However, it has been reported that this may be an inappropriate metric for comparing data across different experiments. Here we propose an approach that addresses the above mentioned problem by the simultaneous testing and integration of the three hypotheses (contrasts) using the cell means ANOVA model. These three contrasts test for the effect of a treatment in wild type, gene knockout, and globally over all experimental groups. We illustrate our approach on microarray experiments that focused on the identification of candidate target genes and biological processes governed by the fatty acid sensing transcription factor PPARα in liver. Compared to the often applied FDR based across experiment comparison, our approach identified a conservative but less noisy set of candidate genes with same sensitivity and specificity. However, our method had the advantage of properly adjusting for multiple testing while integrating data from two experiments, and was driven by biological inference. Taken together, in this study we present a simple, yet efficient strategy to compare differential expression of genes across experiments while controlling for multiple hypothesis testing. |
format | Online Article Text |
id | pubmed-3388742 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Libertas Academica |
record_format | MEDLINE/PubMed |
spelling | pubmed-33887422012-07-10 An Integrated Statistical Approach to Compare Transcriptomics Data Across Experiments: A Case Study on the Identification of Candidate Target Genes of the Transcription Factor PPARα Ullah, Mohammad Ohid Müller, Michael Hooiveld, Guido J.E.J. Bioinform Biol Insights Methodology An effective strategy to elucidate the signal transduction cascades activated by a transcription factor is to compare the transcriptional profiles of wild type and transcription factor knockout models. Many statistical tests have been proposed for analyzing gene expression data, but most tests are based on pair-wise comparisons. Since the analysis of microarrays involves the testing of multiple hypotheses within one study, it is generally accepted that one should control for false positives by the false discovery rate (FDR). However, it has been reported that this may be an inappropriate metric for comparing data across different experiments. Here we propose an approach that addresses the above mentioned problem by the simultaneous testing and integration of the three hypotheses (contrasts) using the cell means ANOVA model. These three contrasts test for the effect of a treatment in wild type, gene knockout, and globally over all experimental groups. We illustrate our approach on microarray experiments that focused on the identification of candidate target genes and biological processes governed by the fatty acid sensing transcription factor PPARα in liver. Compared to the often applied FDR based across experiment comparison, our approach identified a conservative but less noisy set of candidate genes with same sensitivity and specificity. However, our method had the advantage of properly adjusting for multiple testing while integrating data from two experiments, and was driven by biological inference. Taken together, in this study we present a simple, yet efficient strategy to compare differential expression of genes across experiments while controlling for multiple hypothesis testing. Libertas Academica 2012-06-19 /pmc/articles/PMC3388742/ /pubmed/22783064 http://dx.doi.org/10.4137/BBI.S9529 Text en © the author(s), publisher and licensee Libertas Academica Ltd. This is an open access article. Unrestricted non-commercial use is permitted provided the original work is properly cited. |
spellingShingle | Methodology Ullah, Mohammad Ohid Müller, Michael Hooiveld, Guido J.E.J. An Integrated Statistical Approach to Compare Transcriptomics Data Across Experiments: A Case Study on the Identification of Candidate Target Genes of the Transcription Factor PPARα |
title | An Integrated Statistical Approach to Compare Transcriptomics Data Across Experiments: A Case Study on the Identification of Candidate Target Genes of the Transcription Factor PPARα |
title_full | An Integrated Statistical Approach to Compare Transcriptomics Data Across Experiments: A Case Study on the Identification of Candidate Target Genes of the Transcription Factor PPARα |
title_fullStr | An Integrated Statistical Approach to Compare Transcriptomics Data Across Experiments: A Case Study on the Identification of Candidate Target Genes of the Transcription Factor PPARα |
title_full_unstemmed | An Integrated Statistical Approach to Compare Transcriptomics Data Across Experiments: A Case Study on the Identification of Candidate Target Genes of the Transcription Factor PPARα |
title_short | An Integrated Statistical Approach to Compare Transcriptomics Data Across Experiments: A Case Study on the Identification of Candidate Target Genes of the Transcription Factor PPARα |
title_sort | integrated statistical approach to compare transcriptomics data across experiments: a case study on the identification of candidate target genes of the transcription factor pparα |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3388742/ https://www.ncbi.nlm.nih.gov/pubmed/22783064 http://dx.doi.org/10.4137/BBI.S9529 |
work_keys_str_mv | AT ullahmohammadohid anintegratedstatisticalapproachtocomparetranscriptomicsdataacrossexperimentsacasestudyontheidentificationofcandidatetargetgenesofthetranscriptionfactorppara AT mullermichael anintegratedstatisticalapproachtocomparetranscriptomicsdataacrossexperimentsacasestudyontheidentificationofcandidatetargetgenesofthetranscriptionfactorppara AT hooiveldguidojej anintegratedstatisticalapproachtocomparetranscriptomicsdataacrossexperimentsacasestudyontheidentificationofcandidatetargetgenesofthetranscriptionfactorppara AT ullahmohammadohid integratedstatisticalapproachtocomparetranscriptomicsdataacrossexperimentsacasestudyontheidentificationofcandidatetargetgenesofthetranscriptionfactorppara AT mullermichael integratedstatisticalapproachtocomparetranscriptomicsdataacrossexperimentsacasestudyontheidentificationofcandidatetargetgenesofthetranscriptionfactorppara AT hooiveldguidojej integratedstatisticalapproachtocomparetranscriptomicsdataacrossexperimentsacasestudyontheidentificationofcandidatetargetgenesofthetranscriptionfactorppara |