Cargando…

An Integrated Statistical Approach to Compare Transcriptomics Data Across Experiments: A Case Study on the Identification of Candidate Target Genes of the Transcription Factor PPARα

An effective strategy to elucidate the signal transduction cascades activated by a transcription factor is to compare the transcriptional profiles of wild type and transcription factor knockout models. Many statistical tests have been proposed for analyzing gene expression data, but most tests are b...

Descripción completa

Detalles Bibliográficos
Autores principales: Ullah, Mohammad Ohid, Müller, Michael, Hooiveld, Guido J.E.J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Libertas Academica 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3388742/
https://www.ncbi.nlm.nih.gov/pubmed/22783064
http://dx.doi.org/10.4137/BBI.S9529
_version_ 1782237234439651328
author Ullah, Mohammad Ohid
Müller, Michael
Hooiveld, Guido J.E.J.
author_facet Ullah, Mohammad Ohid
Müller, Michael
Hooiveld, Guido J.E.J.
author_sort Ullah, Mohammad Ohid
collection PubMed
description An effective strategy to elucidate the signal transduction cascades activated by a transcription factor is to compare the transcriptional profiles of wild type and transcription factor knockout models. Many statistical tests have been proposed for analyzing gene expression data, but most tests are based on pair-wise comparisons. Since the analysis of microarrays involves the testing of multiple hypotheses within one study, it is generally accepted that one should control for false positives by the false discovery rate (FDR). However, it has been reported that this may be an inappropriate metric for comparing data across different experiments. Here we propose an approach that addresses the above mentioned problem by the simultaneous testing and integration of the three hypotheses (contrasts) using the cell means ANOVA model. These three contrasts test for the effect of a treatment in wild type, gene knockout, and globally over all experimental groups. We illustrate our approach on microarray experiments that focused on the identification of candidate target genes and biological processes governed by the fatty acid sensing transcription factor PPARα in liver. Compared to the often applied FDR based across experiment comparison, our approach identified a conservative but less noisy set of candidate genes with same sensitivity and specificity. However, our method had the advantage of properly adjusting for multiple testing while integrating data from two experiments, and was driven by biological inference. Taken together, in this study we present a simple, yet efficient strategy to compare differential expression of genes across experiments while controlling for multiple hypothesis testing.
format Online
Article
Text
id pubmed-3388742
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-33887422012-07-10 An Integrated Statistical Approach to Compare Transcriptomics Data Across Experiments: A Case Study on the Identification of Candidate Target Genes of the Transcription Factor PPARα Ullah, Mohammad Ohid Müller, Michael Hooiveld, Guido J.E.J. Bioinform Biol Insights Methodology An effective strategy to elucidate the signal transduction cascades activated by a transcription factor is to compare the transcriptional profiles of wild type and transcription factor knockout models. Many statistical tests have been proposed for analyzing gene expression data, but most tests are based on pair-wise comparisons. Since the analysis of microarrays involves the testing of multiple hypotheses within one study, it is generally accepted that one should control for false positives by the false discovery rate (FDR). However, it has been reported that this may be an inappropriate metric for comparing data across different experiments. Here we propose an approach that addresses the above mentioned problem by the simultaneous testing and integration of the three hypotheses (contrasts) using the cell means ANOVA model. These three contrasts test for the effect of a treatment in wild type, gene knockout, and globally over all experimental groups. We illustrate our approach on microarray experiments that focused on the identification of candidate target genes and biological processes governed by the fatty acid sensing transcription factor PPARα in liver. Compared to the often applied FDR based across experiment comparison, our approach identified a conservative but less noisy set of candidate genes with same sensitivity and specificity. However, our method had the advantage of properly adjusting for multiple testing while integrating data from two experiments, and was driven by biological inference. Taken together, in this study we present a simple, yet efficient strategy to compare differential expression of genes across experiments while controlling for multiple hypothesis testing. Libertas Academica 2012-06-19 /pmc/articles/PMC3388742/ /pubmed/22783064 http://dx.doi.org/10.4137/BBI.S9529 Text en © the author(s), publisher and licensee Libertas Academica Ltd. This is an open access article. Unrestricted non-commercial use is permitted provided the original work is properly cited.
spellingShingle Methodology
Ullah, Mohammad Ohid
Müller, Michael
Hooiveld, Guido J.E.J.
An Integrated Statistical Approach to Compare Transcriptomics Data Across Experiments: A Case Study on the Identification of Candidate Target Genes of the Transcription Factor PPARα
title An Integrated Statistical Approach to Compare Transcriptomics Data Across Experiments: A Case Study on the Identification of Candidate Target Genes of the Transcription Factor PPARα
title_full An Integrated Statistical Approach to Compare Transcriptomics Data Across Experiments: A Case Study on the Identification of Candidate Target Genes of the Transcription Factor PPARα
title_fullStr An Integrated Statistical Approach to Compare Transcriptomics Data Across Experiments: A Case Study on the Identification of Candidate Target Genes of the Transcription Factor PPARα
title_full_unstemmed An Integrated Statistical Approach to Compare Transcriptomics Data Across Experiments: A Case Study on the Identification of Candidate Target Genes of the Transcription Factor PPARα
title_short An Integrated Statistical Approach to Compare Transcriptomics Data Across Experiments: A Case Study on the Identification of Candidate Target Genes of the Transcription Factor PPARα
title_sort integrated statistical approach to compare transcriptomics data across experiments: a case study on the identification of candidate target genes of the transcription factor pparα
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3388742/
https://www.ncbi.nlm.nih.gov/pubmed/22783064
http://dx.doi.org/10.4137/BBI.S9529
work_keys_str_mv AT ullahmohammadohid anintegratedstatisticalapproachtocomparetranscriptomicsdataacrossexperimentsacasestudyontheidentificationofcandidatetargetgenesofthetranscriptionfactorppara
AT mullermichael anintegratedstatisticalapproachtocomparetranscriptomicsdataacrossexperimentsacasestudyontheidentificationofcandidatetargetgenesofthetranscriptionfactorppara
AT hooiveldguidojej anintegratedstatisticalapproachtocomparetranscriptomicsdataacrossexperimentsacasestudyontheidentificationofcandidatetargetgenesofthetranscriptionfactorppara
AT ullahmohammadohid integratedstatisticalapproachtocomparetranscriptomicsdataacrossexperimentsacasestudyontheidentificationofcandidatetargetgenesofthetranscriptionfactorppara
AT mullermichael integratedstatisticalapproachtocomparetranscriptomicsdataacrossexperimentsacasestudyontheidentificationofcandidatetargetgenesofthetranscriptionfactorppara
AT hooiveldguidojej integratedstatisticalapproachtocomparetranscriptomicsdataacrossexperimentsacasestudyontheidentificationofcandidatetargetgenesofthetranscriptionfactorppara