Cargando…
A comparative evaluation of data-merging and meta-analysis methods for reconstructing gene-gene interactions
BACKGROUND: We address the problem of integratively analyzing multiple gene expression, microarray datasets in order to reconstruct gene-gene interaction networks. Integrating multiple datasets is generally believed to provide increased statistical power and to lead to a better characterization of t...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4905611/ https://www.ncbi.nlm.nih.gov/pubmed/27294826 http://dx.doi.org/10.1186/s12859-016-1038-1 |
_version_ | 1782437280011517952 |
---|---|
author | Lagani, Vincenzo Karozou, Argyro D. Gomez-Cabrero, David Silberberg, Gilad Tsamardinos, Ioannis |
author_facet | Lagani, Vincenzo Karozou, Argyro D. Gomez-Cabrero, David Silberberg, Gilad Tsamardinos, Ioannis |
author_sort | Lagani, Vincenzo |
collection | PubMed |
description | BACKGROUND: We address the problem of integratively analyzing multiple gene expression, microarray datasets in order to reconstruct gene-gene interaction networks. Integrating multiple datasets is generally believed to provide increased statistical power and to lead to a better characterization of the system under study. However, the presence of systematic variation across different studies makes network reverse-engineering tasks particularly challenging. We contrast two approaches that have been frequently used in the literature for addressing systematic biases: meta-analysis methods, which first calculate opportune statistics on single datasets and successively summarize them, and data-merging methods, which directly analyze the pooled data after removing eventual biases. This comparative evaluation is performed on both synthetic and real data, the latter consisting of two manually curated microarray compendia comprising several E. coli and Yeast studies, respectively. Furthermore, the reconstruction of the regulatory network of the transcription factor Ikaros in human Peripheral Blood Mononuclear Cells (PBMCs) is presented as a case-study. RESULTS: The meta-analysis and data-merging methods included in our experimentations provided comparable performances on both synthetic and real data. Furthermore, both approaches outperformed (a) the naïve solution of merging data together ignoring possible biases, and (b) the results that are expected when only one dataset out of the available ones is analyzed in isolation. Using correlation statistics proved to be more effective than using p-values for correctly ranking candidate interactions. The results from the PBMC case-study indicate that the findings of the present study generalize to different types of network reconstruction algorithms. CONCLUSIONS: Ignoring the systematic variations that differentiate heterogeneous studies can produce results that are statistically indistinguishable from random guessing. Meta-analysis and data merging methods have proved equally effective in addressing this issue, and thus researchers may safely select the approach that best suit their specific application. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1038-1) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4905611 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-49056112016-06-14 A comparative evaluation of data-merging and meta-analysis methods for reconstructing gene-gene interactions Lagani, Vincenzo Karozou, Argyro D. Gomez-Cabrero, David Silberberg, Gilad Tsamardinos, Ioannis BMC Bioinformatics Research BACKGROUND: We address the problem of integratively analyzing multiple gene expression, microarray datasets in order to reconstruct gene-gene interaction networks. Integrating multiple datasets is generally believed to provide increased statistical power and to lead to a better characterization of the system under study. However, the presence of systematic variation across different studies makes network reverse-engineering tasks particularly challenging. We contrast two approaches that have been frequently used in the literature for addressing systematic biases: meta-analysis methods, which first calculate opportune statistics on single datasets and successively summarize them, and data-merging methods, which directly analyze the pooled data after removing eventual biases. This comparative evaluation is performed on both synthetic and real data, the latter consisting of two manually curated microarray compendia comprising several E. coli and Yeast studies, respectively. Furthermore, the reconstruction of the regulatory network of the transcription factor Ikaros in human Peripheral Blood Mononuclear Cells (PBMCs) is presented as a case-study. RESULTS: The meta-analysis and data-merging methods included in our experimentations provided comparable performances on both synthetic and real data. Furthermore, both approaches outperformed (a) the naïve solution of merging data together ignoring possible biases, and (b) the results that are expected when only one dataset out of the available ones is analyzed in isolation. Using correlation statistics proved to be more effective than using p-values for correctly ranking candidate interactions. The results from the PBMC case-study indicate that the findings of the present study generalize to different types of network reconstruction algorithms. CONCLUSIONS: Ignoring the systematic variations that differentiate heterogeneous studies can produce results that are statistically indistinguishable from random guessing. Meta-analysis and data merging methods have proved equally effective in addressing this issue, and thus researchers may safely select the approach that best suit their specific application. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1038-1) contains supplementary material, which is available to authorized users. BioMed Central 2016-06-06 /pmc/articles/PMC4905611/ /pubmed/27294826 http://dx.doi.org/10.1186/s12859-016-1038-1 Text en © Lagani et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Lagani, Vincenzo Karozou, Argyro D. Gomez-Cabrero, David Silberberg, Gilad Tsamardinos, Ioannis A comparative evaluation of data-merging and meta-analysis methods for reconstructing gene-gene interactions |
title | A comparative evaluation of data-merging and meta-analysis methods for reconstructing gene-gene interactions |
title_full | A comparative evaluation of data-merging and meta-analysis methods for reconstructing gene-gene interactions |
title_fullStr | A comparative evaluation of data-merging and meta-analysis methods for reconstructing gene-gene interactions |
title_full_unstemmed | A comparative evaluation of data-merging and meta-analysis methods for reconstructing gene-gene interactions |
title_short | A comparative evaluation of data-merging and meta-analysis methods for reconstructing gene-gene interactions |
title_sort | comparative evaluation of data-merging and meta-analysis methods for reconstructing gene-gene interactions |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4905611/ https://www.ncbi.nlm.nih.gov/pubmed/27294826 http://dx.doi.org/10.1186/s12859-016-1038-1 |
work_keys_str_mv | AT laganivincenzo acomparativeevaluationofdatamergingandmetaanalysismethodsforreconstructinggenegeneinteractions AT karozouargyrod acomparativeevaluationofdatamergingandmetaanalysismethodsforreconstructinggenegeneinteractions AT gomezcabrerodavid acomparativeevaluationofdatamergingandmetaanalysismethodsforreconstructinggenegeneinteractions AT silberberggilad acomparativeevaluationofdatamergingandmetaanalysismethodsforreconstructinggenegeneinteractions AT tsamardinosioannis acomparativeevaluationofdatamergingandmetaanalysismethodsforreconstructinggenegeneinteractions AT laganivincenzo comparativeevaluationofdatamergingandmetaanalysismethodsforreconstructinggenegeneinteractions AT karozouargyrod comparativeevaluationofdatamergingandmetaanalysismethodsforreconstructinggenegeneinteractions AT gomezcabrerodavid comparativeevaluationofdatamergingandmetaanalysismethodsforreconstructinggenegeneinteractions AT silberberggilad comparativeevaluationofdatamergingandmetaanalysismethodsforreconstructinggenegeneinteractions AT tsamardinosioannis comparativeevaluationofdatamergingandmetaanalysismethodsforreconstructinggenegeneinteractions |