Cargando…

A comparative evaluation of data-merging and meta-analysis methods for reconstructing gene-gene interactions

BACKGROUND: We address the problem of integratively analyzing multiple gene expression, microarray datasets in order to reconstruct gene-gene interaction networks. Integrating multiple datasets is generally believed to provide increased statistical power and to lead to a better characterization of t...

Descripción completa

Detalles Bibliográficos
Autores principales: Lagani, Vincenzo, Karozou, Argyro D., Gomez-Cabrero, David, Silberberg, Gilad, Tsamardinos, Ioannis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4905611/
https://www.ncbi.nlm.nih.gov/pubmed/27294826
http://dx.doi.org/10.1186/s12859-016-1038-1
_version_ 1782437280011517952
author Lagani, Vincenzo
Karozou, Argyro D.
Gomez-Cabrero, David
Silberberg, Gilad
Tsamardinos, Ioannis
author_facet Lagani, Vincenzo
Karozou, Argyro D.
Gomez-Cabrero, David
Silberberg, Gilad
Tsamardinos, Ioannis
author_sort Lagani, Vincenzo
collection PubMed
description BACKGROUND: We address the problem of integratively analyzing multiple gene expression, microarray datasets in order to reconstruct gene-gene interaction networks. Integrating multiple datasets is generally believed to provide increased statistical power and to lead to a better characterization of the system under study. However, the presence of systematic variation across different studies makes network reverse-engineering tasks particularly challenging. We contrast two approaches that have been frequently used in the literature for addressing systematic biases: meta-analysis methods, which first calculate opportune statistics on single datasets and successively summarize them, and data-merging methods, which directly analyze the pooled data after removing eventual biases. This comparative evaluation is performed on both synthetic and real data, the latter consisting of two manually curated microarray compendia comprising several E. coli and Yeast studies, respectively. Furthermore, the reconstruction of the regulatory network of the transcription factor Ikaros in human Peripheral Blood Mononuclear Cells (PBMCs) is presented as a case-study. RESULTS: The meta-analysis and data-merging methods included in our experimentations provided comparable performances on both synthetic and real data. Furthermore, both approaches outperformed (a) the naïve solution of merging data together ignoring possible biases, and (b) the results that are expected when only one dataset out of the available ones is analyzed in isolation. Using correlation statistics proved to be more effective than using p-values for correctly ranking candidate interactions. The results from the PBMC case-study indicate that the findings of the present study generalize to different types of network reconstruction algorithms. CONCLUSIONS: Ignoring the systematic variations that differentiate heterogeneous studies can produce results that are statistically indistinguishable from random guessing. Meta-analysis and data merging methods have proved equally effective in addressing this issue, and thus researchers may safely select the approach that best suit their specific application. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1038-1) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4905611
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-49056112016-06-14 A comparative evaluation of data-merging and meta-analysis methods for reconstructing gene-gene interactions Lagani, Vincenzo Karozou, Argyro D. Gomez-Cabrero, David Silberberg, Gilad Tsamardinos, Ioannis BMC Bioinformatics Research BACKGROUND: We address the problem of integratively analyzing multiple gene expression, microarray datasets in order to reconstruct gene-gene interaction networks. Integrating multiple datasets is generally believed to provide increased statistical power and to lead to a better characterization of the system under study. However, the presence of systematic variation across different studies makes network reverse-engineering tasks particularly challenging. We contrast two approaches that have been frequently used in the literature for addressing systematic biases: meta-analysis methods, which first calculate opportune statistics on single datasets and successively summarize them, and data-merging methods, which directly analyze the pooled data after removing eventual biases. This comparative evaluation is performed on both synthetic and real data, the latter consisting of two manually curated microarray compendia comprising several E. coli and Yeast studies, respectively. Furthermore, the reconstruction of the regulatory network of the transcription factor Ikaros in human Peripheral Blood Mononuclear Cells (PBMCs) is presented as a case-study. RESULTS: The meta-analysis and data-merging methods included in our experimentations provided comparable performances on both synthetic and real data. Furthermore, both approaches outperformed (a) the naïve solution of merging data together ignoring possible biases, and (b) the results that are expected when only one dataset out of the available ones is analyzed in isolation. Using correlation statistics proved to be more effective than using p-values for correctly ranking candidate interactions. The results from the PBMC case-study indicate that the findings of the present study generalize to different types of network reconstruction algorithms. CONCLUSIONS: Ignoring the systematic variations that differentiate heterogeneous studies can produce results that are statistically indistinguishable from random guessing. Meta-analysis and data merging methods have proved equally effective in addressing this issue, and thus researchers may safely select the approach that best suit their specific application. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1038-1) contains supplementary material, which is available to authorized users. BioMed Central 2016-06-06 /pmc/articles/PMC4905611/ /pubmed/27294826 http://dx.doi.org/10.1186/s12859-016-1038-1 Text en © Lagani et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Lagani, Vincenzo
Karozou, Argyro D.
Gomez-Cabrero, David
Silberberg, Gilad
Tsamardinos, Ioannis
A comparative evaluation of data-merging and meta-analysis methods for reconstructing gene-gene interactions
title A comparative evaluation of data-merging and meta-analysis methods for reconstructing gene-gene interactions
title_full A comparative evaluation of data-merging and meta-analysis methods for reconstructing gene-gene interactions
title_fullStr A comparative evaluation of data-merging and meta-analysis methods for reconstructing gene-gene interactions
title_full_unstemmed A comparative evaluation of data-merging and meta-analysis methods for reconstructing gene-gene interactions
title_short A comparative evaluation of data-merging and meta-analysis methods for reconstructing gene-gene interactions
title_sort comparative evaluation of data-merging and meta-analysis methods for reconstructing gene-gene interactions
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4905611/
https://www.ncbi.nlm.nih.gov/pubmed/27294826
http://dx.doi.org/10.1186/s12859-016-1038-1
work_keys_str_mv AT laganivincenzo acomparativeevaluationofdatamergingandmetaanalysismethodsforreconstructinggenegeneinteractions
AT karozouargyrod acomparativeevaluationofdatamergingandmetaanalysismethodsforreconstructinggenegeneinteractions
AT gomezcabrerodavid acomparativeevaluationofdatamergingandmetaanalysismethodsforreconstructinggenegeneinteractions
AT silberberggilad acomparativeevaluationofdatamergingandmetaanalysismethodsforreconstructinggenegeneinteractions
AT tsamardinosioannis acomparativeevaluationofdatamergingandmetaanalysismethodsforreconstructinggenegeneinteractions
AT laganivincenzo comparativeevaluationofdatamergingandmetaanalysismethodsforreconstructinggenegeneinteractions
AT karozouargyrod comparativeevaluationofdatamergingandmetaanalysismethodsforreconstructinggenegeneinteractions
AT gomezcabrerodavid comparativeevaluationofdatamergingandmetaanalysismethodsforreconstructinggenegeneinteractions
AT silberberggilad comparativeevaluationofdatamergingandmetaanalysismethodsforreconstructinggenegeneinteractions
AT tsamardinosioannis comparativeevaluationofdatamergingandmetaanalysismethodsforreconstructinggenegeneinteractions