Cargando…

A regulation probability model-based meta-analysis of multiple transcriptomics data sets for cancer biomarker identification

BACKGROUND: Large-scale accumulation of omics data poses a pressing challenge of integrative analysis of multiple data sets in bioinformatics. An open question of such integrative analysis is how to pinpoint consistent but subtle gene activity patterns across studies. Study heterogeneity needs to be...

Descripción completa

Detalles Bibliográficos
Autores principales: Xie, Xin-Ping, Xie, Yu-Feng, Wang, Hong-Qiang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5568075/
https://www.ncbi.nlm.nih.gov/pubmed/28830341
http://dx.doi.org/10.1186/s12859-017-1794-6
_version_ 1783258797427392512
author Xie, Xin-Ping
Xie, Yu-Feng
Wang, Hong-Qiang
author_facet Xie, Xin-Ping
Xie, Yu-Feng
Wang, Hong-Qiang
author_sort Xie, Xin-Ping
collection PubMed
description BACKGROUND: Large-scale accumulation of omics data poses a pressing challenge of integrative analysis of multiple data sets in bioinformatics. An open question of such integrative analysis is how to pinpoint consistent but subtle gene activity patterns across studies. Study heterogeneity needs to be addressed carefully for this goal. RESULTS: This paper proposes a regulation probability model-based meta-analysis, jGRP, for identifying differentially expressed genes (DEGs). The method integrates multiple transcriptomics data sets in a gene regulatory space instead of in a gene expression space, which makes it easy to capture and manage data heterogeneity across studies from different laboratories or platforms. Specifically, we transform gene expression profiles into a united gene regulation profile across studies by mathematically defining two gene regulation events between two conditions and estimating their occurring probabilities in a sample. Finally, a novel differential expression statistic is established based on the gene regulation profiles, realizing accurate and flexible identification of DEGs in gene regulation space. We evaluated the proposed method on simulation data and real-world cancer datasets and showed the effectiveness and efficiency of jGRP in identifying DEGs identification in the context of meta-analysis. CONCLUSIONS: Data heterogeneity largely influences the performance of meta-analysis of DEGs identification. Existing different meta-analysis methods were revealed to exhibit very different degrees of sensitivity to study heterogeneity. The proposed method, jGRP, can be a standalone tool due to its united framework and controllable way to deal with study heterogeneity. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1794-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5568075
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-55680752017-08-29 A regulation probability model-based meta-analysis of multiple transcriptomics data sets for cancer biomarker identification Xie, Xin-Ping Xie, Yu-Feng Wang, Hong-Qiang BMC Bioinformatics Research Article BACKGROUND: Large-scale accumulation of omics data poses a pressing challenge of integrative analysis of multiple data sets in bioinformatics. An open question of such integrative analysis is how to pinpoint consistent but subtle gene activity patterns across studies. Study heterogeneity needs to be addressed carefully for this goal. RESULTS: This paper proposes a regulation probability model-based meta-analysis, jGRP, for identifying differentially expressed genes (DEGs). The method integrates multiple transcriptomics data sets in a gene regulatory space instead of in a gene expression space, which makes it easy to capture and manage data heterogeneity across studies from different laboratories or platforms. Specifically, we transform gene expression profiles into a united gene regulation profile across studies by mathematically defining two gene regulation events between two conditions and estimating their occurring probabilities in a sample. Finally, a novel differential expression statistic is established based on the gene regulation profiles, realizing accurate and flexible identification of DEGs in gene regulation space. We evaluated the proposed method on simulation data and real-world cancer datasets and showed the effectiveness and efficiency of jGRP in identifying DEGs identification in the context of meta-analysis. CONCLUSIONS: Data heterogeneity largely influences the performance of meta-analysis of DEGs identification. Existing different meta-analysis methods were revealed to exhibit very different degrees of sensitivity to study heterogeneity. The proposed method, jGRP, can be a standalone tool due to its united framework and controllable way to deal with study heterogeneity. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1794-6) contains supplementary material, which is available to authorized users. BioMed Central 2017-08-23 /pmc/articles/PMC5568075/ /pubmed/28830341 http://dx.doi.org/10.1186/s12859-017-1794-6 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Xie, Xin-Ping
Xie, Yu-Feng
Wang, Hong-Qiang
A regulation probability model-based meta-analysis of multiple transcriptomics data sets for cancer biomarker identification
title A regulation probability model-based meta-analysis of multiple transcriptomics data sets for cancer biomarker identification
title_full A regulation probability model-based meta-analysis of multiple transcriptomics data sets for cancer biomarker identification
title_fullStr A regulation probability model-based meta-analysis of multiple transcriptomics data sets for cancer biomarker identification
title_full_unstemmed A regulation probability model-based meta-analysis of multiple transcriptomics data sets for cancer biomarker identification
title_short A regulation probability model-based meta-analysis of multiple transcriptomics data sets for cancer biomarker identification
title_sort regulation probability model-based meta-analysis of multiple transcriptomics data sets for cancer biomarker identification
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5568075/
https://www.ncbi.nlm.nih.gov/pubmed/28830341
http://dx.doi.org/10.1186/s12859-017-1794-6
work_keys_str_mv AT xiexinping aregulationprobabilitymodelbasedmetaanalysisofmultipletranscriptomicsdatasetsforcancerbiomarkeridentification
AT xieyufeng aregulationprobabilitymodelbasedmetaanalysisofmultipletranscriptomicsdatasetsforcancerbiomarkeridentification
AT wanghongqiang aregulationprobabilitymodelbasedmetaanalysisofmultipletranscriptomicsdatasetsforcancerbiomarkeridentification
AT xiexinping regulationprobabilitymodelbasedmetaanalysisofmultipletranscriptomicsdatasetsforcancerbiomarkeridentification
AT xieyufeng regulationprobabilitymodelbasedmetaanalysisofmultipletranscriptomicsdatasetsforcancerbiomarkeridentification
AT wanghongqiang regulationprobabilitymodelbasedmetaanalysisofmultipletranscriptomicsdatasetsforcancerbiomarkeridentification