Cargando…
A regulation probability model-based meta-analysis of multiple transcriptomics data sets for cancer biomarker identification
BACKGROUND: Large-scale accumulation of omics data poses a pressing challenge of integrative analysis of multiple data sets in bioinformatics. An open question of such integrative analysis is how to pinpoint consistent but subtle gene activity patterns across studies. Study heterogeneity needs to be...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5568075/ https://www.ncbi.nlm.nih.gov/pubmed/28830341 http://dx.doi.org/10.1186/s12859-017-1794-6 |
_version_ | 1783258797427392512 |
---|---|
author | Xie, Xin-Ping Xie, Yu-Feng Wang, Hong-Qiang |
author_facet | Xie, Xin-Ping Xie, Yu-Feng Wang, Hong-Qiang |
author_sort | Xie, Xin-Ping |
collection | PubMed |
description | BACKGROUND: Large-scale accumulation of omics data poses a pressing challenge of integrative analysis of multiple data sets in bioinformatics. An open question of such integrative analysis is how to pinpoint consistent but subtle gene activity patterns across studies. Study heterogeneity needs to be addressed carefully for this goal. RESULTS: This paper proposes a regulation probability model-based meta-analysis, jGRP, for identifying differentially expressed genes (DEGs). The method integrates multiple transcriptomics data sets in a gene regulatory space instead of in a gene expression space, which makes it easy to capture and manage data heterogeneity across studies from different laboratories or platforms. Specifically, we transform gene expression profiles into a united gene regulation profile across studies by mathematically defining two gene regulation events between two conditions and estimating their occurring probabilities in a sample. Finally, a novel differential expression statistic is established based on the gene regulation profiles, realizing accurate and flexible identification of DEGs in gene regulation space. We evaluated the proposed method on simulation data and real-world cancer datasets and showed the effectiveness and efficiency of jGRP in identifying DEGs identification in the context of meta-analysis. CONCLUSIONS: Data heterogeneity largely influences the performance of meta-analysis of DEGs identification. Existing different meta-analysis methods were revealed to exhibit very different degrees of sensitivity to study heterogeneity. The proposed method, jGRP, can be a standalone tool due to its united framework and controllable way to deal with study heterogeneity. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1794-6) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5568075 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-55680752017-08-29 A regulation probability model-based meta-analysis of multiple transcriptomics data sets for cancer biomarker identification Xie, Xin-Ping Xie, Yu-Feng Wang, Hong-Qiang BMC Bioinformatics Research Article BACKGROUND: Large-scale accumulation of omics data poses a pressing challenge of integrative analysis of multiple data sets in bioinformatics. An open question of such integrative analysis is how to pinpoint consistent but subtle gene activity patterns across studies. Study heterogeneity needs to be addressed carefully for this goal. RESULTS: This paper proposes a regulation probability model-based meta-analysis, jGRP, for identifying differentially expressed genes (DEGs). The method integrates multiple transcriptomics data sets in a gene regulatory space instead of in a gene expression space, which makes it easy to capture and manage data heterogeneity across studies from different laboratories or platforms. Specifically, we transform gene expression profiles into a united gene regulation profile across studies by mathematically defining two gene regulation events between two conditions and estimating their occurring probabilities in a sample. Finally, a novel differential expression statistic is established based on the gene regulation profiles, realizing accurate and flexible identification of DEGs in gene regulation space. We evaluated the proposed method on simulation data and real-world cancer datasets and showed the effectiveness and efficiency of jGRP in identifying DEGs identification in the context of meta-analysis. CONCLUSIONS: Data heterogeneity largely influences the performance of meta-analysis of DEGs identification. Existing different meta-analysis methods were revealed to exhibit very different degrees of sensitivity to study heterogeneity. The proposed method, jGRP, can be a standalone tool due to its united framework and controllable way to deal with study heterogeneity. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1794-6) contains supplementary material, which is available to authorized users. BioMed Central 2017-08-23 /pmc/articles/PMC5568075/ /pubmed/28830341 http://dx.doi.org/10.1186/s12859-017-1794-6 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Xie, Xin-Ping Xie, Yu-Feng Wang, Hong-Qiang A regulation probability model-based meta-analysis of multiple transcriptomics data sets for cancer biomarker identification |
title | A regulation probability model-based meta-analysis of multiple transcriptomics data sets for cancer biomarker identification |
title_full | A regulation probability model-based meta-analysis of multiple transcriptomics data sets for cancer biomarker identification |
title_fullStr | A regulation probability model-based meta-analysis of multiple transcriptomics data sets for cancer biomarker identification |
title_full_unstemmed | A regulation probability model-based meta-analysis of multiple transcriptomics data sets for cancer biomarker identification |
title_short | A regulation probability model-based meta-analysis of multiple transcriptomics data sets for cancer biomarker identification |
title_sort | regulation probability model-based meta-analysis of multiple transcriptomics data sets for cancer biomarker identification |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5568075/ https://www.ncbi.nlm.nih.gov/pubmed/28830341 http://dx.doi.org/10.1186/s12859-017-1794-6 |
work_keys_str_mv | AT xiexinping aregulationprobabilitymodelbasedmetaanalysisofmultipletranscriptomicsdatasetsforcancerbiomarkeridentification AT xieyufeng aregulationprobabilitymodelbasedmetaanalysisofmultipletranscriptomicsdatasetsforcancerbiomarkeridentification AT wanghongqiang aregulationprobabilitymodelbasedmetaanalysisofmultipletranscriptomicsdatasetsforcancerbiomarkeridentification AT xiexinping regulationprobabilitymodelbasedmetaanalysisofmultipletranscriptomicsdatasetsforcancerbiomarkeridentification AT xieyufeng regulationprobabilitymodelbasedmetaanalysisofmultipletranscriptomicsdatasetsforcancerbiomarkeridentification AT wanghongqiang regulationprobabilitymodelbasedmetaanalysisofmultipletranscriptomicsdatasetsforcancerbiomarkeridentification |