Cargando…
Simultaneous Interrogation of Cancer Omics to Identify Subtypes With Significant Clinical Differences
Recent advances in high-throughput sequencing have accelerated the accumulation of omics data on the same tumor tissue from multiple sources. Intensive study of multi-omics integration on tumor samples can stimulate progress in precision medicine and is promising in detecting potential biomarkers. H...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6448130/ https://www.ncbi.nlm.nih.gov/pubmed/30984238 http://dx.doi.org/10.3389/fgene.2019.00236 |
_version_ | 1783408638784700416 |
---|---|
author | Xu, Aodan Chen, Jiazhou Peng, Hong Han, GuoQiang Cai, Hongmin |
author_facet | Xu, Aodan Chen, Jiazhou Peng, Hong Han, GuoQiang Cai, Hongmin |
author_sort | Xu, Aodan |
collection | PubMed |
description | Recent advances in high-throughput sequencing have accelerated the accumulation of omics data on the same tumor tissue from multiple sources. Intensive study of multi-omics integration on tumor samples can stimulate progress in precision medicine and is promising in detecting potential biomarkers. However, current methods are restricted owing to highly unbalanced dimensions of omics data or difficulty in assigning weights between different data sources. Therefore, the appropriate approximation and constraints of integrated targets remain a major challenge. In this paper, we proposed an omics data integration method, named high-order path elucidated similarity (HOPES). HOPES fuses the similarities derived from various omics data sources to solve the dimensional discrepancy, and progressively elucidate the similarities from each type of omics data into an integrated similarity with various high-order connected paths. Through a series of incremental constraints for commonality, HOPES can take both specificity of single data and consistency between different data types into consideration. The fused similarity matrix gives global insight into patients' correlation and efficiently distinguishes subgroups. We tested the performance of HOPES on both a simulated dataset and several empirical tumor datasets. The test datasets contain three omics types including gene expression, DNA methylation, and microRNA data for five different TCGA cancer projects. Our method was shown to achieve superior accuracy and high robustness compared with several benchmark methods on simulated data. Further experiments on five cancer datasets demonstrated that HOPES achieved superior performances in cancer classification. The stratified subgroups were shown to have statistically significant differences in survival. We further located and identified the key genes, methylation sites, and microRNAs within each subgroup. They were shown to achieve high potential prognostic value and were enriched in many cancer-related biological processes or pathways. |
format | Online Article Text |
id | pubmed-6448130 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-64481302019-04-12 Simultaneous Interrogation of Cancer Omics to Identify Subtypes With Significant Clinical Differences Xu, Aodan Chen, Jiazhou Peng, Hong Han, GuoQiang Cai, Hongmin Front Genet Genetics Recent advances in high-throughput sequencing have accelerated the accumulation of omics data on the same tumor tissue from multiple sources. Intensive study of multi-omics integration on tumor samples can stimulate progress in precision medicine and is promising in detecting potential biomarkers. However, current methods are restricted owing to highly unbalanced dimensions of omics data or difficulty in assigning weights between different data sources. Therefore, the appropriate approximation and constraints of integrated targets remain a major challenge. In this paper, we proposed an omics data integration method, named high-order path elucidated similarity (HOPES). HOPES fuses the similarities derived from various omics data sources to solve the dimensional discrepancy, and progressively elucidate the similarities from each type of omics data into an integrated similarity with various high-order connected paths. Through a series of incremental constraints for commonality, HOPES can take both specificity of single data and consistency between different data types into consideration. The fused similarity matrix gives global insight into patients' correlation and efficiently distinguishes subgroups. We tested the performance of HOPES on both a simulated dataset and several empirical tumor datasets. The test datasets contain three omics types including gene expression, DNA methylation, and microRNA data for five different TCGA cancer projects. Our method was shown to achieve superior accuracy and high robustness compared with several benchmark methods on simulated data. Further experiments on five cancer datasets demonstrated that HOPES achieved superior performances in cancer classification. The stratified subgroups were shown to have statistically significant differences in survival. We further located and identified the key genes, methylation sites, and microRNAs within each subgroup. They were shown to achieve high potential prognostic value and were enriched in many cancer-related biological processes or pathways. Frontiers Media S.A. 2019-03-28 /pmc/articles/PMC6448130/ /pubmed/30984238 http://dx.doi.org/10.3389/fgene.2019.00236 Text en Copyright © 2019 Xu, Chen, Peng, Han and Cai. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Xu, Aodan Chen, Jiazhou Peng, Hong Han, GuoQiang Cai, Hongmin Simultaneous Interrogation of Cancer Omics to Identify Subtypes With Significant Clinical Differences |
title | Simultaneous Interrogation of Cancer Omics to Identify Subtypes With Significant Clinical Differences |
title_full | Simultaneous Interrogation of Cancer Omics to Identify Subtypes With Significant Clinical Differences |
title_fullStr | Simultaneous Interrogation of Cancer Omics to Identify Subtypes With Significant Clinical Differences |
title_full_unstemmed | Simultaneous Interrogation of Cancer Omics to Identify Subtypes With Significant Clinical Differences |
title_short | Simultaneous Interrogation of Cancer Omics to Identify Subtypes With Significant Clinical Differences |
title_sort | simultaneous interrogation of cancer omics to identify subtypes with significant clinical differences |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6448130/ https://www.ncbi.nlm.nih.gov/pubmed/30984238 http://dx.doi.org/10.3389/fgene.2019.00236 |
work_keys_str_mv | AT xuaodan simultaneousinterrogationofcanceromicstoidentifysubtypeswithsignificantclinicaldifferences AT chenjiazhou simultaneousinterrogationofcanceromicstoidentifysubtypeswithsignificantclinicaldifferences AT penghong simultaneousinterrogationofcanceromicstoidentifysubtypeswithsignificantclinicaldifferences AT hanguoqiang simultaneousinterrogationofcanceromicstoidentifysubtypeswithsignificantclinicaldifferences AT caihongmin simultaneousinterrogationofcanceromicstoidentifysubtypeswithsignificantclinicaldifferences |