Cargando…
Predicting clinical outcomes in neuroblastoma with genomic data integration
BACKGROUND: Neuroblastoma is a heterogeneous disease with diverse clinical outcomes. Current risk group models require improvement as patients within the same risk group can still show variable prognosis. Recently collected genome-wide datasets provide opportunities to infer neuroblastoma subtypes i...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6889397/ https://www.ncbi.nlm.nih.gov/pubmed/30621745 http://dx.doi.org/10.1186/s13062-018-0223-8 |
_version_ | 1783475408176414720 |
---|---|
author | Baali, Ilyes Acar, D Alp Emre Aderinwale, Tunde W. HafezQorani, Saber Kazan, Hilal |
author_facet | Baali, Ilyes Acar, D Alp Emre Aderinwale, Tunde W. HafezQorani, Saber Kazan, Hilal |
author_sort | Baali, Ilyes |
collection | PubMed |
description | BACKGROUND: Neuroblastoma is a heterogeneous disease with diverse clinical outcomes. Current risk group models require improvement as patients within the same risk group can still show variable prognosis. Recently collected genome-wide datasets provide opportunities to infer neuroblastoma subtypes in a more unified way. Within this context, data integration is critical as different molecular characteristics can contain complementary signals. To this end, we utilized the genomic datasets available for the SEQC cohort patients to develop supervised and unsupervised models that can predict disease prognosis. RESULTS: Our supervised model trained on the SEQC cohort can accurately predict overall survival and event-free survival profiles of patients in two independent cohorts. We also performed extensive experiments to assess the prediction accuracy of high risk patients and patients without MYCN amplification. Our results from this part suggest that clinical endpoints can be predicted accurately across multiple cohorts. To explore the data in an unsupervised manner, we used an integrative clustering strategy named multi-view kernel k-means (MVKKM) that can effectively integrate multiple high-dimensional datasets with varying weights. We observed that integrating different gene expression datasets results in a better patient stratification compared to using these datasets individually. Also, our identified subgroups provide a better Cox regression model fit compared to the existing risk group definitions. CONCLUSION: Altogether, our results indicate that integration of multiple genomic characterizations enables the discovery of subtypes that improve over existing definitions of risk groups. Effective prediction of survival times will have a direct impact on choosing the right therapies for patients. REVIEWERS: This article was reviewed by Susmita Datta, Wenzhong Xiao and Ziv Shkedy. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13062-018-0223-8) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6889397 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-68893972019-12-11 Predicting clinical outcomes in neuroblastoma with genomic data integration Baali, Ilyes Acar, D Alp Emre Aderinwale, Tunde W. HafezQorani, Saber Kazan, Hilal Biol Direct Research BACKGROUND: Neuroblastoma is a heterogeneous disease with diverse clinical outcomes. Current risk group models require improvement as patients within the same risk group can still show variable prognosis. Recently collected genome-wide datasets provide opportunities to infer neuroblastoma subtypes in a more unified way. Within this context, data integration is critical as different molecular characteristics can contain complementary signals. To this end, we utilized the genomic datasets available for the SEQC cohort patients to develop supervised and unsupervised models that can predict disease prognosis. RESULTS: Our supervised model trained on the SEQC cohort can accurately predict overall survival and event-free survival profiles of patients in two independent cohorts. We also performed extensive experiments to assess the prediction accuracy of high risk patients and patients without MYCN amplification. Our results from this part suggest that clinical endpoints can be predicted accurately across multiple cohorts. To explore the data in an unsupervised manner, we used an integrative clustering strategy named multi-view kernel k-means (MVKKM) that can effectively integrate multiple high-dimensional datasets with varying weights. We observed that integrating different gene expression datasets results in a better patient stratification compared to using these datasets individually. Also, our identified subgroups provide a better Cox regression model fit compared to the existing risk group definitions. CONCLUSION: Altogether, our results indicate that integration of multiple genomic characterizations enables the discovery of subtypes that improve over existing definitions of risk groups. Effective prediction of survival times will have a direct impact on choosing the right therapies for patients. REVIEWERS: This article was reviewed by Susmita Datta, Wenzhong Xiao and Ziv Shkedy. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13062-018-0223-8) contains supplementary material, which is available to authorized users. BioMed Central 2018-09-27 /pmc/articles/PMC6889397/ /pubmed/30621745 http://dx.doi.org/10.1186/s13062-018-0223-8 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Baali, Ilyes Acar, D Alp Emre Aderinwale, Tunde W. HafezQorani, Saber Kazan, Hilal Predicting clinical outcomes in neuroblastoma with genomic data integration |
title | Predicting clinical outcomes in neuroblastoma with genomic data integration |
title_full | Predicting clinical outcomes in neuroblastoma with genomic data integration |
title_fullStr | Predicting clinical outcomes in neuroblastoma with genomic data integration |
title_full_unstemmed | Predicting clinical outcomes in neuroblastoma with genomic data integration |
title_short | Predicting clinical outcomes in neuroblastoma with genomic data integration |
title_sort | predicting clinical outcomes in neuroblastoma with genomic data integration |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6889397/ https://www.ncbi.nlm.nih.gov/pubmed/30621745 http://dx.doi.org/10.1186/s13062-018-0223-8 |
work_keys_str_mv | AT baaliilyes predictingclinicaloutcomesinneuroblastomawithgenomicdataintegration AT acardalpemre predictingclinicaloutcomesinneuroblastomawithgenomicdataintegration AT aderinwaletundew predictingclinicaloutcomesinneuroblastomawithgenomicdataintegration AT hafezqoranisaber predictingclinicaloutcomesinneuroblastomawithgenomicdataintegration AT kazanhilal predictingclinicaloutcomesinneuroblastomawithgenomicdataintegration |