Cargando…

Predicting clinical outcomes in neuroblastoma with genomic data integration

BACKGROUND: Neuroblastoma is a heterogeneous disease with diverse clinical outcomes. Current risk group models require improvement as patients within the same risk group can still show variable prognosis. Recently collected genome-wide datasets provide opportunities to infer neuroblastoma subtypes i...

Descripción completa

Detalles Bibliográficos
Autores principales: Baali, Ilyes, Acar, D Alp Emre, Aderinwale, Tunde W., HafezQorani, Saber, Kazan, Hilal
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6889397/
https://www.ncbi.nlm.nih.gov/pubmed/30621745
http://dx.doi.org/10.1186/s13062-018-0223-8
_version_ 1783475408176414720
author Baali, Ilyes
Acar, D Alp Emre
Aderinwale, Tunde W.
HafezQorani, Saber
Kazan, Hilal
author_facet Baali, Ilyes
Acar, D Alp Emre
Aderinwale, Tunde W.
HafezQorani, Saber
Kazan, Hilal
author_sort Baali, Ilyes
collection PubMed
description BACKGROUND: Neuroblastoma is a heterogeneous disease with diverse clinical outcomes. Current risk group models require improvement as patients within the same risk group can still show variable prognosis. Recently collected genome-wide datasets provide opportunities to infer neuroblastoma subtypes in a more unified way. Within this context, data integration is critical as different molecular characteristics can contain complementary signals. To this end, we utilized the genomic datasets available for the SEQC cohort patients to develop supervised and unsupervised models that can predict disease prognosis. RESULTS: Our supervised model trained on the SEQC cohort can accurately predict overall survival and event-free survival profiles of patients in two independent cohorts. We also performed extensive experiments to assess the prediction accuracy of high risk patients and patients without MYCN amplification. Our results from this part suggest that clinical endpoints can be predicted accurately across multiple cohorts. To explore the data in an unsupervised manner, we used an integrative clustering strategy named multi-view kernel k-means (MVKKM) that can effectively integrate multiple high-dimensional datasets with varying weights. We observed that integrating different gene expression datasets results in a better patient stratification compared to using these datasets individually. Also, our identified subgroups provide a better Cox regression model fit compared to the existing risk group definitions. CONCLUSION: Altogether, our results indicate that integration of multiple genomic characterizations enables the discovery of subtypes that improve over existing definitions of risk groups. Effective prediction of survival times will have a direct impact on choosing the right therapies for patients. REVIEWERS: This article was reviewed by Susmita Datta, Wenzhong Xiao and Ziv Shkedy. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13062-018-0223-8) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6889397
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68893972019-12-11 Predicting clinical outcomes in neuroblastoma with genomic data integration Baali, Ilyes Acar, D Alp Emre Aderinwale, Tunde W. HafezQorani, Saber Kazan, Hilal Biol Direct Research BACKGROUND: Neuroblastoma is a heterogeneous disease with diverse clinical outcomes. Current risk group models require improvement as patients within the same risk group can still show variable prognosis. Recently collected genome-wide datasets provide opportunities to infer neuroblastoma subtypes in a more unified way. Within this context, data integration is critical as different molecular characteristics can contain complementary signals. To this end, we utilized the genomic datasets available for the SEQC cohort patients to develop supervised and unsupervised models that can predict disease prognosis. RESULTS: Our supervised model trained on the SEQC cohort can accurately predict overall survival and event-free survival profiles of patients in two independent cohorts. We also performed extensive experiments to assess the prediction accuracy of high risk patients and patients without MYCN amplification. Our results from this part suggest that clinical endpoints can be predicted accurately across multiple cohorts. To explore the data in an unsupervised manner, we used an integrative clustering strategy named multi-view kernel k-means (MVKKM) that can effectively integrate multiple high-dimensional datasets with varying weights. We observed that integrating different gene expression datasets results in a better patient stratification compared to using these datasets individually. Also, our identified subgroups provide a better Cox regression model fit compared to the existing risk group definitions. CONCLUSION: Altogether, our results indicate that integration of multiple genomic characterizations enables the discovery of subtypes that improve over existing definitions of risk groups. Effective prediction of survival times will have a direct impact on choosing the right therapies for patients. REVIEWERS: This article was reviewed by Susmita Datta, Wenzhong Xiao and Ziv Shkedy. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13062-018-0223-8) contains supplementary material, which is available to authorized users. BioMed Central 2018-09-27 /pmc/articles/PMC6889397/ /pubmed/30621745 http://dx.doi.org/10.1186/s13062-018-0223-8 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Baali, Ilyes
Acar, D Alp Emre
Aderinwale, Tunde W.
HafezQorani, Saber
Kazan, Hilal
Predicting clinical outcomes in neuroblastoma with genomic data integration
title Predicting clinical outcomes in neuroblastoma with genomic data integration
title_full Predicting clinical outcomes in neuroblastoma with genomic data integration
title_fullStr Predicting clinical outcomes in neuroblastoma with genomic data integration
title_full_unstemmed Predicting clinical outcomes in neuroblastoma with genomic data integration
title_short Predicting clinical outcomes in neuroblastoma with genomic data integration
title_sort predicting clinical outcomes in neuroblastoma with genomic data integration
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6889397/
https://www.ncbi.nlm.nih.gov/pubmed/30621745
http://dx.doi.org/10.1186/s13062-018-0223-8
work_keys_str_mv AT baaliilyes predictingclinicaloutcomesinneuroblastomawithgenomicdataintegration
AT acardalpemre predictingclinicaloutcomesinneuroblastomawithgenomicdataintegration
AT aderinwaletundew predictingclinicaloutcomesinneuroblastomawithgenomicdataintegration
AT hafezqoranisaber predictingclinicaloutcomesinneuroblastomawithgenomicdataintegration
AT kazanhilal predictingclinicaloutcomesinneuroblastomawithgenomicdataintegration