Cargando…

Integrating different data types by regularized unsupervised multiple kernel learning with application to cancer subtype discovery

Motivation: Despite ongoing cancer research, available therapies are still limited in quantity and effectiveness, and making treatment decisions for individual patients remains a hard problem. Established subtypes, which help guide these decisions, are mainly based on individual data types. However,...

Descripción completa

Detalles Bibliográficos
Autores principales: Speicher, Nora K., Pfeifer, Nico
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4765854/
https://www.ncbi.nlm.nih.gov/pubmed/26072491
http://dx.doi.org/10.1093/bioinformatics/btv244
_version_ 1782417582890942464
author Speicher, Nora K.
Pfeifer, Nico
author_facet Speicher, Nora K.
Pfeifer, Nico
author_sort Speicher, Nora K.
collection PubMed
description Motivation: Despite ongoing cancer research, available therapies are still limited in quantity and effectiveness, and making treatment decisions for individual patients remains a hard problem. Established subtypes, which help guide these decisions, are mainly based on individual data types. However, the analysis of multidimensional patient data involving the measurements of various molecular features could reveal intrinsic characteristics of the tumor. Large-scale projects accumulate this kind of data for various cancer types, but we still lack the computational methods to reliably integrate this information in a meaningful manner. Therefore, we apply and extend current multiple kernel learning for dimensionality reduction approaches. On the one hand, we add a regularization term to avoid overfitting during the optimization procedure, and on the other hand, we show that one can even use several kernels per data type and thereby alleviate the user from having to choose the best kernel functions and kernel parameters for each data type beforehand. Results: We have identified biologically meaningful subgroups for five different cancer types. Survival analysis has revealed significant differences between the survival times of the identified subtypes, with P values comparable or even better than state-of-the-art methods. Moreover, our resulting subtypes reflect combined patterns from the different data sources, and we demonstrate that input kernel matrices with only little information have less impact on the integrated kernel matrix. Our subtypes show different responses to specific therapies, which could eventually assist in treatment decision making. Availability and implementation: An executable is available upon request. Contact: nora@mpi-inf.mpg.de or npfeifer@mpi-inf.mpg.de
format Online
Article
Text
id pubmed-4765854
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-47658542016-03-04 Integrating different data types by regularized unsupervised multiple kernel learning with application to cancer subtype discovery Speicher, Nora K. Pfeifer, Nico Bioinformatics Ismb/Eccb 2015 Proceedings Papers Committee July 10 to July 14, 2015, Dublin, Ireland Motivation: Despite ongoing cancer research, available therapies are still limited in quantity and effectiveness, and making treatment decisions for individual patients remains a hard problem. Established subtypes, which help guide these decisions, are mainly based on individual data types. However, the analysis of multidimensional patient data involving the measurements of various molecular features could reveal intrinsic characteristics of the tumor. Large-scale projects accumulate this kind of data for various cancer types, but we still lack the computational methods to reliably integrate this information in a meaningful manner. Therefore, we apply and extend current multiple kernel learning for dimensionality reduction approaches. On the one hand, we add a regularization term to avoid overfitting during the optimization procedure, and on the other hand, we show that one can even use several kernels per data type and thereby alleviate the user from having to choose the best kernel functions and kernel parameters for each data type beforehand. Results: We have identified biologically meaningful subgroups for five different cancer types. Survival analysis has revealed significant differences between the survival times of the identified subtypes, with P values comparable or even better than state-of-the-art methods. Moreover, our resulting subtypes reflect combined patterns from the different data sources, and we demonstrate that input kernel matrices with only little information have less impact on the integrated kernel matrix. Our subtypes show different responses to specific therapies, which could eventually assist in treatment decision making. Availability and implementation: An executable is available upon request. Contact: nora@mpi-inf.mpg.de or npfeifer@mpi-inf.mpg.de Oxford University Press 2015-06-15 2015-06-10 /pmc/articles/PMC4765854/ /pubmed/26072491 http://dx.doi.org/10.1093/bioinformatics/btv244 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb/Eccb 2015 Proceedings Papers Committee July 10 to July 14, 2015, Dublin, Ireland
Speicher, Nora K.
Pfeifer, Nico
Integrating different data types by regularized unsupervised multiple kernel learning with application to cancer subtype discovery
title Integrating different data types by regularized unsupervised multiple kernel learning with application to cancer subtype discovery
title_full Integrating different data types by regularized unsupervised multiple kernel learning with application to cancer subtype discovery
title_fullStr Integrating different data types by regularized unsupervised multiple kernel learning with application to cancer subtype discovery
title_full_unstemmed Integrating different data types by regularized unsupervised multiple kernel learning with application to cancer subtype discovery
title_short Integrating different data types by regularized unsupervised multiple kernel learning with application to cancer subtype discovery
title_sort integrating different data types by regularized unsupervised multiple kernel learning with application to cancer subtype discovery
topic Ismb/Eccb 2015 Proceedings Papers Committee July 10 to July 14, 2015, Dublin, Ireland
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4765854/
https://www.ncbi.nlm.nih.gov/pubmed/26072491
http://dx.doi.org/10.1093/bioinformatics/btv244
work_keys_str_mv AT speichernorak integratingdifferentdatatypesbyregularizedunsupervisedmultiplekernellearningwithapplicationtocancersubtypediscovery
AT pfeifernico integratingdifferentdatatypesbyregularizedunsupervisedmultiplekernellearningwithapplicationtocancersubtypediscovery