Cargando…

Towards Multiple Kernel Principal Component Analysis for Integrative Analysis of Tumor Samples

Personalized treatment of patients based on tissue-specific cancer subtypes has strongly increased the efficacy of the chosen therapies. Even though the amount of data measured for cancer patients has increased over the last years, most cancer subtypes are still diagnosed based on individual data so...

Descripción completa

Detalles Bibliográficos
Autores principales: Speicher, Nora K., Pfeifer, Nico
Formato: Online Artículo Texto
Lenguaje:English
Publicado: De Gruyter 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6042822/
https://www.ncbi.nlm.nih.gov/pubmed/28688226
http://dx.doi.org/10.1515/jib-2017-0019
_version_ 1783339228471492608
author Speicher, Nora K.
Pfeifer, Nico
author_facet Speicher, Nora K.
Pfeifer, Nico
author_sort Speicher, Nora K.
collection PubMed
description Personalized treatment of patients based on tissue-specific cancer subtypes has strongly increased the efficacy of the chosen therapies. Even though the amount of data measured for cancer patients has increased over the last years, most cancer subtypes are still diagnosed based on individual data sources (e.g. gene expression data). We propose an unsupervised data integration method based on kernel principal component analysis. Principal component analysis is one of the most widely used techniques in data analysis. Unfortunately, the straightforward multiple kernel extension of this method leads to the use of only one of the input matrices, which does not fit the goal of gaining information from all data sources. Therefore, we present a scoring function to determine the impact of each input matrix. The approach enables visualizing the integrated data and subsequent clustering for cancer subtype identification. Due to the nature of the method, no hyperparameters have to be set. We apply the methodology to five different cancer data sets and demonstrate its advantages in terms of results and usability.
format Online
Article
Text
id pubmed-6042822
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher De Gruyter
record_format MEDLINE/PubMed
spelling pubmed-60428222019-01-28 Towards Multiple Kernel Principal Component Analysis for Integrative Analysis of Tumor Samples Speicher, Nora K. Pfeifer, Nico J Integr Bioinform Research Articles Personalized treatment of patients based on tissue-specific cancer subtypes has strongly increased the efficacy of the chosen therapies. Even though the amount of data measured for cancer patients has increased over the last years, most cancer subtypes are still diagnosed based on individual data sources (e.g. gene expression data). We propose an unsupervised data integration method based on kernel principal component analysis. Principal component analysis is one of the most widely used techniques in data analysis. Unfortunately, the straightforward multiple kernel extension of this method leads to the use of only one of the input matrices, which does not fit the goal of gaining information from all data sources. Therefore, we present a scoring function to determine the impact of each input matrix. The approach enables visualizing the integrated data and subsequent clustering for cancer subtype identification. Due to the nature of the method, no hyperparameters have to be set. We apply the methodology to five different cancer data sets and demonstrate its advantages in terms of results and usability. De Gruyter 2017-07-08 /pmc/articles/PMC6042822/ /pubmed/28688226 http://dx.doi.org/10.1515/jib-2017-0019 Text en ©2017, Nora K. Speicher and Nico Pfeifer, published by De Gruyter, Berlin/Boston http://creativecommons.org/licenses/by-nc-nd/3.0 This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.
spellingShingle Research Articles
Speicher, Nora K.
Pfeifer, Nico
Towards Multiple Kernel Principal Component Analysis for Integrative Analysis of Tumor Samples
title Towards Multiple Kernel Principal Component Analysis for Integrative Analysis of Tumor Samples
title_full Towards Multiple Kernel Principal Component Analysis for Integrative Analysis of Tumor Samples
title_fullStr Towards Multiple Kernel Principal Component Analysis for Integrative Analysis of Tumor Samples
title_full_unstemmed Towards Multiple Kernel Principal Component Analysis for Integrative Analysis of Tumor Samples
title_short Towards Multiple Kernel Principal Component Analysis for Integrative Analysis of Tumor Samples
title_sort towards multiple kernel principal component analysis for integrative analysis of tumor samples
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6042822/
https://www.ncbi.nlm.nih.gov/pubmed/28688226
http://dx.doi.org/10.1515/jib-2017-0019
work_keys_str_mv AT speichernorak towardsmultiplekernelprincipalcomponentanalysisforintegrativeanalysisoftumorsamples
AT pfeifernico towardsmultiplekernelprincipalcomponentanalysisforintegrativeanalysisoftumorsamples