Cargando…

Pathway Activity Score Learning for Dimensionality Reduction of Gene Expression Data

Molecular gene-expression datasets consist of samples with tens of thousands of measured quantities (e.g., high dimensional data). However, there exist lower-dimensional representations that retain the useful information. We present a novel algorithm for such dimensionality reduction called Pathway...

Descripción completa

Detalles Bibliográficos
Autores principales: Karagiannaki, Ioulia, Pantazis, Yannis, Chatzaki, Ekaterini, Tsamardinos, Ioannis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7556388/
http://dx.doi.org/10.1007/978-3-030-61527-7_17
_version_ 1783594208356990976
author Karagiannaki, Ioulia
Pantazis, Yannis
Chatzaki, Ekaterini
Tsamardinos, Ioannis
author_facet Karagiannaki, Ioulia
Pantazis, Yannis
Chatzaki, Ekaterini
Tsamardinos, Ioannis
author_sort Karagiannaki, Ioulia
collection PubMed
description Molecular gene-expression datasets consist of samples with tens of thousands of measured quantities (e.g., high dimensional data). However, there exist lower-dimensional representations that retain the useful information. We present a novel algorithm for such dimensionality reduction called Pathway Activity Score Learning (PASL). The major novelty of PASL is that the constructed features directly correspond to known molecular pathways and can be interpreted as pathway activity scores. Hence, unlike PCA and similar methods, PASL’s latent space has a relatively straight-forward biological interpretation. As a use-case, PASL is applied on two collections of breast cancer and leukemia gene expression datasets. We show that PASL does retain the predictive information for disease classification on new, unseen datasets, as well as outperforming PLIER, a recently proposed competitive method. We also show that differential activation pathway analysis provides complementary information to standard gene set enrichment analysis. The code is available at https://github.com/mensxmachina/PASL.
format Online
Article
Text
id pubmed-7556388
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-75563882020-10-15 Pathway Activity Score Learning for Dimensionality Reduction of Gene Expression Data Karagiannaki, Ioulia Pantazis, Yannis Chatzaki, Ekaterini Tsamardinos, Ioannis Discovery Science Article Molecular gene-expression datasets consist of samples with tens of thousands of measured quantities (e.g., high dimensional data). However, there exist lower-dimensional representations that retain the useful information. We present a novel algorithm for such dimensionality reduction called Pathway Activity Score Learning (PASL). The major novelty of PASL is that the constructed features directly correspond to known molecular pathways and can be interpreted as pathway activity scores. Hence, unlike PCA and similar methods, PASL’s latent space has a relatively straight-forward biological interpretation. As a use-case, PASL is applied on two collections of breast cancer and leukemia gene expression datasets. We show that PASL does retain the predictive information for disease classification on new, unseen datasets, as well as outperforming PLIER, a recently proposed competitive method. We also show that differential activation pathway analysis provides complementary information to standard gene set enrichment analysis. The code is available at https://github.com/mensxmachina/PASL. 2020-09-19 /pmc/articles/PMC7556388/ http://dx.doi.org/10.1007/978-3-030-61527-7_17 Text en © The Author(s) 2020 Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made. The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
spellingShingle Article
Karagiannaki, Ioulia
Pantazis, Yannis
Chatzaki, Ekaterini
Tsamardinos, Ioannis
Pathway Activity Score Learning for Dimensionality Reduction of Gene Expression Data
title Pathway Activity Score Learning for Dimensionality Reduction of Gene Expression Data
title_full Pathway Activity Score Learning for Dimensionality Reduction of Gene Expression Data
title_fullStr Pathway Activity Score Learning for Dimensionality Reduction of Gene Expression Data
title_full_unstemmed Pathway Activity Score Learning for Dimensionality Reduction of Gene Expression Data
title_short Pathway Activity Score Learning for Dimensionality Reduction of Gene Expression Data
title_sort pathway activity score learning for dimensionality reduction of gene expression data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7556388/
http://dx.doi.org/10.1007/978-3-030-61527-7_17
work_keys_str_mv AT karagiannakiioulia pathwayactivityscorelearningfordimensionalityreductionofgeneexpressiondata
AT pantazisyannis pathwayactivityscorelearningfordimensionalityreductionofgeneexpressiondata
AT chatzakiekaterini pathwayactivityscorelearningfordimensionalityreductionofgeneexpressiondata
AT tsamardinosioannis pathwayactivityscorelearningfordimensionalityreductionofgeneexpressiondata