Cargando…
Pathway Activity Score Learning for Dimensionality Reduction of Gene Expression Data
Molecular gene-expression datasets consist of samples with tens of thousands of measured quantities (e.g., high dimensional data). However, there exist lower-dimensional representations that retain the useful information. We present a novel algorithm for such dimensionality reduction called Pathway...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7556388/ http://dx.doi.org/10.1007/978-3-030-61527-7_17 |
_version_ | 1783594208356990976 |
---|---|
author | Karagiannaki, Ioulia Pantazis, Yannis Chatzaki, Ekaterini Tsamardinos, Ioannis |
author_facet | Karagiannaki, Ioulia Pantazis, Yannis Chatzaki, Ekaterini Tsamardinos, Ioannis |
author_sort | Karagiannaki, Ioulia |
collection | PubMed |
description | Molecular gene-expression datasets consist of samples with tens of thousands of measured quantities (e.g., high dimensional data). However, there exist lower-dimensional representations that retain the useful information. We present a novel algorithm for such dimensionality reduction called Pathway Activity Score Learning (PASL). The major novelty of PASL is that the constructed features directly correspond to known molecular pathways and can be interpreted as pathway activity scores. Hence, unlike PCA and similar methods, PASL’s latent space has a relatively straight-forward biological interpretation. As a use-case, PASL is applied on two collections of breast cancer and leukemia gene expression datasets. We show that PASL does retain the predictive information for disease classification on new, unseen datasets, as well as outperforming PLIER, a recently proposed competitive method. We also show that differential activation pathway analysis provides complementary information to standard gene set enrichment analysis. The code is available at https://github.com/mensxmachina/PASL. |
format | Online Article Text |
id | pubmed-7556388 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-75563882020-10-15 Pathway Activity Score Learning for Dimensionality Reduction of Gene Expression Data Karagiannaki, Ioulia Pantazis, Yannis Chatzaki, Ekaterini Tsamardinos, Ioannis Discovery Science Article Molecular gene-expression datasets consist of samples with tens of thousands of measured quantities (e.g., high dimensional data). However, there exist lower-dimensional representations that retain the useful information. We present a novel algorithm for such dimensionality reduction called Pathway Activity Score Learning (PASL). The major novelty of PASL is that the constructed features directly correspond to known molecular pathways and can be interpreted as pathway activity scores. Hence, unlike PCA and similar methods, PASL’s latent space has a relatively straight-forward biological interpretation. As a use-case, PASL is applied on two collections of breast cancer and leukemia gene expression datasets. We show that PASL does retain the predictive information for disease classification on new, unseen datasets, as well as outperforming PLIER, a recently proposed competitive method. We also show that differential activation pathway analysis provides complementary information to standard gene set enrichment analysis. The code is available at https://github.com/mensxmachina/PASL. 2020-09-19 /pmc/articles/PMC7556388/ http://dx.doi.org/10.1007/978-3-030-61527-7_17 Text en © The Author(s) 2020 Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made. The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. |
spellingShingle | Article Karagiannaki, Ioulia Pantazis, Yannis Chatzaki, Ekaterini Tsamardinos, Ioannis Pathway Activity Score Learning for Dimensionality Reduction of Gene Expression Data |
title | Pathway Activity Score Learning for Dimensionality Reduction of Gene Expression Data |
title_full | Pathway Activity Score Learning for Dimensionality Reduction of Gene Expression Data |
title_fullStr | Pathway Activity Score Learning for Dimensionality Reduction of Gene Expression Data |
title_full_unstemmed | Pathway Activity Score Learning for Dimensionality Reduction of Gene Expression Data |
title_short | Pathway Activity Score Learning for Dimensionality Reduction of Gene Expression Data |
title_sort | pathway activity score learning for dimensionality reduction of gene expression data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7556388/ http://dx.doi.org/10.1007/978-3-030-61527-7_17 |
work_keys_str_mv | AT karagiannakiioulia pathwayactivityscorelearningfordimensionalityreductionofgeneexpressiondata AT pantazisyannis pathwayactivityscorelearningfordimensionalityreductionofgeneexpressiondata AT chatzakiekaterini pathwayactivityscorelearningfordimensionalityreductionofgeneexpressiondata AT tsamardinosioannis pathwayactivityscorelearningfordimensionalityreductionofgeneexpressiondata |