Cargando…
Dissection of gene expression datasets into clinically relevant interaction signatures via high-dimensional correlation maximization
Gene expression is controlled by many simultaneous interactions, frequently measured collectively in biology and medicine by high-throughput technologies. It is a highly challenging task to infer from these data the generating effects and cooperating genes. Here, we present an unsupervised hypothesi...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6883077/ https://www.ncbi.nlm.nih.gov/pubmed/31780653 http://dx.doi.org/10.1038/s41467-019-12713-5 |
_version_ | 1783474297539395584 |
---|---|
author | Grau, Michael Lenz, Georg Lenz, Peter |
author_facet | Grau, Michael Lenz, Georg Lenz, Peter |
author_sort | Grau, Michael |
collection | PubMed |
description | Gene expression is controlled by many simultaneous interactions, frequently measured collectively in biology and medicine by high-throughput technologies. It is a highly challenging task to infer from these data the generating effects and cooperating genes. Here, we present an unsupervised hypothesis-generating learning concept termed signal dissection by correlation maximization (SDCM) that dissects large high-dimensional datasets into signatures. Each signature captures a particular signal pattern that was consistently observed for multiple genes and samples, likely caused by the same underlying interaction. A key difference to other methods is our flexible nonlinear signal superposition model, combined with a precise regression technique. Analyzing gene expression of diffuse large B-cell lymphoma, our method discovers previously unidentified signatures that reveal significant differences in patient survival. These signatures are more predictive than those from various methods used for comparison and robustly validate across technological platforms. This implies highly specific extraction of clinically relevant gene interactions. |
format | Online Article Text |
id | pubmed-6883077 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-68830772019-12-03 Dissection of gene expression datasets into clinically relevant interaction signatures via high-dimensional correlation maximization Grau, Michael Lenz, Georg Lenz, Peter Nat Commun Article Gene expression is controlled by many simultaneous interactions, frequently measured collectively in biology and medicine by high-throughput technologies. It is a highly challenging task to infer from these data the generating effects and cooperating genes. Here, we present an unsupervised hypothesis-generating learning concept termed signal dissection by correlation maximization (SDCM) that dissects large high-dimensional datasets into signatures. Each signature captures a particular signal pattern that was consistently observed for multiple genes and samples, likely caused by the same underlying interaction. A key difference to other methods is our flexible nonlinear signal superposition model, combined with a precise regression technique. Analyzing gene expression of diffuse large B-cell lymphoma, our method discovers previously unidentified signatures that reveal significant differences in patient survival. These signatures are more predictive than those from various methods used for comparison and robustly validate across technological platforms. This implies highly specific extraction of clinically relevant gene interactions. Nature Publishing Group UK 2019-11-28 /pmc/articles/PMC6883077/ /pubmed/31780653 http://dx.doi.org/10.1038/s41467-019-12713-5 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Grau, Michael Lenz, Georg Lenz, Peter Dissection of gene expression datasets into clinically relevant interaction signatures via high-dimensional correlation maximization |
title | Dissection of gene expression datasets into clinically relevant interaction signatures via high-dimensional correlation maximization |
title_full | Dissection of gene expression datasets into clinically relevant interaction signatures via high-dimensional correlation maximization |
title_fullStr | Dissection of gene expression datasets into clinically relevant interaction signatures via high-dimensional correlation maximization |
title_full_unstemmed | Dissection of gene expression datasets into clinically relevant interaction signatures via high-dimensional correlation maximization |
title_short | Dissection of gene expression datasets into clinically relevant interaction signatures via high-dimensional correlation maximization |
title_sort | dissection of gene expression datasets into clinically relevant interaction signatures via high-dimensional correlation maximization |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6883077/ https://www.ncbi.nlm.nih.gov/pubmed/31780653 http://dx.doi.org/10.1038/s41467-019-12713-5 |
work_keys_str_mv | AT graumichael dissectionofgeneexpressiondatasetsintoclinicallyrelevantinteractionsignaturesviahighdimensionalcorrelationmaximization AT lenzgeorg dissectionofgeneexpressiondatasetsintoclinicallyrelevantinteractionsignaturesviahighdimensionalcorrelationmaximization AT lenzpeter dissectionofgeneexpressiondatasetsintoclinicallyrelevantinteractionsignaturesviahighdimensionalcorrelationmaximization |