Cargando…

Dissection of gene expression datasets into clinically relevant interaction signatures via high-dimensional correlation maximization

Gene expression is controlled by many simultaneous interactions, frequently measured collectively in biology and medicine by high-throughput technologies. It is a highly challenging task to infer from these data the generating effects and cooperating genes. Here, we present an unsupervised hypothesi...

Descripción completa

Detalles Bibliográficos
Autores principales: Grau, Michael, Lenz, Georg, Lenz, Peter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6883077/
https://www.ncbi.nlm.nih.gov/pubmed/31780653
http://dx.doi.org/10.1038/s41467-019-12713-5
_version_ 1783474297539395584
author Grau, Michael
Lenz, Georg
Lenz, Peter
author_facet Grau, Michael
Lenz, Georg
Lenz, Peter
author_sort Grau, Michael
collection PubMed
description Gene expression is controlled by many simultaneous interactions, frequently measured collectively in biology and medicine by high-throughput technologies. It is a highly challenging task to infer from these data the generating effects and cooperating genes. Here, we present an unsupervised hypothesis-generating learning concept termed signal dissection by correlation maximization (SDCM) that dissects large high-dimensional datasets into signatures. Each signature captures a particular signal pattern that was consistently observed for multiple genes and samples, likely caused by the same underlying interaction. A key difference to other methods is our flexible nonlinear signal superposition model, combined with a precise regression technique. Analyzing gene expression of diffuse large B-cell lymphoma, our method discovers previously unidentified signatures that reveal significant differences in patient survival. These signatures are more predictive than those from various methods used for comparison and robustly validate across technological platforms. This implies highly specific extraction of clinically relevant gene interactions.
format Online
Article
Text
id pubmed-6883077
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-68830772019-12-03 Dissection of gene expression datasets into clinically relevant interaction signatures via high-dimensional correlation maximization Grau, Michael Lenz, Georg Lenz, Peter Nat Commun Article Gene expression is controlled by many simultaneous interactions, frequently measured collectively in biology and medicine by high-throughput technologies. It is a highly challenging task to infer from these data the generating effects and cooperating genes. Here, we present an unsupervised hypothesis-generating learning concept termed signal dissection by correlation maximization (SDCM) that dissects large high-dimensional datasets into signatures. Each signature captures a particular signal pattern that was consistently observed for multiple genes and samples, likely caused by the same underlying interaction. A key difference to other methods is our flexible nonlinear signal superposition model, combined with a precise regression technique. Analyzing gene expression of diffuse large B-cell lymphoma, our method discovers previously unidentified signatures that reveal significant differences in patient survival. These signatures are more predictive than those from various methods used for comparison and robustly validate across technological platforms. This implies highly specific extraction of clinically relevant gene interactions. Nature Publishing Group UK 2019-11-28 /pmc/articles/PMC6883077/ /pubmed/31780653 http://dx.doi.org/10.1038/s41467-019-12713-5 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Grau, Michael
Lenz, Georg
Lenz, Peter
Dissection of gene expression datasets into clinically relevant interaction signatures via high-dimensional correlation maximization
title Dissection of gene expression datasets into clinically relevant interaction signatures via high-dimensional correlation maximization
title_full Dissection of gene expression datasets into clinically relevant interaction signatures via high-dimensional correlation maximization
title_fullStr Dissection of gene expression datasets into clinically relevant interaction signatures via high-dimensional correlation maximization
title_full_unstemmed Dissection of gene expression datasets into clinically relevant interaction signatures via high-dimensional correlation maximization
title_short Dissection of gene expression datasets into clinically relevant interaction signatures via high-dimensional correlation maximization
title_sort dissection of gene expression datasets into clinically relevant interaction signatures via high-dimensional correlation maximization
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6883077/
https://www.ncbi.nlm.nih.gov/pubmed/31780653
http://dx.doi.org/10.1038/s41467-019-12713-5
work_keys_str_mv AT graumichael dissectionofgeneexpressiondatasetsintoclinicallyrelevantinteractionsignaturesviahighdimensionalcorrelationmaximization
AT lenzgeorg dissectionofgeneexpressiondatasetsintoclinicallyrelevantinteractionsignaturesviahighdimensionalcorrelationmaximization
AT lenzpeter dissectionofgeneexpressiondatasetsintoclinicallyrelevantinteractionsignaturesviahighdimensionalcorrelationmaximization