Cargando…

Data-driven identification of complex disease phenotypes

Disease interaction in multimorbid patients is relevant to treatment and prognosis, yet poorly understood. In the present work, we combine approaches from network science, machine learning and computational phenotyping to assess interactions between two or more diseases in a transparent way across t...

Descripción completa

Detalles Bibliográficos
Autores principales: Strauss, Markus J., Niederkrotenthaler, Thomas, Thurner, Stefan, Kautzky-Willer, Alexandra, Klimek, Peter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8315834/
https://www.ncbi.nlm.nih.gov/pubmed/34314651
http://dx.doi.org/10.1098/rsif.2020.1040
_version_ 1783729782823845888
author Strauss, Markus J.
Niederkrotenthaler, Thomas
Thurner, Stefan
Kautzky-Willer, Alexandra
Klimek, Peter
author_facet Strauss, Markus J.
Niederkrotenthaler, Thomas
Thurner, Stefan
Kautzky-Willer, Alexandra
Klimek, Peter
author_sort Strauss, Markus J.
collection PubMed
description Disease interaction in multimorbid patients is relevant to treatment and prognosis, yet poorly understood. In the present work, we combine approaches from network science, machine learning and computational phenotyping to assess interactions between two or more diseases in a transparent way across the full diagnostic spectrum. We demonstrate that health states of hospitalized patients can be better characterized by including higher-order features capturing interactions between more than two diseases. We identify a meaningful set of higher-order diagnosis features that account for synergistic disease interactions in a population-wide (N = 9 M) medical claims dataset. We construct a generalized disease network where (higher-order) diagnosis features are linked if they predict similar diagnoses across the whole diagnostic spectrum. The fact that specific diagnoses are generally represented multiple times in the network allows for the identification of putatively different disease phenotypes that may reflect different disease aetiologies. At the example of obesity, we demonstrate the purely data-driven detection of two complex phenotypes of obesity. As indicated by a matched comparison between patients having these phenotypes, we show that these phenotypes show specific characteristics of what has been controversially discussed in the medical literature as metabolically healthy and unhealthy obesity, respectively. The findings also suggest that metabolically healthy patients show some progression towards more unhealthy obesity over time, a finding that is consistent with longitudinal studies indicating a transient nature of metabolically healthy obesity. The disease network is available for exploration at https://disease.network/.
format Online
Article
Text
id pubmed-8315834
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher The Royal Society
record_format MEDLINE/PubMed
spelling pubmed-83158342021-07-31 Data-driven identification of complex disease phenotypes Strauss, Markus J. Niederkrotenthaler, Thomas Thurner, Stefan Kautzky-Willer, Alexandra Klimek, Peter J R Soc Interface Life Sciences–Physics interface Disease interaction in multimorbid patients is relevant to treatment and prognosis, yet poorly understood. In the present work, we combine approaches from network science, machine learning and computational phenotyping to assess interactions between two or more diseases in a transparent way across the full diagnostic spectrum. We demonstrate that health states of hospitalized patients can be better characterized by including higher-order features capturing interactions between more than two diseases. We identify a meaningful set of higher-order diagnosis features that account for synergistic disease interactions in a population-wide (N = 9 M) medical claims dataset. We construct a generalized disease network where (higher-order) diagnosis features are linked if they predict similar diagnoses across the whole diagnostic spectrum. The fact that specific diagnoses are generally represented multiple times in the network allows for the identification of putatively different disease phenotypes that may reflect different disease aetiologies. At the example of obesity, we demonstrate the purely data-driven detection of two complex phenotypes of obesity. As indicated by a matched comparison between patients having these phenotypes, we show that these phenotypes show specific characteristics of what has been controversially discussed in the medical literature as metabolically healthy and unhealthy obesity, respectively. The findings also suggest that metabolically healthy patients show some progression towards more unhealthy obesity over time, a finding that is consistent with longitudinal studies indicating a transient nature of metabolically healthy obesity. The disease network is available for exploration at https://disease.network/. The Royal Society 2021-07-28 /pmc/articles/PMC8315834/ /pubmed/34314651 http://dx.doi.org/10.1098/rsif.2020.1040 Text en © 2021 The Authors. https://creativecommons.org/licenses/by/4.0/Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, provided the original author and source are credited.
spellingShingle Life Sciences–Physics interface
Strauss, Markus J.
Niederkrotenthaler, Thomas
Thurner, Stefan
Kautzky-Willer, Alexandra
Klimek, Peter
Data-driven identification of complex disease phenotypes
title Data-driven identification of complex disease phenotypes
title_full Data-driven identification of complex disease phenotypes
title_fullStr Data-driven identification of complex disease phenotypes
title_full_unstemmed Data-driven identification of complex disease phenotypes
title_short Data-driven identification of complex disease phenotypes
title_sort data-driven identification of complex disease phenotypes
topic Life Sciences–Physics interface
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8315834/
https://www.ncbi.nlm.nih.gov/pubmed/34314651
http://dx.doi.org/10.1098/rsif.2020.1040
work_keys_str_mv AT straussmarkusj datadrivenidentificationofcomplexdiseasephenotypes
AT niederkrotenthalerthomas datadrivenidentificationofcomplexdiseasephenotypes
AT thurnerstefan datadrivenidentificationofcomplexdiseasephenotypes
AT kautzkywilleralexandra datadrivenidentificationofcomplexdiseasephenotypes
AT klimekpeter datadrivenidentificationofcomplexdiseasephenotypes