Cargando…
An unsupervised learning approach to identify novel signatures of health and disease from multimodal data
BACKGROUND: Modern medicine is rapidly moving towards a data-driven paradigm based on comprehensive multimodal health assessments. Integrated analysis of data from different modalities has the potential of uncovering novel biomarkers and disease signatures. METHODS: We collected 1385 data features f...
Autores principales: | , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6953286/ https://www.ncbi.nlm.nih.gov/pubmed/31924279 http://dx.doi.org/10.1186/s13073-019-0705-z |
_version_ | 1783486610266914816 |
---|---|
author | Shomorony, Ilan Cirulli, Elizabeth T. Huang, Lei Napier, Lori A. Heister, Robyn R. Hicks, Michael Cohen, Isaac V. Yu, Hung-Chun Swisher, Christine Leon Schenker-Ahmed, Natalie M. Li, Weizhong Nelson, Karen E. Brar, Pamila Kahn, Andrew M. Spector, Timothy D. Caskey, C. Thomas Venter, J. Craig Karow, David S. Kirkness, Ewen F. Shah, Naisha |
author_facet | Shomorony, Ilan Cirulli, Elizabeth T. Huang, Lei Napier, Lori A. Heister, Robyn R. Hicks, Michael Cohen, Isaac V. Yu, Hung-Chun Swisher, Christine Leon Schenker-Ahmed, Natalie M. Li, Weizhong Nelson, Karen E. Brar, Pamila Kahn, Andrew M. Spector, Timothy D. Caskey, C. Thomas Venter, J. Craig Karow, David S. Kirkness, Ewen F. Shah, Naisha |
author_sort | Shomorony, Ilan |
collection | PubMed |
description | BACKGROUND: Modern medicine is rapidly moving towards a data-driven paradigm based on comprehensive multimodal health assessments. Integrated analysis of data from different modalities has the potential of uncovering novel biomarkers and disease signatures. METHODS: We collected 1385 data features from diverse modalities, including metabolome, microbiome, genetics, and advanced imaging, from 1253 individuals and from a longitudinal validation cohort of 1083 individuals. We utilized a combination of unsupervised machine learning methods to identify multimodal biomarker signatures of health and disease risk. RESULTS: Our method identified a set of cardiometabolic biomarkers that goes beyond standard clinical biomarkers. Stratification of individuals based on the signatures of these biomarkers identified distinct subsets of individuals with similar health statuses. Subset membership was a better predictor for diabetes than established clinical biomarkers such as glucose, insulin resistance, and body mass index. The novel biomarkers in the diabetes signature included 1-stearoyl-2-dihomo-linolenoyl-GPC and 1-(1-enyl-palmitoyl)-2-oleoyl-GPC. Another metabolite, cinnamoylglycine, was identified as a potential biomarker for both gut microbiome health and lean mass percentage. We identified potential early signatures for hypertension and a poor metabolic health outcome. Additionally, we found novel associations between a uremic toxin, p-cresol sulfate, and the abundance of the microbiome genera Intestinimonas and an unclassified genus in the Erysipelotrichaceae family. CONCLUSIONS: Our methodology and results demonstrate the potential of multimodal data integration, from the identification of novel biomarker signatures to a data-driven stratification of individuals into disease subtypes and stages—an essential step towards personalized, preventative health risk assessment. |
format | Online Article Text |
id | pubmed-6953286 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-69532862020-01-14 An unsupervised learning approach to identify novel signatures of health and disease from multimodal data Shomorony, Ilan Cirulli, Elizabeth T. Huang, Lei Napier, Lori A. Heister, Robyn R. Hicks, Michael Cohen, Isaac V. Yu, Hung-Chun Swisher, Christine Leon Schenker-Ahmed, Natalie M. Li, Weizhong Nelson, Karen E. Brar, Pamila Kahn, Andrew M. Spector, Timothy D. Caskey, C. Thomas Venter, J. Craig Karow, David S. Kirkness, Ewen F. Shah, Naisha Genome Med Research BACKGROUND: Modern medicine is rapidly moving towards a data-driven paradigm based on comprehensive multimodal health assessments. Integrated analysis of data from different modalities has the potential of uncovering novel biomarkers and disease signatures. METHODS: We collected 1385 data features from diverse modalities, including metabolome, microbiome, genetics, and advanced imaging, from 1253 individuals and from a longitudinal validation cohort of 1083 individuals. We utilized a combination of unsupervised machine learning methods to identify multimodal biomarker signatures of health and disease risk. RESULTS: Our method identified a set of cardiometabolic biomarkers that goes beyond standard clinical biomarkers. Stratification of individuals based on the signatures of these biomarkers identified distinct subsets of individuals with similar health statuses. Subset membership was a better predictor for diabetes than established clinical biomarkers such as glucose, insulin resistance, and body mass index. The novel biomarkers in the diabetes signature included 1-stearoyl-2-dihomo-linolenoyl-GPC and 1-(1-enyl-palmitoyl)-2-oleoyl-GPC. Another metabolite, cinnamoylglycine, was identified as a potential biomarker for both gut microbiome health and lean mass percentage. We identified potential early signatures for hypertension and a poor metabolic health outcome. Additionally, we found novel associations between a uremic toxin, p-cresol sulfate, and the abundance of the microbiome genera Intestinimonas and an unclassified genus in the Erysipelotrichaceae family. CONCLUSIONS: Our methodology and results demonstrate the potential of multimodal data integration, from the identification of novel biomarker signatures to a data-driven stratification of individuals into disease subtypes and stages—an essential step towards personalized, preventative health risk assessment. BioMed Central 2020-01-10 /pmc/articles/PMC6953286/ /pubmed/31924279 http://dx.doi.org/10.1186/s13073-019-0705-z Text en © The Author(s). 2020 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Shomorony, Ilan Cirulli, Elizabeth T. Huang, Lei Napier, Lori A. Heister, Robyn R. Hicks, Michael Cohen, Isaac V. Yu, Hung-Chun Swisher, Christine Leon Schenker-Ahmed, Natalie M. Li, Weizhong Nelson, Karen E. Brar, Pamila Kahn, Andrew M. Spector, Timothy D. Caskey, C. Thomas Venter, J. Craig Karow, David S. Kirkness, Ewen F. Shah, Naisha An unsupervised learning approach to identify novel signatures of health and disease from multimodal data |
title | An unsupervised learning approach to identify novel signatures of health and disease from multimodal data |
title_full | An unsupervised learning approach to identify novel signatures of health and disease from multimodal data |
title_fullStr | An unsupervised learning approach to identify novel signatures of health and disease from multimodal data |
title_full_unstemmed | An unsupervised learning approach to identify novel signatures of health and disease from multimodal data |
title_short | An unsupervised learning approach to identify novel signatures of health and disease from multimodal data |
title_sort | unsupervised learning approach to identify novel signatures of health and disease from multimodal data |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6953286/ https://www.ncbi.nlm.nih.gov/pubmed/31924279 http://dx.doi.org/10.1186/s13073-019-0705-z |
work_keys_str_mv | AT shomoronyilan anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT cirullielizabetht anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT huanglei anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT napierloria anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT heisterrobynr anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT hicksmichael anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT cohenisaacv anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT yuhungchun anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT swisherchristineleon anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT schenkerahmednataliem anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT liweizhong anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT nelsonkarene anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT brarpamila anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT kahnandrewm anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT spectortimothyd anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT caskeycthomas anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT venterjcraig anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT karowdavids anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT kirknessewenf anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT shahnaisha anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT shomoronyilan unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT cirullielizabetht unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT huanglei unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT napierloria unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT heisterrobynr unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT hicksmichael unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT cohenisaacv unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT yuhungchun unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT swisherchristineleon unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT schenkerahmednataliem unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT liweizhong unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT nelsonkarene unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT brarpamila unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT kahnandrewm unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT spectortimothyd unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT caskeycthomas unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT venterjcraig unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT karowdavids unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT kirknessewenf unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata AT shahnaisha unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata |