Cargando…

An unsupervised learning approach to identify novel signatures of health and disease from multimodal data

BACKGROUND: Modern medicine is rapidly moving towards a data-driven paradigm based on comprehensive multimodal health assessments. Integrated analysis of data from different modalities has the potential of uncovering novel biomarkers and disease signatures. METHODS: We collected 1385 data features f...

Descripción completa

Detalles Bibliográficos
Autores principales: Shomorony, Ilan, Cirulli, Elizabeth T., Huang, Lei, Napier, Lori A., Heister, Robyn R., Hicks, Michael, Cohen, Isaac V., Yu, Hung-Chun, Swisher, Christine Leon, Schenker-Ahmed, Natalie M., Li, Weizhong, Nelson, Karen E., Brar, Pamila, Kahn, Andrew M., Spector, Timothy D., Caskey, C. Thomas, Venter, J. Craig, Karow, David S., Kirkness, Ewen F., Shah, Naisha
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6953286/
https://www.ncbi.nlm.nih.gov/pubmed/31924279
http://dx.doi.org/10.1186/s13073-019-0705-z
_version_ 1783486610266914816
author Shomorony, Ilan
Cirulli, Elizabeth T.
Huang, Lei
Napier, Lori A.
Heister, Robyn R.
Hicks, Michael
Cohen, Isaac V.
Yu, Hung-Chun
Swisher, Christine Leon
Schenker-Ahmed, Natalie M.
Li, Weizhong
Nelson, Karen E.
Brar, Pamila
Kahn, Andrew M.
Spector, Timothy D.
Caskey, C. Thomas
Venter, J. Craig
Karow, David S.
Kirkness, Ewen F.
Shah, Naisha
author_facet Shomorony, Ilan
Cirulli, Elizabeth T.
Huang, Lei
Napier, Lori A.
Heister, Robyn R.
Hicks, Michael
Cohen, Isaac V.
Yu, Hung-Chun
Swisher, Christine Leon
Schenker-Ahmed, Natalie M.
Li, Weizhong
Nelson, Karen E.
Brar, Pamila
Kahn, Andrew M.
Spector, Timothy D.
Caskey, C. Thomas
Venter, J. Craig
Karow, David S.
Kirkness, Ewen F.
Shah, Naisha
author_sort Shomorony, Ilan
collection PubMed
description BACKGROUND: Modern medicine is rapidly moving towards a data-driven paradigm based on comprehensive multimodal health assessments. Integrated analysis of data from different modalities has the potential of uncovering novel biomarkers and disease signatures. METHODS: We collected 1385 data features from diverse modalities, including metabolome, microbiome, genetics, and advanced imaging, from 1253 individuals and from a longitudinal validation cohort of 1083 individuals. We utilized a combination of unsupervised machine learning methods to identify multimodal biomarker signatures of health and disease risk. RESULTS: Our method identified a set of cardiometabolic biomarkers that goes beyond standard clinical biomarkers. Stratification of individuals based on the signatures of these biomarkers identified distinct subsets of individuals with similar health statuses. Subset membership was a better predictor for diabetes than established clinical biomarkers such as glucose, insulin resistance, and body mass index. The novel biomarkers in the diabetes signature included 1-stearoyl-2-dihomo-linolenoyl-GPC and 1-(1-enyl-palmitoyl)-2-oleoyl-GPC. Another metabolite, cinnamoylglycine, was identified as a potential biomarker for both gut microbiome health and lean mass percentage. We identified potential early signatures for hypertension and a poor metabolic health outcome. Additionally, we found novel associations between a uremic toxin, p-cresol sulfate, and the abundance of the microbiome genera Intestinimonas and an unclassified genus in the Erysipelotrichaceae family. CONCLUSIONS: Our methodology and results demonstrate the potential of multimodal data integration, from the identification of novel biomarker signatures to a data-driven stratification of individuals into disease subtypes and stages—an essential step towards personalized, preventative health risk assessment.
format Online
Article
Text
id pubmed-6953286
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-69532862020-01-14 An unsupervised learning approach to identify novel signatures of health and disease from multimodal data Shomorony, Ilan Cirulli, Elizabeth T. Huang, Lei Napier, Lori A. Heister, Robyn R. Hicks, Michael Cohen, Isaac V. Yu, Hung-Chun Swisher, Christine Leon Schenker-Ahmed, Natalie M. Li, Weizhong Nelson, Karen E. Brar, Pamila Kahn, Andrew M. Spector, Timothy D. Caskey, C. Thomas Venter, J. Craig Karow, David S. Kirkness, Ewen F. Shah, Naisha Genome Med Research BACKGROUND: Modern medicine is rapidly moving towards a data-driven paradigm based on comprehensive multimodal health assessments. Integrated analysis of data from different modalities has the potential of uncovering novel biomarkers and disease signatures. METHODS: We collected 1385 data features from diverse modalities, including metabolome, microbiome, genetics, and advanced imaging, from 1253 individuals and from a longitudinal validation cohort of 1083 individuals. We utilized a combination of unsupervised machine learning methods to identify multimodal biomarker signatures of health and disease risk. RESULTS: Our method identified a set of cardiometabolic biomarkers that goes beyond standard clinical biomarkers. Stratification of individuals based on the signatures of these biomarkers identified distinct subsets of individuals with similar health statuses. Subset membership was a better predictor for diabetes than established clinical biomarkers such as glucose, insulin resistance, and body mass index. The novel biomarkers in the diabetes signature included 1-stearoyl-2-dihomo-linolenoyl-GPC and 1-(1-enyl-palmitoyl)-2-oleoyl-GPC. Another metabolite, cinnamoylglycine, was identified as a potential biomarker for both gut microbiome health and lean mass percentage. We identified potential early signatures for hypertension and a poor metabolic health outcome. Additionally, we found novel associations between a uremic toxin, p-cresol sulfate, and the abundance of the microbiome genera Intestinimonas and an unclassified genus in the Erysipelotrichaceae family. CONCLUSIONS: Our methodology and results demonstrate the potential of multimodal data integration, from the identification of novel biomarker signatures to a data-driven stratification of individuals into disease subtypes and stages—an essential step towards personalized, preventative health risk assessment. BioMed Central 2020-01-10 /pmc/articles/PMC6953286/ /pubmed/31924279 http://dx.doi.org/10.1186/s13073-019-0705-z Text en © The Author(s). 2020 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Shomorony, Ilan
Cirulli, Elizabeth T.
Huang, Lei
Napier, Lori A.
Heister, Robyn R.
Hicks, Michael
Cohen, Isaac V.
Yu, Hung-Chun
Swisher, Christine Leon
Schenker-Ahmed, Natalie M.
Li, Weizhong
Nelson, Karen E.
Brar, Pamila
Kahn, Andrew M.
Spector, Timothy D.
Caskey, C. Thomas
Venter, J. Craig
Karow, David S.
Kirkness, Ewen F.
Shah, Naisha
An unsupervised learning approach to identify novel signatures of health and disease from multimodal data
title An unsupervised learning approach to identify novel signatures of health and disease from multimodal data
title_full An unsupervised learning approach to identify novel signatures of health and disease from multimodal data
title_fullStr An unsupervised learning approach to identify novel signatures of health and disease from multimodal data
title_full_unstemmed An unsupervised learning approach to identify novel signatures of health and disease from multimodal data
title_short An unsupervised learning approach to identify novel signatures of health and disease from multimodal data
title_sort unsupervised learning approach to identify novel signatures of health and disease from multimodal data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6953286/
https://www.ncbi.nlm.nih.gov/pubmed/31924279
http://dx.doi.org/10.1186/s13073-019-0705-z
work_keys_str_mv AT shomoronyilan anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT cirullielizabetht anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT huanglei anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT napierloria anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT heisterrobynr anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT hicksmichael anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT cohenisaacv anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT yuhungchun anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT swisherchristineleon anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT schenkerahmednataliem anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT liweizhong anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT nelsonkarene anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT brarpamila anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT kahnandrewm anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT spectortimothyd anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT caskeycthomas anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT venterjcraig anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT karowdavids anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT kirknessewenf anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT shahnaisha anunsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT shomoronyilan unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT cirullielizabetht unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT huanglei unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT napierloria unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT heisterrobynr unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT hicksmichael unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT cohenisaacv unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT yuhungchun unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT swisherchristineleon unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT schenkerahmednataliem unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT liweizhong unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT nelsonkarene unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT brarpamila unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT kahnandrewm unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT spectortimothyd unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT caskeycthomas unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT venterjcraig unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT karowdavids unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT kirknessewenf unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata
AT shahnaisha unsupervisedlearningapproachtoidentifynovelsignaturesofhealthanddiseasefrommultimodaldata