Cargando…

DeepGAMI: deep biologically guided auxiliary learning for multimodal integration and imputation to improve genotype–phenotype prediction

BACKGROUND: Genotypes are strongly associated with disease phenotypes, particularly in brain disorders. However, the molecular and cellular mechanisms behind this association remain elusive. With emerging multimodal data for these mechanisms, machine learning methods can be applied for phenotype pre...

Descripción completa

Detalles Bibliográficos
Autores principales: Chandrashekar, Pramod Bharadwaj, Alatkar, Sayali, Wang, Jiebiao, Hoffman, Gabriel E., He, Chenfeng, Jin, Ting, Khullar, Saniya, Bendl, Jaroslav, Fullard, John F., Roussos, Panos, Wang, Daifeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10617196/
https://www.ncbi.nlm.nih.gov/pubmed/37904203
http://dx.doi.org/10.1186/s13073-023-01248-6
_version_ 1785129555099385856
author Chandrashekar, Pramod Bharadwaj
Alatkar, Sayali
Wang, Jiebiao
Hoffman, Gabriel E.
He, Chenfeng
Jin, Ting
Khullar, Saniya
Bendl, Jaroslav
Fullard, John F.
Roussos, Panos
Wang, Daifeng
author_facet Chandrashekar, Pramod Bharadwaj
Alatkar, Sayali
Wang, Jiebiao
Hoffman, Gabriel E.
He, Chenfeng
Jin, Ting
Khullar, Saniya
Bendl, Jaroslav
Fullard, John F.
Roussos, Panos
Wang, Daifeng
author_sort Chandrashekar, Pramod Bharadwaj
collection PubMed
description BACKGROUND: Genotypes are strongly associated with disease phenotypes, particularly in brain disorders. However, the molecular and cellular mechanisms behind this association remain elusive. With emerging multimodal data for these mechanisms, machine learning methods can be applied for phenotype prediction at different scales, but due to the black-box nature of machine learning, integrating these modalities and interpreting biological mechanisms can be challenging. Additionally, the partial availability of these multimodal data presents a challenge in developing these predictive models. METHOD: To address these challenges, we developed DeepGAMI, an interpretable neural network model to improve genotype–phenotype prediction from multimodal data. DeepGAMI leverages functional genomic information, such as eQTLs and gene regulation, to guide neural network connections. Additionally, it includes an auxiliary learning layer for cross-modal imputation allowing the imputation of latent features of missing modalities and thus predicting phenotypes from a single modality. Finally, DeepGAMI uses integrated gradient to prioritize multimodal features for various phenotypes. RESULTS: We applied DeepGAMI to several multimodal datasets including genotype and bulk and cell-type gene expression data in brain diseases, and gene expression and electrophysiology data of mouse neuronal cells. Using cross-validation and independent validation, DeepGAMI outperformed existing methods for classifying disease types, and cellular and clinical phenotypes, even using single modalities (e.g., AUC score of 0.79 for Schizophrenia and 0.73 for cognitive impairment in Alzheimer’s disease). CONCLUSION: We demonstrated that DeepGAMI improves phenotype prediction and prioritizes phenotypic features and networks in multiple multimodal datasets in complex brains and brain diseases. Also, it prioritized disease-associated variants, genes, and regulatory networks linked to different phenotypes, providing novel insights into the interpretation of gene regulatory mechanisms. DeepGAMI is open-source and available for general use. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13073-023-01248-6.
format Online
Article
Text
id pubmed-10617196
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-106171962023-11-01 DeepGAMI: deep biologically guided auxiliary learning for multimodal integration and imputation to improve genotype–phenotype prediction Chandrashekar, Pramod Bharadwaj Alatkar, Sayali Wang, Jiebiao Hoffman, Gabriel E. He, Chenfeng Jin, Ting Khullar, Saniya Bendl, Jaroslav Fullard, John F. Roussos, Panos Wang, Daifeng Genome Med Research BACKGROUND: Genotypes are strongly associated with disease phenotypes, particularly in brain disorders. However, the molecular and cellular mechanisms behind this association remain elusive. With emerging multimodal data for these mechanisms, machine learning methods can be applied for phenotype prediction at different scales, but due to the black-box nature of machine learning, integrating these modalities and interpreting biological mechanisms can be challenging. Additionally, the partial availability of these multimodal data presents a challenge in developing these predictive models. METHOD: To address these challenges, we developed DeepGAMI, an interpretable neural network model to improve genotype–phenotype prediction from multimodal data. DeepGAMI leverages functional genomic information, such as eQTLs and gene regulation, to guide neural network connections. Additionally, it includes an auxiliary learning layer for cross-modal imputation allowing the imputation of latent features of missing modalities and thus predicting phenotypes from a single modality. Finally, DeepGAMI uses integrated gradient to prioritize multimodal features for various phenotypes. RESULTS: We applied DeepGAMI to several multimodal datasets including genotype and bulk and cell-type gene expression data in brain diseases, and gene expression and electrophysiology data of mouse neuronal cells. Using cross-validation and independent validation, DeepGAMI outperformed existing methods for classifying disease types, and cellular and clinical phenotypes, even using single modalities (e.g., AUC score of 0.79 for Schizophrenia and 0.73 for cognitive impairment in Alzheimer’s disease). CONCLUSION: We demonstrated that DeepGAMI improves phenotype prediction and prioritizes phenotypic features and networks in multiple multimodal datasets in complex brains and brain diseases. Also, it prioritized disease-associated variants, genes, and regulatory networks linked to different phenotypes, providing novel insights into the interpretation of gene regulatory mechanisms. DeepGAMI is open-source and available for general use. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13073-023-01248-6. BioMed Central 2023-10-31 /pmc/articles/PMC10617196/ /pubmed/37904203 http://dx.doi.org/10.1186/s13073-023-01248-6 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Chandrashekar, Pramod Bharadwaj
Alatkar, Sayali
Wang, Jiebiao
Hoffman, Gabriel E.
He, Chenfeng
Jin, Ting
Khullar, Saniya
Bendl, Jaroslav
Fullard, John F.
Roussos, Panos
Wang, Daifeng
DeepGAMI: deep biologically guided auxiliary learning for multimodal integration and imputation to improve genotype–phenotype prediction
title DeepGAMI: deep biologically guided auxiliary learning for multimodal integration and imputation to improve genotype–phenotype prediction
title_full DeepGAMI: deep biologically guided auxiliary learning for multimodal integration and imputation to improve genotype–phenotype prediction
title_fullStr DeepGAMI: deep biologically guided auxiliary learning for multimodal integration and imputation to improve genotype–phenotype prediction
title_full_unstemmed DeepGAMI: deep biologically guided auxiliary learning for multimodal integration and imputation to improve genotype–phenotype prediction
title_short DeepGAMI: deep biologically guided auxiliary learning for multimodal integration and imputation to improve genotype–phenotype prediction
title_sort deepgami: deep biologically guided auxiliary learning for multimodal integration and imputation to improve genotype–phenotype prediction
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10617196/
https://www.ncbi.nlm.nih.gov/pubmed/37904203
http://dx.doi.org/10.1186/s13073-023-01248-6
work_keys_str_mv AT chandrashekarpramodbharadwaj deepgamideepbiologicallyguidedauxiliarylearningformultimodalintegrationandimputationtoimprovegenotypephenotypeprediction
AT alatkarsayali deepgamideepbiologicallyguidedauxiliarylearningformultimodalintegrationandimputationtoimprovegenotypephenotypeprediction
AT wangjiebiao deepgamideepbiologicallyguidedauxiliarylearningformultimodalintegrationandimputationtoimprovegenotypephenotypeprediction
AT hoffmangabriele deepgamideepbiologicallyguidedauxiliarylearningformultimodalintegrationandimputationtoimprovegenotypephenotypeprediction
AT hechenfeng deepgamideepbiologicallyguidedauxiliarylearningformultimodalintegrationandimputationtoimprovegenotypephenotypeprediction
AT jinting deepgamideepbiologicallyguidedauxiliarylearningformultimodalintegrationandimputationtoimprovegenotypephenotypeprediction
AT khullarsaniya deepgamideepbiologicallyguidedauxiliarylearningformultimodalintegrationandimputationtoimprovegenotypephenotypeprediction
AT bendljaroslav deepgamideepbiologicallyguidedauxiliarylearningformultimodalintegrationandimputationtoimprovegenotypephenotypeprediction
AT fullardjohnf deepgamideepbiologicallyguidedauxiliarylearningformultimodalintegrationandimputationtoimprovegenotypephenotypeprediction
AT roussospanos deepgamideepbiologicallyguidedauxiliarylearningformultimodalintegrationandimputationtoimprovegenotypephenotypeprediction
AT wangdaifeng deepgamideepbiologicallyguidedauxiliarylearningformultimodalintegrationandimputationtoimprovegenotypephenotypeprediction