Cargando…
DeepGAMI: deep biologically guided auxiliary learning for multimodal integration and imputation to improve genotype–phenotype prediction
BACKGROUND: Genotypes are strongly associated with disease phenotypes, particularly in brain disorders. However, the molecular and cellular mechanisms behind this association remain elusive. With emerging multimodal data for these mechanisms, machine learning methods can be applied for phenotype pre...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10617196/ https://www.ncbi.nlm.nih.gov/pubmed/37904203 http://dx.doi.org/10.1186/s13073-023-01248-6 |
_version_ | 1785129555099385856 |
---|---|
author | Chandrashekar, Pramod Bharadwaj Alatkar, Sayali Wang, Jiebiao Hoffman, Gabriel E. He, Chenfeng Jin, Ting Khullar, Saniya Bendl, Jaroslav Fullard, John F. Roussos, Panos Wang, Daifeng |
author_facet | Chandrashekar, Pramod Bharadwaj Alatkar, Sayali Wang, Jiebiao Hoffman, Gabriel E. He, Chenfeng Jin, Ting Khullar, Saniya Bendl, Jaroslav Fullard, John F. Roussos, Panos Wang, Daifeng |
author_sort | Chandrashekar, Pramod Bharadwaj |
collection | PubMed |
description | BACKGROUND: Genotypes are strongly associated with disease phenotypes, particularly in brain disorders. However, the molecular and cellular mechanisms behind this association remain elusive. With emerging multimodal data for these mechanisms, machine learning methods can be applied for phenotype prediction at different scales, but due to the black-box nature of machine learning, integrating these modalities and interpreting biological mechanisms can be challenging. Additionally, the partial availability of these multimodal data presents a challenge in developing these predictive models. METHOD: To address these challenges, we developed DeepGAMI, an interpretable neural network model to improve genotype–phenotype prediction from multimodal data. DeepGAMI leverages functional genomic information, such as eQTLs and gene regulation, to guide neural network connections. Additionally, it includes an auxiliary learning layer for cross-modal imputation allowing the imputation of latent features of missing modalities and thus predicting phenotypes from a single modality. Finally, DeepGAMI uses integrated gradient to prioritize multimodal features for various phenotypes. RESULTS: We applied DeepGAMI to several multimodal datasets including genotype and bulk and cell-type gene expression data in brain diseases, and gene expression and electrophysiology data of mouse neuronal cells. Using cross-validation and independent validation, DeepGAMI outperformed existing methods for classifying disease types, and cellular and clinical phenotypes, even using single modalities (e.g., AUC score of 0.79 for Schizophrenia and 0.73 for cognitive impairment in Alzheimer’s disease). CONCLUSION: We demonstrated that DeepGAMI improves phenotype prediction and prioritizes phenotypic features and networks in multiple multimodal datasets in complex brains and brain diseases. Also, it prioritized disease-associated variants, genes, and regulatory networks linked to different phenotypes, providing novel insights into the interpretation of gene regulatory mechanisms. DeepGAMI is open-source and available for general use. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13073-023-01248-6. |
format | Online Article Text |
id | pubmed-10617196 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-106171962023-11-01 DeepGAMI: deep biologically guided auxiliary learning for multimodal integration and imputation to improve genotype–phenotype prediction Chandrashekar, Pramod Bharadwaj Alatkar, Sayali Wang, Jiebiao Hoffman, Gabriel E. He, Chenfeng Jin, Ting Khullar, Saniya Bendl, Jaroslav Fullard, John F. Roussos, Panos Wang, Daifeng Genome Med Research BACKGROUND: Genotypes are strongly associated with disease phenotypes, particularly in brain disorders. However, the molecular and cellular mechanisms behind this association remain elusive. With emerging multimodal data for these mechanisms, machine learning methods can be applied for phenotype prediction at different scales, but due to the black-box nature of machine learning, integrating these modalities and interpreting biological mechanisms can be challenging. Additionally, the partial availability of these multimodal data presents a challenge in developing these predictive models. METHOD: To address these challenges, we developed DeepGAMI, an interpretable neural network model to improve genotype–phenotype prediction from multimodal data. DeepGAMI leverages functional genomic information, such as eQTLs and gene regulation, to guide neural network connections. Additionally, it includes an auxiliary learning layer for cross-modal imputation allowing the imputation of latent features of missing modalities and thus predicting phenotypes from a single modality. Finally, DeepGAMI uses integrated gradient to prioritize multimodal features for various phenotypes. RESULTS: We applied DeepGAMI to several multimodal datasets including genotype and bulk and cell-type gene expression data in brain diseases, and gene expression and electrophysiology data of mouse neuronal cells. Using cross-validation and independent validation, DeepGAMI outperformed existing methods for classifying disease types, and cellular and clinical phenotypes, even using single modalities (e.g., AUC score of 0.79 for Schizophrenia and 0.73 for cognitive impairment in Alzheimer’s disease). CONCLUSION: We demonstrated that DeepGAMI improves phenotype prediction and prioritizes phenotypic features and networks in multiple multimodal datasets in complex brains and brain diseases. Also, it prioritized disease-associated variants, genes, and regulatory networks linked to different phenotypes, providing novel insights into the interpretation of gene regulatory mechanisms. DeepGAMI is open-source and available for general use. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13073-023-01248-6. BioMed Central 2023-10-31 /pmc/articles/PMC10617196/ /pubmed/37904203 http://dx.doi.org/10.1186/s13073-023-01248-6 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Chandrashekar, Pramod Bharadwaj Alatkar, Sayali Wang, Jiebiao Hoffman, Gabriel E. He, Chenfeng Jin, Ting Khullar, Saniya Bendl, Jaroslav Fullard, John F. Roussos, Panos Wang, Daifeng DeepGAMI: deep biologically guided auxiliary learning for multimodal integration and imputation to improve genotype–phenotype prediction |
title | DeepGAMI: deep biologically guided auxiliary learning for multimodal integration and imputation to improve genotype–phenotype prediction |
title_full | DeepGAMI: deep biologically guided auxiliary learning for multimodal integration and imputation to improve genotype–phenotype prediction |
title_fullStr | DeepGAMI: deep biologically guided auxiliary learning for multimodal integration and imputation to improve genotype–phenotype prediction |
title_full_unstemmed | DeepGAMI: deep biologically guided auxiliary learning for multimodal integration and imputation to improve genotype–phenotype prediction |
title_short | DeepGAMI: deep biologically guided auxiliary learning for multimodal integration and imputation to improve genotype–phenotype prediction |
title_sort | deepgami: deep biologically guided auxiliary learning for multimodal integration and imputation to improve genotype–phenotype prediction |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10617196/ https://www.ncbi.nlm.nih.gov/pubmed/37904203 http://dx.doi.org/10.1186/s13073-023-01248-6 |
work_keys_str_mv | AT chandrashekarpramodbharadwaj deepgamideepbiologicallyguidedauxiliarylearningformultimodalintegrationandimputationtoimprovegenotypephenotypeprediction AT alatkarsayali deepgamideepbiologicallyguidedauxiliarylearningformultimodalintegrationandimputationtoimprovegenotypephenotypeprediction AT wangjiebiao deepgamideepbiologicallyguidedauxiliarylearningformultimodalintegrationandimputationtoimprovegenotypephenotypeprediction AT hoffmangabriele deepgamideepbiologicallyguidedauxiliarylearningformultimodalintegrationandimputationtoimprovegenotypephenotypeprediction AT hechenfeng deepgamideepbiologicallyguidedauxiliarylearningformultimodalintegrationandimputationtoimprovegenotypephenotypeprediction AT jinting deepgamideepbiologicallyguidedauxiliarylearningformultimodalintegrationandimputationtoimprovegenotypephenotypeprediction AT khullarsaniya deepgamideepbiologicallyguidedauxiliarylearningformultimodalintegrationandimputationtoimprovegenotypephenotypeprediction AT bendljaroslav deepgamideepbiologicallyguidedauxiliarylearningformultimodalintegrationandimputationtoimprovegenotypephenotypeprediction AT fullardjohnf deepgamideepbiologicallyguidedauxiliarylearningformultimodalintegrationandimputationtoimprovegenotypephenotypeprediction AT roussospanos deepgamideepbiologicallyguidedauxiliarylearningformultimodalintegrationandimputationtoimprovegenotypephenotypeprediction AT wangdaifeng deepgamideepbiologicallyguidedauxiliarylearningformultimodalintegrationandimputationtoimprovegenotypephenotypeprediction |