Cargando…
Deep mendelian randomization: Investigating the causal knowledge of genomic deep learning models
Multi-task deep learning (DL) models can accurately predict diverse genomic marks from sequence, but whether these models learn the causal relationships between genomic marks is unknown. Here, we describe Deep Mendelian Randomization (DeepMR), a method for estimating causal relationships between gen...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9624391/ https://www.ncbi.nlm.nih.gov/pubmed/36265006 http://dx.doi.org/10.1371/journal.pcbi.1009880 |
_version_ | 1784822225028775936 |
---|---|
author | Malina, Stephen Cizin, Daniel Knowles, David A. |
author_facet | Malina, Stephen Cizin, Daniel Knowles, David A. |
author_sort | Malina, Stephen |
collection | PubMed |
description | Multi-task deep learning (DL) models can accurately predict diverse genomic marks from sequence, but whether these models learn the causal relationships between genomic marks is unknown. Here, we describe Deep Mendelian Randomization (DeepMR), a method for estimating causal relationships between genomic marks learned by genomic DL models. By combining Mendelian randomization with in silico mutagenesis, DeepMR obtains local (locus specific) and global estimates of (an assumed) linear causal relationship between marks. In a simulation designed to test recovery of pairwise causal relations between transcription factors (TFs), DeepMR gives accurate and unbiased estimates of the ‘true’ global causal effect, but its coverage decays in the presence of sequence-dependent confounding. We then apply DeepMR to examine the global relationships learned by a state-of-the-art DL model, BPNet, between TFs involved in reprogramming. DeepMR’s causal effect estimates validate previously hypothesized relationships between TFs and suggest new relationships for future investigation. |
format | Online Article Text |
id | pubmed-9624391 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-96243912022-11-02 Deep mendelian randomization: Investigating the causal knowledge of genomic deep learning models Malina, Stephen Cizin, Daniel Knowles, David A. PLoS Comput Biol Research Article Multi-task deep learning (DL) models can accurately predict diverse genomic marks from sequence, but whether these models learn the causal relationships between genomic marks is unknown. Here, we describe Deep Mendelian Randomization (DeepMR), a method for estimating causal relationships between genomic marks learned by genomic DL models. By combining Mendelian randomization with in silico mutagenesis, DeepMR obtains local (locus specific) and global estimates of (an assumed) linear causal relationship between marks. In a simulation designed to test recovery of pairwise causal relations between transcription factors (TFs), DeepMR gives accurate and unbiased estimates of the ‘true’ global causal effect, but its coverage decays in the presence of sequence-dependent confounding. We then apply DeepMR to examine the global relationships learned by a state-of-the-art DL model, BPNet, between TFs involved in reprogramming. DeepMR’s causal effect estimates validate previously hypothesized relationships between TFs and suggest new relationships for future investigation. Public Library of Science 2022-10-20 /pmc/articles/PMC9624391/ /pubmed/36265006 http://dx.doi.org/10.1371/journal.pcbi.1009880 Text en © 2022 Malina et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Malina, Stephen Cizin, Daniel Knowles, David A. Deep mendelian randomization: Investigating the causal knowledge of genomic deep learning models |
title | Deep mendelian randomization: Investigating the causal knowledge of genomic deep learning models |
title_full | Deep mendelian randomization: Investigating the causal knowledge of genomic deep learning models |
title_fullStr | Deep mendelian randomization: Investigating the causal knowledge of genomic deep learning models |
title_full_unstemmed | Deep mendelian randomization: Investigating the causal knowledge of genomic deep learning models |
title_short | Deep mendelian randomization: Investigating the causal knowledge of genomic deep learning models |
title_sort | deep mendelian randomization: investigating the causal knowledge of genomic deep learning models |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9624391/ https://www.ncbi.nlm.nih.gov/pubmed/36265006 http://dx.doi.org/10.1371/journal.pcbi.1009880 |
work_keys_str_mv | AT malinastephen deepmendelianrandomizationinvestigatingthecausalknowledgeofgenomicdeeplearningmodels AT cizindaniel deepmendelianrandomizationinvestigatingthecausalknowledgeofgenomicdeeplearningmodels AT knowlesdavida deepmendelianrandomizationinvestigatingthecausalknowledgeofgenomicdeeplearningmodels |