Cargando…

Deep mendelian randomization: Investigating the causal knowledge of genomic deep learning models

Multi-task deep learning (DL) models can accurately predict diverse genomic marks from sequence, but whether these models learn the causal relationships between genomic marks is unknown. Here, we describe Deep Mendelian Randomization (DeepMR), a method for estimating causal relationships between gen...

Descripción completa

Detalles Bibliográficos
Autores principales: Malina, Stephen, Cizin, Daniel, Knowles, David A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9624391/
https://www.ncbi.nlm.nih.gov/pubmed/36265006
http://dx.doi.org/10.1371/journal.pcbi.1009880
_version_ 1784822225028775936
author Malina, Stephen
Cizin, Daniel
Knowles, David A.
author_facet Malina, Stephen
Cizin, Daniel
Knowles, David A.
author_sort Malina, Stephen
collection PubMed
description Multi-task deep learning (DL) models can accurately predict diverse genomic marks from sequence, but whether these models learn the causal relationships between genomic marks is unknown. Here, we describe Deep Mendelian Randomization (DeepMR), a method for estimating causal relationships between genomic marks learned by genomic DL models. By combining Mendelian randomization with in silico mutagenesis, DeepMR obtains local (locus specific) and global estimates of (an assumed) linear causal relationship between marks. In a simulation designed to test recovery of pairwise causal relations between transcription factors (TFs), DeepMR gives accurate and unbiased estimates of the ‘true’ global causal effect, but its coverage decays in the presence of sequence-dependent confounding. We then apply DeepMR to examine the global relationships learned by a state-of-the-art DL model, BPNet, between TFs involved in reprogramming. DeepMR’s causal effect estimates validate previously hypothesized relationships between TFs and suggest new relationships for future investigation.
format Online
Article
Text
id pubmed-9624391
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-96243912022-11-02 Deep mendelian randomization: Investigating the causal knowledge of genomic deep learning models Malina, Stephen Cizin, Daniel Knowles, David A. PLoS Comput Biol Research Article Multi-task deep learning (DL) models can accurately predict diverse genomic marks from sequence, but whether these models learn the causal relationships between genomic marks is unknown. Here, we describe Deep Mendelian Randomization (DeepMR), a method for estimating causal relationships between genomic marks learned by genomic DL models. By combining Mendelian randomization with in silico mutagenesis, DeepMR obtains local (locus specific) and global estimates of (an assumed) linear causal relationship between marks. In a simulation designed to test recovery of pairwise causal relations between transcription factors (TFs), DeepMR gives accurate and unbiased estimates of the ‘true’ global causal effect, but its coverage decays in the presence of sequence-dependent confounding. We then apply DeepMR to examine the global relationships learned by a state-of-the-art DL model, BPNet, between TFs involved in reprogramming. DeepMR’s causal effect estimates validate previously hypothesized relationships between TFs and suggest new relationships for future investigation. Public Library of Science 2022-10-20 /pmc/articles/PMC9624391/ /pubmed/36265006 http://dx.doi.org/10.1371/journal.pcbi.1009880 Text en © 2022 Malina et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Malina, Stephen
Cizin, Daniel
Knowles, David A.
Deep mendelian randomization: Investigating the causal knowledge of genomic deep learning models
title Deep mendelian randomization: Investigating the causal knowledge of genomic deep learning models
title_full Deep mendelian randomization: Investigating the causal knowledge of genomic deep learning models
title_fullStr Deep mendelian randomization: Investigating the causal knowledge of genomic deep learning models
title_full_unstemmed Deep mendelian randomization: Investigating the causal knowledge of genomic deep learning models
title_short Deep mendelian randomization: Investigating the causal knowledge of genomic deep learning models
title_sort deep mendelian randomization: investigating the causal knowledge of genomic deep learning models
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9624391/
https://www.ncbi.nlm.nih.gov/pubmed/36265006
http://dx.doi.org/10.1371/journal.pcbi.1009880
work_keys_str_mv AT malinastephen deepmendelianrandomizationinvestigatingthecausalknowledgeofgenomicdeeplearningmodels
AT cizindaniel deepmendelianrandomizationinvestigatingthecausalknowledgeofgenomicdeeplearningmodels
AT knowlesdavida deepmendelianrandomizationinvestigatingthecausalknowledgeofgenomicdeeplearningmodels