Cargando…

Deciphering RNA splicing logic with interpretable machine learning

Machine learning methods, particularly neural networks trained on large datasets, are transforming how scientists approach scientific discovery and experimental design. However, current state-of-the-art neural networks are limited by their uninterpretability: Despite their excellent accuracy, they c...

Descripción completa

Detalles Bibliográficos
Autores principales: Liao, Susan E., Sudarshan, Mukund, Regev, Oded
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10576025/
https://www.ncbi.nlm.nih.gov/pubmed/37796983
http://dx.doi.org/10.1073/pnas.2221165120
_version_ 1785121035916410880
author Liao, Susan E.
Sudarshan, Mukund
Regev, Oded
author_facet Liao, Susan E.
Sudarshan, Mukund
Regev, Oded
author_sort Liao, Susan E.
collection PubMed
description Machine learning methods, particularly neural networks trained on large datasets, are transforming how scientists approach scientific discovery and experimental design. However, current state-of-the-art neural networks are limited by their uninterpretability: Despite their excellent accuracy, they cannot describe how they arrived at their predictions. Here, using an “interpretable-by-design” approach, we present a neural network model that provides insights into RNA splicing, a fundamental process in the transfer of genomic information into functional biochemical products. Although we designed our model to emphasize interpretability, its predictive accuracy is on par with state-of-the-art models. To demonstrate the model’s interpretability, we introduce a visualization that, for any given exon, allows us to trace and quantify the entire decision process from input sequence to output splicing prediction. Importantly, the model revealed uncharacterized components of the splicing logic, which we experimentally validated. This study highlights how interpretable machine learning can advance scientific discovery.
format Online
Article
Text
id pubmed-10576025
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-105760252023-10-15 Deciphering RNA splicing logic with interpretable machine learning Liao, Susan E. Sudarshan, Mukund Regev, Oded Proc Natl Acad Sci U S A Biological Sciences Machine learning methods, particularly neural networks trained on large datasets, are transforming how scientists approach scientific discovery and experimental design. However, current state-of-the-art neural networks are limited by their uninterpretability: Despite their excellent accuracy, they cannot describe how they arrived at their predictions. Here, using an “interpretable-by-design” approach, we present a neural network model that provides insights into RNA splicing, a fundamental process in the transfer of genomic information into functional biochemical products. Although we designed our model to emphasize interpretability, its predictive accuracy is on par with state-of-the-art models. To demonstrate the model’s interpretability, we introduce a visualization that, for any given exon, allows us to trace and quantify the entire decision process from input sequence to output splicing prediction. Importantly, the model revealed uncharacterized components of the splicing logic, which we experimentally validated. This study highlights how interpretable machine learning can advance scientific discovery. National Academy of Sciences 2023-10-05 2023-10-10 /pmc/articles/PMC10576025/ /pubmed/37796983 http://dx.doi.org/10.1073/pnas.2221165120 Text en Copyright © 2023 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by/4.0/This open access article is distributed under Creative Commons Attribution License 4.0 (CC BY) (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Biological Sciences
Liao, Susan E.
Sudarshan, Mukund
Regev, Oded
Deciphering RNA splicing logic with interpretable machine learning
title Deciphering RNA splicing logic with interpretable machine learning
title_full Deciphering RNA splicing logic with interpretable machine learning
title_fullStr Deciphering RNA splicing logic with interpretable machine learning
title_full_unstemmed Deciphering RNA splicing logic with interpretable machine learning
title_short Deciphering RNA splicing logic with interpretable machine learning
title_sort deciphering rna splicing logic with interpretable machine learning
topic Biological Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10576025/
https://www.ncbi.nlm.nih.gov/pubmed/37796983
http://dx.doi.org/10.1073/pnas.2221165120
work_keys_str_mv AT liaosusane decipheringrnasplicinglogicwithinterpretablemachinelearning
AT sudarshanmukund decipheringrnasplicinglogicwithinterpretablemachinelearning
AT regevoded decipheringrnasplicinglogicwithinterpretablemachinelearning