Cargando…

Interpretation of deep learning in genomics and epigenomics

Machine learning methods have been widely applied to big data analysis in genomics and epigenomics research. Although accuracy and efficiency are common goals in many modeling tasks, model interpretability is especially important to these studies towards understanding the underlying molecular and ce...

Descripción completa

Detalles Bibliográficos
Autores principales: Talukder, Amlan, Barham, Clayton, Li, Xiaoman, Hu, Haiyan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8138893/
https://www.ncbi.nlm.nih.gov/pubmed/34020542
http://dx.doi.org/10.1093/bib/bbaa177
_version_ 1783695896084480000
author Talukder, Amlan
Barham, Clayton
Li, Xiaoman
Hu, Haiyan
author_facet Talukder, Amlan
Barham, Clayton
Li, Xiaoman
Hu, Haiyan
author_sort Talukder, Amlan
collection PubMed
description Machine learning methods have been widely applied to big data analysis in genomics and epigenomics research. Although accuracy and efficiency are common goals in many modeling tasks, model interpretability is especially important to these studies towards understanding the underlying molecular and cellular mechanisms. Deep neural networks (DNNs) have recently gained popularity in various types of genomic and epigenomic studies due to their capabilities in utilizing large-scale high-throughput bioinformatics data and achieving high accuracy in predictions and classifications. However, DNNs are often challenged by their potential to explain the predictions due to their black-box nature. In this review, we present current development in the model interpretation of DNNs, focusing on their applications in genomics and epigenomics. We first describe state-of-the-art DNN interpretation methods in representative machine learning fields. We then summarize the DNN interpretation methods in recent studies on genomics and epigenomics, focusing on current data- and computing-intensive topics such as sequence motif identification, genetic variations, gene expression, chromatin interactions and non-coding RNAs. We also present the biological discoveries that resulted from these interpretation methods. We finally discuss the advantages and limitations of current interpretation approaches in the context of genomic and epigenomic studies. Contact:xiaoman@mail.ucf.edu, haihu@cs.ucf.edu
format Online
Article
Text
id pubmed-8138893
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-81388932021-05-25 Interpretation of deep learning in genomics and epigenomics Talukder, Amlan Barham, Clayton Li, Xiaoman Hu, Haiyan Brief Bioinform Method Review Machine learning methods have been widely applied to big data analysis in genomics and epigenomics research. Although accuracy and efficiency are common goals in many modeling tasks, model interpretability is especially important to these studies towards understanding the underlying molecular and cellular mechanisms. Deep neural networks (DNNs) have recently gained popularity in various types of genomic and epigenomic studies due to their capabilities in utilizing large-scale high-throughput bioinformatics data and achieving high accuracy in predictions and classifications. However, DNNs are often challenged by their potential to explain the predictions due to their black-box nature. In this review, we present current development in the model interpretation of DNNs, focusing on their applications in genomics and epigenomics. We first describe state-of-the-art DNN interpretation methods in representative machine learning fields. We then summarize the DNN interpretation methods in recent studies on genomics and epigenomics, focusing on current data- and computing-intensive topics such as sequence motif identification, genetic variations, gene expression, chromatin interactions and non-coding RNAs. We also present the biological discoveries that resulted from these interpretation methods. We finally discuss the advantages and limitations of current interpretation approaches in the context of genomic and epigenomic studies. Contact:xiaoman@mail.ucf.edu, haihu@cs.ucf.edu Oxford University Press 2020-08-20 /pmc/articles/PMC8138893/ /pubmed/34020542 http://dx.doi.org/10.1093/bib/bbaa177 Text en © The Author(s) 2020. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Method Review
Talukder, Amlan
Barham, Clayton
Li, Xiaoman
Hu, Haiyan
Interpretation of deep learning in genomics and epigenomics
title Interpretation of deep learning in genomics and epigenomics
title_full Interpretation of deep learning in genomics and epigenomics
title_fullStr Interpretation of deep learning in genomics and epigenomics
title_full_unstemmed Interpretation of deep learning in genomics and epigenomics
title_short Interpretation of deep learning in genomics and epigenomics
title_sort interpretation of deep learning in genomics and epigenomics
topic Method Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8138893/
https://www.ncbi.nlm.nih.gov/pubmed/34020542
http://dx.doi.org/10.1093/bib/bbaa177
work_keys_str_mv AT talukderamlan interpretationofdeeplearningingenomicsandepigenomics
AT barhamclayton interpretationofdeeplearningingenomicsandepigenomics
AT lixiaoman interpretationofdeeplearningingenomicsandepigenomics
AT huhaiyan interpretationofdeeplearningingenomicsandepigenomics