Cargando…
Interpretation of deep learning in genomics and epigenomics
Machine learning methods have been widely applied to big data analysis in genomics and epigenomics research. Although accuracy and efficiency are common goals in many modeling tasks, model interpretability is especially important to these studies towards understanding the underlying molecular and ce...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8138893/ https://www.ncbi.nlm.nih.gov/pubmed/34020542 http://dx.doi.org/10.1093/bib/bbaa177 |
_version_ | 1783695896084480000 |
---|---|
author | Talukder, Amlan Barham, Clayton Li, Xiaoman Hu, Haiyan |
author_facet | Talukder, Amlan Barham, Clayton Li, Xiaoman Hu, Haiyan |
author_sort | Talukder, Amlan |
collection | PubMed |
description | Machine learning methods have been widely applied to big data analysis in genomics and epigenomics research. Although accuracy and efficiency are common goals in many modeling tasks, model interpretability is especially important to these studies towards understanding the underlying molecular and cellular mechanisms. Deep neural networks (DNNs) have recently gained popularity in various types of genomic and epigenomic studies due to their capabilities in utilizing large-scale high-throughput bioinformatics data and achieving high accuracy in predictions and classifications. However, DNNs are often challenged by their potential to explain the predictions due to their black-box nature. In this review, we present current development in the model interpretation of DNNs, focusing on their applications in genomics and epigenomics. We first describe state-of-the-art DNN interpretation methods in representative machine learning fields. We then summarize the DNN interpretation methods in recent studies on genomics and epigenomics, focusing on current data- and computing-intensive topics such as sequence motif identification, genetic variations, gene expression, chromatin interactions and non-coding RNAs. We also present the biological discoveries that resulted from these interpretation methods. We finally discuss the advantages and limitations of current interpretation approaches in the context of genomic and epigenomic studies. Contact:xiaoman@mail.ucf.edu, haihu@cs.ucf.edu |
format | Online Article Text |
id | pubmed-8138893 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-81388932021-05-25 Interpretation of deep learning in genomics and epigenomics Talukder, Amlan Barham, Clayton Li, Xiaoman Hu, Haiyan Brief Bioinform Method Review Machine learning methods have been widely applied to big data analysis in genomics and epigenomics research. Although accuracy and efficiency are common goals in many modeling tasks, model interpretability is especially important to these studies towards understanding the underlying molecular and cellular mechanisms. Deep neural networks (DNNs) have recently gained popularity in various types of genomic and epigenomic studies due to their capabilities in utilizing large-scale high-throughput bioinformatics data and achieving high accuracy in predictions and classifications. However, DNNs are often challenged by their potential to explain the predictions due to their black-box nature. In this review, we present current development in the model interpretation of DNNs, focusing on their applications in genomics and epigenomics. We first describe state-of-the-art DNN interpretation methods in representative machine learning fields. We then summarize the DNN interpretation methods in recent studies on genomics and epigenomics, focusing on current data- and computing-intensive topics such as sequence motif identification, genetic variations, gene expression, chromatin interactions and non-coding RNAs. We also present the biological discoveries that resulted from these interpretation methods. We finally discuss the advantages and limitations of current interpretation approaches in the context of genomic and epigenomic studies. Contact:xiaoman@mail.ucf.edu, haihu@cs.ucf.edu Oxford University Press 2020-08-20 /pmc/articles/PMC8138893/ /pubmed/34020542 http://dx.doi.org/10.1093/bib/bbaa177 Text en © The Author(s) 2020. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Method Review Talukder, Amlan Barham, Clayton Li, Xiaoman Hu, Haiyan Interpretation of deep learning in genomics and epigenomics |
title | Interpretation of deep learning in genomics and epigenomics |
title_full | Interpretation of deep learning in genomics and epigenomics |
title_fullStr | Interpretation of deep learning in genomics and epigenomics |
title_full_unstemmed | Interpretation of deep learning in genomics and epigenomics |
title_short | Interpretation of deep learning in genomics and epigenomics |
title_sort | interpretation of deep learning in genomics and epigenomics |
topic | Method Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8138893/ https://www.ncbi.nlm.nih.gov/pubmed/34020542 http://dx.doi.org/10.1093/bib/bbaa177 |
work_keys_str_mv | AT talukderamlan interpretationofdeeplearningingenomicsandepigenomics AT barhamclayton interpretationofdeeplearningingenomicsandepigenomics AT lixiaoman interpretationofdeeplearningingenomicsandepigenomics AT huhaiyan interpretationofdeeplearningingenomicsandepigenomics |