Cargando…

CNN-Peaks: ChIP-Seq peak detection pipeline using convolutional neural networks that imitate human visual inspection

ChIP-seq is one of the core experimental resources available to understand genome-wide epigenetic interactions and identify the functional elements associated with diseases. The analysis of ChIP-seq data is important but poses a difficult computational challenge, due to the presence of irregular noi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Oh, Dongpin, Strattan, J. Seth, Hur, Junho K., Bento, José, Urban, Alexander Eckehart, Song, Giltae, Cherry, J. Michael
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7220942/ https://www.ncbi.nlm.nih.gov/pubmed/32404971 http://dx.doi.org/10.1038/s41598-020-64655-4

_version_	1783533266881478656
author	Oh, Dongpin Strattan, J. Seth Hur, Junho K. Bento, José Urban, Alexander Eckehart Song, Giltae Cherry, J. Michael
author_facet	Oh, Dongpin Strattan, J. Seth Hur, Junho K. Bento, José Urban, Alexander Eckehart Song, Giltae Cherry, J. Michael
author_sort	Oh, Dongpin
collection	PubMed
description	ChIP-seq is one of the core experimental resources available to understand genome-wide epigenetic interactions and identify the functional elements associated with diseases. The analysis of ChIP-seq data is important but poses a difficult computational challenge, due to the presence of irregular noise and bias on various levels. Although many peak-calling methods have been developed, the current computational tools still require, in some cases, human manual inspection using data visualization. However, the huge volumes of ChIP-seq data make it almost impossible for human researchers to manually uncover all the peaks. Recently developed convolutional neural networks (CNN), which are capable of achieving human-like classification accuracy, can be applied to this challenging problem. In this study, we design a novel supervised learning approach for identifying ChIP-seq peaks using CNNs, and integrate it into a software pipeline called CNN-Peaks. We use data labeled by human researchers who annotate the presence or absence of peaks in some genomic segments, as training data for our model. The trained model is then applied to predict peaks in previously unseen genomic segments from multiple ChIP-seq datasets including benchmark datasets commonly used for validation of peak calling methods. We observe a performance superior to that of previous methods.
format	Online Article Text
id	pubmed-7220942
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-72209422020-05-20 CNN-Peaks: ChIP-Seq peak detection pipeline using convolutional neural networks that imitate human visual inspection Oh, Dongpin Strattan, J. Seth Hur, Junho K. Bento, José Urban, Alexander Eckehart Song, Giltae Cherry, J. Michael Sci Rep Article ChIP-seq is one of the core experimental resources available to understand genome-wide epigenetic interactions and identify the functional elements associated with diseases. The analysis of ChIP-seq data is important but poses a difficult computational challenge, due to the presence of irregular noise and bias on various levels. Although many peak-calling methods have been developed, the current computational tools still require, in some cases, human manual inspection using data visualization. However, the huge volumes of ChIP-seq data make it almost impossible for human researchers to manually uncover all the peaks. Recently developed convolutional neural networks (CNN), which are capable of achieving human-like classification accuracy, can be applied to this challenging problem. In this study, we design a novel supervised learning approach for identifying ChIP-seq peaks using CNNs, and integrate it into a software pipeline called CNN-Peaks. We use data labeled by human researchers who annotate the presence or absence of peaks in some genomic segments, as training data for our model. The trained model is then applied to predict peaks in previously unseen genomic segments from multiple ChIP-seq datasets including benchmark datasets commonly used for validation of peak calling methods. We observe a performance superior to that of previous methods. Nature Publishing Group UK 2020-05-13 /pmc/articles/PMC7220942/ /pubmed/32404971 http://dx.doi.org/10.1038/s41598-020-64655-4 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle	Article Oh, Dongpin Strattan, J. Seth Hur, Junho K. Bento, José Urban, Alexander Eckehart Song, Giltae Cherry, J. Michael CNN-Peaks: ChIP-Seq peak detection pipeline using convolutional neural networks that imitate human visual inspection
title	CNN-Peaks: ChIP-Seq peak detection pipeline using convolutional neural networks that imitate human visual inspection
title_full	CNN-Peaks: ChIP-Seq peak detection pipeline using convolutional neural networks that imitate human visual inspection
title_fullStr	CNN-Peaks: ChIP-Seq peak detection pipeline using convolutional neural networks that imitate human visual inspection
title_full_unstemmed	CNN-Peaks: ChIP-Seq peak detection pipeline using convolutional neural networks that imitate human visual inspection
title_short	CNN-Peaks: ChIP-Seq peak detection pipeline using convolutional neural networks that imitate human visual inspection
title_sort	cnn-peaks: chip-seq peak detection pipeline using convolutional neural networks that imitate human visual inspection
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7220942/ https://www.ncbi.nlm.nih.gov/pubmed/32404971 http://dx.doi.org/10.1038/s41598-020-64655-4
work_keys_str_mv	AT ohdongpin cnnpeakschipseqpeakdetectionpipelineusingconvolutionalneuralnetworksthatimitatehumanvisualinspection AT strattanjseth cnnpeakschipseqpeakdetectionpipelineusingconvolutionalneuralnetworksthatimitatehumanvisualinspection AT hurjunhok cnnpeakschipseqpeakdetectionpipelineusingconvolutionalneuralnetworksthatimitatehumanvisualinspection AT bentojose cnnpeakschipseqpeakdetectionpipelineusingconvolutionalneuralnetworksthatimitatehumanvisualinspection AT urbanalexandereckehart cnnpeakschipseqpeakdetectionpipelineusingconvolutionalneuralnetworksthatimitatehumanvisualinspection AT songgiltae cnnpeakschipseqpeakdetectionpipelineusingconvolutionalneuralnetworksthatimitatehumanvisualinspection AT cherryjmichael cnnpeakschipseqpeakdetectionpipelineusingconvolutionalneuralnetworksthatimitatehumanvisualinspection

CNN-Peaks: ChIP-Seq peak detection pipeline using convolutional neural networks that imitate human visual inspection

Ejemplares similares