Cargando…

Denoising genome-wide histone ChIP-seq with convolutional neural networks

MOTIVATION: Chromatin immune-precipitation sequencing (ChIP-seq) experiments are commonly used to obtain genome-wide profiles of histone modifications associated with different types of functional genomic elements. However, the quality of histone ChIP-seq data is affected by many experimental parame...

Descripción completa

Detalles Bibliográficos
Autores principales: Koh, Pang Wei, Pierson, Emma, Kundaje, Anshul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5870713/
https://www.ncbi.nlm.nih.gov/pubmed/28881977
http://dx.doi.org/10.1093/bioinformatics/btx243
_version_ 1783309538889302016
author Koh, Pang Wei
Pierson, Emma
Kundaje, Anshul
author_facet Koh, Pang Wei
Pierson, Emma
Kundaje, Anshul
author_sort Koh, Pang Wei
collection PubMed
description MOTIVATION: Chromatin immune-precipitation sequencing (ChIP-seq) experiments are commonly used to obtain genome-wide profiles of histone modifications associated with different types of functional genomic elements. However, the quality of histone ChIP-seq data is affected by many experimental parameters such as the amount of input DNA, antibody specificity, ChIP enrichment and sequencing depth. Making accurate inferences from chromatin profiling experiments that involve diverse experimental parameters is challenging. RESULTS: We introduce a convolutional denoising algorithm, Coda, that uses convolutional neural networks to learn a mapping from suboptimal to high-quality histone ChIP-seq data. This overcomes various sources of noise and variability, substantially enhancing and recovering signal when applied to low-quality chromatin profiling datasets across individuals, cell types and species. Our method has the potential to improve data quality at reduced costs. More broadly, this approach—using a high-dimensional discriminative model to encode a generative noise process—is generally applicable to other biological domains where it is easy to generate noisy data but difficult to analytically characterize the noise or underlying data distribution. AVAILABILITY AND IMPLEMENTATION: https://github.com/kundajelab/coda.
format Online
Article
Text
id pubmed-5870713
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58707132018-04-05 Denoising genome-wide histone ChIP-seq with convolutional neural networks Koh, Pang Wei Pierson, Emma Kundaje, Anshul Bioinformatics Ismb/Eccb 2017: The 25th Annual Conference Intelligent Systems for Molecular Biology Held Jointly with the 16th Annual European Conference on Computational Biology, Prague, Czech Republic, July 21–25, 2017 MOTIVATION: Chromatin immune-precipitation sequencing (ChIP-seq) experiments are commonly used to obtain genome-wide profiles of histone modifications associated with different types of functional genomic elements. However, the quality of histone ChIP-seq data is affected by many experimental parameters such as the amount of input DNA, antibody specificity, ChIP enrichment and sequencing depth. Making accurate inferences from chromatin profiling experiments that involve diverse experimental parameters is challenging. RESULTS: We introduce a convolutional denoising algorithm, Coda, that uses convolutional neural networks to learn a mapping from suboptimal to high-quality histone ChIP-seq data. This overcomes various sources of noise and variability, substantially enhancing and recovering signal when applied to low-quality chromatin profiling datasets across individuals, cell types and species. Our method has the potential to improve data quality at reduced costs. More broadly, this approach—using a high-dimensional discriminative model to encode a generative noise process—is generally applicable to other biological domains where it is easy to generate noisy data but difficult to analytically characterize the noise or underlying data distribution. AVAILABILITY AND IMPLEMENTATION: https://github.com/kundajelab/coda. Oxford University Press 2017-07-15 2017-07-12 /pmc/articles/PMC5870713/ /pubmed/28881977 http://dx.doi.org/10.1093/bioinformatics/btx243 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb/Eccb 2017: The 25th Annual Conference Intelligent Systems for Molecular Biology Held Jointly with the 16th Annual European Conference on Computational Biology, Prague, Czech Republic, July 21–25, 2017
Koh, Pang Wei
Pierson, Emma
Kundaje, Anshul
Denoising genome-wide histone ChIP-seq with convolutional neural networks
title Denoising genome-wide histone ChIP-seq with convolutional neural networks
title_full Denoising genome-wide histone ChIP-seq with convolutional neural networks
title_fullStr Denoising genome-wide histone ChIP-seq with convolutional neural networks
title_full_unstemmed Denoising genome-wide histone ChIP-seq with convolutional neural networks
title_short Denoising genome-wide histone ChIP-seq with convolutional neural networks
title_sort denoising genome-wide histone chip-seq with convolutional neural networks
topic Ismb/Eccb 2017: The 25th Annual Conference Intelligent Systems for Molecular Biology Held Jointly with the 16th Annual European Conference on Computational Biology, Prague, Czech Republic, July 21–25, 2017
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5870713/
https://www.ncbi.nlm.nih.gov/pubmed/28881977
http://dx.doi.org/10.1093/bioinformatics/btx243
work_keys_str_mv AT kohpangwei denoisinggenomewidehistonechipseqwithconvolutionalneuralnetworks
AT piersonemma denoisinggenomewidehistonechipseqwithconvolutionalneuralnetworks
AT kundajeanshul denoisinggenomewidehistonechipseqwithconvolutionalneuralnetworks