Cargando…

A neural network based model effectively predicts enhancers from clinical ATAC-seq samples

Enhancers are cis-acting sequences that regulate transcription rates of their target genes in a cell-specific manner and harbor disease-associated sequence variants in cognate cell types. Many complex diseases are associated with enhancer malfunction, necessitating the discovery and study of enhance...

Descripción completa

Detalles Bibliográficos
Autores principales: Thibodeau, Asa, Uyar, Asli, Khetan, Shubham, Stitzel, Michael L., Ucar, Duygu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6207744/
https://www.ncbi.nlm.nih.gov/pubmed/30375457
http://dx.doi.org/10.1038/s41598-018-34420-9
_version_ 1783366574797750272
author Thibodeau, Asa
Uyar, Asli
Khetan, Shubham
Stitzel, Michael L.
Ucar, Duygu
author_facet Thibodeau, Asa
Uyar, Asli
Khetan, Shubham
Stitzel, Michael L.
Ucar, Duygu
author_sort Thibodeau, Asa
collection PubMed
description Enhancers are cis-acting sequences that regulate transcription rates of their target genes in a cell-specific manner and harbor disease-associated sequence variants in cognate cell types. Many complex diseases are associated with enhancer malfunction, necessitating the discovery and study of enhancers from clinical samples. Assay for Transposase Accessible Chromatin (ATAC-seq) technology can interrogate chromatin accessibility from small cell numbers and facilitate studying enhancers in pathologies. However, on average, ~35% of open chromatin regions (OCRs) from ATAC-seq samples map to enhancers. We developed a neural network-based model, Predicting Enhancers from ATAC-Seq data (PEAS), to effectively infer enhancers from clinical ATAC-seq samples by extracting ATAC-seq data features and integrating these with sequence-related features (e.g., GC ratio). PEAS recapitulated ChromHMM-defined enhancers in CD14+ monocytes, CD4+ T cells, GM12878, peripheral blood mononuclear cells, and pancreatic islets. PEAS models trained on these 5 cell types effectively predicted enhancers in four cell types that are not used in model training (EndoC-βH1, naïve CD8+ T, MCF7, and K562 cells). Finally, PEAS inferred individual-specific enhancers from 19 islet ATAC-seq samples and revealed variability in enhancer activity across individuals, including those driven by genetic differences. PEAS is an easy-to-use tool developed to study enhancers in pathologies by taking advantage of the increasing number of clinical epigenomes.
format Online
Article
Text
id pubmed-6207744
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-62077442018-11-01 A neural network based model effectively predicts enhancers from clinical ATAC-seq samples Thibodeau, Asa Uyar, Asli Khetan, Shubham Stitzel, Michael L. Ucar, Duygu Sci Rep Article Enhancers are cis-acting sequences that regulate transcription rates of their target genes in a cell-specific manner and harbor disease-associated sequence variants in cognate cell types. Many complex diseases are associated with enhancer malfunction, necessitating the discovery and study of enhancers from clinical samples. Assay for Transposase Accessible Chromatin (ATAC-seq) technology can interrogate chromatin accessibility from small cell numbers and facilitate studying enhancers in pathologies. However, on average, ~35% of open chromatin regions (OCRs) from ATAC-seq samples map to enhancers. We developed a neural network-based model, Predicting Enhancers from ATAC-Seq data (PEAS), to effectively infer enhancers from clinical ATAC-seq samples by extracting ATAC-seq data features and integrating these with sequence-related features (e.g., GC ratio). PEAS recapitulated ChromHMM-defined enhancers in CD14+ monocytes, CD4+ T cells, GM12878, peripheral blood mononuclear cells, and pancreatic islets. PEAS models trained on these 5 cell types effectively predicted enhancers in four cell types that are not used in model training (EndoC-βH1, naïve CD8+ T, MCF7, and K562 cells). Finally, PEAS inferred individual-specific enhancers from 19 islet ATAC-seq samples and revealed variability in enhancer activity across individuals, including those driven by genetic differences. PEAS is an easy-to-use tool developed to study enhancers in pathologies by taking advantage of the increasing number of clinical epigenomes. Nature Publishing Group UK 2018-10-30 /pmc/articles/PMC6207744/ /pubmed/30375457 http://dx.doi.org/10.1038/s41598-018-34420-9 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Thibodeau, Asa
Uyar, Asli
Khetan, Shubham
Stitzel, Michael L.
Ucar, Duygu
A neural network based model effectively predicts enhancers from clinical ATAC-seq samples
title A neural network based model effectively predicts enhancers from clinical ATAC-seq samples
title_full A neural network based model effectively predicts enhancers from clinical ATAC-seq samples
title_fullStr A neural network based model effectively predicts enhancers from clinical ATAC-seq samples
title_full_unstemmed A neural network based model effectively predicts enhancers from clinical ATAC-seq samples
title_short A neural network based model effectively predicts enhancers from clinical ATAC-seq samples
title_sort neural network based model effectively predicts enhancers from clinical atac-seq samples
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6207744/
https://www.ncbi.nlm.nih.gov/pubmed/30375457
http://dx.doi.org/10.1038/s41598-018-34420-9
work_keys_str_mv AT thibodeauasa aneuralnetworkbasedmodeleffectivelypredictsenhancersfromclinicalatacseqsamples
AT uyarasli aneuralnetworkbasedmodeleffectivelypredictsenhancersfromclinicalatacseqsamples
AT khetanshubham aneuralnetworkbasedmodeleffectivelypredictsenhancersfromclinicalatacseqsamples
AT stitzelmichaell aneuralnetworkbasedmodeleffectivelypredictsenhancersfromclinicalatacseqsamples
AT ucarduygu aneuralnetworkbasedmodeleffectivelypredictsenhancersfromclinicalatacseqsamples
AT thibodeauasa neuralnetworkbasedmodeleffectivelypredictsenhancersfromclinicalatacseqsamples
AT uyarasli neuralnetworkbasedmodeleffectivelypredictsenhancersfromclinicalatacseqsamples
AT khetanshubham neuralnetworkbasedmodeleffectivelypredictsenhancersfromclinicalatacseqsamples
AT stitzelmichaell neuralnetworkbasedmodeleffectivelypredictsenhancersfromclinicalatacseqsamples
AT ucarduygu neuralnetworkbasedmodeleffectivelypredictsenhancersfromclinicalatacseqsamples