Cargando…

DeepCAGE: Incorporating Transcription Factors in Genome-wide Prediction of Chromatin Accessibility

Although computational approaches have been complementing high-throughput biological experiments for the identification of functional regions in the human genome, it remains a great challenge to systematically decipher interactions between transcription factors (TFs) and regulatory elements to achie...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Qiao, Hua, Kui, Zhang, Xuegong, Wong, Wing Hung, Jiang, Rui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9801045/
https://www.ncbi.nlm.nih.gov/pubmed/35293310
http://dx.doi.org/10.1016/j.gpb.2021.08.015
_version_ 1784861416401928192
author Liu, Qiao
Hua, Kui
Zhang, Xuegong
Wong, Wing Hung
Jiang, Rui
author_facet Liu, Qiao
Hua, Kui
Zhang, Xuegong
Wong, Wing Hung
Jiang, Rui
author_sort Liu, Qiao
collection PubMed
description Although computational approaches have been complementing high-throughput biological experiments for the identification of functional regions in the human genome, it remains a great challenge to systematically decipher interactions between transcription factors (TFs) and regulatory elements to achieve interpretable annotations of chromatin accessibility across diverse cellular contexts. To solve this problem, we propose DeepCAGE, a deep learning framework that integrates sequence information and binding statuses of TFs, for the accurate prediction of chromatin accessible regions at a genome-wide scale in a variety of cell types. DeepCAGE takes advantage of a densely connected deep convolutional neural network architecture to automatically learn sequence signatures of known chromatin accessible regions and then incorporates such features with expression levels and binding activities of human core TFs to predict novel chromatin accessible regions. In a series of systematic comparisons with existing methods, DeepCAGE exhibits superior performance in not only the classification but also the regression of chromatin accessibility signals. In a detailed analysis of TF activities, DeepCAGE successfully extracts novel binding motifs and measures the contribution of a TF to the regulation with respect to a specific locus in a certain cell type. When applied to whole-genome sequencing data analysis, our method successfully prioritizes putative deleterious variants underlying a human complex trait and thus provides insights into the understanding of disease-associated genetic variants. DeepCAGE can be downloaded from https://github.com/kimmo1019/DeepCAGE.
format Online
Article
Text
id pubmed-9801045
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-98010452022-12-31 DeepCAGE: Incorporating Transcription Factors in Genome-wide Prediction of Chromatin Accessibility Liu, Qiao Hua, Kui Zhang, Xuegong Wong, Wing Hung Jiang, Rui Genomics Proteomics Bioinformatics Method Although computational approaches have been complementing high-throughput biological experiments for the identification of functional regions in the human genome, it remains a great challenge to systematically decipher interactions between transcription factors (TFs) and regulatory elements to achieve interpretable annotations of chromatin accessibility across diverse cellular contexts. To solve this problem, we propose DeepCAGE, a deep learning framework that integrates sequence information and binding statuses of TFs, for the accurate prediction of chromatin accessible regions at a genome-wide scale in a variety of cell types. DeepCAGE takes advantage of a densely connected deep convolutional neural network architecture to automatically learn sequence signatures of known chromatin accessible regions and then incorporates such features with expression levels and binding activities of human core TFs to predict novel chromatin accessible regions. In a series of systematic comparisons with existing methods, DeepCAGE exhibits superior performance in not only the classification but also the regression of chromatin accessibility signals. In a detailed analysis of TF activities, DeepCAGE successfully extracts novel binding motifs and measures the contribution of a TF to the regulation with respect to a specific locus in a certain cell type. When applied to whole-genome sequencing data analysis, our method successfully prioritizes putative deleterious variants underlying a human complex trait and thus provides insights into the understanding of disease-associated genetic variants. DeepCAGE can be downloaded from https://github.com/kimmo1019/DeepCAGE. Elsevier 2022-06 2022-03-12 /pmc/articles/PMC9801045/ /pubmed/35293310 http://dx.doi.org/10.1016/j.gpb.2021.08.015 Text en © 2022 The Authors. Published by Elsevier B.V. and Science Press on behalf of Beijing Institute of Genomics, Chinese Academy of Sciences / China National Center for Bioinformation and Genetics Society of China. https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Method
Liu, Qiao
Hua, Kui
Zhang, Xuegong
Wong, Wing Hung
Jiang, Rui
DeepCAGE: Incorporating Transcription Factors in Genome-wide Prediction of Chromatin Accessibility
title DeepCAGE: Incorporating Transcription Factors in Genome-wide Prediction of Chromatin Accessibility
title_full DeepCAGE: Incorporating Transcription Factors in Genome-wide Prediction of Chromatin Accessibility
title_fullStr DeepCAGE: Incorporating Transcription Factors in Genome-wide Prediction of Chromatin Accessibility
title_full_unstemmed DeepCAGE: Incorporating Transcription Factors in Genome-wide Prediction of Chromatin Accessibility
title_short DeepCAGE: Incorporating Transcription Factors in Genome-wide Prediction of Chromatin Accessibility
title_sort deepcage: incorporating transcription factors in genome-wide prediction of chromatin accessibility
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9801045/
https://www.ncbi.nlm.nih.gov/pubmed/35293310
http://dx.doi.org/10.1016/j.gpb.2021.08.015
work_keys_str_mv AT liuqiao deepcageincorporatingtranscriptionfactorsingenomewidepredictionofchromatinaccessibility
AT huakui deepcageincorporatingtranscriptionfactorsingenomewidepredictionofchromatinaccessibility
AT zhangxuegong deepcageincorporatingtranscriptionfactorsingenomewidepredictionofchromatinaccessibility
AT wongwinghung deepcageincorporatingtranscriptionfactorsingenomewidepredictionofchromatinaccessibility
AT jiangrui deepcageincorporatingtranscriptionfactorsingenomewidepredictionofchromatinaccessibility