Cargando…

Imputation for transcription factor binding predictions based on deep learning

Understanding the cell-specific binding patterns of transcription factors (TFs) is fundamental to studying gene regulatory networks in biological systems, for which ChIP-seq not only provides valuable data but is also considered as the gold standard. Despite tremendous efforts from the scientific co...

Descripción completa

Detalles Bibliográficos
Autores principales: Qin, Qian, Feng, Jianxing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5345877/
https://www.ncbi.nlm.nih.gov/pubmed/28234893
http://dx.doi.org/10.1371/journal.pcbi.1005403
_version_ 1782513798484066304
author Qin, Qian
Feng, Jianxing
author_facet Qin, Qian
Feng, Jianxing
author_sort Qin, Qian
collection PubMed
description Understanding the cell-specific binding patterns of transcription factors (TFs) is fundamental to studying gene regulatory networks in biological systems, for which ChIP-seq not only provides valuable data but is also considered as the gold standard. Despite tremendous efforts from the scientific community to conduct TF ChIP-seq experiments, the available data represent only a limited percentage of ChIP-seq experiments, considering all possible combinations of TFs and cell lines. In this study, we demonstrate a method for accurately predicting cell-specific TF binding for TF-cell line combinations based on only a small fraction (4%) of the combinations using available ChIP-seq data. The proposed model, termed TFImpute, is based on a deep neural network with a multi-task learning setting to borrow information across transcription factors and cell lines. Compared with existing methods, TFImpute achieves comparable accuracy on TF-cell line combinations with ChIP-seq data; moreover, TFImpute achieves better accuracy on TF-cell line combinations without ChIP-seq data. This approach can predict cell line specific enhancer activities in K562 and HepG2 cell lines, as measured by massively parallel reporter assays, and predicts the impact of SNPs on TF binding.
format Online
Article
Text
id pubmed-5345877
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-53458772017-03-29 Imputation for transcription factor binding predictions based on deep learning Qin, Qian Feng, Jianxing PLoS Comput Biol Research Article Understanding the cell-specific binding patterns of transcription factors (TFs) is fundamental to studying gene regulatory networks in biological systems, for which ChIP-seq not only provides valuable data but is also considered as the gold standard. Despite tremendous efforts from the scientific community to conduct TF ChIP-seq experiments, the available data represent only a limited percentage of ChIP-seq experiments, considering all possible combinations of TFs and cell lines. In this study, we demonstrate a method for accurately predicting cell-specific TF binding for TF-cell line combinations based on only a small fraction (4%) of the combinations using available ChIP-seq data. The proposed model, termed TFImpute, is based on a deep neural network with a multi-task learning setting to borrow information across transcription factors and cell lines. Compared with existing methods, TFImpute achieves comparable accuracy on TF-cell line combinations with ChIP-seq data; moreover, TFImpute achieves better accuracy on TF-cell line combinations without ChIP-seq data. This approach can predict cell line specific enhancer activities in K562 and HepG2 cell lines, as measured by massively parallel reporter assays, and predicts the impact of SNPs on TF binding. Public Library of Science 2017-02-24 /pmc/articles/PMC5345877/ /pubmed/28234893 http://dx.doi.org/10.1371/journal.pcbi.1005403 Text en © 2017 Qin, Feng http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Qin, Qian
Feng, Jianxing
Imputation for transcription factor binding predictions based on deep learning
title Imputation for transcription factor binding predictions based on deep learning
title_full Imputation for transcription factor binding predictions based on deep learning
title_fullStr Imputation for transcription factor binding predictions based on deep learning
title_full_unstemmed Imputation for transcription factor binding predictions based on deep learning
title_short Imputation for transcription factor binding predictions based on deep learning
title_sort imputation for transcription factor binding predictions based on deep learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5345877/
https://www.ncbi.nlm.nih.gov/pubmed/28234893
http://dx.doi.org/10.1371/journal.pcbi.1005403
work_keys_str_mv AT qinqian imputationfortranscriptionfactorbindingpredictionsbasedondeeplearning
AT fengjianxing imputationfortranscriptionfactorbindingpredictionsbasedondeeplearning