Cargando…
Imputation for transcription factor binding predictions based on deep learning
Understanding the cell-specific binding patterns of transcription factors (TFs) is fundamental to studying gene regulatory networks in biological systems, for which ChIP-seq not only provides valuable data but is also considered as the gold standard. Despite tremendous efforts from the scientific co...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5345877/ https://www.ncbi.nlm.nih.gov/pubmed/28234893 http://dx.doi.org/10.1371/journal.pcbi.1005403 |
_version_ | 1782513798484066304 |
---|---|
author | Qin, Qian Feng, Jianxing |
author_facet | Qin, Qian Feng, Jianxing |
author_sort | Qin, Qian |
collection | PubMed |
description | Understanding the cell-specific binding patterns of transcription factors (TFs) is fundamental to studying gene regulatory networks in biological systems, for which ChIP-seq not only provides valuable data but is also considered as the gold standard. Despite tremendous efforts from the scientific community to conduct TF ChIP-seq experiments, the available data represent only a limited percentage of ChIP-seq experiments, considering all possible combinations of TFs and cell lines. In this study, we demonstrate a method for accurately predicting cell-specific TF binding for TF-cell line combinations based on only a small fraction (4%) of the combinations using available ChIP-seq data. The proposed model, termed TFImpute, is based on a deep neural network with a multi-task learning setting to borrow information across transcription factors and cell lines. Compared with existing methods, TFImpute achieves comparable accuracy on TF-cell line combinations with ChIP-seq data; moreover, TFImpute achieves better accuracy on TF-cell line combinations without ChIP-seq data. This approach can predict cell line specific enhancer activities in K562 and HepG2 cell lines, as measured by massively parallel reporter assays, and predicts the impact of SNPs on TF binding. |
format | Online Article Text |
id | pubmed-5345877 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-53458772017-03-29 Imputation for transcription factor binding predictions based on deep learning Qin, Qian Feng, Jianxing PLoS Comput Biol Research Article Understanding the cell-specific binding patterns of transcription factors (TFs) is fundamental to studying gene regulatory networks in biological systems, for which ChIP-seq not only provides valuable data but is also considered as the gold standard. Despite tremendous efforts from the scientific community to conduct TF ChIP-seq experiments, the available data represent only a limited percentage of ChIP-seq experiments, considering all possible combinations of TFs and cell lines. In this study, we demonstrate a method for accurately predicting cell-specific TF binding for TF-cell line combinations based on only a small fraction (4%) of the combinations using available ChIP-seq data. The proposed model, termed TFImpute, is based on a deep neural network with a multi-task learning setting to borrow information across transcription factors and cell lines. Compared with existing methods, TFImpute achieves comparable accuracy on TF-cell line combinations with ChIP-seq data; moreover, TFImpute achieves better accuracy on TF-cell line combinations without ChIP-seq data. This approach can predict cell line specific enhancer activities in K562 and HepG2 cell lines, as measured by massively parallel reporter assays, and predicts the impact of SNPs on TF binding. Public Library of Science 2017-02-24 /pmc/articles/PMC5345877/ /pubmed/28234893 http://dx.doi.org/10.1371/journal.pcbi.1005403 Text en © 2017 Qin, Feng http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Qin, Qian Feng, Jianxing Imputation for transcription factor binding predictions based on deep learning |
title | Imputation for transcription factor binding predictions based on deep learning |
title_full | Imputation for transcription factor binding predictions based on deep learning |
title_fullStr | Imputation for transcription factor binding predictions based on deep learning |
title_full_unstemmed | Imputation for transcription factor binding predictions based on deep learning |
title_short | Imputation for transcription factor binding predictions based on deep learning |
title_sort | imputation for transcription factor binding predictions based on deep learning |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5345877/ https://www.ncbi.nlm.nih.gov/pubmed/28234893 http://dx.doi.org/10.1371/journal.pcbi.1005403 |
work_keys_str_mv | AT qinqian imputationfortranscriptionfactorbindingpredictionsbasedondeeplearning AT fengjianxing imputationfortranscriptionfactorbindingpredictionsbasedondeeplearning |