Cargando…

NetProphet 2.0: mapping transcription factor networks by exploiting scalable data resources

MOTIVATION: Cells process information, in part, through transcription factor (TF) networks, which control the rates at which individual genes produce their products. A TF network map is a graph that indicates which TFs bind and directly regulate each gene. Previous work has described network mapping...

Descripción completa

Detalles Bibliográficos
Autores principales: Kang, Yiming, Liow, Hien-Haw, Maier, Ezekiel J, Brent, Michael R
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860202/
https://www.ncbi.nlm.nih.gov/pubmed/28968736
http://dx.doi.org/10.1093/bioinformatics/btx563
_version_ 1783307959985504256
author Kang, Yiming
Liow, Hien-Haw
Maier, Ezekiel J
Brent, Michael R
author_facet Kang, Yiming
Liow, Hien-Haw
Maier, Ezekiel J
Brent, Michael R
author_sort Kang, Yiming
collection PubMed
description MOTIVATION: Cells process information, in part, through transcription factor (TF) networks, which control the rates at which individual genes produce their products. A TF network map is a graph that indicates which TFs bind and directly regulate each gene. Previous work has described network mapping algorithms that rely exclusively on gene expression data and ‘integrative’ algorithms that exploit a wide range of data sources including chromatin immunoprecipitation sequencing (ChIP-seq) of many TFs, genome-wide chromatin marks, and binding specificities for many TFs determined in vitro. However, such resources are available only for a few major model systems and cannot be easily replicated for new organisms or cell types. RESULTS: We present NetProphet 2.0, a ‘data light’ algorithm for TF network mapping, and show that it is more accurate at identifying direct targets of TFs than other, similarly data light algorithms. In particular, it improves on the accuracy of NetProphet 1.0, which used only gene expression data, by exploiting three principles. First, combining multiple approaches to network mapping from expression data can improve accuracy relative to the constituent approaches. Second, TFs with similar DNA binding domains bind similar sets of target genes. Third, even a noisy, preliminary network map can be used to infer DNA binding specificities from promoter sequences and these inferred specificities can be used to further improve the accuracy of the network map. AVAILABILITY AND IMPLEMENTATION: Source code and comprehensive documentation are freely available at https://github.com/yiming-kang/NetProphet_2.0. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-5860202
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58602022018-03-21 NetProphet 2.0: mapping transcription factor networks by exploiting scalable data resources Kang, Yiming Liow, Hien-Haw Maier, Ezekiel J Brent, Michael R Bioinformatics Original Papers MOTIVATION: Cells process information, in part, through transcription factor (TF) networks, which control the rates at which individual genes produce their products. A TF network map is a graph that indicates which TFs bind and directly regulate each gene. Previous work has described network mapping algorithms that rely exclusively on gene expression data and ‘integrative’ algorithms that exploit a wide range of data sources including chromatin immunoprecipitation sequencing (ChIP-seq) of many TFs, genome-wide chromatin marks, and binding specificities for many TFs determined in vitro. However, such resources are available only for a few major model systems and cannot be easily replicated for new organisms or cell types. RESULTS: We present NetProphet 2.0, a ‘data light’ algorithm for TF network mapping, and show that it is more accurate at identifying direct targets of TFs than other, similarly data light algorithms. In particular, it improves on the accuracy of NetProphet 1.0, which used only gene expression data, by exploiting three principles. First, combining multiple approaches to network mapping from expression data can improve accuracy relative to the constituent approaches. Second, TFs with similar DNA binding domains bind similar sets of target genes. Third, even a noisy, preliminary network map can be used to infer DNA binding specificities from promoter sequences and these inferred specificities can be used to further improve the accuracy of the network map. AVAILABILITY AND IMPLEMENTATION: Source code and comprehensive documentation are freely available at https://github.com/yiming-kang/NetProphet_2.0. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-01-15 2017-09-12 /pmc/articles/PMC5860202/ /pubmed/28968736 http://dx.doi.org/10.1093/bioinformatics/btx563 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Kang, Yiming
Liow, Hien-Haw
Maier, Ezekiel J
Brent, Michael R
NetProphet 2.0: mapping transcription factor networks by exploiting scalable data resources
title NetProphet 2.0: mapping transcription factor networks by exploiting scalable data resources
title_full NetProphet 2.0: mapping transcription factor networks by exploiting scalable data resources
title_fullStr NetProphet 2.0: mapping transcription factor networks by exploiting scalable data resources
title_full_unstemmed NetProphet 2.0: mapping transcription factor networks by exploiting scalable data resources
title_short NetProphet 2.0: mapping transcription factor networks by exploiting scalable data resources
title_sort netprophet 2.0: mapping transcription factor networks by exploiting scalable data resources
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860202/
https://www.ncbi.nlm.nih.gov/pubmed/28968736
http://dx.doi.org/10.1093/bioinformatics/btx563
work_keys_str_mv AT kangyiming netprophet20mappingtranscriptionfactornetworksbyexploitingscalabledataresources
AT liowhienhaw netprophet20mappingtranscriptionfactornetworksbyexploitingscalabledataresources
AT maierezekielj netprophet20mappingtranscriptionfactornetworksbyexploitingscalabledataresources
AT brentmichaelr netprophet20mappingtranscriptionfactornetworksbyexploitingscalabledataresources