Cargando…

BTR: training asynchronous Boolean models using single-cell expression data

BACKGROUND: Rapid technological innovation for the generation of single-cell genomics data presents new challenges and opportunities for bioinformatics analysis. One such area lies in the development of new ways to train gene regulatory networks. The use of single-cell expression profiling technique...

Descripción completa

Detalles Bibliográficos
Autores principales: Lim, Chee Yee, Wang, Huange, Woodhouse, Steven, Piterman, Nir, Wernisch, Lorenz, Fisher, Jasmin, Göttgens, Berthold
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5012073/
https://www.ncbi.nlm.nih.gov/pubmed/27600248
http://dx.doi.org/10.1186/s12859-016-1235-y
_version_ 1782451951723610112
author Lim, Chee Yee
Wang, Huange
Woodhouse, Steven
Piterman, Nir
Wernisch, Lorenz
Fisher, Jasmin
Göttgens, Berthold
author_facet Lim, Chee Yee
Wang, Huange
Woodhouse, Steven
Piterman, Nir
Wernisch, Lorenz
Fisher, Jasmin
Göttgens, Berthold
author_sort Lim, Chee Yee
collection PubMed
description BACKGROUND: Rapid technological innovation for the generation of single-cell genomics data presents new challenges and opportunities for bioinformatics analysis. One such area lies in the development of new ways to train gene regulatory networks. The use of single-cell expression profiling technique allows the profiling of the expression states of hundreds of cells, but these expression states are typically noisier due to the presence of technical artefacts such as drop-outs. While many algorithms exist to infer a gene regulatory network, very few of them are able to harness the extra expression states present in single-cell expression data without getting adversely affected by the substantial technical noise present. RESULTS: Here we introduce BTR, an algorithm for training asynchronous Boolean models with single-cell expression data using a novel Boolean state space scoring function. BTR is capable of refining existing Boolean models and reconstructing new Boolean models by improving the match between model prediction and expression data. We demonstrate that the Boolean scoring function performed favourably against the BIC scoring function for Bayesian networks. In addition, we show that BTR outperforms many other network inference algorithms in both bulk and single-cell synthetic expression data. Lastly, we introduce two case studies, in which we use BTR to improve published Boolean models in order to generate potentially new biological insights. CONCLUSIONS: BTR provides a novel way to refine or reconstruct Boolean models using single-cell expression data. Boolean model is particularly useful for network reconstruction using single-cell data because it is more robust to the effect of drop-outs. In addition, BTR does not assume any relationship in the expression states among cells, it is useful for reconstructing a gene regulatory network with as few assumptions as possible. Given the simplicity of Boolean models and the rapid adoption of single-cell genomics by biologists, BTR has the potential to make an impact across many fields of biomedical research. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1235-y) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5012073
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-50120732016-09-15 BTR: training asynchronous Boolean models using single-cell expression data Lim, Chee Yee Wang, Huange Woodhouse, Steven Piterman, Nir Wernisch, Lorenz Fisher, Jasmin Göttgens, Berthold BMC Bioinformatics Methodology Article BACKGROUND: Rapid technological innovation for the generation of single-cell genomics data presents new challenges and opportunities for bioinformatics analysis. One such area lies in the development of new ways to train gene regulatory networks. The use of single-cell expression profiling technique allows the profiling of the expression states of hundreds of cells, but these expression states are typically noisier due to the presence of technical artefacts such as drop-outs. While many algorithms exist to infer a gene regulatory network, very few of them are able to harness the extra expression states present in single-cell expression data without getting adversely affected by the substantial technical noise present. RESULTS: Here we introduce BTR, an algorithm for training asynchronous Boolean models with single-cell expression data using a novel Boolean state space scoring function. BTR is capable of refining existing Boolean models and reconstructing new Boolean models by improving the match between model prediction and expression data. We demonstrate that the Boolean scoring function performed favourably against the BIC scoring function for Bayesian networks. In addition, we show that BTR outperforms many other network inference algorithms in both bulk and single-cell synthetic expression data. Lastly, we introduce two case studies, in which we use BTR to improve published Boolean models in order to generate potentially new biological insights. CONCLUSIONS: BTR provides a novel way to refine or reconstruct Boolean models using single-cell expression data. Boolean model is particularly useful for network reconstruction using single-cell data because it is more robust to the effect of drop-outs. In addition, BTR does not assume any relationship in the expression states among cells, it is useful for reconstructing a gene regulatory network with as few assumptions as possible. Given the simplicity of Boolean models and the rapid adoption of single-cell genomics by biologists, BTR has the potential to make an impact across many fields of biomedical research. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1235-y) contains supplementary material, which is available to authorized users. BioMed Central 2016-09-06 /pmc/articles/PMC5012073/ /pubmed/27600248 http://dx.doi.org/10.1186/s12859-016-1235-y Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Lim, Chee Yee
Wang, Huange
Woodhouse, Steven
Piterman, Nir
Wernisch, Lorenz
Fisher, Jasmin
Göttgens, Berthold
BTR: training asynchronous Boolean models using single-cell expression data
title BTR: training asynchronous Boolean models using single-cell expression data
title_full BTR: training asynchronous Boolean models using single-cell expression data
title_fullStr BTR: training asynchronous Boolean models using single-cell expression data
title_full_unstemmed BTR: training asynchronous Boolean models using single-cell expression data
title_short BTR: training asynchronous Boolean models using single-cell expression data
title_sort btr: training asynchronous boolean models using single-cell expression data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5012073/
https://www.ncbi.nlm.nih.gov/pubmed/27600248
http://dx.doi.org/10.1186/s12859-016-1235-y
work_keys_str_mv AT limcheeyee btrtrainingasynchronousbooleanmodelsusingsinglecellexpressiondata
AT wanghuange btrtrainingasynchronousbooleanmodelsusingsinglecellexpressiondata
AT woodhousesteven btrtrainingasynchronousbooleanmodelsusingsinglecellexpressiondata
AT pitermannir btrtrainingasynchronousbooleanmodelsusingsinglecellexpressiondata
AT wernischlorenz btrtrainingasynchronousbooleanmodelsusingsinglecellexpressiondata
AT fisherjasmin btrtrainingasynchronousbooleanmodelsusingsinglecellexpressiondata
AT gottgensberthold btrtrainingasynchronousbooleanmodelsusingsinglecellexpressiondata