Cargando…

Evaluating methods of inferring gene regulatory networks highlights their lack of performance for single cell gene expression data

BACKGROUND: A fundamental fact in biology states that genes do not operate in isolation, and yet, methods that infer regulatory networks for single cell gene expression data have been slow to emerge. With single cell sequencing methods now becoming accessible, general network inference algorithms th...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Shuonan, Mar, Jessica C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6006753/
https://www.ncbi.nlm.nih.gov/pubmed/29914350
http://dx.doi.org/10.1186/s12859-018-2217-z
_version_ 1783332901720424448
author Chen, Shuonan
Mar, Jessica C.
author_facet Chen, Shuonan
Mar, Jessica C.
author_sort Chen, Shuonan
collection PubMed
description BACKGROUND: A fundamental fact in biology states that genes do not operate in isolation, and yet, methods that infer regulatory networks for single cell gene expression data have been slow to emerge. With single cell sequencing methods now becoming accessible, general network inference algorithms that were initially developed for data collected from bulk samples may not be suitable for single cells. Meanwhile, although methods that are specific for single cell data are now emerging, whether they have improved performance over general methods is unknown. In this study, we evaluate the applicability of five general methods and three single cell methods for inferring gene regulatory networks from both experimental single cell gene expression data and in silico simulated data. RESULTS: Standard evaluation metrics using ROC curves and Precision-Recall curves against reference sets sourced from the literature demonstrated that most of the methods performed poorly when they were applied to either experimental single cell data, or simulated single cell data, which demonstrates their lack of performance for this task. Using default settings, network methods were applied to the same datasets. Comparisons of the learned networks highlighted the uniqueness of some predicted edges for each method. The fact that different methods infer networks that vary substantially reflects the underlying mathematical rationale and assumptions that distinguish network methods from each other. CONCLUSIONS: This study provides a comprehensive evaluation of network modeling algorithms applied to experimental single cell gene expression data and in silico simulated datasets where the network structure is known. Comparisons demonstrate that most of these assessed network methods are not able to predict network structures from single cell expression data accurately, even if they are specifically developed for single cell methods. Also, single cell methods, which usually depend on more elaborative algorithms, in general have less similarity to each other in the sets of edges detected. The results from this study emphasize the importance for developing more accurate optimized network modeling methods that are compatible for single cell data. Newly-developed single cell methods may uniquely capture particular features of potential gene-gene relationships, and caution should be taken when we interpret these results. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2217-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6006753
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-60067532018-06-26 Evaluating methods of inferring gene regulatory networks highlights their lack of performance for single cell gene expression data Chen, Shuonan Mar, Jessica C. BMC Bioinformatics Research Article BACKGROUND: A fundamental fact in biology states that genes do not operate in isolation, and yet, methods that infer regulatory networks for single cell gene expression data have been slow to emerge. With single cell sequencing methods now becoming accessible, general network inference algorithms that were initially developed for data collected from bulk samples may not be suitable for single cells. Meanwhile, although methods that are specific for single cell data are now emerging, whether they have improved performance over general methods is unknown. In this study, we evaluate the applicability of five general methods and three single cell methods for inferring gene regulatory networks from both experimental single cell gene expression data and in silico simulated data. RESULTS: Standard evaluation metrics using ROC curves and Precision-Recall curves against reference sets sourced from the literature demonstrated that most of the methods performed poorly when they were applied to either experimental single cell data, or simulated single cell data, which demonstrates their lack of performance for this task. Using default settings, network methods were applied to the same datasets. Comparisons of the learned networks highlighted the uniqueness of some predicted edges for each method. The fact that different methods infer networks that vary substantially reflects the underlying mathematical rationale and assumptions that distinguish network methods from each other. CONCLUSIONS: This study provides a comprehensive evaluation of network modeling algorithms applied to experimental single cell gene expression data and in silico simulated datasets where the network structure is known. Comparisons demonstrate that most of these assessed network methods are not able to predict network structures from single cell expression data accurately, even if they are specifically developed for single cell methods. Also, single cell methods, which usually depend on more elaborative algorithms, in general have less similarity to each other in the sets of edges detected. The results from this study emphasize the importance for developing more accurate optimized network modeling methods that are compatible for single cell data. Newly-developed single cell methods may uniquely capture particular features of potential gene-gene relationships, and caution should be taken when we interpret these results. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2217-z) contains supplementary material, which is available to authorized users. BioMed Central 2018-06-19 /pmc/articles/PMC6006753/ /pubmed/29914350 http://dx.doi.org/10.1186/s12859-018-2217-z Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Chen, Shuonan
Mar, Jessica C.
Evaluating methods of inferring gene regulatory networks highlights their lack of performance for single cell gene expression data
title Evaluating methods of inferring gene regulatory networks highlights their lack of performance for single cell gene expression data
title_full Evaluating methods of inferring gene regulatory networks highlights their lack of performance for single cell gene expression data
title_fullStr Evaluating methods of inferring gene regulatory networks highlights their lack of performance for single cell gene expression data
title_full_unstemmed Evaluating methods of inferring gene regulatory networks highlights their lack of performance for single cell gene expression data
title_short Evaluating methods of inferring gene regulatory networks highlights their lack of performance for single cell gene expression data
title_sort evaluating methods of inferring gene regulatory networks highlights their lack of performance for single cell gene expression data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6006753/
https://www.ncbi.nlm.nih.gov/pubmed/29914350
http://dx.doi.org/10.1186/s12859-018-2217-z
work_keys_str_mv AT chenshuonan evaluatingmethodsofinferringgeneregulatorynetworkshighlightstheirlackofperformanceforsinglecellgeneexpressiondata
AT marjessicac evaluatingmethodsofinferringgeneregulatorynetworkshighlightstheirlackofperformanceforsinglecellgeneexpressiondata