Cargando…

Identifying strengths and weaknesses of methods for computational network inference from single-cell RNA-seq data

Single-cell RNA-sequencing (scRNA-seq) offers unparalleled insight into the transcriptional programs of different cellular states by measuring the transcriptome of thousands of individual cells. An emerging problem in the analysis of scRNA-seq is the inference of transcriptional gene regulatory netw...

Descripción completa

Detalles Bibliográficos
Autores principales: McCalla, Sunnie Grace, Fotuhi Siahpirani, Alireza, Li, Jiaxin, Pyne, Saptarshi, Stone, Matthew, Periyasamy, Viswesh, Shin, Junha, Roy, Sushmita
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9997554/
https://www.ncbi.nlm.nih.gov/pubmed/36626328
http://dx.doi.org/10.1093/g3journal/jkad004
_version_ 1784903277916192768
author McCalla, Sunnie Grace
Fotuhi Siahpirani, Alireza
Li, Jiaxin
Pyne, Saptarshi
Stone, Matthew
Periyasamy, Viswesh
Shin, Junha
Roy, Sushmita
author_facet McCalla, Sunnie Grace
Fotuhi Siahpirani, Alireza
Li, Jiaxin
Pyne, Saptarshi
Stone, Matthew
Periyasamy, Viswesh
Shin, Junha
Roy, Sushmita
author_sort McCalla, Sunnie Grace
collection PubMed
description Single-cell RNA-sequencing (scRNA-seq) offers unparalleled insight into the transcriptional programs of different cellular states by measuring the transcriptome of thousands of individual cells. An emerging problem in the analysis of scRNA-seq is the inference of transcriptional gene regulatory networks and a number of methods with different learning frameworks have been developed to address this problem. Here, we present an expanded benchmarking study of eleven recent network inference methods on seven published scRNA-seq datasets in human, mouse, and yeast considering different types of gold standard networks and evaluation metrics. We evaluate methods based on their computing requirements as well as on their ability to recover the network structure. We find that, while most methods have a modest recovery of experimentally derived interactions based on global metrics such as Area Under the Precision Recall curve, methods are able to capture targets of regulators that are relevant to the system under study. Among the top performing methods that use only expression were SCENIC, PIDC, MERLIN or Correlation. Addition of prior biological knowledge and the estimation of transcription factor activities resulted in the best overall performance with the Inferelator and MERLIN methods that use prior knowledge outperforming methods that use expression alone. We found that imputation for network inference did not improve network inference accuracy and could be detrimental. Comparisons of inferred networks for comparable bulk conditions showed that the networks inferred from scRNA-seq datasets are often better or at par with the networks inferred from bulk datasets. Our analysis should be beneficial in selecting methods for network inference. At the same time, this highlights the need for improved methods and better gold standards for regulatory network inference from scRNAseq datasets.
format Online
Article
Text
id pubmed-9997554
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-99975542023-03-10 Identifying strengths and weaknesses of methods for computational network inference from single-cell RNA-seq data McCalla, Sunnie Grace Fotuhi Siahpirani, Alireza Li, Jiaxin Pyne, Saptarshi Stone, Matthew Periyasamy, Viswesh Shin, Junha Roy, Sushmita G3 (Bethesda) Investigation Single-cell RNA-sequencing (scRNA-seq) offers unparalleled insight into the transcriptional programs of different cellular states by measuring the transcriptome of thousands of individual cells. An emerging problem in the analysis of scRNA-seq is the inference of transcriptional gene regulatory networks and a number of methods with different learning frameworks have been developed to address this problem. Here, we present an expanded benchmarking study of eleven recent network inference methods on seven published scRNA-seq datasets in human, mouse, and yeast considering different types of gold standard networks and evaluation metrics. We evaluate methods based on their computing requirements as well as on their ability to recover the network structure. We find that, while most methods have a modest recovery of experimentally derived interactions based on global metrics such as Area Under the Precision Recall curve, methods are able to capture targets of regulators that are relevant to the system under study. Among the top performing methods that use only expression were SCENIC, PIDC, MERLIN or Correlation. Addition of prior biological knowledge and the estimation of transcription factor activities resulted in the best overall performance with the Inferelator and MERLIN methods that use prior knowledge outperforming methods that use expression alone. We found that imputation for network inference did not improve network inference accuracy and could be detrimental. Comparisons of inferred networks for comparable bulk conditions showed that the networks inferred from scRNA-seq datasets are often better or at par with the networks inferred from bulk datasets. Our analysis should be beneficial in selecting methods for network inference. At the same time, this highlights the need for improved methods and better gold standards for regulatory network inference from scRNAseq datasets. Oxford University Press 2023-01-10 /pmc/articles/PMC9997554/ /pubmed/36626328 http://dx.doi.org/10.1093/g3journal/jkad004 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigation
McCalla, Sunnie Grace
Fotuhi Siahpirani, Alireza
Li, Jiaxin
Pyne, Saptarshi
Stone, Matthew
Periyasamy, Viswesh
Shin, Junha
Roy, Sushmita
Identifying strengths and weaknesses of methods for computational network inference from single-cell RNA-seq data
title Identifying strengths and weaknesses of methods for computational network inference from single-cell RNA-seq data
title_full Identifying strengths and weaknesses of methods for computational network inference from single-cell RNA-seq data
title_fullStr Identifying strengths and weaknesses of methods for computational network inference from single-cell RNA-seq data
title_full_unstemmed Identifying strengths and weaknesses of methods for computational network inference from single-cell RNA-seq data
title_short Identifying strengths and weaknesses of methods for computational network inference from single-cell RNA-seq data
title_sort identifying strengths and weaknesses of methods for computational network inference from single-cell rna-seq data
topic Investigation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9997554/
https://www.ncbi.nlm.nih.gov/pubmed/36626328
http://dx.doi.org/10.1093/g3journal/jkad004
work_keys_str_mv AT mccallasunniegrace identifyingstrengthsandweaknessesofmethodsforcomputationalnetworkinferencefromsinglecellrnaseqdata
AT fotuhisiahpiranialireza identifyingstrengthsandweaknessesofmethodsforcomputationalnetworkinferencefromsinglecellrnaseqdata
AT lijiaxin identifyingstrengthsandweaknessesofmethodsforcomputationalnetworkinferencefromsinglecellrnaseqdata
AT pynesaptarshi identifyingstrengthsandweaknessesofmethodsforcomputationalnetworkinferencefromsinglecellrnaseqdata
AT stonematthew identifyingstrengthsandweaknessesofmethodsforcomputationalnetworkinferencefromsinglecellrnaseqdata
AT periyasamyviswesh identifyingstrengthsandweaknessesofmethodsforcomputationalnetworkinferencefromsinglecellrnaseqdata
AT shinjunha identifyingstrengthsandweaknessesofmethodsforcomputationalnetworkinferencefromsinglecellrnaseqdata
AT roysushmita identifyingstrengthsandweaknessesofmethodsforcomputationalnetworkinferencefromsinglecellrnaseqdata