Cargando…

SRGS: sparse partial least squares-based recursive gene selection for gene regulatory network inference

BACKGROUND: The identification of gene regulatory networks (GRNs) facilitates the understanding of the underlying molecular mechanism of various biological processes and complex diseases. With the availability of single-cell RNA sequencing data, it is essential to infer GRNs from single-cell express...

Descripción completa

Detalles Bibliográficos
Autores principales: Guan, Jinting, Wang, Yang, Wang, Yongjie, Zhuang, Yan, Ji, Guoli
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710113/
https://www.ncbi.nlm.nih.gov/pubmed/36451086
http://dx.doi.org/10.1186/s12864-022-09020-7
_version_ 1784841299863535616
author Guan, Jinting
Wang, Yang
Wang, Yongjie
Zhuang, Yan
Ji, Guoli
author_facet Guan, Jinting
Wang, Yang
Wang, Yongjie
Zhuang, Yan
Ji, Guoli
author_sort Guan, Jinting
collection PubMed
description BACKGROUND: The identification of gene regulatory networks (GRNs) facilitates the understanding of the underlying molecular mechanism of various biological processes and complex diseases. With the availability of single-cell RNA sequencing data, it is essential to infer GRNs from single-cell expression. Although some GRN methods originally developed for bulk expression data can be applicable to single-cell data and several single-cell specific GRN algorithms were developed, recent benchmarking studies have emphasized the need of developing more accurate and robust GRN modeling methods that are compatible for single-cell expression data. RESULTS: We present SRGS, SPLS (sparse partial least squares)-based recursive gene selection, to infer GRNs from bulk or single-cell expression data. SRGS recursively selects and scores the genes which may have regulations on the considered target gene based on SPLS. When dealing with gene expression data with dropouts, we randomly scramble samples, set some values in the expression matrix to zeroes, and generate multiple copies of data through multiple iterations to make SRGS more robust. We test SRGS on different kinds of expression data, including simulated bulk data, simulated single-cell data without and with dropouts, and experimental single-cell data, and also compared with the existing GRN methods, including the ones originally developed for bulk data, the ones developed specifically for single-cell data, and even the ones recommended by recent benchmarking studies. CONCLUSIONS: It has been shown that SRGS is competitive with the existing GRN methods and effective in the gene regulatory network inference from bulk or single-cell gene expression data. SRGS is available at: https://github.com/JGuan-lab/SRGS. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-022-09020-7.
format Online
Article
Text
id pubmed-9710113
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-97101132022-12-01 SRGS: sparse partial least squares-based recursive gene selection for gene regulatory network inference Guan, Jinting Wang, Yang Wang, Yongjie Zhuang, Yan Ji, Guoli BMC Genomics Software BACKGROUND: The identification of gene regulatory networks (GRNs) facilitates the understanding of the underlying molecular mechanism of various biological processes and complex diseases. With the availability of single-cell RNA sequencing data, it is essential to infer GRNs from single-cell expression. Although some GRN methods originally developed for bulk expression data can be applicable to single-cell data and several single-cell specific GRN algorithms were developed, recent benchmarking studies have emphasized the need of developing more accurate and robust GRN modeling methods that are compatible for single-cell expression data. RESULTS: We present SRGS, SPLS (sparse partial least squares)-based recursive gene selection, to infer GRNs from bulk or single-cell expression data. SRGS recursively selects and scores the genes which may have regulations on the considered target gene based on SPLS. When dealing with gene expression data with dropouts, we randomly scramble samples, set some values in the expression matrix to zeroes, and generate multiple copies of data through multiple iterations to make SRGS more robust. We test SRGS on different kinds of expression data, including simulated bulk data, simulated single-cell data without and with dropouts, and experimental single-cell data, and also compared with the existing GRN methods, including the ones originally developed for bulk data, the ones developed specifically for single-cell data, and even the ones recommended by recent benchmarking studies. CONCLUSIONS: It has been shown that SRGS is competitive with the existing GRN methods and effective in the gene regulatory network inference from bulk or single-cell gene expression data. SRGS is available at: https://github.com/JGuan-lab/SRGS. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-022-09020-7. BioMed Central 2022-11-30 /pmc/articles/PMC9710113/ /pubmed/36451086 http://dx.doi.org/10.1186/s12864-022-09020-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Guan, Jinting
Wang, Yang
Wang, Yongjie
Zhuang, Yan
Ji, Guoli
SRGS: sparse partial least squares-based recursive gene selection for gene regulatory network inference
title SRGS: sparse partial least squares-based recursive gene selection for gene regulatory network inference
title_full SRGS: sparse partial least squares-based recursive gene selection for gene regulatory network inference
title_fullStr SRGS: sparse partial least squares-based recursive gene selection for gene regulatory network inference
title_full_unstemmed SRGS: sparse partial least squares-based recursive gene selection for gene regulatory network inference
title_short SRGS: sparse partial least squares-based recursive gene selection for gene regulatory network inference
title_sort srgs: sparse partial least squares-based recursive gene selection for gene regulatory network inference
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710113/
https://www.ncbi.nlm.nih.gov/pubmed/36451086
http://dx.doi.org/10.1186/s12864-022-09020-7
work_keys_str_mv AT guanjinting srgssparsepartialleastsquaresbasedrecursivegeneselectionforgeneregulatorynetworkinference
AT wangyang srgssparsepartialleastsquaresbasedrecursivegeneselectionforgeneregulatorynetworkinference
AT wangyongjie srgssparsepartialleastsquaresbasedrecursivegeneselectionforgeneregulatorynetworkinference
AT zhuangyan srgssparsepartialleastsquaresbasedrecursivegeneselectionforgeneregulatorynetworkinference
AT jiguoli srgssparsepartialleastsquaresbasedrecursivegeneselectionforgeneregulatorynetworkinference