Cargando…
SRGS: sparse partial least squares-based recursive gene selection for gene regulatory network inference
BACKGROUND: The identification of gene regulatory networks (GRNs) facilitates the understanding of the underlying molecular mechanism of various biological processes and complex diseases. With the availability of single-cell RNA sequencing data, it is essential to infer GRNs from single-cell express...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710113/ https://www.ncbi.nlm.nih.gov/pubmed/36451086 http://dx.doi.org/10.1186/s12864-022-09020-7 |
_version_ | 1784841299863535616 |
---|---|
author | Guan, Jinting Wang, Yang Wang, Yongjie Zhuang, Yan Ji, Guoli |
author_facet | Guan, Jinting Wang, Yang Wang, Yongjie Zhuang, Yan Ji, Guoli |
author_sort | Guan, Jinting |
collection | PubMed |
description | BACKGROUND: The identification of gene regulatory networks (GRNs) facilitates the understanding of the underlying molecular mechanism of various biological processes and complex diseases. With the availability of single-cell RNA sequencing data, it is essential to infer GRNs from single-cell expression. Although some GRN methods originally developed for bulk expression data can be applicable to single-cell data and several single-cell specific GRN algorithms were developed, recent benchmarking studies have emphasized the need of developing more accurate and robust GRN modeling methods that are compatible for single-cell expression data. RESULTS: We present SRGS, SPLS (sparse partial least squares)-based recursive gene selection, to infer GRNs from bulk or single-cell expression data. SRGS recursively selects and scores the genes which may have regulations on the considered target gene based on SPLS. When dealing with gene expression data with dropouts, we randomly scramble samples, set some values in the expression matrix to zeroes, and generate multiple copies of data through multiple iterations to make SRGS more robust. We test SRGS on different kinds of expression data, including simulated bulk data, simulated single-cell data without and with dropouts, and experimental single-cell data, and also compared with the existing GRN methods, including the ones originally developed for bulk data, the ones developed specifically for single-cell data, and even the ones recommended by recent benchmarking studies. CONCLUSIONS: It has been shown that SRGS is competitive with the existing GRN methods and effective in the gene regulatory network inference from bulk or single-cell gene expression data. SRGS is available at: https://github.com/JGuan-lab/SRGS. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-022-09020-7. |
format | Online Article Text |
id | pubmed-9710113 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-97101132022-12-01 SRGS: sparse partial least squares-based recursive gene selection for gene regulatory network inference Guan, Jinting Wang, Yang Wang, Yongjie Zhuang, Yan Ji, Guoli BMC Genomics Software BACKGROUND: The identification of gene regulatory networks (GRNs) facilitates the understanding of the underlying molecular mechanism of various biological processes and complex diseases. With the availability of single-cell RNA sequencing data, it is essential to infer GRNs from single-cell expression. Although some GRN methods originally developed for bulk expression data can be applicable to single-cell data and several single-cell specific GRN algorithms were developed, recent benchmarking studies have emphasized the need of developing more accurate and robust GRN modeling methods that are compatible for single-cell expression data. RESULTS: We present SRGS, SPLS (sparse partial least squares)-based recursive gene selection, to infer GRNs from bulk or single-cell expression data. SRGS recursively selects and scores the genes which may have regulations on the considered target gene based on SPLS. When dealing with gene expression data with dropouts, we randomly scramble samples, set some values in the expression matrix to zeroes, and generate multiple copies of data through multiple iterations to make SRGS more robust. We test SRGS on different kinds of expression data, including simulated bulk data, simulated single-cell data without and with dropouts, and experimental single-cell data, and also compared with the existing GRN methods, including the ones originally developed for bulk data, the ones developed specifically for single-cell data, and even the ones recommended by recent benchmarking studies. CONCLUSIONS: It has been shown that SRGS is competitive with the existing GRN methods and effective in the gene regulatory network inference from bulk or single-cell gene expression data. SRGS is available at: https://github.com/JGuan-lab/SRGS. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-022-09020-7. BioMed Central 2022-11-30 /pmc/articles/PMC9710113/ /pubmed/36451086 http://dx.doi.org/10.1186/s12864-022-09020-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Software Guan, Jinting Wang, Yang Wang, Yongjie Zhuang, Yan Ji, Guoli SRGS: sparse partial least squares-based recursive gene selection for gene regulatory network inference |
title | SRGS: sparse partial least squares-based recursive gene selection for gene regulatory network inference |
title_full | SRGS: sparse partial least squares-based recursive gene selection for gene regulatory network inference |
title_fullStr | SRGS: sparse partial least squares-based recursive gene selection for gene regulatory network inference |
title_full_unstemmed | SRGS: sparse partial least squares-based recursive gene selection for gene regulatory network inference |
title_short | SRGS: sparse partial least squares-based recursive gene selection for gene regulatory network inference |
title_sort | srgs: sparse partial least squares-based recursive gene selection for gene regulatory network inference |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710113/ https://www.ncbi.nlm.nih.gov/pubmed/36451086 http://dx.doi.org/10.1186/s12864-022-09020-7 |
work_keys_str_mv | AT guanjinting srgssparsepartialleastsquaresbasedrecursivegeneselectionforgeneregulatorynetworkinference AT wangyang srgssparsepartialleastsquaresbasedrecursivegeneselectionforgeneregulatorynetworkinference AT wangyongjie srgssparsepartialleastsquaresbasedrecursivegeneselectionforgeneregulatorynetworkinference AT zhuangyan srgssparsepartialleastsquaresbasedrecursivegeneselectionforgeneregulatorynetworkinference AT jiguoli srgssparsepartialleastsquaresbasedrecursivegeneselectionforgeneregulatorynetworkinference |