Cargando…

A novel method for single-cell data imputation using subspace regression

Recent advances in biochemistry and single-cell RNA sequencing (scRNA-seq) have allowed us to monitor the biological systems at the single-cell resolution. However, the low capture of mRNA material within individual cells often leads to inaccurate quantification of genetic material. Consequently, a...

Descripción completa

Detalles Bibliográficos
Autores principales: Tran, Duc, Tran, Bang, Nguyen, Hung, Nguyen, Tin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8854597/
https://www.ncbi.nlm.nih.gov/pubmed/35177662
http://dx.doi.org/10.1038/s41598-022-06500-4
_version_ 1784653465242304512
author Tran, Duc
Tran, Bang
Nguyen, Hung
Nguyen, Tin
author_facet Tran, Duc
Tran, Bang
Nguyen, Hung
Nguyen, Tin
author_sort Tran, Duc
collection PubMed
description Recent advances in biochemistry and single-cell RNA sequencing (scRNA-seq) have allowed us to monitor the biological systems at the single-cell resolution. However, the low capture of mRNA material within individual cells often leads to inaccurate quantification of genetic material. Consequently, a significant amount of expression values are reported as missing, which are often referred to as dropouts. To overcome this challenge, we develop a novel imputation method, named single-cell Imputation via Subspace Regression (scISR), that can reliably recover the dropout values of scRNA-seq data. The scISR method first uses a hypothesis-testing technique to identify zero-valued entries that are most likely affected by dropout events and then estimates the dropout values using a subspace regression model. Our comprehensive evaluation using 25 publicly available scRNA-seq datasets and various simulation scenarios against five state-of-the-art methods demonstrates that scISR is better than other imputation methods in recovering scRNA-seq expression profiles via imputation. scISR consistently improves the quality of cluster analysis regardless of dropout rates, normalization techniques, and quantification schemes. The source code of scISR can be found on GitHub at https://github.com/duct317/scISR.
format Online
Article
Text
id pubmed-8854597
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-88545972022-02-18 A novel method for single-cell data imputation using subspace regression Tran, Duc Tran, Bang Nguyen, Hung Nguyen, Tin Sci Rep Article Recent advances in biochemistry and single-cell RNA sequencing (scRNA-seq) have allowed us to monitor the biological systems at the single-cell resolution. However, the low capture of mRNA material within individual cells often leads to inaccurate quantification of genetic material. Consequently, a significant amount of expression values are reported as missing, which are often referred to as dropouts. To overcome this challenge, we develop a novel imputation method, named single-cell Imputation via Subspace Regression (scISR), that can reliably recover the dropout values of scRNA-seq data. The scISR method first uses a hypothesis-testing technique to identify zero-valued entries that are most likely affected by dropout events and then estimates the dropout values using a subspace regression model. Our comprehensive evaluation using 25 publicly available scRNA-seq datasets and various simulation scenarios against five state-of-the-art methods demonstrates that scISR is better than other imputation methods in recovering scRNA-seq expression profiles via imputation. scISR consistently improves the quality of cluster analysis regardless of dropout rates, normalization techniques, and quantification schemes. The source code of scISR can be found on GitHub at https://github.com/duct317/scISR. Nature Publishing Group UK 2022-02-17 /pmc/articles/PMC8854597/ /pubmed/35177662 http://dx.doi.org/10.1038/s41598-022-06500-4 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Tran, Duc
Tran, Bang
Nguyen, Hung
Nguyen, Tin
A novel method for single-cell data imputation using subspace regression
title A novel method for single-cell data imputation using subspace regression
title_full A novel method for single-cell data imputation using subspace regression
title_fullStr A novel method for single-cell data imputation using subspace regression
title_full_unstemmed A novel method for single-cell data imputation using subspace regression
title_short A novel method for single-cell data imputation using subspace regression
title_sort novel method for single-cell data imputation using subspace regression
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8854597/
https://www.ncbi.nlm.nih.gov/pubmed/35177662
http://dx.doi.org/10.1038/s41598-022-06500-4
work_keys_str_mv AT tranduc anovelmethodforsinglecelldataimputationusingsubspaceregression
AT tranbang anovelmethodforsinglecelldataimputationusingsubspaceregression
AT nguyenhung anovelmethodforsinglecelldataimputationusingsubspaceregression
AT nguyentin anovelmethodforsinglecelldataimputationusingsubspaceregression
AT tranduc novelmethodforsinglecelldataimputationusingsubspaceregression
AT tranbang novelmethodforsinglecelldataimputationusingsubspaceregression
AT nguyenhung novelmethodforsinglecelldataimputationusingsubspaceregression
AT nguyentin novelmethodforsinglecelldataimputationusingsubspaceregression