Cargando…

A flexible network-based imputing-and-fusing approach towards the identification of cell types from single-cell RNA-seq data

BACKGROUND: Single-cell RNA sequencing (scRNA-seq) provides an effective tool to investigate the transcriptomic characteristics at the single-cell resolution. Due to the low amounts of transcripts in single cells and the technical biases in experiments, the raw scRNA-seq data usually includes large...

Descripción completa

Detalles Bibliográficos
Autores principales: Qi, Yang, Guo, Yang, Jiao, Huixin, Shang, Xuequn
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7291547/
https://www.ncbi.nlm.nih.gov/pubmed/32527285
http://dx.doi.org/10.1186/s12859-020-03547-w
_version_ 1783545928036122624
author Qi, Yang
Guo, Yang
Jiao, Huixin
Shang, Xuequn
author_facet Qi, Yang
Guo, Yang
Jiao, Huixin
Shang, Xuequn
author_sort Qi, Yang
collection PubMed
description BACKGROUND: Single-cell RNA sequencing (scRNA-seq) provides an effective tool to investigate the transcriptomic characteristics at the single-cell resolution. Due to the low amounts of transcripts in single cells and the technical biases in experiments, the raw scRNA-seq data usually includes large noise and makes the downstream analyses complicated. Although many methods have been proposed to impute the noisy scRNA-seq data in recent years, few of them take into account the prior associations across genes in imputation and integrate multiple types of imputation data to identify cell types. RESULTS: We present a new framework, NetImpute, towards the identification of cell types from scRNA-seq data by integrating multiple types of biological networks. We employ a statistic method to detect the noise data items in scRNA-seq data and develop a new imputation model to estimate the real values of data noise by integrating the PPI network and gene pathways. Meanwhile, based on the data imputed by multiple types of biological networks, we propose an integrated approach to identify cell types from scRNA-seq data. Comprehensive experiments demonstrate that the proposed network-based imputation model can estimate the real values of noise data items accurately and integrating the imputation data based on multiple types of biological networks can improve the identification of cell types from scRNA-seq data. CONCLUSIONS: Incorporating the prior gene associations in biological networks can potentially help to improve the imputation of noisy scRNA-seq data and integrating multiple types of network-based imputation data can enhance the identification of cell types. The proposed NetImpute provides an open framework for incorporating multiple types of biological network data to identify cell types from scRNA-seq data.
format Online
Article
Text
id pubmed-7291547
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-72915472020-06-12 A flexible network-based imputing-and-fusing approach towards the identification of cell types from single-cell RNA-seq data Qi, Yang Guo, Yang Jiao, Huixin Shang, Xuequn BMC Bioinformatics Methodology Article BACKGROUND: Single-cell RNA sequencing (scRNA-seq) provides an effective tool to investigate the transcriptomic characteristics at the single-cell resolution. Due to the low amounts of transcripts in single cells and the technical biases in experiments, the raw scRNA-seq data usually includes large noise and makes the downstream analyses complicated. Although many methods have been proposed to impute the noisy scRNA-seq data in recent years, few of them take into account the prior associations across genes in imputation and integrate multiple types of imputation data to identify cell types. RESULTS: We present a new framework, NetImpute, towards the identification of cell types from scRNA-seq data by integrating multiple types of biological networks. We employ a statistic method to detect the noise data items in scRNA-seq data and develop a new imputation model to estimate the real values of data noise by integrating the PPI network and gene pathways. Meanwhile, based on the data imputed by multiple types of biological networks, we propose an integrated approach to identify cell types from scRNA-seq data. Comprehensive experiments demonstrate that the proposed network-based imputation model can estimate the real values of noise data items accurately and integrating the imputation data based on multiple types of biological networks can improve the identification of cell types from scRNA-seq data. CONCLUSIONS: Incorporating the prior gene associations in biological networks can potentially help to improve the imputation of noisy scRNA-seq data and integrating multiple types of network-based imputation data can enhance the identification of cell types. The proposed NetImpute provides an open framework for incorporating multiple types of biological network data to identify cell types from scRNA-seq data. BioMed Central 2020-06-11 /pmc/articles/PMC7291547/ /pubmed/32527285 http://dx.doi.org/10.1186/s12859-020-03547-w Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Qi, Yang
Guo, Yang
Jiao, Huixin
Shang, Xuequn
A flexible network-based imputing-and-fusing approach towards the identification of cell types from single-cell RNA-seq data
title A flexible network-based imputing-and-fusing approach towards the identification of cell types from single-cell RNA-seq data
title_full A flexible network-based imputing-and-fusing approach towards the identification of cell types from single-cell RNA-seq data
title_fullStr A flexible network-based imputing-and-fusing approach towards the identification of cell types from single-cell RNA-seq data
title_full_unstemmed A flexible network-based imputing-and-fusing approach towards the identification of cell types from single-cell RNA-seq data
title_short A flexible network-based imputing-and-fusing approach towards the identification of cell types from single-cell RNA-seq data
title_sort flexible network-based imputing-and-fusing approach towards the identification of cell types from single-cell rna-seq data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7291547/
https://www.ncbi.nlm.nih.gov/pubmed/32527285
http://dx.doi.org/10.1186/s12859-020-03547-w
work_keys_str_mv AT qiyang aflexiblenetworkbasedimputingandfusingapproachtowardstheidentificationofcelltypesfromsinglecellrnaseqdata
AT guoyang aflexiblenetworkbasedimputingandfusingapproachtowardstheidentificationofcelltypesfromsinglecellrnaseqdata
AT jiaohuixin aflexiblenetworkbasedimputingandfusingapproachtowardstheidentificationofcelltypesfromsinglecellrnaseqdata
AT shangxuequn aflexiblenetworkbasedimputingandfusingapproachtowardstheidentificationofcelltypesfromsinglecellrnaseqdata
AT qiyang flexiblenetworkbasedimputingandfusingapproachtowardstheidentificationofcelltypesfromsinglecellrnaseqdata
AT guoyang flexiblenetworkbasedimputingandfusingapproachtowardstheidentificationofcelltypesfromsinglecellrnaseqdata
AT jiaohuixin flexiblenetworkbasedimputingandfusingapproachtowardstheidentificationofcelltypesfromsinglecellrnaseqdata
AT shangxuequn flexiblenetworkbasedimputingandfusingapproachtowardstheidentificationofcelltypesfromsinglecellrnaseqdata