Cargando…

IKAP—Identifying K mAjor cell Population groups in single-cell RNA-sequencing analysis

BACKGROUND: In single-cell RNA-sequencing analysis, clustering cells into groups and differentiating cell groups by differentially expressed (DE) genes are 2 separate steps for investigating cell identity. However, the ability to differentiate between cell groups could be affected by clustering. Thi...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Yun-Ching, Suresh, Abhilash, Underbayev, Chingiz, Sun, Clare, Singh, Komudi, Seifuddin, Fayaz, Wiestner, Adrian, Pirooznia, Mehdi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6771546/
https://www.ncbi.nlm.nih.gov/pubmed/31574155
http://dx.doi.org/10.1093/gigascience/giz121
_version_ 1783455710924767232
author Chen, Yun-Ching
Suresh, Abhilash
Underbayev, Chingiz
Sun, Clare
Singh, Komudi
Seifuddin, Fayaz
Wiestner, Adrian
Pirooznia, Mehdi
author_facet Chen, Yun-Ching
Suresh, Abhilash
Underbayev, Chingiz
Sun, Clare
Singh, Komudi
Seifuddin, Fayaz
Wiestner, Adrian
Pirooznia, Mehdi
author_sort Chen, Yun-Ching
collection PubMed
description BACKGROUND: In single-cell RNA-sequencing analysis, clustering cells into groups and differentiating cell groups by differentially expressed (DE) genes are 2 separate steps for investigating cell identity. However, the ability to differentiate between cell groups could be affected by clustering. This interdependency often creates a bottleneck in the analysis pipeline, requiring researchers to repeat these 2 steps multiple times by setting different clustering parameters to identify a set of cell groups that are more differentiated and biologically relevant. FINDINGS: To accelerate this process, we have developed IKAP—an algorithm to identify major cell groups and improve differentiating cell groups by systematically tuning parameters for clustering. We demonstrate that, with default parameters, IKAP successfully identifies major cell types such as T cells, B cells, natural killer cells, and monocytes in 2 peripheral blood mononuclear cell datasets and recovers major cell types in a previously published mouse cortex dataset. These major cell groups identified by IKAP present more distinguishing DE genes compared with cell groups generated by different combinations of clustering parameters. We further show that cell subtypes can be identified by recursively applying IKAP within identified major cell types, thereby delineating cell identities in a multi-layered ontology. CONCLUSIONS: By tuning the clustering parameters to identify major cell groups, IKAP greatly improves the automation of single-cell RNA-sequencing analysis to produce distinguishing DE genes and refine cell ontology using single-cell RNA-sequencing data.
format Online
Article
Text
id pubmed-6771546
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-67715462019-10-07 IKAP—Identifying K mAjor cell Population groups in single-cell RNA-sequencing analysis Chen, Yun-Ching Suresh, Abhilash Underbayev, Chingiz Sun, Clare Singh, Komudi Seifuddin, Fayaz Wiestner, Adrian Pirooznia, Mehdi Gigascience Technical Note BACKGROUND: In single-cell RNA-sequencing analysis, clustering cells into groups and differentiating cell groups by differentially expressed (DE) genes are 2 separate steps for investigating cell identity. However, the ability to differentiate between cell groups could be affected by clustering. This interdependency often creates a bottleneck in the analysis pipeline, requiring researchers to repeat these 2 steps multiple times by setting different clustering parameters to identify a set of cell groups that are more differentiated and biologically relevant. FINDINGS: To accelerate this process, we have developed IKAP—an algorithm to identify major cell groups and improve differentiating cell groups by systematically tuning parameters for clustering. We demonstrate that, with default parameters, IKAP successfully identifies major cell types such as T cells, B cells, natural killer cells, and monocytes in 2 peripheral blood mononuclear cell datasets and recovers major cell types in a previously published mouse cortex dataset. These major cell groups identified by IKAP present more distinguishing DE genes compared with cell groups generated by different combinations of clustering parameters. We further show that cell subtypes can be identified by recursively applying IKAP within identified major cell types, thereby delineating cell identities in a multi-layered ontology. CONCLUSIONS: By tuning the clustering parameters to identify major cell groups, IKAP greatly improves the automation of single-cell RNA-sequencing analysis to produce distinguishing DE genes and refine cell ontology using single-cell RNA-sequencing data. Oxford University Press 2019-10-01 /pmc/articles/PMC6771546/ /pubmed/31574155 http://dx.doi.org/10.1093/gigascience/giz121 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Chen, Yun-Ching
Suresh, Abhilash
Underbayev, Chingiz
Sun, Clare
Singh, Komudi
Seifuddin, Fayaz
Wiestner, Adrian
Pirooznia, Mehdi
IKAP—Identifying K mAjor cell Population groups in single-cell RNA-sequencing analysis
title IKAP—Identifying K mAjor cell Population groups in single-cell RNA-sequencing analysis
title_full IKAP—Identifying K mAjor cell Population groups in single-cell RNA-sequencing analysis
title_fullStr IKAP—Identifying K mAjor cell Population groups in single-cell RNA-sequencing analysis
title_full_unstemmed IKAP—Identifying K mAjor cell Population groups in single-cell RNA-sequencing analysis
title_short IKAP—Identifying K mAjor cell Population groups in single-cell RNA-sequencing analysis
title_sort ikap—identifying k major cell population groups in single-cell rna-sequencing analysis
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6771546/
https://www.ncbi.nlm.nih.gov/pubmed/31574155
http://dx.doi.org/10.1093/gigascience/giz121
work_keys_str_mv AT chenyunching ikapidentifyingkmajorcellpopulationgroupsinsinglecellrnasequencinganalysis
AT sureshabhilash ikapidentifyingkmajorcellpopulationgroupsinsinglecellrnasequencinganalysis
AT underbayevchingiz ikapidentifyingkmajorcellpopulationgroupsinsinglecellrnasequencinganalysis
AT sunclare ikapidentifyingkmajorcellpopulationgroupsinsinglecellrnasequencinganalysis
AT singhkomudi ikapidentifyingkmajorcellpopulationgroupsinsinglecellrnasequencinganalysis
AT seifuddinfayaz ikapidentifyingkmajorcellpopulationgroupsinsinglecellrnasequencinganalysis
AT wiestneradrian ikapidentifyingkmajorcellpopulationgroupsinsinglecellrnasequencinganalysis
AT piroozniamehdi ikapidentifyingkmajorcellpopulationgroupsinsinglecellrnasequencinganalysis