Cargando…
Comparative analysis of common alignment tools for single-cell RNA sequencing
BACKGROUND: With the rise of single-cell RNA sequencing new bioinformatic tools have been developed to handle specific demands, such as quantifying unique molecular identifiers and correcting cell barcodes. Here, we benchmarked several datasets with the most common alignment tools for single-cell RN...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8848315/ https://www.ncbi.nlm.nih.gov/pubmed/35084033 http://dx.doi.org/10.1093/gigascience/giac001 |
_version_ | 1784652223310987264 |
---|---|
author | Brüning, Ralf Schulze Tombor, Lukas Schulz, Marcel H Dimmeler, Stefanie John, David |
author_facet | Brüning, Ralf Schulze Tombor, Lukas Schulz, Marcel H Dimmeler, Stefanie John, David |
author_sort | Brüning, Ralf Schulze |
collection | PubMed |
description | BACKGROUND: With the rise of single-cell RNA sequencing new bioinformatic tools have been developed to handle specific demands, such as quantifying unique molecular identifiers and correcting cell barcodes. Here, we benchmarked several datasets with the most common alignment tools for single-cell RNA sequencing data. We evaluated differences in the whitelisting, gene quantification, overall performance, and potential variations in clustering or detection of differentially expressed genes. We compared the tools Cell Ranger version 6, STARsolo, Kallisto, Alevin, and Alevin-fry on 3 published datasets for human and mouse, sequenced with different versions of the 10X sequencing protocol. RESULTS: Striking differences were observed in the overall runtime of the mappers. Besides that, Kallisto and Alevin showed variances in the number of valid cells and detected genes per cell. Kallisto reported the highest number of cells; however, we observed an overrepresentation of cells with low gene content and unknown cell type. Conversely, Alevin rarely reported such low-content cells. Further variations were detected in the set of expressed genes. While STARsolo, Cell Ranger 6, Alevin-fry, and Alevin produced similar gene sets, Kallisto detected additional genes from the Vmn and Olfr gene family, which are likely mapping artefacts. We also observed differences in the mitochondrial content of the resulting cells when comparing a prefiltered annotation set to the full annotation set that includes pseudogenes and other biotypes. CONCLUSION: Overall, this study provides a detailed comparison of common single-cell RNA sequencing mappers and shows their specific properties on 10X Genomics data. |
format | Online Article Text |
id | pubmed-8848315 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-88483152022-02-17 Comparative analysis of common alignment tools for single-cell RNA sequencing Brüning, Ralf Schulze Tombor, Lukas Schulz, Marcel H Dimmeler, Stefanie John, David Gigascience Research BACKGROUND: With the rise of single-cell RNA sequencing new bioinformatic tools have been developed to handle specific demands, such as quantifying unique molecular identifiers and correcting cell barcodes. Here, we benchmarked several datasets with the most common alignment tools for single-cell RNA sequencing data. We evaluated differences in the whitelisting, gene quantification, overall performance, and potential variations in clustering or detection of differentially expressed genes. We compared the tools Cell Ranger version 6, STARsolo, Kallisto, Alevin, and Alevin-fry on 3 published datasets for human and mouse, sequenced with different versions of the 10X sequencing protocol. RESULTS: Striking differences were observed in the overall runtime of the mappers. Besides that, Kallisto and Alevin showed variances in the number of valid cells and detected genes per cell. Kallisto reported the highest number of cells; however, we observed an overrepresentation of cells with low gene content and unknown cell type. Conversely, Alevin rarely reported such low-content cells. Further variations were detected in the set of expressed genes. While STARsolo, Cell Ranger 6, Alevin-fry, and Alevin produced similar gene sets, Kallisto detected additional genes from the Vmn and Olfr gene family, which are likely mapping artefacts. We also observed differences in the mitochondrial content of the resulting cells when comparing a prefiltered annotation set to the full annotation set that includes pseudogenes and other biotypes. CONCLUSION: Overall, this study provides a detailed comparison of common single-cell RNA sequencing mappers and shows their specific properties on 10X Genomics data. Oxford University Press 2022-01-27 /pmc/articles/PMC8848315/ /pubmed/35084033 http://dx.doi.org/10.1093/gigascience/giac001 Text en © The Author(s) 2022. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Brüning, Ralf Schulze Tombor, Lukas Schulz, Marcel H Dimmeler, Stefanie John, David Comparative analysis of common alignment tools for single-cell RNA sequencing |
title | Comparative analysis of common alignment tools for single-cell RNA sequencing |
title_full | Comparative analysis of common alignment tools for single-cell RNA sequencing |
title_fullStr | Comparative analysis of common alignment tools for single-cell RNA sequencing |
title_full_unstemmed | Comparative analysis of common alignment tools for single-cell RNA sequencing |
title_short | Comparative analysis of common alignment tools for single-cell RNA sequencing |
title_sort | comparative analysis of common alignment tools for single-cell rna sequencing |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8848315/ https://www.ncbi.nlm.nih.gov/pubmed/35084033 http://dx.doi.org/10.1093/gigascience/giac001 |
work_keys_str_mv | AT bruningralfschulze comparativeanalysisofcommonalignmenttoolsforsinglecellrnasequencing AT tomborlukas comparativeanalysisofcommonalignmenttoolsforsinglecellrnasequencing AT schulzmarcelh comparativeanalysisofcommonalignmenttoolsforsinglecellrnasequencing AT dimmelerstefanie comparativeanalysisofcommonalignmenttoolsforsinglecellrnasequencing AT johndavid comparativeanalysisofcommonalignmenttoolsforsinglecellrnasequencing |