Cargando…

Comparative analysis of common alignment tools for single-cell RNA sequencing

BACKGROUND: With the rise of single-cell RNA sequencing new bioinformatic tools have been developed to handle specific demands, such as quantifying unique molecular identifiers and correcting cell barcodes. Here, we benchmarked several datasets with the most common alignment tools for single-cell RN...

Descripción completa

Detalles Bibliográficos
Autores principales: Brüning, Ralf Schulze, Tombor, Lukas, Schulz, Marcel H, Dimmeler, Stefanie, John, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8848315/
https://www.ncbi.nlm.nih.gov/pubmed/35084033
http://dx.doi.org/10.1093/gigascience/giac001
_version_ 1784652223310987264
author Brüning, Ralf Schulze
Tombor, Lukas
Schulz, Marcel H
Dimmeler, Stefanie
John, David
author_facet Brüning, Ralf Schulze
Tombor, Lukas
Schulz, Marcel H
Dimmeler, Stefanie
John, David
author_sort Brüning, Ralf Schulze
collection PubMed
description BACKGROUND: With the rise of single-cell RNA sequencing new bioinformatic tools have been developed to handle specific demands, such as quantifying unique molecular identifiers and correcting cell barcodes. Here, we benchmarked several datasets with the most common alignment tools for single-cell RNA sequencing data. We evaluated differences in the whitelisting, gene quantification, overall performance, and potential variations in clustering or detection of differentially expressed genes. We compared the tools Cell Ranger version 6, STARsolo, Kallisto, Alevin, and Alevin-fry on 3 published datasets for human and mouse, sequenced with different versions of the 10X sequencing protocol. RESULTS: Striking differences were observed in the overall runtime of the mappers. Besides that, Kallisto and Alevin showed variances in the number of valid cells and detected genes per cell. Kallisto reported the highest number of cells; however, we observed an overrepresentation of cells with low gene content and unknown cell type. Conversely, Alevin rarely reported such low-content cells. Further variations were detected in the set of expressed genes. While STARsolo, Cell Ranger 6, Alevin-fry, and Alevin produced similar gene sets, Kallisto detected additional genes from the Vmn and Olfr gene family, which are likely mapping artefacts. We also observed differences in the mitochondrial content of the resulting cells when comparing a prefiltered annotation set to the full annotation set that includes pseudogenes and other biotypes. CONCLUSION: Overall, this study provides a detailed comparison of common single-cell RNA sequencing mappers and shows their specific properties on 10X Genomics data.
format Online
Article
Text
id pubmed-8848315
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-88483152022-02-17 Comparative analysis of common alignment tools for single-cell RNA sequencing Brüning, Ralf Schulze Tombor, Lukas Schulz, Marcel H Dimmeler, Stefanie John, David Gigascience Research BACKGROUND: With the rise of single-cell RNA sequencing new bioinformatic tools have been developed to handle specific demands, such as quantifying unique molecular identifiers and correcting cell barcodes. Here, we benchmarked several datasets with the most common alignment tools for single-cell RNA sequencing data. We evaluated differences in the whitelisting, gene quantification, overall performance, and potential variations in clustering or detection of differentially expressed genes. We compared the tools Cell Ranger version 6, STARsolo, Kallisto, Alevin, and Alevin-fry on 3 published datasets for human and mouse, sequenced with different versions of the 10X sequencing protocol. RESULTS: Striking differences were observed in the overall runtime of the mappers. Besides that, Kallisto and Alevin showed variances in the number of valid cells and detected genes per cell. Kallisto reported the highest number of cells; however, we observed an overrepresentation of cells with low gene content and unknown cell type. Conversely, Alevin rarely reported such low-content cells. Further variations were detected in the set of expressed genes. While STARsolo, Cell Ranger 6, Alevin-fry, and Alevin produced similar gene sets, Kallisto detected additional genes from the Vmn and Olfr gene family, which are likely mapping artefacts. We also observed differences in the mitochondrial content of the resulting cells when comparing a prefiltered annotation set to the full annotation set that includes pseudogenes and other biotypes. CONCLUSION: Overall, this study provides a detailed comparison of common single-cell RNA sequencing mappers and shows their specific properties on 10X Genomics data. Oxford University Press 2022-01-27 /pmc/articles/PMC8848315/ /pubmed/35084033 http://dx.doi.org/10.1093/gigascience/giac001 Text en © The Author(s) 2022. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Brüning, Ralf Schulze
Tombor, Lukas
Schulz, Marcel H
Dimmeler, Stefanie
John, David
Comparative analysis of common alignment tools for single-cell RNA sequencing
title Comparative analysis of common alignment tools for single-cell RNA sequencing
title_full Comparative analysis of common alignment tools for single-cell RNA sequencing
title_fullStr Comparative analysis of common alignment tools for single-cell RNA sequencing
title_full_unstemmed Comparative analysis of common alignment tools for single-cell RNA sequencing
title_short Comparative analysis of common alignment tools for single-cell RNA sequencing
title_sort comparative analysis of common alignment tools for single-cell rna sequencing
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8848315/
https://www.ncbi.nlm.nih.gov/pubmed/35084033
http://dx.doi.org/10.1093/gigascience/giac001
work_keys_str_mv AT bruningralfschulze comparativeanalysisofcommonalignmenttoolsforsinglecellrnasequencing
AT tomborlukas comparativeanalysisofcommonalignmenttoolsforsinglecellrnasequencing
AT schulzmarcelh comparativeanalysisofcommonalignmenttoolsforsinglecellrnasequencing
AT dimmelerstefanie comparativeanalysisofcommonalignmenttoolsforsinglecellrnasequencing
AT johndavid comparativeanalysisofcommonalignmenttoolsforsinglecellrnasequencing