Cargando…

Evaluation of STAR and Kallisto on Single Cell RNA-Seq Data Alignment

Alignment of scRNA-Seq data are the first and one of the most critical steps of the scRNA-Seq analysis workflow, and thus the choice of proper aligners is of paramount importance. Recently, STAR an alignment method and Kallisto a pseudoalignment method have both gained a vast amount of popularity in...

Descripción completa

Detalles Bibliográficos
Autores principales: Du, Yuheng, Huang, Qianhui, Arisdakessian, Cedric, Garmire, Lana X.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7202009/
https://www.ncbi.nlm.nih.gov/pubmed/32220951
http://dx.doi.org/10.1534/g3.120.401160
_version_ 1783529654285500416
author Du, Yuheng
Huang, Qianhui
Arisdakessian, Cedric
Garmire, Lana X.
author_facet Du, Yuheng
Huang, Qianhui
Arisdakessian, Cedric
Garmire, Lana X.
author_sort Du, Yuheng
collection PubMed
description Alignment of scRNA-Seq data are the first and one of the most critical steps of the scRNA-Seq analysis workflow, and thus the choice of proper aligners is of paramount importance. Recently, STAR an alignment method and Kallisto a pseudoalignment method have both gained a vast amount of popularity in the single cell sequencing field. However, an unbiased third-party comparison of these two methods in scRNA-Seq is lacking. Here we conduct a systematic comparison of them on a variety of Drop-seq, Fluidigm and 10x genomics data, from the aspects of gene abundance, alignment accuracy, as well as computational speed and memory use. We observe that STAR globally produces more genes and higher gene-expression values, compared to Kallisto, as well as Bowtie2, another popular alignment method for bulk RNA-Seq. STAR also yields higher correlations of the Gini index for the genes with RNA-FISH validation results. Using 10x genomics PBMC 3K scRNA-Seq and mouse cortex single nuclei RNA-Seq data, STAR shows similar or better cell-type annotation results, by detecting a larger subset of known gene markers. However, the gain of accuracy and gene abundance of STAR alignment comes with the price of significantly slower computation time (4 folds) and more memory (7.7 folds), compared to Kallisto.
format Online
Article
Text
id pubmed-7202009
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-72020092020-05-09 Evaluation of STAR and Kallisto on Single Cell RNA-Seq Data Alignment Du, Yuheng Huang, Qianhui Arisdakessian, Cedric Garmire, Lana X. G3 (Bethesda) Investigations Alignment of scRNA-Seq data are the first and one of the most critical steps of the scRNA-Seq analysis workflow, and thus the choice of proper aligners is of paramount importance. Recently, STAR an alignment method and Kallisto a pseudoalignment method have both gained a vast amount of popularity in the single cell sequencing field. However, an unbiased third-party comparison of these two methods in scRNA-Seq is lacking. Here we conduct a systematic comparison of them on a variety of Drop-seq, Fluidigm and 10x genomics data, from the aspects of gene abundance, alignment accuracy, as well as computational speed and memory use. We observe that STAR globally produces more genes and higher gene-expression values, compared to Kallisto, as well as Bowtie2, another popular alignment method for bulk RNA-Seq. STAR also yields higher correlations of the Gini index for the genes with RNA-FISH validation results. Using 10x genomics PBMC 3K scRNA-Seq and mouse cortex single nuclei RNA-Seq data, STAR shows similar or better cell-type annotation results, by detecting a larger subset of known gene markers. However, the gain of accuracy and gene abundance of STAR alignment comes with the price of significantly slower computation time (4 folds) and more memory (7.7 folds), compared to Kallisto. Genetics Society of America 2020-03-27 /pmc/articles/PMC7202009/ /pubmed/32220951 http://dx.doi.org/10.1534/g3.120.401160 Text en Copyright © 2020 Du et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigations
Du, Yuheng
Huang, Qianhui
Arisdakessian, Cedric
Garmire, Lana X.
Evaluation of STAR and Kallisto on Single Cell RNA-Seq Data Alignment
title Evaluation of STAR and Kallisto on Single Cell RNA-Seq Data Alignment
title_full Evaluation of STAR and Kallisto on Single Cell RNA-Seq Data Alignment
title_fullStr Evaluation of STAR and Kallisto on Single Cell RNA-Seq Data Alignment
title_full_unstemmed Evaluation of STAR and Kallisto on Single Cell RNA-Seq Data Alignment
title_short Evaluation of STAR and Kallisto on Single Cell RNA-Seq Data Alignment
title_sort evaluation of star and kallisto on single cell rna-seq data alignment
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7202009/
https://www.ncbi.nlm.nih.gov/pubmed/32220951
http://dx.doi.org/10.1534/g3.120.401160
work_keys_str_mv AT duyuheng evaluationofstarandkallistoonsinglecellrnaseqdataalignment
AT huangqianhui evaluationofstarandkallistoonsinglecellrnaseqdataalignment
AT arisdakessiancedric evaluationofstarandkallistoonsinglecellrnaseqdataalignment
AT garmirelanax evaluationofstarandkallistoonsinglecellrnaseqdataalignment