Cargando…
Evaluation of STAR and Kallisto on Single Cell RNA-Seq Data Alignment
Alignment of scRNA-Seq data are the first and one of the most critical steps of the scRNA-Seq analysis workflow, and thus the choice of proper aligners is of paramount importance. Recently, STAR an alignment method and Kallisto a pseudoalignment method have both gained a vast amount of popularity in...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Genetics Society of America
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7202009/ https://www.ncbi.nlm.nih.gov/pubmed/32220951 http://dx.doi.org/10.1534/g3.120.401160 |
_version_ | 1783529654285500416 |
---|---|
author | Du, Yuheng Huang, Qianhui Arisdakessian, Cedric Garmire, Lana X. |
author_facet | Du, Yuheng Huang, Qianhui Arisdakessian, Cedric Garmire, Lana X. |
author_sort | Du, Yuheng |
collection | PubMed |
description | Alignment of scRNA-Seq data are the first and one of the most critical steps of the scRNA-Seq analysis workflow, and thus the choice of proper aligners is of paramount importance. Recently, STAR an alignment method and Kallisto a pseudoalignment method have both gained a vast amount of popularity in the single cell sequencing field. However, an unbiased third-party comparison of these two methods in scRNA-Seq is lacking. Here we conduct a systematic comparison of them on a variety of Drop-seq, Fluidigm and 10x genomics data, from the aspects of gene abundance, alignment accuracy, as well as computational speed and memory use. We observe that STAR globally produces more genes and higher gene-expression values, compared to Kallisto, as well as Bowtie2, another popular alignment method for bulk RNA-Seq. STAR also yields higher correlations of the Gini index for the genes with RNA-FISH validation results. Using 10x genomics PBMC 3K scRNA-Seq and mouse cortex single nuclei RNA-Seq data, STAR shows similar or better cell-type annotation results, by detecting a larger subset of known gene markers. However, the gain of accuracy and gene abundance of STAR alignment comes with the price of significantly slower computation time (4 folds) and more memory (7.7 folds), compared to Kallisto. |
format | Online Article Text |
id | pubmed-7202009 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Genetics Society of America |
record_format | MEDLINE/PubMed |
spelling | pubmed-72020092020-05-09 Evaluation of STAR and Kallisto on Single Cell RNA-Seq Data Alignment Du, Yuheng Huang, Qianhui Arisdakessian, Cedric Garmire, Lana X. G3 (Bethesda) Investigations Alignment of scRNA-Seq data are the first and one of the most critical steps of the scRNA-Seq analysis workflow, and thus the choice of proper aligners is of paramount importance. Recently, STAR an alignment method and Kallisto a pseudoalignment method have both gained a vast amount of popularity in the single cell sequencing field. However, an unbiased third-party comparison of these two methods in scRNA-Seq is lacking. Here we conduct a systematic comparison of them on a variety of Drop-seq, Fluidigm and 10x genomics data, from the aspects of gene abundance, alignment accuracy, as well as computational speed and memory use. We observe that STAR globally produces more genes and higher gene-expression values, compared to Kallisto, as well as Bowtie2, another popular alignment method for bulk RNA-Seq. STAR also yields higher correlations of the Gini index for the genes with RNA-FISH validation results. Using 10x genomics PBMC 3K scRNA-Seq and mouse cortex single nuclei RNA-Seq data, STAR shows similar or better cell-type annotation results, by detecting a larger subset of known gene markers. However, the gain of accuracy and gene abundance of STAR alignment comes with the price of significantly slower computation time (4 folds) and more memory (7.7 folds), compared to Kallisto. Genetics Society of America 2020-03-27 /pmc/articles/PMC7202009/ /pubmed/32220951 http://dx.doi.org/10.1534/g3.120.401160 Text en Copyright © 2020 Du et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Investigations Du, Yuheng Huang, Qianhui Arisdakessian, Cedric Garmire, Lana X. Evaluation of STAR and Kallisto on Single Cell RNA-Seq Data Alignment |
title | Evaluation of STAR and Kallisto on Single Cell RNA-Seq Data Alignment |
title_full | Evaluation of STAR and Kallisto on Single Cell RNA-Seq Data Alignment |
title_fullStr | Evaluation of STAR and Kallisto on Single Cell RNA-Seq Data Alignment |
title_full_unstemmed | Evaluation of STAR and Kallisto on Single Cell RNA-Seq Data Alignment |
title_short | Evaluation of STAR and Kallisto on Single Cell RNA-Seq Data Alignment |
title_sort | evaluation of star and kallisto on single cell rna-seq data alignment |
topic | Investigations |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7202009/ https://www.ncbi.nlm.nih.gov/pubmed/32220951 http://dx.doi.org/10.1534/g3.120.401160 |
work_keys_str_mv | AT duyuheng evaluationofstarandkallistoonsinglecellrnaseqdataalignment AT huangqianhui evaluationofstarandkallistoonsinglecellrnaseqdataalignment AT arisdakessiancedric evaluationofstarandkallistoonsinglecellrnaseqdataalignment AT garmirelanax evaluationofstarandkallistoonsinglecellrnaseqdataalignment |