Cargando…
Cloud accelerated alignment and assembly of full-length single-cell RNA-seq data using Falco
BACKGROUND: Read alignment and transcript assembly are the core of RNA-seq analysis for transcript isoform discovery. Nonetheless, current tools are not designed to be scalable for analysis of full-length bulk or single cell RNA-seq (scRNA-seq) data. The previous version of our cloud-based tool Falc...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6936136/ https://www.ncbi.nlm.nih.gov/pubmed/31888474 http://dx.doi.org/10.1186/s12864-019-6341-6 |
_version_ | 1783483690514382848 |
---|---|
author | Yang, Andrian Kishore, Abhinav Phipps, Benjamin Ho, Joshua W. K. |
author_facet | Yang, Andrian Kishore, Abhinav Phipps, Benjamin Ho, Joshua W. K. |
author_sort | Yang, Andrian |
collection | PubMed |
description | BACKGROUND: Read alignment and transcript assembly are the core of RNA-seq analysis for transcript isoform discovery. Nonetheless, current tools are not designed to be scalable for analysis of full-length bulk or single cell RNA-seq (scRNA-seq) data. The previous version of our cloud-based tool Falco only focuses on RNA-seq read counting, but does not allow for more flexible steps such as alignment and read assembly. RESULTS: The Falco framework can harness the parallel and distributed computing environment in modern cloud platforms to accelerate read alignment and transcript assembly of full-length bulk RNA-seq and scRNA-seq data. There are two new modes in Falco: alignment-only and transcript assembly. In the alignment-only mode, Falco can speed up the alignment process by 2.5–16.4x based on two public scRNA-seq datasets when compared to alignment on a highly optimised standalone computer. Furthermore, it also provides a 10x average speed-up compared to alignment using published cloud-enabled tool for read alignment, Rail-RNA. In the transcript assembly mode, Falco can speed up the transcript assembly process by 1.7–16.5x compared to performing transcript assembly on a highly optimised computer. CONCLUSION: Falco is a significantly updated open source big data processing framework that enables scalable and accelerated alignment and assembly of full-length scRNA-seq data on the cloud. The source code can be found at https://github.com/VCCRI/Falco. |
format | Online Article Text |
id | pubmed-6936136 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-69361362019-12-31 Cloud accelerated alignment and assembly of full-length single-cell RNA-seq data using Falco Yang, Andrian Kishore, Abhinav Phipps, Benjamin Ho, Joshua W. K. BMC Genomics Software BACKGROUND: Read alignment and transcript assembly are the core of RNA-seq analysis for transcript isoform discovery. Nonetheless, current tools are not designed to be scalable for analysis of full-length bulk or single cell RNA-seq (scRNA-seq) data. The previous version of our cloud-based tool Falco only focuses on RNA-seq read counting, but does not allow for more flexible steps such as alignment and read assembly. RESULTS: The Falco framework can harness the parallel and distributed computing environment in modern cloud platforms to accelerate read alignment and transcript assembly of full-length bulk RNA-seq and scRNA-seq data. There are two new modes in Falco: alignment-only and transcript assembly. In the alignment-only mode, Falco can speed up the alignment process by 2.5–16.4x based on two public scRNA-seq datasets when compared to alignment on a highly optimised standalone computer. Furthermore, it also provides a 10x average speed-up compared to alignment using published cloud-enabled tool for read alignment, Rail-RNA. In the transcript assembly mode, Falco can speed up the transcript assembly process by 1.7–16.5x compared to performing transcript assembly on a highly optimised computer. CONCLUSION: Falco is a significantly updated open source big data processing framework that enables scalable and accelerated alignment and assembly of full-length scRNA-seq data on the cloud. The source code can be found at https://github.com/VCCRI/Falco. BioMed Central 2019-12-30 /pmc/articles/PMC6936136/ /pubmed/31888474 http://dx.doi.org/10.1186/s12864-019-6341-6 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Yang, Andrian Kishore, Abhinav Phipps, Benjamin Ho, Joshua W. K. Cloud accelerated alignment and assembly of full-length single-cell RNA-seq data using Falco |
title | Cloud accelerated alignment and assembly of full-length single-cell RNA-seq data using Falco |
title_full | Cloud accelerated alignment and assembly of full-length single-cell RNA-seq data using Falco |
title_fullStr | Cloud accelerated alignment and assembly of full-length single-cell RNA-seq data using Falco |
title_full_unstemmed | Cloud accelerated alignment and assembly of full-length single-cell RNA-seq data using Falco |
title_short | Cloud accelerated alignment and assembly of full-length single-cell RNA-seq data using Falco |
title_sort | cloud accelerated alignment and assembly of full-length single-cell rna-seq data using falco |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6936136/ https://www.ncbi.nlm.nih.gov/pubmed/31888474 http://dx.doi.org/10.1186/s12864-019-6341-6 |
work_keys_str_mv | AT yangandrian cloudacceleratedalignmentandassemblyoffulllengthsinglecellrnaseqdatausingfalco AT kishoreabhinav cloudacceleratedalignmentandassemblyoffulllengthsinglecellrnaseqdatausingfalco AT phippsbenjamin cloudacceleratedalignmentandassemblyoffulllengthsinglecellrnaseqdatausingfalco AT hojoshuawk cloudacceleratedalignmentandassemblyoffulllengthsinglecellrnaseqdatausingfalco |