Cargando…

DART: a fast and accurate RNA-seq mapper with a partitioning strategy

MOTIVATION: In recent years, the massively parallel cDNA sequencing (RNA-Seq) technologies have become a powerful tool to provide high resolution measurement of expression and high sensitivity in detecting low abundance transcripts. However, RNA-seq data requires a huge amount of computational effor...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Hsin-Nan, Hsu, Wen-Lian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860201/
https://www.ncbi.nlm.nih.gov/pubmed/28968831
http://dx.doi.org/10.1093/bioinformatics/btx558
_version_ 1783307959762157568
author Lin, Hsin-Nan
Hsu, Wen-Lian
author_facet Lin, Hsin-Nan
Hsu, Wen-Lian
author_sort Lin, Hsin-Nan
collection PubMed
description MOTIVATION: In recent years, the massively parallel cDNA sequencing (RNA-Seq) technologies have become a powerful tool to provide high resolution measurement of expression and high sensitivity in detecting low abundance transcripts. However, RNA-seq data requires a huge amount of computational efforts. The very fundamental and critical step is to align each sequence fragment against the reference genome. Various de novo spliced RNA aligners have been developed in recent years. Though these aligners can handle spliced alignment and detect splice junctions, some challenges still remain to be solved. With the advances in sequencing technologies and the ongoing collection of sequencing data in the ENCODE project, more efficient alignment algorithms are highly demanded. Most read mappers follow the conventional seed-and-extend strategy to deal with inexact matches for sequence alignment. However, the extension is much more time consuming than the seeding step. RESULTS: We proposed a novel RNA-seq de novo mapping algorithm, call DART, which adopts a partitioning strategy to avoid the extension step. The experiment results on synthetic datasets and real NGS datasets showed that DART is a highly efficient aligner that yields the highest or comparable sensitivity and accuracy compared to most state-of-the-art aligners, and more importantly, it spends the least amount of time among the selected aligners. AVAILABILITY AND IMPLEMENTATION: https://github.com/hsinnan75/DART SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-5860201
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58602012018-03-21 DART: a fast and accurate RNA-seq mapper with a partitioning strategy Lin, Hsin-Nan Hsu, Wen-Lian Bioinformatics Original Papers MOTIVATION: In recent years, the massively parallel cDNA sequencing (RNA-Seq) technologies have become a powerful tool to provide high resolution measurement of expression and high sensitivity in detecting low abundance transcripts. However, RNA-seq data requires a huge amount of computational efforts. The very fundamental and critical step is to align each sequence fragment against the reference genome. Various de novo spliced RNA aligners have been developed in recent years. Though these aligners can handle spliced alignment and detect splice junctions, some challenges still remain to be solved. With the advances in sequencing technologies and the ongoing collection of sequencing data in the ENCODE project, more efficient alignment algorithms are highly demanded. Most read mappers follow the conventional seed-and-extend strategy to deal with inexact matches for sequence alignment. However, the extension is much more time consuming than the seeding step. RESULTS: We proposed a novel RNA-seq de novo mapping algorithm, call DART, which adopts a partitioning strategy to avoid the extension step. The experiment results on synthetic datasets and real NGS datasets showed that DART is a highly efficient aligner that yields the highest or comparable sensitivity and accuracy compared to most state-of-the-art aligners, and more importantly, it spends the least amount of time among the selected aligners. AVAILABILITY AND IMPLEMENTATION: https://github.com/hsinnan75/DART SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-01-15 2017-09-05 /pmc/articles/PMC5860201/ /pubmed/28968831 http://dx.doi.org/10.1093/bioinformatics/btx558 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Lin, Hsin-Nan
Hsu, Wen-Lian
DART: a fast and accurate RNA-seq mapper with a partitioning strategy
title DART: a fast and accurate RNA-seq mapper with a partitioning strategy
title_full DART: a fast and accurate RNA-seq mapper with a partitioning strategy
title_fullStr DART: a fast and accurate RNA-seq mapper with a partitioning strategy
title_full_unstemmed DART: a fast and accurate RNA-seq mapper with a partitioning strategy
title_short DART: a fast and accurate RNA-seq mapper with a partitioning strategy
title_sort dart: a fast and accurate rna-seq mapper with a partitioning strategy
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860201/
https://www.ncbi.nlm.nih.gov/pubmed/28968831
http://dx.doi.org/10.1093/bioinformatics/btx558
work_keys_str_mv AT linhsinnan dartafastandaccuraternaseqmapperwithapartitioningstrategy
AT hsuwenlian dartafastandaccuraternaseqmapperwithapartitioningstrategy