Cargando…
DART: a fast and accurate RNA-seq mapper with a partitioning strategy
MOTIVATION: In recent years, the massively parallel cDNA sequencing (RNA-Seq) technologies have become a powerful tool to provide high resolution measurement of expression and high sensitivity in detecting low abundance transcripts. However, RNA-seq data requires a huge amount of computational effor...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860201/ https://www.ncbi.nlm.nih.gov/pubmed/28968831 http://dx.doi.org/10.1093/bioinformatics/btx558 |
_version_ | 1783307959762157568 |
---|---|
author | Lin, Hsin-Nan Hsu, Wen-Lian |
author_facet | Lin, Hsin-Nan Hsu, Wen-Lian |
author_sort | Lin, Hsin-Nan |
collection | PubMed |
description | MOTIVATION: In recent years, the massively parallel cDNA sequencing (RNA-Seq) technologies have become a powerful tool to provide high resolution measurement of expression and high sensitivity in detecting low abundance transcripts. However, RNA-seq data requires a huge amount of computational efforts. The very fundamental and critical step is to align each sequence fragment against the reference genome. Various de novo spliced RNA aligners have been developed in recent years. Though these aligners can handle spliced alignment and detect splice junctions, some challenges still remain to be solved. With the advances in sequencing technologies and the ongoing collection of sequencing data in the ENCODE project, more efficient alignment algorithms are highly demanded. Most read mappers follow the conventional seed-and-extend strategy to deal with inexact matches for sequence alignment. However, the extension is much more time consuming than the seeding step. RESULTS: We proposed a novel RNA-seq de novo mapping algorithm, call DART, which adopts a partitioning strategy to avoid the extension step. The experiment results on synthetic datasets and real NGS datasets showed that DART is a highly efficient aligner that yields the highest or comparable sensitivity and accuracy compared to most state-of-the-art aligners, and more importantly, it spends the least amount of time among the selected aligners. AVAILABILITY AND IMPLEMENTATION: https://github.com/hsinnan75/DART SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-5860201 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-58602012018-03-21 DART: a fast and accurate RNA-seq mapper with a partitioning strategy Lin, Hsin-Nan Hsu, Wen-Lian Bioinformatics Original Papers MOTIVATION: In recent years, the massively parallel cDNA sequencing (RNA-Seq) technologies have become a powerful tool to provide high resolution measurement of expression and high sensitivity in detecting low abundance transcripts. However, RNA-seq data requires a huge amount of computational efforts. The very fundamental and critical step is to align each sequence fragment against the reference genome. Various de novo spliced RNA aligners have been developed in recent years. Though these aligners can handle spliced alignment and detect splice junctions, some challenges still remain to be solved. With the advances in sequencing technologies and the ongoing collection of sequencing data in the ENCODE project, more efficient alignment algorithms are highly demanded. Most read mappers follow the conventional seed-and-extend strategy to deal with inexact matches for sequence alignment. However, the extension is much more time consuming than the seeding step. RESULTS: We proposed a novel RNA-seq de novo mapping algorithm, call DART, which adopts a partitioning strategy to avoid the extension step. The experiment results on synthetic datasets and real NGS datasets showed that DART is a highly efficient aligner that yields the highest or comparable sensitivity and accuracy compared to most state-of-the-art aligners, and more importantly, it spends the least amount of time among the selected aligners. AVAILABILITY AND IMPLEMENTATION: https://github.com/hsinnan75/DART SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-01-15 2017-09-05 /pmc/articles/PMC5860201/ /pubmed/28968831 http://dx.doi.org/10.1093/bioinformatics/btx558 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Papers Lin, Hsin-Nan Hsu, Wen-Lian DART: a fast and accurate RNA-seq mapper with a partitioning strategy |
title | DART: a fast and accurate RNA-seq mapper with a partitioning strategy |
title_full | DART: a fast and accurate RNA-seq mapper with a partitioning strategy |
title_fullStr | DART: a fast and accurate RNA-seq mapper with a partitioning strategy |
title_full_unstemmed | DART: a fast and accurate RNA-seq mapper with a partitioning strategy |
title_short | DART: a fast and accurate RNA-seq mapper with a partitioning strategy |
title_sort | dart: a fast and accurate rna-seq mapper with a partitioning strategy |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860201/ https://www.ncbi.nlm.nih.gov/pubmed/28968831 http://dx.doi.org/10.1093/bioinformatics/btx558 |
work_keys_str_mv | AT linhsinnan dartafastandaccuraternaseqmapperwithapartitioningstrategy AT hsuwenlian dartafastandaccuraternaseqmapperwithapartitioningstrategy |