Cargando…

An approximate Bayesian approach for mapping paired-end DNA reads to a reference genome

Summary: Many high-throughput sequencing experiments produce paired DNA reads. Paired-end DNA reads provide extra positional information that is useful in reliable mapping of short reads to a reference genome, as well as in downstream analyses of structural variations. Given the importance of paired...

Descripción completa

Detalles Bibliográficos
Autores principales: Shrestha, Anish Man Singh, Frith, Martin C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3624798/
https://www.ncbi.nlm.nih.gov/pubmed/23413433
http://dx.doi.org/10.1093/bioinformatics/btt073
_version_ 1782266055937228800
author Shrestha, Anish Man Singh
Frith, Martin C.
author_facet Shrestha, Anish Man Singh
Frith, Martin C.
author_sort Shrestha, Anish Man Singh
collection PubMed
description Summary: Many high-throughput sequencing experiments produce paired DNA reads. Paired-end DNA reads provide extra positional information that is useful in reliable mapping of short reads to a reference genome, as well as in downstream analyses of structural variations. Given the importance of paired-end alignments, it is surprising that there have been no previous publications focusing on this topic. In this article, we present a new probabilistic framework to predict the alignment of paired-end reads to a reference genome. Using both simulated and real data, we compare the performance of our method with six other read-mapping tools that provide a paired-end option. We show that our method provides a good combination of accuracy, error rate and computation time, especially in more challenging and practical cases, such as when the reference genome is incomplete or unavailable for the sample, or when there are large variations between the reference genome and the source of the reads. An open-source implementation of our method is available as part of Last, a multi-purpose alignment program freely available at http://last.cbrc.jp. Contact: martin@cbrc.jp Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3624798
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-36247982013-04-12 An approximate Bayesian approach for mapping paired-end DNA reads to a reference genome Shrestha, Anish Man Singh Frith, Martin C. Bioinformatics Original Papers Summary: Many high-throughput sequencing experiments produce paired DNA reads. Paired-end DNA reads provide extra positional information that is useful in reliable mapping of short reads to a reference genome, as well as in downstream analyses of structural variations. Given the importance of paired-end alignments, it is surprising that there have been no previous publications focusing on this topic. In this article, we present a new probabilistic framework to predict the alignment of paired-end reads to a reference genome. Using both simulated and real data, we compare the performance of our method with six other read-mapping tools that provide a paired-end option. We show that our method provides a good combination of accuracy, error rate and computation time, especially in more challenging and practical cases, such as when the reference genome is incomplete or unavailable for the sample, or when there are large variations between the reference genome and the source of the reads. An open-source implementation of our method is available as part of Last, a multi-purpose alignment program freely available at http://last.cbrc.jp. Contact: martin@cbrc.jp Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2013-04-15 2013-02-14 /pmc/articles/PMC3624798/ /pubmed/23413433 http://dx.doi.org/10.1093/bioinformatics/btt073 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Shrestha, Anish Man Singh
Frith, Martin C.
An approximate Bayesian approach for mapping paired-end DNA reads to a reference genome
title An approximate Bayesian approach for mapping paired-end DNA reads to a reference genome
title_full An approximate Bayesian approach for mapping paired-end DNA reads to a reference genome
title_fullStr An approximate Bayesian approach for mapping paired-end DNA reads to a reference genome
title_full_unstemmed An approximate Bayesian approach for mapping paired-end DNA reads to a reference genome
title_short An approximate Bayesian approach for mapping paired-end DNA reads to a reference genome
title_sort approximate bayesian approach for mapping paired-end dna reads to a reference genome
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3624798/
https://www.ncbi.nlm.nih.gov/pubmed/23413433
http://dx.doi.org/10.1093/bioinformatics/btt073
work_keys_str_mv AT shresthaanishmansingh anapproximatebayesianapproachformappingpairedenddnareadstoareferencegenome
AT frithmartinc anapproximatebayesianapproachformappingpairedenddnareadstoareferencegenome
AT shresthaanishmansingh approximatebayesianapproachformappingpairedenddnareadstoareferencegenome
AT frithmartinc approximatebayesianapproachformappingpairedenddnareadstoareferencegenome