Cargando…
Assessing the impact of exact reads on reducing the error rate of read mapping
BACKGROUND: Nowadays, according to valuable resources of high-quality genome sequences, reference-based assembly methods with high accuracy and efficiency are strongly required. Many different algorithms have been designed for mapping reads onto a genome sequence which try to enhance the accuracy of...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6220446/ https://www.ncbi.nlm.nih.gov/pubmed/30400807 http://dx.doi.org/10.1186/s12859-018-2432-7 |
_version_ | 1783368831577620480 |
---|---|
author | Salari, Farzaneh Zare-Mirakabad, Fatemeh Sadeghi, Mehdi Rokni-Zadeh, Hassan |
author_facet | Salari, Farzaneh Zare-Mirakabad, Fatemeh Sadeghi, Mehdi Rokni-Zadeh, Hassan |
author_sort | Salari, Farzaneh |
collection | PubMed |
description | BACKGROUND: Nowadays, according to valuable resources of high-quality genome sequences, reference-based assembly methods with high accuracy and efficiency are strongly required. Many different algorithms have been designed for mapping reads onto a genome sequence which try to enhance the accuracy of reconstructed genomes. In this problem, one of the challenges occurs when some reads are aligned to multiple locations due to repetitive regions in the genomes. RESULTS: In this paper, our goal is to decrease the error rate of rebuilt genomes by resolving multi-mapping reads. To achieve this purpose, we reduce the search space for the reads which can be aligned against the genome with mismatches, insertions or deletions to decrease the probability of incorrect read mapping. We propose a pipeline divided to three steps: ExactMapping, InExactMapping, and MergingContigs, where exact and inexact reads are aligned in two separate phases. We test our pipeline on some simulated and real data sets by applying some read mappers. The results show that the two-step mapping of reads onto the contigs generated by a mapper such as Bowtie2, BWA and Yara is effective in improving the contigs in terms of error rate. CONCLUSIONS: Assessment results of our pipeline suggest that reducing the error rate of read mapping, not only can improve the genomes reconstructed by reference-based assembly in a reasonable running time, but can also have an impact on improving the genomes generated by de novo assembly. In fact, our pipeline produces genomes comparable to those of a multi-mapping reads resolution tool, namely MMR by decreasing the number of multi-mapping reads. Consequently, we introduce EIM as a post-processing step to genomes reconstructed by mappers. |
format | Online Article Text |
id | pubmed-6220446 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-62204462018-11-16 Assessing the impact of exact reads on reducing the error rate of read mapping Salari, Farzaneh Zare-Mirakabad, Fatemeh Sadeghi, Mehdi Rokni-Zadeh, Hassan BMC Bioinformatics Methodology Article BACKGROUND: Nowadays, according to valuable resources of high-quality genome sequences, reference-based assembly methods with high accuracy and efficiency are strongly required. Many different algorithms have been designed for mapping reads onto a genome sequence which try to enhance the accuracy of reconstructed genomes. In this problem, one of the challenges occurs when some reads are aligned to multiple locations due to repetitive regions in the genomes. RESULTS: In this paper, our goal is to decrease the error rate of rebuilt genomes by resolving multi-mapping reads. To achieve this purpose, we reduce the search space for the reads which can be aligned against the genome with mismatches, insertions or deletions to decrease the probability of incorrect read mapping. We propose a pipeline divided to three steps: ExactMapping, InExactMapping, and MergingContigs, where exact and inexact reads are aligned in two separate phases. We test our pipeline on some simulated and real data sets by applying some read mappers. The results show that the two-step mapping of reads onto the contigs generated by a mapper such as Bowtie2, BWA and Yara is effective in improving the contigs in terms of error rate. CONCLUSIONS: Assessment results of our pipeline suggest that reducing the error rate of read mapping, not only can improve the genomes reconstructed by reference-based assembly in a reasonable running time, but can also have an impact on improving the genomes generated by de novo assembly. In fact, our pipeline produces genomes comparable to those of a multi-mapping reads resolution tool, namely MMR by decreasing the number of multi-mapping reads. Consequently, we introduce EIM as a post-processing step to genomes reconstructed by mappers. BioMed Central 2018-11-06 /pmc/articles/PMC6220446/ /pubmed/30400807 http://dx.doi.org/10.1186/s12859-018-2432-7 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Salari, Farzaneh Zare-Mirakabad, Fatemeh Sadeghi, Mehdi Rokni-Zadeh, Hassan Assessing the impact of exact reads on reducing the error rate of read mapping |
title | Assessing the impact of exact reads on reducing the error rate of read mapping |
title_full | Assessing the impact of exact reads on reducing the error rate of read mapping |
title_fullStr | Assessing the impact of exact reads on reducing the error rate of read mapping |
title_full_unstemmed | Assessing the impact of exact reads on reducing the error rate of read mapping |
title_short | Assessing the impact of exact reads on reducing the error rate of read mapping |
title_sort | assessing the impact of exact reads on reducing the error rate of read mapping |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6220446/ https://www.ncbi.nlm.nih.gov/pubmed/30400807 http://dx.doi.org/10.1186/s12859-018-2432-7 |
work_keys_str_mv | AT salarifarzaneh assessingtheimpactofexactreadsonreducingtheerrorrateofreadmapping AT zaremirakabadfatemeh assessingtheimpactofexactreadsonreducingtheerrorrateofreadmapping AT sadeghimehdi assessingtheimpactofexactreadsonreducingtheerrorrateofreadmapping AT roknizadehhassan assessingtheimpactofexactreadsonreducingtheerrorrateofreadmapping |