Cargando…
Objective and Comprehensive Evaluation of Bisulfite Short Read Mapping Tools
Background. Large-scale bisulfite treatment and short reads sequencing technology allow comprehensive estimation of methylation states of Cs in the genomes of different tissues, cell types, and developmental stages. Accurate characterization of DNA methylation is essential for understanding genotype...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4009243/ https://www.ncbi.nlm.nih.gov/pubmed/24839440 http://dx.doi.org/10.1155/2014/472045 |
_version_ | 1782479733949202432 |
---|---|
author | Tran, Hong Porter, Jacob Sun, Ming-an Xie, Hehuang Zhang, Liqing |
author_facet | Tran, Hong Porter, Jacob Sun, Ming-an Xie, Hehuang Zhang, Liqing |
author_sort | Tran, Hong |
collection | PubMed |
description | Background. Large-scale bisulfite treatment and short reads sequencing technology allow comprehensive estimation of methylation states of Cs in the genomes of different tissues, cell types, and developmental stages. Accurate characterization of DNA methylation is essential for understanding genotype phenotype association, gene and environment interaction, diseases, and cancer. Aligning bisulfite short reads to a reference genome has been a challenging task. We compared five bisulfite short read mapping tools, BSMAP, Bismark, BS-Seeker, BiSS, and BRAT-BW, representing two classes of mapping algorithms (hash table and suffix/prefix tries). We examined their mapping efficiency (i.e., the percentage of reads that can be mapped to the genomes), usability, running time, and effects of changing default parameter settings using both real and simulated reads. We also investigated how preprocessing data might affect mapping efficiency. Conclusion. Among the five programs compared, in terms of mapping efficiency, Bismark performs the best on the real data, followed by BiSS, BSMAP, and finally BRAT-BW and BS-Seeker with very similar performance. If CPU time is not a constraint, Bismark is a good choice of program for mapping bisulfite treated short reads. Data quality impacts a great deal mapping efficiency. Although increasing the number of mismatches allowed can increase mapping efficiency, it not only significantly slows down the program, but also runs the risk of having increased false positives. Therefore, users should carefully set the related parameters depending on the quality of their sequencing data. |
format | Online Article Text |
id | pubmed-4009243 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-40092432014-05-18 Objective and Comprehensive Evaluation of Bisulfite Short Read Mapping Tools Tran, Hong Porter, Jacob Sun, Ming-an Xie, Hehuang Zhang, Liqing Adv Bioinformatics Research Article Background. Large-scale bisulfite treatment and short reads sequencing technology allow comprehensive estimation of methylation states of Cs in the genomes of different tissues, cell types, and developmental stages. Accurate characterization of DNA methylation is essential for understanding genotype phenotype association, gene and environment interaction, diseases, and cancer. Aligning bisulfite short reads to a reference genome has been a challenging task. We compared five bisulfite short read mapping tools, BSMAP, Bismark, BS-Seeker, BiSS, and BRAT-BW, representing two classes of mapping algorithms (hash table and suffix/prefix tries). We examined their mapping efficiency (i.e., the percentage of reads that can be mapped to the genomes), usability, running time, and effects of changing default parameter settings using both real and simulated reads. We also investigated how preprocessing data might affect mapping efficiency. Conclusion. Among the five programs compared, in terms of mapping efficiency, Bismark performs the best on the real data, followed by BiSS, BSMAP, and finally BRAT-BW and BS-Seeker with very similar performance. If CPU time is not a constraint, Bismark is a good choice of program for mapping bisulfite treated short reads. Data quality impacts a great deal mapping efficiency. Although increasing the number of mismatches allowed can increase mapping efficiency, it not only significantly slows down the program, but also runs the risk of having increased false positives. Therefore, users should carefully set the related parameters depending on the quality of their sequencing data. Hindawi Publishing Corporation 2014 2014-04-15 /pmc/articles/PMC4009243/ /pubmed/24839440 http://dx.doi.org/10.1155/2014/472045 Text en Copyright © 2014 Hong Tran et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Tran, Hong Porter, Jacob Sun, Ming-an Xie, Hehuang Zhang, Liqing Objective and Comprehensive Evaluation of Bisulfite Short Read Mapping Tools |
title | Objective and Comprehensive Evaluation of Bisulfite Short Read Mapping Tools |
title_full | Objective and Comprehensive Evaluation of Bisulfite Short Read Mapping Tools |
title_fullStr | Objective and Comprehensive Evaluation of Bisulfite Short Read Mapping Tools |
title_full_unstemmed | Objective and Comprehensive Evaluation of Bisulfite Short Read Mapping Tools |
title_short | Objective and Comprehensive Evaluation of Bisulfite Short Read Mapping Tools |
title_sort | objective and comprehensive evaluation of bisulfite short read mapping tools |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4009243/ https://www.ncbi.nlm.nih.gov/pubmed/24839440 http://dx.doi.org/10.1155/2014/472045 |
work_keys_str_mv | AT tranhong objectiveandcomprehensiveevaluationofbisulfiteshortreadmappingtools AT porterjacob objectiveandcomprehensiveevaluationofbisulfiteshortreadmappingtools AT sunmingan objectiveandcomprehensiveevaluationofbisulfiteshortreadmappingtools AT xiehehuang objectiveandcomprehensiveevaluationofbisulfiteshortreadmappingtools AT zhangliqing objectiveandcomprehensiveevaluationofbisulfiteshortreadmappingtools |