Cargando…

BS Seeker: precise mapping for bisulfite sequencing

BACKGROUND: Bisulfite sequencing using next generation sequencers yields genome-wide measurements of DNA methylation at single nucleotide resolution. Traditional aligners are not designed for mapping bisulfite-treated reads, where the unmethylated Cs are converted to Ts. We have developed BS Seeker,...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Pao-Yang, Cokus, Shawn J, Pellegrini, Matteo
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2871274/
https://www.ncbi.nlm.nih.gov/pubmed/20416082
http://dx.doi.org/10.1186/1471-2105-11-203
_version_ 1782181166031306752
author Chen, Pao-Yang
Cokus, Shawn J
Pellegrini, Matteo
author_facet Chen, Pao-Yang
Cokus, Shawn J
Pellegrini, Matteo
author_sort Chen, Pao-Yang
collection PubMed
description BACKGROUND: Bisulfite sequencing using next generation sequencers yields genome-wide measurements of DNA methylation at single nucleotide resolution. Traditional aligners are not designed for mapping bisulfite-treated reads, where the unmethylated Cs are converted to Ts. We have developed BS Seeker, an approach that converts the genome to a three-letter alphabet and uses Bowtie to align bisulfite-treated reads to a reference genome. It uses sequence tags to reduce mapping ambiguity. Post-processing of the alignments removes non-unique and low-quality mappings. RESULTS: We tested our aligner on synthetic data, a bisulfite-converted Arabidopsis library, and human libraries generated from two different experimental protocols. We evaluated the performance of our approach and compared it to other bisulfite aligners. The results demonstrate that among the aligners tested, BS Seeker is more versatile and faster. When mapping to the human genome, BS Seeker generates alignments significantly faster than RMAP and BSMAP. Furthermore, BS Seeker is the only alignment tool that can explicitly account for tags which are generated by certain library construction protocols. CONCLUSIONS: BS Seeker provides fast and accurate mapping of bisulfite-converted reads. It can work with BS reads generated from the two different experimental protocols, and is able to efficiently map reads to large mammalian genomes. The Python program is freely available at http://pellegrini.mcdb.ucla.edu/BS_Seeker/BS_Seeker.html.
format Text
id pubmed-2871274
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28712742010-05-17 BS Seeker: precise mapping for bisulfite sequencing Chen, Pao-Yang Cokus, Shawn J Pellegrini, Matteo BMC Bioinformatics Software BACKGROUND: Bisulfite sequencing using next generation sequencers yields genome-wide measurements of DNA methylation at single nucleotide resolution. Traditional aligners are not designed for mapping bisulfite-treated reads, where the unmethylated Cs are converted to Ts. We have developed BS Seeker, an approach that converts the genome to a three-letter alphabet and uses Bowtie to align bisulfite-treated reads to a reference genome. It uses sequence tags to reduce mapping ambiguity. Post-processing of the alignments removes non-unique and low-quality mappings. RESULTS: We tested our aligner on synthetic data, a bisulfite-converted Arabidopsis library, and human libraries generated from two different experimental protocols. We evaluated the performance of our approach and compared it to other bisulfite aligners. The results demonstrate that among the aligners tested, BS Seeker is more versatile and faster. When mapping to the human genome, BS Seeker generates alignments significantly faster than RMAP and BSMAP. Furthermore, BS Seeker is the only alignment tool that can explicitly account for tags which are generated by certain library construction protocols. CONCLUSIONS: BS Seeker provides fast and accurate mapping of bisulfite-converted reads. It can work with BS reads generated from the two different experimental protocols, and is able to efficiently map reads to large mammalian genomes. The Python program is freely available at http://pellegrini.mcdb.ucla.edu/BS_Seeker/BS_Seeker.html. BioMed Central 2010-04-23 /pmc/articles/PMC2871274/ /pubmed/20416082 http://dx.doi.org/10.1186/1471-2105-11-203 Text en Copyright ©2010 Chen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Chen, Pao-Yang
Cokus, Shawn J
Pellegrini, Matteo
BS Seeker: precise mapping for bisulfite sequencing
title BS Seeker: precise mapping for bisulfite sequencing
title_full BS Seeker: precise mapping for bisulfite sequencing
title_fullStr BS Seeker: precise mapping for bisulfite sequencing
title_full_unstemmed BS Seeker: precise mapping for bisulfite sequencing
title_short BS Seeker: precise mapping for bisulfite sequencing
title_sort bs seeker: precise mapping for bisulfite sequencing
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2871274/
https://www.ncbi.nlm.nih.gov/pubmed/20416082
http://dx.doi.org/10.1186/1471-2105-11-203
work_keys_str_mv AT chenpaoyang bsseekerprecisemappingforbisulfitesequencing
AT cokusshawnj bsseekerprecisemappingforbisulfitesequencing
AT pellegrinimatteo bsseekerprecisemappingforbisulfitesequencing