Cargando…

Targeted alignment and end repair elimination increase alignment and methylation measure accuracy for reduced representation bisulfite sequencing data

BACKGROUND: DNA methylation is an important epigenetic modification involved in many biological processes. Reduced representation bisulfite sequencing (RRBS) is a cost-effective method for studying DNA methylation at single base resolution. Although several tools are available for RRBS data processi...

Descripción completa

Detalles Bibliográficos
Autores principales: Baheti, Saurabh, Kanwar, Rahul, Goelzenleuchter, Meike, Kocher, Jean-Pierre A., Beutler, Andreas S., Sun, Zhifu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4769831/
https://www.ncbi.nlm.nih.gov/pubmed/26922377
http://dx.doi.org/10.1186/s12864-016-2494-8
_version_ 1782418164764639232
author Baheti, Saurabh
Kanwar, Rahul
Goelzenleuchter, Meike
Kocher, Jean-Pierre A.
Beutler, Andreas S.
Sun, Zhifu
author_facet Baheti, Saurabh
Kanwar, Rahul
Goelzenleuchter, Meike
Kocher, Jean-Pierre A.
Beutler, Andreas S.
Sun, Zhifu
author_sort Baheti, Saurabh
collection PubMed
description BACKGROUND: DNA methylation is an important epigenetic modification involved in many biological processes. Reduced representation bisulfite sequencing (RRBS) is a cost-effective method for studying DNA methylation at single base resolution. Although several tools are available for RRBS data processing and analysis, it is not clear which strategy performs the best and there has not been much attention to the contamination issue from artificial cytosines incorporated during the end repair step of library preparation. To address these issues, we describe a new method, Targeted Alignment and Artificial Cytosine Elimination for RRBS (TRACE-RRBS), which aligns bisulfite sequence reads to MSP1 digitally digested reference and specifically removes the end repair cytosines. We compared this approach on a simulated and a real dataset with 7 other RRBS analysis tools and Illumina 450 K microarray platform. RESULTS: TRACE-RRBS aligns sequence reads to a small fraction of the genome where RRBS protocol targets on and was demonstrated as the fastest, most sensitive and specific tool for the simulated dataset. For the real dataset, TRACE-RRBS took about the same time as RRBSMAP, a third to a sixth of time needed for BISMARK and NOVOALIGN. TRACE-RRBS aligned more reads uniquely than other tools and achieved the highest correlation with 450 k microarray data. The end repair artificial cytosine removal increased correlation between nearby CpGs and accuracy of methylation quantification. CONCLUSIONS: TRACE-RRBS is fast and more accurate tool for RRBS data analysis. It is freely available for academic use at http://bioinformaticstools.mayo.edu/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2494-8) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4769831
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-47698312016-02-29 Targeted alignment and end repair elimination increase alignment and methylation measure accuracy for reduced representation bisulfite sequencing data Baheti, Saurabh Kanwar, Rahul Goelzenleuchter, Meike Kocher, Jean-Pierre A. Beutler, Andreas S. Sun, Zhifu BMC Genomics Methodology Article BACKGROUND: DNA methylation is an important epigenetic modification involved in many biological processes. Reduced representation bisulfite sequencing (RRBS) is a cost-effective method for studying DNA methylation at single base resolution. Although several tools are available for RRBS data processing and analysis, it is not clear which strategy performs the best and there has not been much attention to the contamination issue from artificial cytosines incorporated during the end repair step of library preparation. To address these issues, we describe a new method, Targeted Alignment and Artificial Cytosine Elimination for RRBS (TRACE-RRBS), which aligns bisulfite sequence reads to MSP1 digitally digested reference and specifically removes the end repair cytosines. We compared this approach on a simulated and a real dataset with 7 other RRBS analysis tools and Illumina 450 K microarray platform. RESULTS: TRACE-RRBS aligns sequence reads to a small fraction of the genome where RRBS protocol targets on and was demonstrated as the fastest, most sensitive and specific tool for the simulated dataset. For the real dataset, TRACE-RRBS took about the same time as RRBSMAP, a third to a sixth of time needed for BISMARK and NOVOALIGN. TRACE-RRBS aligned more reads uniquely than other tools and achieved the highest correlation with 450 k microarray data. The end repair artificial cytosine removal increased correlation between nearby CpGs and accuracy of methylation quantification. CONCLUSIONS: TRACE-RRBS is fast and more accurate tool for RRBS data analysis. It is freely available for academic use at http://bioinformaticstools.mayo.edu/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2494-8) contains supplementary material, which is available to authorized users. BioMed Central 2016-02-27 /pmc/articles/PMC4769831/ /pubmed/26922377 http://dx.doi.org/10.1186/s12864-016-2494-8 Text en © Baheti et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Baheti, Saurabh
Kanwar, Rahul
Goelzenleuchter, Meike
Kocher, Jean-Pierre A.
Beutler, Andreas S.
Sun, Zhifu
Targeted alignment and end repair elimination increase alignment and methylation measure accuracy for reduced representation bisulfite sequencing data
title Targeted alignment and end repair elimination increase alignment and methylation measure accuracy for reduced representation bisulfite sequencing data
title_full Targeted alignment and end repair elimination increase alignment and methylation measure accuracy for reduced representation bisulfite sequencing data
title_fullStr Targeted alignment and end repair elimination increase alignment and methylation measure accuracy for reduced representation bisulfite sequencing data
title_full_unstemmed Targeted alignment and end repair elimination increase alignment and methylation measure accuracy for reduced representation bisulfite sequencing data
title_short Targeted alignment and end repair elimination increase alignment and methylation measure accuracy for reduced representation bisulfite sequencing data
title_sort targeted alignment and end repair elimination increase alignment and methylation measure accuracy for reduced representation bisulfite sequencing data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4769831/
https://www.ncbi.nlm.nih.gov/pubmed/26922377
http://dx.doi.org/10.1186/s12864-016-2494-8
work_keys_str_mv AT bahetisaurabh targetedalignmentandendrepaireliminationincreasealignmentandmethylationmeasureaccuracyforreducedrepresentationbisulfitesequencingdata
AT kanwarrahul targetedalignmentandendrepaireliminationincreasealignmentandmethylationmeasureaccuracyforreducedrepresentationbisulfitesequencingdata
AT goelzenleuchtermeike targetedalignmentandendrepaireliminationincreasealignmentandmethylationmeasureaccuracyforreducedrepresentationbisulfitesequencingdata
AT kocherjeanpierrea targetedalignmentandendrepaireliminationincreasealignmentandmethylationmeasureaccuracyforreducedrepresentationbisulfitesequencingdata
AT beutlerandreass targetedalignmentandendrepaireliminationincreasealignmentandmethylationmeasureaccuracyforreducedrepresentationbisulfitesequencingdata
AT sunzhifu targetedalignmentandendrepaireliminationincreasealignmentandmethylationmeasureaccuracyforreducedrepresentationbisulfitesequencingdata