Cargando…
Targeted alignment and end repair elimination increase alignment and methylation measure accuracy for reduced representation bisulfite sequencing data
BACKGROUND: DNA methylation is an important epigenetic modification involved in many biological processes. Reduced representation bisulfite sequencing (RRBS) is a cost-effective method for studying DNA methylation at single base resolution. Although several tools are available for RRBS data processi...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4769831/ https://www.ncbi.nlm.nih.gov/pubmed/26922377 http://dx.doi.org/10.1186/s12864-016-2494-8 |
_version_ | 1782418164764639232 |
---|---|
author | Baheti, Saurabh Kanwar, Rahul Goelzenleuchter, Meike Kocher, Jean-Pierre A. Beutler, Andreas S. Sun, Zhifu |
author_facet | Baheti, Saurabh Kanwar, Rahul Goelzenleuchter, Meike Kocher, Jean-Pierre A. Beutler, Andreas S. Sun, Zhifu |
author_sort | Baheti, Saurabh |
collection | PubMed |
description | BACKGROUND: DNA methylation is an important epigenetic modification involved in many biological processes. Reduced representation bisulfite sequencing (RRBS) is a cost-effective method for studying DNA methylation at single base resolution. Although several tools are available for RRBS data processing and analysis, it is not clear which strategy performs the best and there has not been much attention to the contamination issue from artificial cytosines incorporated during the end repair step of library preparation. To address these issues, we describe a new method, Targeted Alignment and Artificial Cytosine Elimination for RRBS (TRACE-RRBS), which aligns bisulfite sequence reads to MSP1 digitally digested reference and specifically removes the end repair cytosines. We compared this approach on a simulated and a real dataset with 7 other RRBS analysis tools and Illumina 450 K microarray platform. RESULTS: TRACE-RRBS aligns sequence reads to a small fraction of the genome where RRBS protocol targets on and was demonstrated as the fastest, most sensitive and specific tool for the simulated dataset. For the real dataset, TRACE-RRBS took about the same time as RRBSMAP, a third to a sixth of time needed for BISMARK and NOVOALIGN. TRACE-RRBS aligned more reads uniquely than other tools and achieved the highest correlation with 450 k microarray data. The end repair artificial cytosine removal increased correlation between nearby CpGs and accuracy of methylation quantification. CONCLUSIONS: TRACE-RRBS is fast and more accurate tool for RRBS data analysis. It is freely available for academic use at http://bioinformaticstools.mayo.edu/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2494-8) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4769831 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-47698312016-02-29 Targeted alignment and end repair elimination increase alignment and methylation measure accuracy for reduced representation bisulfite sequencing data Baheti, Saurabh Kanwar, Rahul Goelzenleuchter, Meike Kocher, Jean-Pierre A. Beutler, Andreas S. Sun, Zhifu BMC Genomics Methodology Article BACKGROUND: DNA methylation is an important epigenetic modification involved in many biological processes. Reduced representation bisulfite sequencing (RRBS) is a cost-effective method for studying DNA methylation at single base resolution. Although several tools are available for RRBS data processing and analysis, it is not clear which strategy performs the best and there has not been much attention to the contamination issue from artificial cytosines incorporated during the end repair step of library preparation. To address these issues, we describe a new method, Targeted Alignment and Artificial Cytosine Elimination for RRBS (TRACE-RRBS), which aligns bisulfite sequence reads to MSP1 digitally digested reference and specifically removes the end repair cytosines. We compared this approach on a simulated and a real dataset with 7 other RRBS analysis tools and Illumina 450 K microarray platform. RESULTS: TRACE-RRBS aligns sequence reads to a small fraction of the genome where RRBS protocol targets on and was demonstrated as the fastest, most sensitive and specific tool for the simulated dataset. For the real dataset, TRACE-RRBS took about the same time as RRBSMAP, a third to a sixth of time needed for BISMARK and NOVOALIGN. TRACE-RRBS aligned more reads uniquely than other tools and achieved the highest correlation with 450 k microarray data. The end repair artificial cytosine removal increased correlation between nearby CpGs and accuracy of methylation quantification. CONCLUSIONS: TRACE-RRBS is fast and more accurate tool for RRBS data analysis. It is freely available for academic use at http://bioinformaticstools.mayo.edu/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2494-8) contains supplementary material, which is available to authorized users. BioMed Central 2016-02-27 /pmc/articles/PMC4769831/ /pubmed/26922377 http://dx.doi.org/10.1186/s12864-016-2494-8 Text en © Baheti et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Baheti, Saurabh Kanwar, Rahul Goelzenleuchter, Meike Kocher, Jean-Pierre A. Beutler, Andreas S. Sun, Zhifu Targeted alignment and end repair elimination increase alignment and methylation measure accuracy for reduced representation bisulfite sequencing data |
title | Targeted alignment and end repair elimination increase alignment and methylation measure accuracy for reduced representation bisulfite sequencing data |
title_full | Targeted alignment and end repair elimination increase alignment and methylation measure accuracy for reduced representation bisulfite sequencing data |
title_fullStr | Targeted alignment and end repair elimination increase alignment and methylation measure accuracy for reduced representation bisulfite sequencing data |
title_full_unstemmed | Targeted alignment and end repair elimination increase alignment and methylation measure accuracy for reduced representation bisulfite sequencing data |
title_short | Targeted alignment and end repair elimination increase alignment and methylation measure accuracy for reduced representation bisulfite sequencing data |
title_sort | targeted alignment and end repair elimination increase alignment and methylation measure accuracy for reduced representation bisulfite sequencing data |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4769831/ https://www.ncbi.nlm.nih.gov/pubmed/26922377 http://dx.doi.org/10.1186/s12864-016-2494-8 |
work_keys_str_mv | AT bahetisaurabh targetedalignmentandendrepaireliminationincreasealignmentandmethylationmeasureaccuracyforreducedrepresentationbisulfitesequencingdata AT kanwarrahul targetedalignmentandendrepaireliminationincreasealignmentandmethylationmeasureaccuracyforreducedrepresentationbisulfitesequencingdata AT goelzenleuchtermeike targetedalignmentandendrepaireliminationincreasealignmentandmethylationmeasureaccuracyforreducedrepresentationbisulfitesequencingdata AT kocherjeanpierrea targetedalignmentandendrepaireliminationincreasealignmentandmethylationmeasureaccuracyforreducedrepresentationbisulfitesequencingdata AT beutlerandreass targetedalignmentandendrepaireliminationincreasealignmentandmethylationmeasureaccuracyforreducedrepresentationbisulfitesequencingdata AT sunzhifu targetedalignmentandendrepaireliminationincreasealignmentandmethylationmeasureaccuracyforreducedrepresentationbisulfitesequencingdata |