Cargando…

An integrative approach for efficient analysis of whole genome bisulfite sequencing data

BACKGROUND: Whole genome bisulfite sequencing (WGBS) is a high-throughput technique for profiling genome-wide DNA methylation at single nucleotide resolution. However, the applications of WGBS are limited by low accuracy resulting from bisulfite-induced damage on DNA fragments. Although many compute...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Jong-Hun, Park, Sung-Joon, Kenta, Nakai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4682396/
https://www.ncbi.nlm.nih.gov/pubmed/26680746
http://dx.doi.org/10.1186/1471-2164-16-S12-S14
_version_ 1782405881895321600
author Lee, Jong-Hun
Park, Sung-Joon
Kenta, Nakai
author_facet Lee, Jong-Hun
Park, Sung-Joon
Kenta, Nakai
author_sort Lee, Jong-Hun
collection PubMed
description BACKGROUND: Whole genome bisulfite sequencing (WGBS) is a high-throughput technique for profiling genome-wide DNA methylation at single nucleotide resolution. However, the applications of WGBS are limited by low accuracy resulting from bisulfite-induced damage on DNA fragments. Although many computer programs have been developed for accurate detecting, most of the programs have barely succeeded in improving either quantity or quality of the methylation results. To improve both, we attempted to develop a novel integration of most widely used bisulfite-read mappers: Bismark, BSMAP, and BS-seeker2. RESULTS: A comprehensive analysis of the three mappers revealed that the mapping results of the mappers were mutually complementary under diverse read conditions. Therefore, we sought to integrate the characteristics of the mappers by scoring them to gain robustness against artifacts. As a result, the integration significantly increased detection accuracy compared with the individual mappers. In addition, the amount of detected cytosine was higher than that by Bismark. Furthermore, the integration successfully reduced the fluctuation of detection accuracy induced by read conditions. We applied the integration to real WGBS samples and succeeded in classifying the samples according to the originated tissues by both CpG and CpH methylation patterns. CONCLUSIONS: In this study, we improved both quality and quantity of methylation results from WGBS data by integrating the mapping results of three bisulfite-read mappers. Also, we succeeded in combining and comparing WGBS samples by reducing the effects of read heterogeneity on methylation detection. This study contributes to DNA methylation researches by improving efficiency of methylation detection from WGBS data and facilitating the comprehensive analysis of public WGBS data.
format Online
Article
Text
id pubmed-4682396
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46823962015-12-21 An integrative approach for efficient analysis of whole genome bisulfite sequencing data Lee, Jong-Hun Park, Sung-Joon Kenta, Nakai BMC Genomics Research BACKGROUND: Whole genome bisulfite sequencing (WGBS) is a high-throughput technique for profiling genome-wide DNA methylation at single nucleotide resolution. However, the applications of WGBS are limited by low accuracy resulting from bisulfite-induced damage on DNA fragments. Although many computer programs have been developed for accurate detecting, most of the programs have barely succeeded in improving either quantity or quality of the methylation results. To improve both, we attempted to develop a novel integration of most widely used bisulfite-read mappers: Bismark, BSMAP, and BS-seeker2. RESULTS: A comprehensive analysis of the three mappers revealed that the mapping results of the mappers were mutually complementary under diverse read conditions. Therefore, we sought to integrate the characteristics of the mappers by scoring them to gain robustness against artifacts. As a result, the integration significantly increased detection accuracy compared with the individual mappers. In addition, the amount of detected cytosine was higher than that by Bismark. Furthermore, the integration successfully reduced the fluctuation of detection accuracy induced by read conditions. We applied the integration to real WGBS samples and succeeded in classifying the samples according to the originated tissues by both CpG and CpH methylation patterns. CONCLUSIONS: In this study, we improved both quality and quantity of methylation results from WGBS data by integrating the mapping results of three bisulfite-read mappers. Also, we succeeded in combining and comparing WGBS samples by reducing the effects of read heterogeneity on methylation detection. This study contributes to DNA methylation researches by improving efficiency of methylation detection from WGBS data and facilitating the comprehensive analysis of public WGBS data. BioMed Central 2015-12-09 /pmc/articles/PMC4682396/ /pubmed/26680746 http://dx.doi.org/10.1186/1471-2164-16-S12-S14 Text en Copyright © 2015 Lee et al. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Lee, Jong-Hun
Park, Sung-Joon
Kenta, Nakai
An integrative approach for efficient analysis of whole genome bisulfite sequencing data
title An integrative approach for efficient analysis of whole genome bisulfite sequencing data
title_full An integrative approach for efficient analysis of whole genome bisulfite sequencing data
title_fullStr An integrative approach for efficient analysis of whole genome bisulfite sequencing data
title_full_unstemmed An integrative approach for efficient analysis of whole genome bisulfite sequencing data
title_short An integrative approach for efficient analysis of whole genome bisulfite sequencing data
title_sort integrative approach for efficient analysis of whole genome bisulfite sequencing data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4682396/
https://www.ncbi.nlm.nih.gov/pubmed/26680746
http://dx.doi.org/10.1186/1471-2164-16-S12-S14
work_keys_str_mv AT leejonghun anintegrativeapproachforefficientanalysisofwholegenomebisulfitesequencingdata
AT parksungjoon anintegrativeapproachforefficientanalysisofwholegenomebisulfitesequencingdata
AT kentanakai anintegrativeapproachforefficientanalysisofwholegenomebisulfitesequencingdata
AT leejonghun integrativeapproachforefficientanalysisofwholegenomebisulfitesequencingdata
AT parksungjoon integrativeapproachforefficientanalysisofwholegenomebisulfitesequencingdata
AT kentanakai integrativeapproachforefficientanalysisofwholegenomebisulfitesequencingdata