Cargando…

FadE: whole genome methylation analysis for multiple sequencing platforms

DNA methylation plays a central role in genomic regulation and disease. Sodium bisulfite treatment (SBT) causes unmethylated cytosines to be sequenced as thymine, which allows methylation levels to reflected in the number of ‘C’-‘C’ alignments covering reference cytosines. Di-base color reads produc...

Descripción completa

Detalles Bibliográficos
Autores principales: Souaiaia, Tade, Zhang, Zheng, Chen, Ting
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592442/
https://www.ncbi.nlm.nih.gov/pubmed/22965123
http://dx.doi.org/10.1093/nar/gks830
_version_ 1782262116910104576
author Souaiaia, Tade
Zhang, Zheng
Chen, Ting
author_facet Souaiaia, Tade
Zhang, Zheng
Chen, Ting
author_sort Souaiaia, Tade
collection PubMed
description DNA methylation plays a central role in genomic regulation and disease. Sodium bisulfite treatment (SBT) causes unmethylated cytosines to be sequenced as thymine, which allows methylation levels to reflected in the number of ‘C’-‘C’ alignments covering reference cytosines. Di-base color reads produced by lifetech’s SOLiD sequencer provide unreliable results when translated to bases because single sequencing errors effect the downstream sequence. We describe FadE, an algorithm to accurately determine genome-wide methylation rates directly in color or nucleotide space. FadE uses SBT unmethylated and untreated data to determine background error rates and incorporate them into a model which uses Newton–Raphson optimization to estimate the methylation rate and provide a credible interval describing its distribution at every reference cytosine. We sequenced two slides of human fibroblast cell-line bisulfite-converted fragment library with the SOLiD sequencer to investigate genome-wide methylation levels. FadE reported widespread differences in methylation levels across CpG islands and a large number of differentially methylated regions adjacent to genes which compares favorably to the results of an investigation on the same cell-line using nucleotide-space reads at higher coverage levels, suggesting that FadE is an accurate method to estimate genome-wide methylation with color or nucleotide reads. http://code.google.com/p/fade/.
format Online
Article
Text
id pubmed-3592442
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-35924422013-03-08 FadE: whole genome methylation analysis for multiple sequencing platforms Souaiaia, Tade Zhang, Zheng Chen, Ting Nucleic Acids Res Methods Online DNA methylation plays a central role in genomic regulation and disease. Sodium bisulfite treatment (SBT) causes unmethylated cytosines to be sequenced as thymine, which allows methylation levels to reflected in the number of ‘C’-‘C’ alignments covering reference cytosines. Di-base color reads produced by lifetech’s SOLiD sequencer provide unreliable results when translated to bases because single sequencing errors effect the downstream sequence. We describe FadE, an algorithm to accurately determine genome-wide methylation rates directly in color or nucleotide space. FadE uses SBT unmethylated and untreated data to determine background error rates and incorporate them into a model which uses Newton–Raphson optimization to estimate the methylation rate and provide a credible interval describing its distribution at every reference cytosine. We sequenced two slides of human fibroblast cell-line bisulfite-converted fragment library with the SOLiD sequencer to investigate genome-wide methylation levels. FadE reported widespread differences in methylation levels across CpG islands and a large number of differentially methylated regions adjacent to genes which compares favorably to the results of an investigation on the same cell-line using nucleotide-space reads at higher coverage levels, suggesting that FadE is an accurate method to estimate genome-wide methylation with color or nucleotide reads. http://code.google.com/p/fade/. Oxford University Press 2013-01 2012-09-08 /pmc/articles/PMC3592442/ /pubmed/22965123 http://dx.doi.org/10.1093/nar/gks830 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Souaiaia, Tade
Zhang, Zheng
Chen, Ting
FadE: whole genome methylation analysis for multiple sequencing platforms
title FadE: whole genome methylation analysis for multiple sequencing platforms
title_full FadE: whole genome methylation analysis for multiple sequencing platforms
title_fullStr FadE: whole genome methylation analysis for multiple sequencing platforms
title_full_unstemmed FadE: whole genome methylation analysis for multiple sequencing platforms
title_short FadE: whole genome methylation analysis for multiple sequencing platforms
title_sort fade: whole genome methylation analysis for multiple sequencing platforms
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592442/
https://www.ncbi.nlm.nih.gov/pubmed/22965123
http://dx.doi.org/10.1093/nar/gks830
work_keys_str_mv AT souaiaiatade fadewholegenomemethylationanalysisformultiplesequencingplatforms
AT zhangzheng fadewholegenomemethylationanalysisformultiplesequencingplatforms
AT chenting fadewholegenomemethylationanalysisformultiplesequencingplatforms