Cargando…
FadE: whole genome methylation analysis for multiple sequencing platforms
DNA methylation plays a central role in genomic regulation and disease. Sodium bisulfite treatment (SBT) causes unmethylated cytosines to be sequenced as thymine, which allows methylation levels to reflected in the number of ‘C’-‘C’ alignments covering reference cytosines. Di-base color reads produc...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592442/ https://www.ncbi.nlm.nih.gov/pubmed/22965123 http://dx.doi.org/10.1093/nar/gks830 |
_version_ | 1782262116910104576 |
---|---|
author | Souaiaia, Tade Zhang, Zheng Chen, Ting |
author_facet | Souaiaia, Tade Zhang, Zheng Chen, Ting |
author_sort | Souaiaia, Tade |
collection | PubMed |
description | DNA methylation plays a central role in genomic regulation and disease. Sodium bisulfite treatment (SBT) causes unmethylated cytosines to be sequenced as thymine, which allows methylation levels to reflected in the number of ‘C’-‘C’ alignments covering reference cytosines. Di-base color reads produced by lifetech’s SOLiD sequencer provide unreliable results when translated to bases because single sequencing errors effect the downstream sequence. We describe FadE, an algorithm to accurately determine genome-wide methylation rates directly in color or nucleotide space. FadE uses SBT unmethylated and untreated data to determine background error rates and incorporate them into a model which uses Newton–Raphson optimization to estimate the methylation rate and provide a credible interval describing its distribution at every reference cytosine. We sequenced two slides of human fibroblast cell-line bisulfite-converted fragment library with the SOLiD sequencer to investigate genome-wide methylation levels. FadE reported widespread differences in methylation levels across CpG islands and a large number of differentially methylated regions adjacent to genes which compares favorably to the results of an investigation on the same cell-line using nucleotide-space reads at higher coverage levels, suggesting that FadE is an accurate method to estimate genome-wide methylation with color or nucleotide reads. http://code.google.com/p/fade/. |
format | Online Article Text |
id | pubmed-3592442 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-35924422013-03-08 FadE: whole genome methylation analysis for multiple sequencing platforms Souaiaia, Tade Zhang, Zheng Chen, Ting Nucleic Acids Res Methods Online DNA methylation plays a central role in genomic regulation and disease. Sodium bisulfite treatment (SBT) causes unmethylated cytosines to be sequenced as thymine, which allows methylation levels to reflected in the number of ‘C’-‘C’ alignments covering reference cytosines. Di-base color reads produced by lifetech’s SOLiD sequencer provide unreliable results when translated to bases because single sequencing errors effect the downstream sequence. We describe FadE, an algorithm to accurately determine genome-wide methylation rates directly in color or nucleotide space. FadE uses SBT unmethylated and untreated data to determine background error rates and incorporate them into a model which uses Newton–Raphson optimization to estimate the methylation rate and provide a credible interval describing its distribution at every reference cytosine. We sequenced two slides of human fibroblast cell-line bisulfite-converted fragment library with the SOLiD sequencer to investigate genome-wide methylation levels. FadE reported widespread differences in methylation levels across CpG islands and a large number of differentially methylated regions adjacent to genes which compares favorably to the results of an investigation on the same cell-line using nucleotide-space reads at higher coverage levels, suggesting that FadE is an accurate method to estimate genome-wide methylation with color or nucleotide reads. http://code.google.com/p/fade/. Oxford University Press 2013-01 2012-09-08 /pmc/articles/PMC3592442/ /pubmed/22965123 http://dx.doi.org/10.1093/nar/gks830 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Online Souaiaia, Tade Zhang, Zheng Chen, Ting FadE: whole genome methylation analysis for multiple sequencing platforms |
title | FadE: whole genome methylation analysis for multiple sequencing platforms |
title_full | FadE: whole genome methylation analysis for multiple sequencing platforms |
title_fullStr | FadE: whole genome methylation analysis for multiple sequencing platforms |
title_full_unstemmed | FadE: whole genome methylation analysis for multiple sequencing platforms |
title_short | FadE: whole genome methylation analysis for multiple sequencing platforms |
title_sort | fade: whole genome methylation analysis for multiple sequencing platforms |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592442/ https://www.ncbi.nlm.nih.gov/pubmed/22965123 http://dx.doi.org/10.1093/nar/gks830 |
work_keys_str_mv | AT souaiaiatade fadewholegenomemethylationanalysisformultiplesequencingplatforms AT zhangzheng fadewholegenomemethylationanalysisformultiplesequencingplatforms AT chenting fadewholegenomemethylationanalysisformultiplesequencingplatforms |