Cargando…
A novel statistical method for modeling covariate effects in bisulfite sequencing derived measures of DNA methylation
Identifying disease‐associated changes in DNA methylation can help us gain a better understanding of disease etiology. Bisulfite sequencing allows the generation of high‐throughput methylation profiles at single‐base resolution of DNA. However, optimally modeling and analyzing these sparse and discr...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8359306/ https://www.ncbi.nlm.nih.gov/pubmed/32438470 http://dx.doi.org/10.1111/biom.13307 |
_version_ | 1783737519709356032 |
---|---|
author | Zhao, Kaiqiong Oualkacha, Karim Lakhal‐Chaieb, Lajmi Labbe, Aurélie Klein, Kathleen Ciampi, Antonio Hudson, Marie Colmegna, Inés Pastinen, Tomi Zhang, Tieyuan Daley, Denise Greenwood, Celia M.T. |
author_facet | Zhao, Kaiqiong Oualkacha, Karim Lakhal‐Chaieb, Lajmi Labbe, Aurélie Klein, Kathleen Ciampi, Antonio Hudson, Marie Colmegna, Inés Pastinen, Tomi Zhang, Tieyuan Daley, Denise Greenwood, Celia M.T. |
author_sort | Zhao, Kaiqiong |
collection | PubMed |
description | Identifying disease‐associated changes in DNA methylation can help us gain a better understanding of disease etiology. Bisulfite sequencing allows the generation of high‐throughput methylation profiles at single‐base resolution of DNA. However, optimally modeling and analyzing these sparse and discrete sequencing data is still very challenging due to variable read depth, missing data patterns, long‐range correlations, data errors, and confounding from cell type mixtures. We propose a regression‐based hierarchical model that allows covariate effects to vary smoothly along genomic positions and we have built a specialized EM algorithm, which explicitly allows for experimental errors and cell type mixtures, to make inference about smooth covariate effects in the model. Simulations show that the proposed method provides accurate estimates of covariate effects and captures the major underlying methylation patterns with excellent power. We also apply our method to analyze data from rheumatoid arthritis patients and controls. The method has been implemented in R package SOMNiBUS. |
format | Online Article Text |
id | pubmed-8359306 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-83593062021-08-17 A novel statistical method for modeling covariate effects in bisulfite sequencing derived measures of DNA methylation Zhao, Kaiqiong Oualkacha, Karim Lakhal‐Chaieb, Lajmi Labbe, Aurélie Klein, Kathleen Ciampi, Antonio Hudson, Marie Colmegna, Inés Pastinen, Tomi Zhang, Tieyuan Daley, Denise Greenwood, Celia M.T. Biometrics Biometric Methodology Identifying disease‐associated changes in DNA methylation can help us gain a better understanding of disease etiology. Bisulfite sequencing allows the generation of high‐throughput methylation profiles at single‐base resolution of DNA. However, optimally modeling and analyzing these sparse and discrete sequencing data is still very challenging due to variable read depth, missing data patterns, long‐range correlations, data errors, and confounding from cell type mixtures. We propose a regression‐based hierarchical model that allows covariate effects to vary smoothly along genomic positions and we have built a specialized EM algorithm, which explicitly allows for experimental errors and cell type mixtures, to make inference about smooth covariate effects in the model. Simulations show that the proposed method provides accurate estimates of covariate effects and captures the major underlying methylation patterns with excellent power. We also apply our method to analyze data from rheumatoid arthritis patients and controls. The method has been implemented in R package SOMNiBUS. John Wiley and Sons Inc. 2020-06-05 2021-06 /pmc/articles/PMC8359306/ /pubmed/32438470 http://dx.doi.org/10.1111/biom.13307 Text en © 2020 The Authors. Biometrics published by Wiley Periodicals, Inc. on behalf of International Biometric Society. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Biometric Methodology Zhao, Kaiqiong Oualkacha, Karim Lakhal‐Chaieb, Lajmi Labbe, Aurélie Klein, Kathleen Ciampi, Antonio Hudson, Marie Colmegna, Inés Pastinen, Tomi Zhang, Tieyuan Daley, Denise Greenwood, Celia M.T. A novel statistical method for modeling covariate effects in bisulfite sequencing derived measures of DNA methylation |
title | A novel statistical method for modeling covariate effects in bisulfite sequencing derived measures of DNA methylation |
title_full | A novel statistical method for modeling covariate effects in bisulfite sequencing derived measures of DNA methylation |
title_fullStr | A novel statistical method for modeling covariate effects in bisulfite sequencing derived measures of DNA methylation |
title_full_unstemmed | A novel statistical method for modeling covariate effects in bisulfite sequencing derived measures of DNA methylation |
title_short | A novel statistical method for modeling covariate effects in bisulfite sequencing derived measures of DNA methylation |
title_sort | novel statistical method for modeling covariate effects in bisulfite sequencing derived measures of dna methylation |
topic | Biometric Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8359306/ https://www.ncbi.nlm.nih.gov/pubmed/32438470 http://dx.doi.org/10.1111/biom.13307 |
work_keys_str_mv | AT zhaokaiqiong anovelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation AT oualkachakarim anovelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation AT lakhalchaieblajmi anovelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation AT labbeaurelie anovelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation AT kleinkathleen anovelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation AT ciampiantonio anovelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation AT hudsonmarie anovelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation AT colmegnaines anovelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation AT pastinentomi anovelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation AT zhangtieyuan anovelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation AT daleydenise anovelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation AT greenwoodceliamt anovelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation AT zhaokaiqiong novelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation AT oualkachakarim novelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation AT lakhalchaieblajmi novelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation AT labbeaurelie novelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation AT kleinkathleen novelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation AT ciampiantonio novelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation AT hudsonmarie novelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation AT colmegnaines novelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation AT pastinentomi novelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation AT zhangtieyuan novelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation AT daleydenise novelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation AT greenwoodceliamt novelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation |