Cargando…

A novel statistical method for modeling covariate effects in bisulfite sequencing derived measures of DNA methylation

Identifying disease‐associated changes in DNA methylation can help us gain a better understanding of disease etiology. Bisulfite sequencing allows the generation of high‐throughput methylation profiles at single‐base resolution of DNA. However, optimally modeling and analyzing these sparse and discr...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Kaiqiong, Oualkacha, Karim, Lakhal‐Chaieb, Lajmi, Labbe, Aurélie, Klein, Kathleen, Ciampi, Antonio, Hudson, Marie, Colmegna, Inés, Pastinen, Tomi, Zhang, Tieyuan, Daley, Denise, Greenwood, Celia M.T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8359306/
https://www.ncbi.nlm.nih.gov/pubmed/32438470
http://dx.doi.org/10.1111/biom.13307
_version_ 1783737519709356032
author Zhao, Kaiqiong
Oualkacha, Karim
Lakhal‐Chaieb, Lajmi
Labbe, Aurélie
Klein, Kathleen
Ciampi, Antonio
Hudson, Marie
Colmegna, Inés
Pastinen, Tomi
Zhang, Tieyuan
Daley, Denise
Greenwood, Celia M.T.
author_facet Zhao, Kaiqiong
Oualkacha, Karim
Lakhal‐Chaieb, Lajmi
Labbe, Aurélie
Klein, Kathleen
Ciampi, Antonio
Hudson, Marie
Colmegna, Inés
Pastinen, Tomi
Zhang, Tieyuan
Daley, Denise
Greenwood, Celia M.T.
author_sort Zhao, Kaiqiong
collection PubMed
description Identifying disease‐associated changes in DNA methylation can help us gain a better understanding of disease etiology. Bisulfite sequencing allows the generation of high‐throughput methylation profiles at single‐base resolution of DNA. However, optimally modeling and analyzing these sparse and discrete sequencing data is still very challenging due to variable read depth, missing data patterns, long‐range correlations, data errors, and confounding from cell type mixtures. We propose a regression‐based hierarchical model that allows covariate effects to vary smoothly along genomic positions and we have built a specialized EM algorithm, which explicitly allows for experimental errors and cell type mixtures, to make inference about smooth covariate effects in the model. Simulations show that the proposed method provides accurate estimates of covariate effects and captures the major underlying methylation patterns with excellent power. We also apply our method to analyze data from rheumatoid arthritis patients and controls. The method has been implemented in R package SOMNiBUS.
format Online
Article
Text
id pubmed-8359306
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-83593062021-08-17 A novel statistical method for modeling covariate effects in bisulfite sequencing derived measures of DNA methylation Zhao, Kaiqiong Oualkacha, Karim Lakhal‐Chaieb, Lajmi Labbe, Aurélie Klein, Kathleen Ciampi, Antonio Hudson, Marie Colmegna, Inés Pastinen, Tomi Zhang, Tieyuan Daley, Denise Greenwood, Celia M.T. Biometrics Biometric Methodology Identifying disease‐associated changes in DNA methylation can help us gain a better understanding of disease etiology. Bisulfite sequencing allows the generation of high‐throughput methylation profiles at single‐base resolution of DNA. However, optimally modeling and analyzing these sparse and discrete sequencing data is still very challenging due to variable read depth, missing data patterns, long‐range correlations, data errors, and confounding from cell type mixtures. We propose a regression‐based hierarchical model that allows covariate effects to vary smoothly along genomic positions and we have built a specialized EM algorithm, which explicitly allows for experimental errors and cell type mixtures, to make inference about smooth covariate effects in the model. Simulations show that the proposed method provides accurate estimates of covariate effects and captures the major underlying methylation patterns with excellent power. We also apply our method to analyze data from rheumatoid arthritis patients and controls. The method has been implemented in R package SOMNiBUS. John Wiley and Sons Inc. 2020-06-05 2021-06 /pmc/articles/PMC8359306/ /pubmed/32438470 http://dx.doi.org/10.1111/biom.13307 Text en © 2020 The Authors. Biometrics published by Wiley Periodicals, Inc. on behalf of International Biometric Society. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Biometric Methodology
Zhao, Kaiqiong
Oualkacha, Karim
Lakhal‐Chaieb, Lajmi
Labbe, Aurélie
Klein, Kathleen
Ciampi, Antonio
Hudson, Marie
Colmegna, Inés
Pastinen, Tomi
Zhang, Tieyuan
Daley, Denise
Greenwood, Celia M.T.
A novel statistical method for modeling covariate effects in bisulfite sequencing derived measures of DNA methylation
title A novel statistical method for modeling covariate effects in bisulfite sequencing derived measures of DNA methylation
title_full A novel statistical method for modeling covariate effects in bisulfite sequencing derived measures of DNA methylation
title_fullStr A novel statistical method for modeling covariate effects in bisulfite sequencing derived measures of DNA methylation
title_full_unstemmed A novel statistical method for modeling covariate effects in bisulfite sequencing derived measures of DNA methylation
title_short A novel statistical method for modeling covariate effects in bisulfite sequencing derived measures of DNA methylation
title_sort novel statistical method for modeling covariate effects in bisulfite sequencing derived measures of dna methylation
topic Biometric Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8359306/
https://www.ncbi.nlm.nih.gov/pubmed/32438470
http://dx.doi.org/10.1111/biom.13307
work_keys_str_mv AT zhaokaiqiong anovelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation
AT oualkachakarim anovelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation
AT lakhalchaieblajmi anovelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation
AT labbeaurelie anovelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation
AT kleinkathleen anovelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation
AT ciampiantonio anovelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation
AT hudsonmarie anovelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation
AT colmegnaines anovelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation
AT pastinentomi anovelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation
AT zhangtieyuan anovelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation
AT daleydenise anovelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation
AT greenwoodceliamt anovelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation
AT zhaokaiqiong novelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation
AT oualkachakarim novelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation
AT lakhalchaieblajmi novelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation
AT labbeaurelie novelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation
AT kleinkathleen novelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation
AT ciampiantonio novelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation
AT hudsonmarie novelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation
AT colmegnaines novelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation
AT pastinentomi novelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation
AT zhangtieyuan novelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation
AT daleydenise novelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation
AT greenwoodceliamt novelstatisticalmethodformodelingcovariateeffectsinbisulfitesequencingderivedmeasuresofdnamethylation