Cargando…

A novel principal component based method for identifying differentially methylated regions in Illumina Infinium MethylationEPIC BeadChip data

Differentially methylated regions (DMRs) are genomic regions with methylation patterns across multiple CpG sites that are associated with a phenotype. In this study, we proposed a Principal Component (PC) based DMR analysis method for use with data generated using the Illumina Infinium MethylationEP...

Descripción completa

Detalles Bibliográficos
Autores principales: Zheng, Yuanchao, Lunetta, Kathryn L., Liu, Chunyu, Smith, Alicia K., Sherva, Richard, Miller, Mark W., Logue, Mark W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Taylor & Francis 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10193914/
https://www.ncbi.nlm.nih.gov/pubmed/37196182
http://dx.doi.org/10.1080/15592294.2023.2207959
Descripción
Sumario:Differentially methylated regions (DMRs) are genomic regions with methylation patterns across multiple CpG sites that are associated with a phenotype. In this study, we proposed a Principal Component (PC) based DMR analysis method for use with data generated using the Illumina Infinium MethylationEPIC BeadChip (EPIC) array. We obtained methylation residuals by regressing the M-values of CpGs within a region on covariates, extracted PCs of the residuals, and then combined association information across PCs to obtain regional significance. Simulation-based genome-wide false positive (GFP) rates and true positive rates were estimated under a variety of conditions before determining the final version of our method, which we have named DMR(PC). Then, DMR(PC) and another DMR method, coMethDMR, were used to perform epigenome-wide analyses of several phenotypes known to have multiple associated methylation loci (age, sex, and smoking) in a discovery and a replication cohort. Among regions that were analysed by both methods, DMR(PC) identified 50% more genome-wide significant age-associated DMRs than coMethDMR. The replication rate for the loci that were identified by only DMR(PC) was higher than the rate for those that were identified by only coMethDMR (90% for DMRPC vs. 76% for coMethDMR). Furthermore, DMR(PC) identified replicable associations in regions of moderate between-CpG correlation which are typically not analysed by coMethDMR. For the analyses of sex and smoking, the advantage of DMR(PC) was less clear. In conclusion, DMR(PC) is a new powerful DMR discovery tool that retains power in genomic regions with moderate correlation across CpGs.