Cargando…
Permutation-based significance analysis reduces the type 1 error rate in bisulphite sequencing data analysis of human umbilical cord blood samples
DNA methylation patterns are largely established in-utero and might mediate the impacts of in-utero conditions on later health outcomes. Associations between perinatal DNA methylation marks and pregnancy-related variables, such as maternal age and gestational weight gain, have been earlier studied w...
Autores principales: | , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Taylor & Francis
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9620995/ https://www.ncbi.nlm.nih.gov/pubmed/35246015 http://dx.doi.org/10.1080/15592294.2022.2044127 |
_version_ | 1784821441333559296 |
---|---|
author | Laajala, Essi Halla-aho, Viivi Grönroos, Toni Kalim, Ubaid Ullah Vähä-Mäkilä, Mari Nurmio, Mirja Kallionpää, Henna Lietzén, Niina Mykkänen, Juha Rasool, Omid Toppari, Jorma Orešič, Matej Knip, Mikael Lund, Riikka Lahesmaa, Riitta Lähdesmäki, Harri |
author_facet | Laajala, Essi Halla-aho, Viivi Grönroos, Toni Kalim, Ubaid Ullah Vähä-Mäkilä, Mari Nurmio, Mirja Kallionpää, Henna Lietzén, Niina Mykkänen, Juha Rasool, Omid Toppari, Jorma Orešič, Matej Knip, Mikael Lund, Riikka Lahesmaa, Riitta Lähdesmäki, Harri |
author_sort | Laajala, Essi |
collection | PubMed |
description | DNA methylation patterns are largely established in-utero and might mediate the impacts of in-utero conditions on later health outcomes. Associations between perinatal DNA methylation marks and pregnancy-related variables, such as maternal age and gestational weight gain, have been earlier studied with methylation microarrays, which typically cover less than 2% of human CpG sites. To detect such associations outside these regions, we chose the bisulphite sequencing approach. We collected and curated clinical data on 200 newborn infants; whose umbilical cord blood samples were analysed with the reduced representation bisulphite sequencing (RRBS) method. A generalized linear mixed-effects model was fit for each high coverage CpG site, followed by spatial and multiple testing adjustment of P values to identify differentially methylated cytosines (DMCs) and regions (DMRs) associated with clinical variables, such as maternal age, mode of delivery, and birth weight. Type 1 error rate was then evaluated with a permutation analysis. We discovered a strong inflation of spatially adjusted P values through the permutation analysis, which we then applied for empirical type 1 error control. The inflation of P values was caused by a common method for spatial adjustment and DMR detection, implemented in tools comb-p and RADMeth. Based on empirically estimated significance thresholds, very little differential methylation was associated with any of the studied clinical variables, other than sex. With this analysis workflow, the sex-associated differentially methylated regions were highly reproducible across studies, technologies, and statistical models. |
format | Online Article Text |
id | pubmed-9620995 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Taylor & Francis |
record_format | MEDLINE/PubMed |
spelling | pubmed-96209952022-11-01 Permutation-based significance analysis reduces the type 1 error rate in bisulphite sequencing data analysis of human umbilical cord blood samples Laajala, Essi Halla-aho, Viivi Grönroos, Toni Kalim, Ubaid Ullah Vähä-Mäkilä, Mari Nurmio, Mirja Kallionpää, Henna Lietzén, Niina Mykkänen, Juha Rasool, Omid Toppari, Jorma Orešič, Matej Knip, Mikael Lund, Riikka Lahesmaa, Riitta Lähdesmäki, Harri Epigenetics Research Paper DNA methylation patterns are largely established in-utero and might mediate the impacts of in-utero conditions on later health outcomes. Associations between perinatal DNA methylation marks and pregnancy-related variables, such as maternal age and gestational weight gain, have been earlier studied with methylation microarrays, which typically cover less than 2% of human CpG sites. To detect such associations outside these regions, we chose the bisulphite sequencing approach. We collected and curated clinical data on 200 newborn infants; whose umbilical cord blood samples were analysed with the reduced representation bisulphite sequencing (RRBS) method. A generalized linear mixed-effects model was fit for each high coverage CpG site, followed by spatial and multiple testing adjustment of P values to identify differentially methylated cytosines (DMCs) and regions (DMRs) associated with clinical variables, such as maternal age, mode of delivery, and birth weight. Type 1 error rate was then evaluated with a permutation analysis. We discovered a strong inflation of spatially adjusted P values through the permutation analysis, which we then applied for empirical type 1 error control. The inflation of P values was caused by a common method for spatial adjustment and DMR detection, implemented in tools comb-p and RADMeth. Based on empirically estimated significance thresholds, very little differential methylation was associated with any of the studied clinical variables, other than sex. With this analysis workflow, the sex-associated differentially methylated regions were highly reproducible across studies, technologies, and statistical models. Taylor & Francis 2022-03-04 /pmc/articles/PMC9620995/ /pubmed/35246015 http://dx.doi.org/10.1080/15592294.2022.2044127 Text en © 2022 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Paper Laajala, Essi Halla-aho, Viivi Grönroos, Toni Kalim, Ubaid Ullah Vähä-Mäkilä, Mari Nurmio, Mirja Kallionpää, Henna Lietzén, Niina Mykkänen, Juha Rasool, Omid Toppari, Jorma Orešič, Matej Knip, Mikael Lund, Riikka Lahesmaa, Riitta Lähdesmäki, Harri Permutation-based significance analysis reduces the type 1 error rate in bisulphite sequencing data analysis of human umbilical cord blood samples |
title | Permutation-based significance analysis reduces the type 1 error rate in bisulphite sequencing data analysis of human umbilical cord blood samples |
title_full | Permutation-based significance analysis reduces the type 1 error rate in bisulphite sequencing data analysis of human umbilical cord blood samples |
title_fullStr | Permutation-based significance analysis reduces the type 1 error rate in bisulphite sequencing data analysis of human umbilical cord blood samples |
title_full_unstemmed | Permutation-based significance analysis reduces the type 1 error rate in bisulphite sequencing data analysis of human umbilical cord blood samples |
title_short | Permutation-based significance analysis reduces the type 1 error rate in bisulphite sequencing data analysis of human umbilical cord blood samples |
title_sort | permutation-based significance analysis reduces the type 1 error rate in bisulphite sequencing data analysis of human umbilical cord blood samples |
topic | Research Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9620995/ https://www.ncbi.nlm.nih.gov/pubmed/35246015 http://dx.doi.org/10.1080/15592294.2022.2044127 |
work_keys_str_mv | AT laajalaessi permutationbasedsignificanceanalysisreducesthetype1errorrateinbisulphitesequencingdataanalysisofhumanumbilicalcordbloodsamples AT hallaahoviivi permutationbasedsignificanceanalysisreducesthetype1errorrateinbisulphitesequencingdataanalysisofhumanumbilicalcordbloodsamples AT gronroostoni permutationbasedsignificanceanalysisreducesthetype1errorrateinbisulphitesequencingdataanalysisofhumanumbilicalcordbloodsamples AT kalimubaidullah permutationbasedsignificanceanalysisreducesthetype1errorrateinbisulphitesequencingdataanalysisofhumanumbilicalcordbloodsamples AT vahamakilamari permutationbasedsignificanceanalysisreducesthetype1errorrateinbisulphitesequencingdataanalysisofhumanumbilicalcordbloodsamples AT nurmiomirja permutationbasedsignificanceanalysisreducesthetype1errorrateinbisulphitesequencingdataanalysisofhumanumbilicalcordbloodsamples AT kallionpaahenna permutationbasedsignificanceanalysisreducesthetype1errorrateinbisulphitesequencingdataanalysisofhumanumbilicalcordbloodsamples AT lietzenniina permutationbasedsignificanceanalysisreducesthetype1errorrateinbisulphitesequencingdataanalysisofhumanumbilicalcordbloodsamples AT mykkanenjuha permutationbasedsignificanceanalysisreducesthetype1errorrateinbisulphitesequencingdataanalysisofhumanumbilicalcordbloodsamples AT rasoolomid permutationbasedsignificanceanalysisreducesthetype1errorrateinbisulphitesequencingdataanalysisofhumanumbilicalcordbloodsamples AT topparijorma permutationbasedsignificanceanalysisreducesthetype1errorrateinbisulphitesequencingdataanalysisofhumanumbilicalcordbloodsamples AT oresicmatej permutationbasedsignificanceanalysisreducesthetype1errorrateinbisulphitesequencingdataanalysisofhumanumbilicalcordbloodsamples AT knipmikael permutationbasedsignificanceanalysisreducesthetype1errorrateinbisulphitesequencingdataanalysisofhumanumbilicalcordbloodsamples AT lundriikka permutationbasedsignificanceanalysisreducesthetype1errorrateinbisulphitesequencingdataanalysisofhumanumbilicalcordbloodsamples AT lahesmaariitta permutationbasedsignificanceanalysisreducesthetype1errorrateinbisulphitesequencingdataanalysisofhumanumbilicalcordbloodsamples AT lahdesmakiharri permutationbasedsignificanceanalysisreducesthetype1errorrateinbisulphitesequencingdataanalysisofhumanumbilicalcordbloodsamples |