Cargando…

Simultaneous Statistical Inference for Epigenetic Data

Epigenetic research leads to complex data structures. Since parametric model assumptions for the distribution of epigenetic data are hard to verify we introduce in the present work a nonparametric statistical framework for two-group comparisons. Furthermore, epigenetic analyses are often performed a...

Descripción completa

Detalles Bibliográficos
Autores principales: Schildknecht, Konstantin, Olek, Sven, Dickhaus, Thorsten
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4428829/
https://www.ncbi.nlm.nih.gov/pubmed/25965389
http://dx.doi.org/10.1371/journal.pone.0125587
_version_ 1782370946827419648
author Schildknecht, Konstantin
Olek, Sven
Dickhaus, Thorsten
author_facet Schildknecht, Konstantin
Olek, Sven
Dickhaus, Thorsten
author_sort Schildknecht, Konstantin
collection PubMed
description Epigenetic research leads to complex data structures. Since parametric model assumptions for the distribution of epigenetic data are hard to verify we introduce in the present work a nonparametric statistical framework for two-group comparisons. Furthermore, epigenetic analyses are often performed at various genetic loci simultaneously. Hence, in order to be able to draw valid conclusions for specific loci, an appropriate multiple testing correction is necessary. Finally, with technologies available for the simultaneous assessment of many interrelated biological parameters (such as gene arrays), statistical approaches also need to deal with a possibly unknown dependency structure in the data. Our statistical approach to the nonparametric comparison of two samples with independent multivariate observables is based on recently developed multivariate multiple permutation tests. We adapt their theory in order to cope with families of hypotheses regarding relative effects. Our results indicate that the multivariate multiple permutation test keeps the pre-assigned type I error level for the global null hypothesis. In combination with the closure principle, the family-wise error rate for the simultaneous test of the corresponding locus/parameter-specific null hypotheses can be controlled. In applications we demonstrate that group differences in epigenetic data can be detected reliably with our methodology.
format Online
Article
Text
id pubmed-4428829
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-44288292015-05-21 Simultaneous Statistical Inference for Epigenetic Data Schildknecht, Konstantin Olek, Sven Dickhaus, Thorsten PLoS One Research Article Epigenetic research leads to complex data structures. Since parametric model assumptions for the distribution of epigenetic data are hard to verify we introduce in the present work a nonparametric statistical framework for two-group comparisons. Furthermore, epigenetic analyses are often performed at various genetic loci simultaneously. Hence, in order to be able to draw valid conclusions for specific loci, an appropriate multiple testing correction is necessary. Finally, with technologies available for the simultaneous assessment of many interrelated biological parameters (such as gene arrays), statistical approaches also need to deal with a possibly unknown dependency structure in the data. Our statistical approach to the nonparametric comparison of two samples with independent multivariate observables is based on recently developed multivariate multiple permutation tests. We adapt their theory in order to cope with families of hypotheses regarding relative effects. Our results indicate that the multivariate multiple permutation test keeps the pre-assigned type I error level for the global null hypothesis. In combination with the closure principle, the family-wise error rate for the simultaneous test of the corresponding locus/parameter-specific null hypotheses can be controlled. In applications we demonstrate that group differences in epigenetic data can be detected reliably with our methodology. Public Library of Science 2015-05-12 /pmc/articles/PMC4428829/ /pubmed/25965389 http://dx.doi.org/10.1371/journal.pone.0125587 Text en © 2015 Schildknecht et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Schildknecht, Konstantin
Olek, Sven
Dickhaus, Thorsten
Simultaneous Statistical Inference for Epigenetic Data
title Simultaneous Statistical Inference for Epigenetic Data
title_full Simultaneous Statistical Inference for Epigenetic Data
title_fullStr Simultaneous Statistical Inference for Epigenetic Data
title_full_unstemmed Simultaneous Statistical Inference for Epigenetic Data
title_short Simultaneous Statistical Inference for Epigenetic Data
title_sort simultaneous statistical inference for epigenetic data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4428829/
https://www.ncbi.nlm.nih.gov/pubmed/25965389
http://dx.doi.org/10.1371/journal.pone.0125587
work_keys_str_mv AT schildknechtkonstantin simultaneousstatisticalinferenceforepigeneticdata
AT oleksven simultaneousstatisticalinferenceforepigeneticdata
AT dickhausthorsten simultaneousstatisticalinferenceforepigeneticdata