Cargando…

Quality control for Illumina 450K methylation data in the absence of iDat files using correlation structure in pedigrees and repeated measures

BACKGROUND: An important feature in many genomic studies is quality control and normalization. This is particularly important when analyzing epigenetic data, where the process of obtaining measurements can be bias prone. The GAW20 data was from the Genetics of Lipid Lowering Drugs and Diet Network (...

Descripción completa

Detalles Bibliográficos
Autores principales: LeBlanc, Marissa, Nustad, Haakon E., Zucknick, Manuela, Page, Christian M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6156833/
https://www.ncbi.nlm.nih.gov/pubmed/30255766
http://dx.doi.org/10.1186/s12863-018-0636-5
_version_ 1783358166006759424
author LeBlanc, Marissa
Nustad, Haakon E.
Zucknick, Manuela
Page, Christian M.
author_facet LeBlanc, Marissa
Nustad, Haakon E.
Zucknick, Manuela
Page, Christian M.
author_sort LeBlanc, Marissa
collection PubMed
description BACKGROUND: An important feature in many genomic studies is quality control and normalization. This is particularly important when analyzing epigenetic data, where the process of obtaining measurements can be bias prone. The GAW20 data was from the Genetics of Lipid Lowering Drugs and Diet Network (GOLDN), a study with multigeneration families, where DNA cytosine-phosphate-guanine (CpG) methylation was measured pre- and posttreatment with fenofibrate. We performed quality control assessment of the GAW20 DNA methylation data, including normalization, assessment of batch effects and detection of sample swaps. RESULTS: We show that even after normalization, the GOLDN methylation data has systematic differences pre- and posttreatment. Through investigation of (a) CpGs sites containing a single nucleotide polymorphism, (b) the stability of breeding values for methylation across time points, and (c) autosomal gender-associated CpGs, 13 sample swaps were detected, 11 of which were posttreatment. CONCLUSIONS: This paper demonstrates several ways to perform quality control of methylation data in the absence of raw data files and highlights the importance of normalization and quality control of the GAW20 methylation data from the GOLDN study.
format Online
Article
Text
id pubmed-6156833
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-61568332018-09-27 Quality control for Illumina 450K methylation data in the absence of iDat files using correlation structure in pedigrees and repeated measures LeBlanc, Marissa Nustad, Haakon E. Zucknick, Manuela Page, Christian M. BMC Genet Research BACKGROUND: An important feature in many genomic studies is quality control and normalization. This is particularly important when analyzing epigenetic data, where the process of obtaining measurements can be bias prone. The GAW20 data was from the Genetics of Lipid Lowering Drugs and Diet Network (GOLDN), a study with multigeneration families, where DNA cytosine-phosphate-guanine (CpG) methylation was measured pre- and posttreatment with fenofibrate. We performed quality control assessment of the GAW20 DNA methylation data, including normalization, assessment of batch effects and detection of sample swaps. RESULTS: We show that even after normalization, the GOLDN methylation data has systematic differences pre- and posttreatment. Through investigation of (a) CpGs sites containing a single nucleotide polymorphism, (b) the stability of breeding values for methylation across time points, and (c) autosomal gender-associated CpGs, 13 sample swaps were detected, 11 of which were posttreatment. CONCLUSIONS: This paper demonstrates several ways to perform quality control of methylation data in the absence of raw data files and highlights the importance of normalization and quality control of the GAW20 methylation data from the GOLDN study. BioMed Central 2018-09-17 /pmc/articles/PMC6156833/ /pubmed/30255766 http://dx.doi.org/10.1186/s12863-018-0636-5 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
LeBlanc, Marissa
Nustad, Haakon E.
Zucknick, Manuela
Page, Christian M.
Quality control for Illumina 450K methylation data in the absence of iDat files using correlation structure in pedigrees and repeated measures
title Quality control for Illumina 450K methylation data in the absence of iDat files using correlation structure in pedigrees and repeated measures
title_full Quality control for Illumina 450K methylation data in the absence of iDat files using correlation structure in pedigrees and repeated measures
title_fullStr Quality control for Illumina 450K methylation data in the absence of iDat files using correlation structure in pedigrees and repeated measures
title_full_unstemmed Quality control for Illumina 450K methylation data in the absence of iDat files using correlation structure in pedigrees and repeated measures
title_short Quality control for Illumina 450K methylation data in the absence of iDat files using correlation structure in pedigrees and repeated measures
title_sort quality control for illumina 450k methylation data in the absence of idat files using correlation structure in pedigrees and repeated measures
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6156833/
https://www.ncbi.nlm.nih.gov/pubmed/30255766
http://dx.doi.org/10.1186/s12863-018-0636-5
work_keys_str_mv AT leblancmarissa qualitycontrolforillumina450kmethylationdataintheabsenceofidatfilesusingcorrelationstructureinpedigreesandrepeatedmeasures
AT nustadhaakone qualitycontrolforillumina450kmethylationdataintheabsenceofidatfilesusingcorrelationstructureinpedigreesandrepeatedmeasures
AT zucknickmanuela qualitycontrolforillumina450kmethylationdataintheabsenceofidatfilesusingcorrelationstructureinpedigreesandrepeatedmeasures
AT pagechristianm qualitycontrolforillumina450kmethylationdataintheabsenceofidatfilesusingcorrelationstructureinpedigreesandrepeatedmeasures