Cargando…

DNA methylation-based sex classifier to predict sex and identify sex chromosome aneuploidy

BACKGROUND: Sex is an important covariate of epigenome-wide association studies due to its strong influence on DNA methylation patterns across numerous genomic positions. Nevertheless, many samples on the Gene Expression Omnibus (GEO) frequently lack a sex annotation or are incorrectly labelled. Con...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yucheng, Hannon, Eilis, Grant, Olivia A., Gorrie-Stone, Tyler J., Kumari, Meena, Mill, Jonathan, Zhai, Xiaojun, McDonald-Maier, Klaus D., Schalkwyk, Leonard C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8240370/
https://www.ncbi.nlm.nih.gov/pubmed/34182928
http://dx.doi.org/10.1186/s12864-021-07675-2
_version_ 1783715202324234240
author Wang, Yucheng
Hannon, Eilis
Grant, Olivia A.
Gorrie-Stone, Tyler J.
Kumari, Meena
Mill, Jonathan
Zhai, Xiaojun
McDonald-Maier, Klaus D.
Schalkwyk, Leonard C.
author_facet Wang, Yucheng
Hannon, Eilis
Grant, Olivia A.
Gorrie-Stone, Tyler J.
Kumari, Meena
Mill, Jonathan
Zhai, Xiaojun
McDonald-Maier, Klaus D.
Schalkwyk, Leonard C.
author_sort Wang, Yucheng
collection PubMed
description BACKGROUND: Sex is an important covariate of epigenome-wide association studies due to its strong influence on DNA methylation patterns across numerous genomic positions. Nevertheless, many samples on the Gene Expression Omnibus (GEO) frequently lack a sex annotation or are incorrectly labelled. Considering the influence that sex imposes on DNA methylation patterns, it is necessary to ensure that methods for filtering poor samples and checking of sex assignment are accurate and widely applicable. RESULTS: Here we presented a novel method to predict sex using only DNA methylation beta values, which can be readily applied to almost all DNA methylation datasets of different formats (raw IDATs or text files with only signal intensities) uploaded to GEO. We identified 4345 significantly (p<0.01) sex-associated CpG sites present on both 450K and EPIC arrays, and constructed a sex classifier based on the two first principal components of the DNA methylation data of sex-associated probes mapped on sex chromosomes. The proposed method is constructed using whole blood samples and exhibits good performance across a wide range of tissues. We further demonstrated that our method can be used to identify samples with sex chromosome aneuploidy, this function is validated by five Turner syndrome cases and one Klinefelter syndrome case. CONCLUSIONS: This proposed sex classifier not only can be used for sex predictions but also applied to identify samples with sex chromosome aneuploidy, and it is freely and easily accessible by calling the ‘estimateSex’ function from the newest wateRmelon Bioconductor package (https://github.com/schalkwyk/wateRmelon). SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12864-021-07675-2).
format Online
Article
Text
id pubmed-8240370
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-82403702021-06-30 DNA methylation-based sex classifier to predict sex and identify sex chromosome aneuploidy Wang, Yucheng Hannon, Eilis Grant, Olivia A. Gorrie-Stone, Tyler J. Kumari, Meena Mill, Jonathan Zhai, Xiaojun McDonald-Maier, Klaus D. Schalkwyk, Leonard C. BMC Genomics Methodology Article BACKGROUND: Sex is an important covariate of epigenome-wide association studies due to its strong influence on DNA methylation patterns across numerous genomic positions. Nevertheless, many samples on the Gene Expression Omnibus (GEO) frequently lack a sex annotation or are incorrectly labelled. Considering the influence that sex imposes on DNA methylation patterns, it is necessary to ensure that methods for filtering poor samples and checking of sex assignment are accurate and widely applicable. RESULTS: Here we presented a novel method to predict sex using only DNA methylation beta values, which can be readily applied to almost all DNA methylation datasets of different formats (raw IDATs or text files with only signal intensities) uploaded to GEO. We identified 4345 significantly (p<0.01) sex-associated CpG sites present on both 450K and EPIC arrays, and constructed a sex classifier based on the two first principal components of the DNA methylation data of sex-associated probes mapped on sex chromosomes. The proposed method is constructed using whole blood samples and exhibits good performance across a wide range of tissues. We further demonstrated that our method can be used to identify samples with sex chromosome aneuploidy, this function is validated by five Turner syndrome cases and one Klinefelter syndrome case. CONCLUSIONS: This proposed sex classifier not only can be used for sex predictions but also applied to identify samples with sex chromosome aneuploidy, and it is freely and easily accessible by calling the ‘estimateSex’ function from the newest wateRmelon Bioconductor package (https://github.com/schalkwyk/wateRmelon). SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12864-021-07675-2). BioMed Central 2021-06-28 /pmc/articles/PMC8240370/ /pubmed/34182928 http://dx.doi.org/10.1186/s12864-021-07675-2 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Wang, Yucheng
Hannon, Eilis
Grant, Olivia A.
Gorrie-Stone, Tyler J.
Kumari, Meena
Mill, Jonathan
Zhai, Xiaojun
McDonald-Maier, Klaus D.
Schalkwyk, Leonard C.
DNA methylation-based sex classifier to predict sex and identify sex chromosome aneuploidy
title DNA methylation-based sex classifier to predict sex and identify sex chromosome aneuploidy
title_full DNA methylation-based sex classifier to predict sex and identify sex chromosome aneuploidy
title_fullStr DNA methylation-based sex classifier to predict sex and identify sex chromosome aneuploidy
title_full_unstemmed DNA methylation-based sex classifier to predict sex and identify sex chromosome aneuploidy
title_short DNA methylation-based sex classifier to predict sex and identify sex chromosome aneuploidy
title_sort dna methylation-based sex classifier to predict sex and identify sex chromosome aneuploidy
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8240370/
https://www.ncbi.nlm.nih.gov/pubmed/34182928
http://dx.doi.org/10.1186/s12864-021-07675-2
work_keys_str_mv AT wangyucheng dnamethylationbasedsexclassifiertopredictsexandidentifysexchromosomeaneuploidy
AT hannoneilis dnamethylationbasedsexclassifiertopredictsexandidentifysexchromosomeaneuploidy
AT grantoliviaa dnamethylationbasedsexclassifiertopredictsexandidentifysexchromosomeaneuploidy
AT gorriestonetylerj dnamethylationbasedsexclassifiertopredictsexandidentifysexchromosomeaneuploidy
AT kumarimeena dnamethylationbasedsexclassifiertopredictsexandidentifysexchromosomeaneuploidy
AT milljonathan dnamethylationbasedsexclassifiertopredictsexandidentifysexchromosomeaneuploidy
AT zhaixiaojun dnamethylationbasedsexclassifiertopredictsexandidentifysexchromosomeaneuploidy
AT mcdonaldmaierklausd dnamethylationbasedsexclassifiertopredictsexandidentifysexchromosomeaneuploidy
AT schalkwykleonardc dnamethylationbasedsexclassifiertopredictsexandidentifysexchromosomeaneuploidy