Cargando…
Who’s afraid of the X? Incorporating the X and Y chromosomes into the analysis of DNA methylation array data
BACKGROUND: Many human disease phenotypes manifest differently by sex, making the development of methods for incorporating X and Y-chromosome data into analyses vital. Unfortunately, X and Y chromosome data are frequently excluded from large-scale analyses of the human genome and epigenome due to an...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825011/ https://www.ncbi.nlm.nih.gov/pubmed/36609459 http://dx.doi.org/10.1186/s13072-022-00477-0 |
_version_ | 1784866548538671104 |
---|---|
author | Inkster, Amy M. Wong, Martin T. Matthews, Allison M. Brown, Carolyn J. Robinson, Wendy P. |
author_facet | Inkster, Amy M. Wong, Martin T. Matthews, Allison M. Brown, Carolyn J. Robinson, Wendy P. |
author_sort | Inkster, Amy M. |
collection | PubMed |
description | BACKGROUND: Many human disease phenotypes manifest differently by sex, making the development of methods for incorporating X and Y-chromosome data into analyses vital. Unfortunately, X and Y chromosome data are frequently excluded from large-scale analyses of the human genome and epigenome due to analytical complexity associated with sex chromosome dosage differences between XX and XY individuals, and the impact of X-chromosome inactivation (XCI) on the epigenome. As such, little attention has been given to considering the methods by which sex chromosome data may be included in analyses of DNA methylation (DNAme) array data. RESULTS: With Illumina Infinium HumanMethylation450 DNAme array data from 634 placental samples, we investigated the effects of probe filtering, normalization, and batch correction on DNAme data from the X and Y chromosomes. Processing steps were evaluated in both mixed-sex and sex-stratified subsets of the analysis cohort to identify whether including both sexes impacted processing results. We found that identification of probes that have a high detection p-value, or that are non-variable, should be performed in sex-stratified data subsets to avoid over- and under-estimation of the quantity of probes eligible for removal, respectively. All normalization techniques investigated returned X and Y DNAme data that were highly correlated with the raw data from the same samples. We found no difference in batch correction results after application to mixed-sex or sex-stratified cohorts. Additionally, we identify two analytical methods suitable for XY chromosome data, the choice between which should be guided by the research question of interest, and we performed a proof-of-concept analysis studying differential DNAme on the X and Y chromosome in the context of placental acute chorioamnionitis. Finally, we provide an annotation of probe types that may be desirable to filter in X and Y chromosome analyses, including probes in repetitive elements, the X-transposed region, and cancer-testis gene promoters. CONCLUSION: While there may be no single “best” approach for analyzing DNAme array data from the X and Y chromosome, analysts must consider key factors during processing and analysis of sex chromosome data to accommodate the underlying biology of these chromosomes, and the technical limitations of DNA methylation arrays. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13072-022-00477-0. |
format | Online Article Text |
id | pubmed-9825011 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-98250112023-01-08 Who’s afraid of the X? Incorporating the X and Y chromosomes into the analysis of DNA methylation array data Inkster, Amy M. Wong, Martin T. Matthews, Allison M. Brown, Carolyn J. Robinson, Wendy P. Epigenetics Chromatin Methodology BACKGROUND: Many human disease phenotypes manifest differently by sex, making the development of methods for incorporating X and Y-chromosome data into analyses vital. Unfortunately, X and Y chromosome data are frequently excluded from large-scale analyses of the human genome and epigenome due to analytical complexity associated with sex chromosome dosage differences between XX and XY individuals, and the impact of X-chromosome inactivation (XCI) on the epigenome. As such, little attention has been given to considering the methods by which sex chromosome data may be included in analyses of DNA methylation (DNAme) array data. RESULTS: With Illumina Infinium HumanMethylation450 DNAme array data from 634 placental samples, we investigated the effects of probe filtering, normalization, and batch correction on DNAme data from the X and Y chromosomes. Processing steps were evaluated in both mixed-sex and sex-stratified subsets of the analysis cohort to identify whether including both sexes impacted processing results. We found that identification of probes that have a high detection p-value, or that are non-variable, should be performed in sex-stratified data subsets to avoid over- and under-estimation of the quantity of probes eligible for removal, respectively. All normalization techniques investigated returned X and Y DNAme data that were highly correlated with the raw data from the same samples. We found no difference in batch correction results after application to mixed-sex or sex-stratified cohorts. Additionally, we identify two analytical methods suitable for XY chromosome data, the choice between which should be guided by the research question of interest, and we performed a proof-of-concept analysis studying differential DNAme on the X and Y chromosome in the context of placental acute chorioamnionitis. Finally, we provide an annotation of probe types that may be desirable to filter in X and Y chromosome analyses, including probes in repetitive elements, the X-transposed region, and cancer-testis gene promoters. CONCLUSION: While there may be no single “best” approach for analyzing DNAme array data from the X and Y chromosome, analysts must consider key factors during processing and analysis of sex chromosome data to accommodate the underlying biology of these chromosomes, and the technical limitations of DNA methylation arrays. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13072-022-00477-0. BioMed Central 2023-01-07 /pmc/articles/PMC9825011/ /pubmed/36609459 http://dx.doi.org/10.1186/s13072-022-00477-0 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Inkster, Amy M. Wong, Martin T. Matthews, Allison M. Brown, Carolyn J. Robinson, Wendy P. Who’s afraid of the X? Incorporating the X and Y chromosomes into the analysis of DNA methylation array data |
title | Who’s afraid of the X? Incorporating the X and Y chromosomes into the analysis of DNA methylation array data |
title_full | Who’s afraid of the X? Incorporating the X and Y chromosomes into the analysis of DNA methylation array data |
title_fullStr | Who’s afraid of the X? Incorporating the X and Y chromosomes into the analysis of DNA methylation array data |
title_full_unstemmed | Who’s afraid of the X? Incorporating the X and Y chromosomes into the analysis of DNA methylation array data |
title_short | Who’s afraid of the X? Incorporating the X and Y chromosomes into the analysis of DNA methylation array data |
title_sort | who’s afraid of the x? incorporating the x and y chromosomes into the analysis of dna methylation array data |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825011/ https://www.ncbi.nlm.nih.gov/pubmed/36609459 http://dx.doi.org/10.1186/s13072-022-00477-0 |
work_keys_str_mv | AT inksteramym whosafraidofthexincorporatingthexandychromosomesintotheanalysisofdnamethylationarraydata AT wongmartint whosafraidofthexincorporatingthexandychromosomesintotheanalysisofdnamethylationarraydata AT matthewsallisonm whosafraidofthexincorporatingthexandychromosomesintotheanalysisofdnamethylationarraydata AT browncarolynj whosafraidofthexincorporatingthexandychromosomesintotheanalysisofdnamethylationarraydata AT robinsonwendyp whosafraidofthexincorporatingthexandychromosomesintotheanalysisofdnamethylationarraydata |