Cargando…
Accurate ethnicity prediction from placental DNA methylation data
BACKGROUND: The influence of genetics on variation in DNA methylation (DNAme) is well documented. Yet confounding from population stratification is often unaccounted for in DNAme association studies. Existing approaches to address confounding by population stratification using DNAme data may not gen...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6688210/ https://www.ncbi.nlm.nih.gov/pubmed/31399127 http://dx.doi.org/10.1186/s13072-019-0296-3 |
_version_ | 1783442838591111168 |
---|---|
author | Yuan, Victor Price, E. Magda Del Gobbo, Giulia Mostafavi, Sara Cox, Brian Binder, Alexandra M. Michels, Karin B. Marsit, Carmen Robinson, Wendy P. |
author_facet | Yuan, Victor Price, E. Magda Del Gobbo, Giulia Mostafavi, Sara Cox, Brian Binder, Alexandra M. Michels, Karin B. Marsit, Carmen Robinson, Wendy P. |
author_sort | Yuan, Victor |
collection | PubMed |
description | BACKGROUND: The influence of genetics on variation in DNA methylation (DNAme) is well documented. Yet confounding from population stratification is often unaccounted for in DNAme association studies. Existing approaches to address confounding by population stratification using DNAme data may not generalize to populations or tissues outside those in which they were developed. To aid future placental DNAme studies in assessing population stratification, we developed an ethnicity classifier, PlaNET (Placental DNAme Elastic Net Ethnicity Tool), using five cohorts with Infinium Human Methylation 450k BeadChip array (HM450k) data from placental samples that is also compatible with the newer EPIC platform. RESULTS: Data from 509 placental samples were used to develop PlaNET and show that it accurately predicts (accuracy = 0.938, kappa = 0.823) major classes of self-reported ethnicity/race (African: n = 58, Asian: n = 53, Caucasian: n = 389), and produces ethnicity probabilities that are highly correlated with genetic ancestry inferred from genome-wide SNP arrays (> 2.5 million SNP) and ancestry informative markers (n = 50 SNPs). PlaNET’s ethnicity classification relies on 1860 HM450K microarray sites, and over half of these were linked to nearby genetic polymorphisms (n = 955). Our placental-optimized method outperforms existing approaches in assessing population stratification in placental samples from individuals of Asian, African, and Caucasian ethnicities. CONCLUSION: PlaNET provides an improved approach to address population stratification in placental DNAme association studies. The method can be applied to predict ethnicity as a discrete or continuous variable and will be especially useful when self-reported ethnicity information is missing and genotyping markers are unavailable. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13072-019-0296-3) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6688210 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-66882102019-08-14 Accurate ethnicity prediction from placental DNA methylation data Yuan, Victor Price, E. Magda Del Gobbo, Giulia Mostafavi, Sara Cox, Brian Binder, Alexandra M. Michels, Karin B. Marsit, Carmen Robinson, Wendy P. Epigenetics Chromatin Methodology BACKGROUND: The influence of genetics on variation in DNA methylation (DNAme) is well documented. Yet confounding from population stratification is often unaccounted for in DNAme association studies. Existing approaches to address confounding by population stratification using DNAme data may not generalize to populations or tissues outside those in which they were developed. To aid future placental DNAme studies in assessing population stratification, we developed an ethnicity classifier, PlaNET (Placental DNAme Elastic Net Ethnicity Tool), using five cohorts with Infinium Human Methylation 450k BeadChip array (HM450k) data from placental samples that is also compatible with the newer EPIC platform. RESULTS: Data from 509 placental samples were used to develop PlaNET and show that it accurately predicts (accuracy = 0.938, kappa = 0.823) major classes of self-reported ethnicity/race (African: n = 58, Asian: n = 53, Caucasian: n = 389), and produces ethnicity probabilities that are highly correlated with genetic ancestry inferred from genome-wide SNP arrays (> 2.5 million SNP) and ancestry informative markers (n = 50 SNPs). PlaNET’s ethnicity classification relies on 1860 HM450K microarray sites, and over half of these were linked to nearby genetic polymorphisms (n = 955). Our placental-optimized method outperforms existing approaches in assessing population stratification in placental samples from individuals of Asian, African, and Caucasian ethnicities. CONCLUSION: PlaNET provides an improved approach to address population stratification in placental DNAme association studies. The method can be applied to predict ethnicity as a discrete or continuous variable and will be especially useful when self-reported ethnicity information is missing and genotyping markers are unavailable. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13072-019-0296-3) contains supplementary material, which is available to authorized users. BioMed Central 2019-08-09 /pmc/articles/PMC6688210/ /pubmed/31399127 http://dx.doi.org/10.1186/s13072-019-0296-3 Text en © The Author(s) 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Yuan, Victor Price, E. Magda Del Gobbo, Giulia Mostafavi, Sara Cox, Brian Binder, Alexandra M. Michels, Karin B. Marsit, Carmen Robinson, Wendy P. Accurate ethnicity prediction from placental DNA methylation data |
title | Accurate ethnicity prediction from placental DNA methylation data |
title_full | Accurate ethnicity prediction from placental DNA methylation data |
title_fullStr | Accurate ethnicity prediction from placental DNA methylation data |
title_full_unstemmed | Accurate ethnicity prediction from placental DNA methylation data |
title_short | Accurate ethnicity prediction from placental DNA methylation data |
title_sort | accurate ethnicity prediction from placental dna methylation data |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6688210/ https://www.ncbi.nlm.nih.gov/pubmed/31399127 http://dx.doi.org/10.1186/s13072-019-0296-3 |
work_keys_str_mv | AT yuanvictor accurateethnicitypredictionfromplacentaldnamethylationdata AT priceemagda accurateethnicitypredictionfromplacentaldnamethylationdata AT delgobbogiulia accurateethnicitypredictionfromplacentaldnamethylationdata AT mostafavisara accurateethnicitypredictionfromplacentaldnamethylationdata AT coxbrian accurateethnicitypredictionfromplacentaldnamethylationdata AT binderalexandram accurateethnicitypredictionfromplacentaldnamethylationdata AT michelskarinb accurateethnicitypredictionfromplacentaldnamethylationdata AT marsitcarmen accurateethnicitypredictionfromplacentaldnamethylationdata AT robinsonwendyp accurateethnicitypredictionfromplacentaldnamethylationdata |