Cargando…

Accurate ethnicity prediction from placental DNA methylation data

BACKGROUND: The influence of genetics on variation in DNA methylation (DNAme) is well documented. Yet confounding from population stratification is often unaccounted for in DNAme association studies. Existing approaches to address confounding by population stratification using DNAme data may not gen...

Descripción completa

Detalles Bibliográficos
Autores principales: Yuan, Victor, Price, E. Magda, Del Gobbo, Giulia, Mostafavi, Sara, Cox, Brian, Binder, Alexandra M., Michels, Karin B., Marsit, Carmen, Robinson, Wendy P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6688210/
https://www.ncbi.nlm.nih.gov/pubmed/31399127
http://dx.doi.org/10.1186/s13072-019-0296-3
_version_ 1783442838591111168
author Yuan, Victor
Price, E. Magda
Del Gobbo, Giulia
Mostafavi, Sara
Cox, Brian
Binder, Alexandra M.
Michels, Karin B.
Marsit, Carmen
Robinson, Wendy P.
author_facet Yuan, Victor
Price, E. Magda
Del Gobbo, Giulia
Mostafavi, Sara
Cox, Brian
Binder, Alexandra M.
Michels, Karin B.
Marsit, Carmen
Robinson, Wendy P.
author_sort Yuan, Victor
collection PubMed
description BACKGROUND: The influence of genetics on variation in DNA methylation (DNAme) is well documented. Yet confounding from population stratification is often unaccounted for in DNAme association studies. Existing approaches to address confounding by population stratification using DNAme data may not generalize to populations or tissues outside those in which they were developed. To aid future placental DNAme studies in assessing population stratification, we developed an ethnicity classifier, PlaNET (Placental DNAme Elastic Net Ethnicity Tool), using five cohorts with Infinium Human Methylation 450k BeadChip array (HM450k) data from placental samples that is also compatible with the newer EPIC platform. RESULTS: Data from 509 placental samples were used to develop PlaNET and show that it accurately predicts (accuracy = 0.938, kappa = 0.823) major classes of self-reported ethnicity/race (African: n = 58, Asian: n = 53, Caucasian: n = 389), and produces ethnicity probabilities that are highly correlated with genetic ancestry inferred from genome-wide SNP arrays (> 2.5 million SNP) and ancestry informative markers (n = 50 SNPs). PlaNET’s ethnicity classification relies on 1860 HM450K microarray sites, and over half of these were linked to nearby genetic polymorphisms (n = 955). Our placental-optimized method outperforms existing approaches in assessing population stratification in placental samples from individuals of Asian, African, and Caucasian ethnicities. CONCLUSION: PlaNET provides an improved approach to address population stratification in placental DNAme association studies. The method can be applied to predict ethnicity as a discrete or continuous variable and will be especially useful when self-reported ethnicity information is missing and genotyping markers are unavailable. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13072-019-0296-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6688210
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-66882102019-08-14 Accurate ethnicity prediction from placental DNA methylation data Yuan, Victor Price, E. Magda Del Gobbo, Giulia Mostafavi, Sara Cox, Brian Binder, Alexandra M. Michels, Karin B. Marsit, Carmen Robinson, Wendy P. Epigenetics Chromatin Methodology BACKGROUND: The influence of genetics on variation in DNA methylation (DNAme) is well documented. Yet confounding from population stratification is often unaccounted for in DNAme association studies. Existing approaches to address confounding by population stratification using DNAme data may not generalize to populations or tissues outside those in which they were developed. To aid future placental DNAme studies in assessing population stratification, we developed an ethnicity classifier, PlaNET (Placental DNAme Elastic Net Ethnicity Tool), using five cohorts with Infinium Human Methylation 450k BeadChip array (HM450k) data from placental samples that is also compatible with the newer EPIC platform. RESULTS: Data from 509 placental samples were used to develop PlaNET and show that it accurately predicts (accuracy = 0.938, kappa = 0.823) major classes of self-reported ethnicity/race (African: n = 58, Asian: n = 53, Caucasian: n = 389), and produces ethnicity probabilities that are highly correlated with genetic ancestry inferred from genome-wide SNP arrays (> 2.5 million SNP) and ancestry informative markers (n = 50 SNPs). PlaNET’s ethnicity classification relies on 1860 HM450K microarray sites, and over half of these were linked to nearby genetic polymorphisms (n = 955). Our placental-optimized method outperforms existing approaches in assessing population stratification in placental samples from individuals of Asian, African, and Caucasian ethnicities. CONCLUSION: PlaNET provides an improved approach to address population stratification in placental DNAme association studies. The method can be applied to predict ethnicity as a discrete or continuous variable and will be especially useful when self-reported ethnicity information is missing and genotyping markers are unavailable. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13072-019-0296-3) contains supplementary material, which is available to authorized users. BioMed Central 2019-08-09 /pmc/articles/PMC6688210/ /pubmed/31399127 http://dx.doi.org/10.1186/s13072-019-0296-3 Text en © The Author(s) 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Yuan, Victor
Price, E. Magda
Del Gobbo, Giulia
Mostafavi, Sara
Cox, Brian
Binder, Alexandra M.
Michels, Karin B.
Marsit, Carmen
Robinson, Wendy P.
Accurate ethnicity prediction from placental DNA methylation data
title Accurate ethnicity prediction from placental DNA methylation data
title_full Accurate ethnicity prediction from placental DNA methylation data
title_fullStr Accurate ethnicity prediction from placental DNA methylation data
title_full_unstemmed Accurate ethnicity prediction from placental DNA methylation data
title_short Accurate ethnicity prediction from placental DNA methylation data
title_sort accurate ethnicity prediction from placental dna methylation data
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6688210/
https://www.ncbi.nlm.nih.gov/pubmed/31399127
http://dx.doi.org/10.1186/s13072-019-0296-3
work_keys_str_mv AT yuanvictor accurateethnicitypredictionfromplacentaldnamethylationdata
AT priceemagda accurateethnicitypredictionfromplacentaldnamethylationdata
AT delgobbogiulia accurateethnicitypredictionfromplacentaldnamethylationdata
AT mostafavisara accurateethnicitypredictionfromplacentaldnamethylationdata
AT coxbrian accurateethnicitypredictionfromplacentaldnamethylationdata
AT binderalexandram accurateethnicitypredictionfromplacentaldnamethylationdata
AT michelskarinb accurateethnicitypredictionfromplacentaldnamethylationdata
AT marsitcarmen accurateethnicitypredictionfromplacentaldnamethylationdata
AT robinsonwendyp accurateethnicitypredictionfromplacentaldnamethylationdata