Cargando…

Genetic Ancestry Estimates within Dutch Family Units and Across Genotyping Arrays: Insights from Empirical Analysis Using Two Estimation Methods

Accurate inference of genetic ancestry is crucial for population-based association studies, accounting for population heterogeneity and structure. This study analyzes genome-wide SNP data from the Netherlands Twin Register to compare genetic ancestry estimates. The focus is on the comparison of ance...

Descripción completa

Detalles Bibliográficos
Autores principales: Beck, Jeffrey J., Ahmed, Talitha, Finnicum, Casey T., Zwinderman, Koos, Ehli, Erik A., Boomsma, Dorret I., Hottenga, Jouke Jan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10379078/
https://www.ncbi.nlm.nih.gov/pubmed/37510400
http://dx.doi.org/10.3390/genes14071497
_version_ 1785079924265058304
author Beck, Jeffrey J.
Ahmed, Talitha
Finnicum, Casey T.
Zwinderman, Koos
Ehli, Erik A.
Boomsma, Dorret I.
Hottenga, Jouke Jan
author_facet Beck, Jeffrey J.
Ahmed, Talitha
Finnicum, Casey T.
Zwinderman, Koos
Ehli, Erik A.
Boomsma, Dorret I.
Hottenga, Jouke Jan
author_sort Beck, Jeffrey J.
collection PubMed
description Accurate inference of genetic ancestry is crucial for population-based association studies, accounting for population heterogeneity and structure. This study analyzes genome-wide SNP data from the Netherlands Twin Register to compare genetic ancestry estimates. The focus is on the comparison of ancestry estimates between family members and individuals genotyped on multiple arrays (Affymetrix 6.0, Affymetrix Axiom, and Illumina GSA). Two conventional methods, principal component analysis and ADMIXTURE, were implemented to estimate ancestry, each serving its specific purpose, rather than for direct comparison. The results reveal that as the degree of genetic relatedness decreases, the Euclidean distances of genetic ancestry estimates between family members significantly increase (empirical p < 0.001), regardless of the estimation method and genotyping array. Ancestry estimates among individuals genotyped on multiple arrays also show statistically significant differences (empirical p < 0.001). Additionally, this study investigates the relationship between the ancestry estimates of non-identical twin offspring with ancestrally diverse parents and those with ancestrally similar parents. The results indicate a statistically significant weak correlation between the variation in ancestry estimates among offspring and differences in ancestry estimates among parents (Spearman’s rho: 0.07, p = 0.005). This study highlights the utility of current methods in inferring genetic ancestry, emphasizing the importance of reference population composition in determining ancestry estimates.
format Online
Article
Text
id pubmed-10379078
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-103790782023-07-29 Genetic Ancestry Estimates within Dutch Family Units and Across Genotyping Arrays: Insights from Empirical Analysis Using Two Estimation Methods Beck, Jeffrey J. Ahmed, Talitha Finnicum, Casey T. Zwinderman, Koos Ehli, Erik A. Boomsma, Dorret I. Hottenga, Jouke Jan Genes (Basel) Article Accurate inference of genetic ancestry is crucial for population-based association studies, accounting for population heterogeneity and structure. This study analyzes genome-wide SNP data from the Netherlands Twin Register to compare genetic ancestry estimates. The focus is on the comparison of ancestry estimates between family members and individuals genotyped on multiple arrays (Affymetrix 6.0, Affymetrix Axiom, and Illumina GSA). Two conventional methods, principal component analysis and ADMIXTURE, were implemented to estimate ancestry, each serving its specific purpose, rather than for direct comparison. The results reveal that as the degree of genetic relatedness decreases, the Euclidean distances of genetic ancestry estimates between family members significantly increase (empirical p < 0.001), regardless of the estimation method and genotyping array. Ancestry estimates among individuals genotyped on multiple arrays also show statistically significant differences (empirical p < 0.001). Additionally, this study investigates the relationship between the ancestry estimates of non-identical twin offspring with ancestrally diverse parents and those with ancestrally similar parents. The results indicate a statistically significant weak correlation between the variation in ancestry estimates among offspring and differences in ancestry estimates among parents (Spearman’s rho: 0.07, p = 0.005). This study highlights the utility of current methods in inferring genetic ancestry, emphasizing the importance of reference population composition in determining ancestry estimates. MDPI 2023-07-22 /pmc/articles/PMC10379078/ /pubmed/37510400 http://dx.doi.org/10.3390/genes14071497 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Beck, Jeffrey J.
Ahmed, Talitha
Finnicum, Casey T.
Zwinderman, Koos
Ehli, Erik A.
Boomsma, Dorret I.
Hottenga, Jouke Jan
Genetic Ancestry Estimates within Dutch Family Units and Across Genotyping Arrays: Insights from Empirical Analysis Using Two Estimation Methods
title Genetic Ancestry Estimates within Dutch Family Units and Across Genotyping Arrays: Insights from Empirical Analysis Using Two Estimation Methods
title_full Genetic Ancestry Estimates within Dutch Family Units and Across Genotyping Arrays: Insights from Empirical Analysis Using Two Estimation Methods
title_fullStr Genetic Ancestry Estimates within Dutch Family Units and Across Genotyping Arrays: Insights from Empirical Analysis Using Two Estimation Methods
title_full_unstemmed Genetic Ancestry Estimates within Dutch Family Units and Across Genotyping Arrays: Insights from Empirical Analysis Using Two Estimation Methods
title_short Genetic Ancestry Estimates within Dutch Family Units and Across Genotyping Arrays: Insights from Empirical Analysis Using Two Estimation Methods
title_sort genetic ancestry estimates within dutch family units and across genotyping arrays: insights from empirical analysis using two estimation methods
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10379078/
https://www.ncbi.nlm.nih.gov/pubmed/37510400
http://dx.doi.org/10.3390/genes14071497
work_keys_str_mv AT beckjeffreyj geneticancestryestimateswithindutchfamilyunitsandacrossgenotypingarraysinsightsfromempiricalanalysisusingtwoestimationmethods
AT ahmedtalitha geneticancestryestimateswithindutchfamilyunitsandacrossgenotypingarraysinsightsfromempiricalanalysisusingtwoestimationmethods
AT finnicumcaseyt geneticancestryestimateswithindutchfamilyunitsandacrossgenotypingarraysinsightsfromempiricalanalysisusingtwoestimationmethods
AT zwindermankoos geneticancestryestimateswithindutchfamilyunitsandacrossgenotypingarraysinsightsfromempiricalanalysisusingtwoestimationmethods
AT ehlierika geneticancestryestimateswithindutchfamilyunitsandacrossgenotypingarraysinsightsfromempiricalanalysisusingtwoestimationmethods
AT boomsmadorreti geneticancestryestimateswithindutchfamilyunitsandacrossgenotypingarraysinsightsfromempiricalanalysisusingtwoestimationmethods
AT hottengajoukejan geneticancestryestimateswithindutchfamilyunitsandacrossgenotypingarraysinsightsfromempiricalanalysisusingtwoestimationmethods