Cargando…

Enhancements to the ADMIXTURE algorithm for individual ancestry estimation

BACKGROUND: The estimation of individual ancestry from genetic data has become essential to applied population genetics and genetic epidemiology. Software programs for calculating ancestry estimates have become essential tools in the geneticist's analytic arsenal. RESULTS: Here we describe four...

Descripción completa

Detalles Bibliográficos
Autores principales: Alexander, David H, Lange, Kenneth
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3146885/
https://www.ncbi.nlm.nih.gov/pubmed/21682921
http://dx.doi.org/10.1186/1471-2105-12-246
_version_ 1782209256761589760
author Alexander, David H
Lange, Kenneth
author_facet Alexander, David H
Lange, Kenneth
author_sort Alexander, David H
collection PubMed
description BACKGROUND: The estimation of individual ancestry from genetic data has become essential to applied population genetics and genetic epidemiology. Software programs for calculating ancestry estimates have become essential tools in the geneticist's analytic arsenal. RESULTS: Here we describe four enhancements to ADMIXTURE, a high-performance tool for estimating individual ancestries and population allele frequencies from SNP (single nucleotide polymorphism) data. First, ADMIXTURE can be used to estimate the number of underlying populations through cross-validation. Second, individuals of known ancestry can be exploited in supervised learning to yield more precise ancestry estimates. Third, by penalizing small admixture coefficients for each individual, one can encourage model parsimony, often yielding more interpretable results for small datasets or datasets with large numbers of ancestral populations. Finally, by exploiting multiple processors, large datasets can be analyzed even more rapidly. CONCLUSIONS: The enhancements we have described make ADMIXTURE a more accurate, efficient, and versatile tool for ancestry estimation.
format Online
Article
Text
id pubmed-3146885
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31468852011-07-31 Enhancements to the ADMIXTURE algorithm for individual ancestry estimation Alexander, David H Lange, Kenneth BMC Bioinformatics Software BACKGROUND: The estimation of individual ancestry from genetic data has become essential to applied population genetics and genetic epidemiology. Software programs for calculating ancestry estimates have become essential tools in the geneticist's analytic arsenal. RESULTS: Here we describe four enhancements to ADMIXTURE, a high-performance tool for estimating individual ancestries and population allele frequencies from SNP (single nucleotide polymorphism) data. First, ADMIXTURE can be used to estimate the number of underlying populations through cross-validation. Second, individuals of known ancestry can be exploited in supervised learning to yield more precise ancestry estimates. Third, by penalizing small admixture coefficients for each individual, one can encourage model parsimony, often yielding more interpretable results for small datasets or datasets with large numbers of ancestral populations. Finally, by exploiting multiple processors, large datasets can be analyzed even more rapidly. CONCLUSIONS: The enhancements we have described make ADMIXTURE a more accurate, efficient, and versatile tool for ancestry estimation. BioMed Central 2011-06-18 /pmc/articles/PMC3146885/ /pubmed/21682921 http://dx.doi.org/10.1186/1471-2105-12-246 Text en Copyright © 2011 Alexander and Lange; licensee BioMed Central Ltd. https://creativecommons.org/licenses/by/2.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0 (https://creativecommons.org/licenses/by/2.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Alexander, David H
Lange, Kenneth
Enhancements to the ADMIXTURE algorithm for individual ancestry estimation
title Enhancements to the ADMIXTURE algorithm for individual ancestry estimation
title_full Enhancements to the ADMIXTURE algorithm for individual ancestry estimation
title_fullStr Enhancements to the ADMIXTURE algorithm for individual ancestry estimation
title_full_unstemmed Enhancements to the ADMIXTURE algorithm for individual ancestry estimation
title_short Enhancements to the ADMIXTURE algorithm for individual ancestry estimation
title_sort enhancements to the admixture algorithm for individual ancestry estimation
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3146885/
https://www.ncbi.nlm.nih.gov/pubmed/21682921
http://dx.doi.org/10.1186/1471-2105-12-246
work_keys_str_mv AT alexanderdavidh enhancementstotheadmixturealgorithmforindividualancestryestimation
AT langekenneth enhancementstotheadmixturealgorithmforindividualancestryestimation