Cargando…

AWclust: point-and-click software for non-parametric population structure analysis

BACKGROUND: Population structure analysis is important to genetic association studies and evolutionary investigations. Parametric approaches, e.g. STRUCTURE and L-POP, usually assume Hardy-Weinberg equilibrium (HWE) and linkage equilibrium among loci in sample population individuals. However, the as...

Descripción completa

Detalles Bibliográficos
Autores principales: Gao, Xiaoyi, Starmer, Joshua D
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2253519/
https://www.ncbi.nlm.nih.gov/pubmed/18237431
http://dx.doi.org/10.1186/1471-2105-9-77
_version_ 1782151111984021504
author Gao, Xiaoyi
Starmer, Joshua D
author_facet Gao, Xiaoyi
Starmer, Joshua D
author_sort Gao, Xiaoyi
collection PubMed
description BACKGROUND: Population structure analysis is important to genetic association studies and evolutionary investigations. Parametric approaches, e.g. STRUCTURE and L-POP, usually assume Hardy-Weinberg equilibrium (HWE) and linkage equilibrium among loci in sample population individuals. However, the assumptions may not hold and allele frequency estimation may not be accurate in some data sets. The improved version of STRUCTURE (version 2.1) can incorporate linkage information among loci but is still sensitive to high background linkage disequilibrium. Nowadays, large-scale single nucleotide polymorphisms (SNPs) are becoming popular in genetic studies. Therefore, it is imperative to have software that makes full use of these genetic data to generate inference even when model assumptions do not hold or allele frequency estimation suffers from high variation. RESULTS: We have developed point-and-click software for non-parametric population structure analysis distributed as an R package. The software takes advantage of the large number of SNPs available to categorize individuals into ethnically similar clusters and it does not require assumptions about population models. Nor does it estimate allele frequencies. Moreover, this software can also infer the optimal number of populations. CONCLUSION: Our software tool employs non-parametric approaches to assign individuals to clusters using SNPs. It provides efficient computation and an intuitive way for researchers to explore ethnic relationships among individuals. It can be complementary to parametric approaches in population structure analysis.
format Text
id pubmed-2253519
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-22535192008-02-23 AWclust: point-and-click software for non-parametric population structure analysis Gao, Xiaoyi Starmer, Joshua D BMC Bioinformatics Software BACKGROUND: Population structure analysis is important to genetic association studies and evolutionary investigations. Parametric approaches, e.g. STRUCTURE and L-POP, usually assume Hardy-Weinberg equilibrium (HWE) and linkage equilibrium among loci in sample population individuals. However, the assumptions may not hold and allele frequency estimation may not be accurate in some data sets. The improved version of STRUCTURE (version 2.1) can incorporate linkage information among loci but is still sensitive to high background linkage disequilibrium. Nowadays, large-scale single nucleotide polymorphisms (SNPs) are becoming popular in genetic studies. Therefore, it is imperative to have software that makes full use of these genetic data to generate inference even when model assumptions do not hold or allele frequency estimation suffers from high variation. RESULTS: We have developed point-and-click software for non-parametric population structure analysis distributed as an R package. The software takes advantage of the large number of SNPs available to categorize individuals into ethnically similar clusters and it does not require assumptions about population models. Nor does it estimate allele frequencies. Moreover, this software can also infer the optimal number of populations. CONCLUSION: Our software tool employs non-parametric approaches to assign individuals to clusters using SNPs. It provides efficient computation and an intuitive way for researchers to explore ethnic relationships among individuals. It can be complementary to parametric approaches in population structure analysis. BioMed Central 2008-01-31 /pmc/articles/PMC2253519/ /pubmed/18237431 http://dx.doi.org/10.1186/1471-2105-9-77 Text en Copyright © 2008 Gao and Starmer; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Gao, Xiaoyi
Starmer, Joshua D
AWclust: point-and-click software for non-parametric population structure analysis
title AWclust: point-and-click software for non-parametric population structure analysis
title_full AWclust: point-and-click software for non-parametric population structure analysis
title_fullStr AWclust: point-and-click software for non-parametric population structure analysis
title_full_unstemmed AWclust: point-and-click software for non-parametric population structure analysis
title_short AWclust: point-and-click software for non-parametric population structure analysis
title_sort awclust: point-and-click software for non-parametric population structure analysis
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2253519/
https://www.ncbi.nlm.nih.gov/pubmed/18237431
http://dx.doi.org/10.1186/1471-2105-9-77
work_keys_str_mv AT gaoxiaoyi awclustpointandclicksoftwarefornonparametricpopulationstructureanalysis
AT starmerjoshuad awclustpointandclicksoftwarefornonparametricpopulationstructureanalysis