Cargando…
AWclust: point-and-click software for non-parametric population structure analysis
BACKGROUND: Population structure analysis is important to genetic association studies and evolutionary investigations. Parametric approaches, e.g. STRUCTURE and L-POP, usually assume Hardy-Weinberg equilibrium (HWE) and linkage equilibrium among loci in sample population individuals. However, the as...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2253519/ https://www.ncbi.nlm.nih.gov/pubmed/18237431 http://dx.doi.org/10.1186/1471-2105-9-77 |
_version_ | 1782151111984021504 |
---|---|
author | Gao, Xiaoyi Starmer, Joshua D |
author_facet | Gao, Xiaoyi Starmer, Joshua D |
author_sort | Gao, Xiaoyi |
collection | PubMed |
description | BACKGROUND: Population structure analysis is important to genetic association studies and evolutionary investigations. Parametric approaches, e.g. STRUCTURE and L-POP, usually assume Hardy-Weinberg equilibrium (HWE) and linkage equilibrium among loci in sample population individuals. However, the assumptions may not hold and allele frequency estimation may not be accurate in some data sets. The improved version of STRUCTURE (version 2.1) can incorporate linkage information among loci but is still sensitive to high background linkage disequilibrium. Nowadays, large-scale single nucleotide polymorphisms (SNPs) are becoming popular in genetic studies. Therefore, it is imperative to have software that makes full use of these genetic data to generate inference even when model assumptions do not hold or allele frequency estimation suffers from high variation. RESULTS: We have developed point-and-click software for non-parametric population structure analysis distributed as an R package. The software takes advantage of the large number of SNPs available to categorize individuals into ethnically similar clusters and it does not require assumptions about population models. Nor does it estimate allele frequencies. Moreover, this software can also infer the optimal number of populations. CONCLUSION: Our software tool employs non-parametric approaches to assign individuals to clusters using SNPs. It provides efficient computation and an intuitive way for researchers to explore ethnic relationships among individuals. It can be complementary to parametric approaches in population structure analysis. |
format | Text |
id | pubmed-2253519 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-22535192008-02-23 AWclust: point-and-click software for non-parametric population structure analysis Gao, Xiaoyi Starmer, Joshua D BMC Bioinformatics Software BACKGROUND: Population structure analysis is important to genetic association studies and evolutionary investigations. Parametric approaches, e.g. STRUCTURE and L-POP, usually assume Hardy-Weinberg equilibrium (HWE) and linkage equilibrium among loci in sample population individuals. However, the assumptions may not hold and allele frequency estimation may not be accurate in some data sets. The improved version of STRUCTURE (version 2.1) can incorporate linkage information among loci but is still sensitive to high background linkage disequilibrium. Nowadays, large-scale single nucleotide polymorphisms (SNPs) are becoming popular in genetic studies. Therefore, it is imperative to have software that makes full use of these genetic data to generate inference even when model assumptions do not hold or allele frequency estimation suffers from high variation. RESULTS: We have developed point-and-click software for non-parametric population structure analysis distributed as an R package. The software takes advantage of the large number of SNPs available to categorize individuals into ethnically similar clusters and it does not require assumptions about population models. Nor does it estimate allele frequencies. Moreover, this software can also infer the optimal number of populations. CONCLUSION: Our software tool employs non-parametric approaches to assign individuals to clusters using SNPs. It provides efficient computation and an intuitive way for researchers to explore ethnic relationships among individuals. It can be complementary to parametric approaches in population structure analysis. BioMed Central 2008-01-31 /pmc/articles/PMC2253519/ /pubmed/18237431 http://dx.doi.org/10.1186/1471-2105-9-77 Text en Copyright © 2008 Gao and Starmer; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Software Gao, Xiaoyi Starmer, Joshua D AWclust: point-and-click software for non-parametric population structure analysis |
title | AWclust: point-and-click software for non-parametric population structure analysis |
title_full | AWclust: point-and-click software for non-parametric population structure analysis |
title_fullStr | AWclust: point-and-click software for non-parametric population structure analysis |
title_full_unstemmed | AWclust: point-and-click software for non-parametric population structure analysis |
title_short | AWclust: point-and-click software for non-parametric population structure analysis |
title_sort | awclust: point-and-click software for non-parametric population structure analysis |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2253519/ https://www.ncbi.nlm.nih.gov/pubmed/18237431 http://dx.doi.org/10.1186/1471-2105-9-77 |
work_keys_str_mv | AT gaoxiaoyi awclustpointandclicksoftwarefornonparametricpopulationstructureanalysis AT starmerjoshuad awclustpointandclicksoftwarefornonparametricpopulationstructureanalysis |