Cargando…
A new measure of population structure using multiple single nucleotide polymorphisms and its relationship with F(ST)
BACKGROUND: Large-scale genome-wide association studies are promising for unraveling the genetic basis of complex diseases. Population structure is a potential problem, the effects of which on genetic association studies are controversial. The first step to systematically quantify the effects of pop...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2652468/ https://www.ncbi.nlm.nih.gov/pubmed/19284702 http://dx.doi.org/10.1186/1756-0500-2-21 |
_version_ | 1782165245275406336 |
---|---|
author | Xu, Hongyan Sarkar, Bayazid George, Varghese |
author_facet | Xu, Hongyan Sarkar, Bayazid George, Varghese |
author_sort | Xu, Hongyan |
collection | PubMed |
description | BACKGROUND: Large-scale genome-wide association studies are promising for unraveling the genetic basis of complex diseases. Population structure is a potential problem, the effects of which on genetic association studies are controversial. The first step to systematically quantify the effects of population structure is to choose an appropriate measure of population structure for human data. The commonly used measure is Wright's F(ST). For a set of subpopulations it is generally assumed to be one value of F(ST). However, the estimates could be different for distinct loci. Since population structure is a concept at the population level, a measure of population structure that utilized the information across loci would be desirable. FINDINGS: In this study we propose an adjusted C parameter according to the sample size from each sub-population. The new measure C is based on the c parameter proposed for SNP data, which was assumed to be subpopulation-specific and common for all loci. In this study, we performed extensive simulations of samples with varying levels of population structure to investigate the properties and relationships of both measures. It is found that the two measures generally agree well. CONCLUSION: The new measure simultaneously uses the marker information across the genome. It has the advantage of easy interpretation as one measure of population structure and yet can also assess population differentiation. |
format | Text |
id | pubmed-2652468 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-26524682009-03-09 A new measure of population structure using multiple single nucleotide polymorphisms and its relationship with F(ST) Xu, Hongyan Sarkar, Bayazid George, Varghese BMC Res Notes Short Report BACKGROUND: Large-scale genome-wide association studies are promising for unraveling the genetic basis of complex diseases. Population structure is a potential problem, the effects of which on genetic association studies are controversial. The first step to systematically quantify the effects of population structure is to choose an appropriate measure of population structure for human data. The commonly used measure is Wright's F(ST). For a set of subpopulations it is generally assumed to be one value of F(ST). However, the estimates could be different for distinct loci. Since population structure is a concept at the population level, a measure of population structure that utilized the information across loci would be desirable. FINDINGS: In this study we propose an adjusted C parameter according to the sample size from each sub-population. The new measure C is based on the c parameter proposed for SNP data, which was assumed to be subpopulation-specific and common for all loci. In this study, we performed extensive simulations of samples with varying levels of population structure to investigate the properties and relationships of both measures. It is found that the two measures generally agree well. CONCLUSION: The new measure simultaneously uses the marker information across the genome. It has the advantage of easy interpretation as one measure of population structure and yet can also assess population differentiation. BioMed Central 2009-02-06 /pmc/articles/PMC2652468/ /pubmed/19284702 http://dx.doi.org/10.1186/1756-0500-2-21 Text en Copyright © 2009 Xu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Short Report Xu, Hongyan Sarkar, Bayazid George, Varghese A new measure of population structure using multiple single nucleotide polymorphisms and its relationship with F(ST) |
title | A new measure of population structure using multiple single nucleotide polymorphisms and its relationship with F(ST) |
title_full | A new measure of population structure using multiple single nucleotide polymorphisms and its relationship with F(ST) |
title_fullStr | A new measure of population structure using multiple single nucleotide polymorphisms and its relationship with F(ST) |
title_full_unstemmed | A new measure of population structure using multiple single nucleotide polymorphisms and its relationship with F(ST) |
title_short | A new measure of population structure using multiple single nucleotide polymorphisms and its relationship with F(ST) |
title_sort | new measure of population structure using multiple single nucleotide polymorphisms and its relationship with f(st) |
topic | Short Report |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2652468/ https://www.ncbi.nlm.nih.gov/pubmed/19284702 http://dx.doi.org/10.1186/1756-0500-2-21 |
work_keys_str_mv | AT xuhongyan anewmeasureofpopulationstructureusingmultiplesinglenucleotidepolymorphismsanditsrelationshipwithfst AT sarkarbayazid anewmeasureofpopulationstructureusingmultiplesinglenucleotidepolymorphismsanditsrelationshipwithfst AT georgevarghese anewmeasureofpopulationstructureusingmultiplesinglenucleotidepolymorphismsanditsrelationshipwithfst AT xuhongyan newmeasureofpopulationstructureusingmultiplesinglenucleotidepolymorphismsanditsrelationshipwithfst AT sarkarbayazid newmeasureofpopulationstructureusingmultiplesinglenucleotidepolymorphismsanditsrelationshipwithfst AT georgevarghese newmeasureofpopulationstructureusingmultiplesinglenucleotidepolymorphismsanditsrelationshipwithfst |