Cargando…

A new measure of population structure using multiple single nucleotide polymorphisms and its relationship with F(ST)

BACKGROUND: Large-scale genome-wide association studies are promising for unraveling the genetic basis of complex diseases. Population structure is a potential problem, the effects of which on genetic association studies are controversial. The first step to systematically quantify the effects of pop...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Hongyan, Sarkar, Bayazid, George, Varghese
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2652468/
https://www.ncbi.nlm.nih.gov/pubmed/19284702
http://dx.doi.org/10.1186/1756-0500-2-21
_version_ 1782165245275406336
author Xu, Hongyan
Sarkar, Bayazid
George, Varghese
author_facet Xu, Hongyan
Sarkar, Bayazid
George, Varghese
author_sort Xu, Hongyan
collection PubMed
description BACKGROUND: Large-scale genome-wide association studies are promising for unraveling the genetic basis of complex diseases. Population structure is a potential problem, the effects of which on genetic association studies are controversial. The first step to systematically quantify the effects of population structure is to choose an appropriate measure of population structure for human data. The commonly used measure is Wright's F(ST). For a set of subpopulations it is generally assumed to be one value of F(ST). However, the estimates could be different for distinct loci. Since population structure is a concept at the population level, a measure of population structure that utilized the information across loci would be desirable. FINDINGS: In this study we propose an adjusted C parameter according to the sample size from each sub-population. The new measure C is based on the c parameter proposed for SNP data, which was assumed to be subpopulation-specific and common for all loci. In this study, we performed extensive simulations of samples with varying levels of population structure to investigate the properties and relationships of both measures. It is found that the two measures generally agree well. CONCLUSION: The new measure simultaneously uses the marker information across the genome. It has the advantage of easy interpretation as one measure of population structure and yet can also assess population differentiation.
format Text
id pubmed-2652468
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26524682009-03-09 A new measure of population structure using multiple single nucleotide polymorphisms and its relationship with F(ST) Xu, Hongyan Sarkar, Bayazid George, Varghese BMC Res Notes Short Report BACKGROUND: Large-scale genome-wide association studies are promising for unraveling the genetic basis of complex diseases. Population structure is a potential problem, the effects of which on genetic association studies are controversial. The first step to systematically quantify the effects of population structure is to choose an appropriate measure of population structure for human data. The commonly used measure is Wright's F(ST). For a set of subpopulations it is generally assumed to be one value of F(ST). However, the estimates could be different for distinct loci. Since population structure is a concept at the population level, a measure of population structure that utilized the information across loci would be desirable. FINDINGS: In this study we propose an adjusted C parameter according to the sample size from each sub-population. The new measure C is based on the c parameter proposed for SNP data, which was assumed to be subpopulation-specific and common for all loci. In this study, we performed extensive simulations of samples with varying levels of population structure to investigate the properties and relationships of both measures. It is found that the two measures generally agree well. CONCLUSION: The new measure simultaneously uses the marker information across the genome. It has the advantage of easy interpretation as one measure of population structure and yet can also assess population differentiation. BioMed Central 2009-02-06 /pmc/articles/PMC2652468/ /pubmed/19284702 http://dx.doi.org/10.1186/1756-0500-2-21 Text en Copyright © 2009 Xu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Short Report
Xu, Hongyan
Sarkar, Bayazid
George, Varghese
A new measure of population structure using multiple single nucleotide polymorphisms and its relationship with F(ST)
title A new measure of population structure using multiple single nucleotide polymorphisms and its relationship with F(ST)
title_full A new measure of population structure using multiple single nucleotide polymorphisms and its relationship with F(ST)
title_fullStr A new measure of population structure using multiple single nucleotide polymorphisms and its relationship with F(ST)
title_full_unstemmed A new measure of population structure using multiple single nucleotide polymorphisms and its relationship with F(ST)
title_short A new measure of population structure using multiple single nucleotide polymorphisms and its relationship with F(ST)
title_sort new measure of population structure using multiple single nucleotide polymorphisms and its relationship with f(st)
topic Short Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2652468/
https://www.ncbi.nlm.nih.gov/pubmed/19284702
http://dx.doi.org/10.1186/1756-0500-2-21
work_keys_str_mv AT xuhongyan anewmeasureofpopulationstructureusingmultiplesinglenucleotidepolymorphismsanditsrelationshipwithfst
AT sarkarbayazid anewmeasureofpopulationstructureusingmultiplesinglenucleotidepolymorphismsanditsrelationshipwithfst
AT georgevarghese anewmeasureofpopulationstructureusingmultiplesinglenucleotidepolymorphismsanditsrelationshipwithfst
AT xuhongyan newmeasureofpopulationstructureusingmultiplesinglenucleotidepolymorphismsanditsrelationshipwithfst
AT sarkarbayazid newmeasureofpopulationstructureusingmultiplesinglenucleotidepolymorphismsanditsrelationshipwithfst
AT georgevarghese newmeasureofpopulationstructureusingmultiplesinglenucleotidepolymorphismsanditsrelationshipwithfst