Cargando…

A Quantitative Comparison of the Similarity between Genes and Geography in Worldwide Human Populations

Multivariate statistical techniques such as principal components analysis (PCA) and multidimensional scaling (MDS) have been widely used to summarize the structure of human genetic variation, often in easily visualized two-dimensional maps. Many recent studies have reported similarity between geogra...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Chaolong, Zöllner, Sebastian, Rosenberg, Noah A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3426559/
https://www.ncbi.nlm.nih.gov/pubmed/22927824
http://dx.doi.org/10.1371/journal.pgen.1002886
_version_ 1782241529054625792
author Wang, Chaolong
Zöllner, Sebastian
Rosenberg, Noah A.
author_facet Wang, Chaolong
Zöllner, Sebastian
Rosenberg, Noah A.
author_sort Wang, Chaolong
collection PubMed
description Multivariate statistical techniques such as principal components analysis (PCA) and multidimensional scaling (MDS) have been widely used to summarize the structure of human genetic variation, often in easily visualized two-dimensional maps. Many recent studies have reported similarity between geographic maps of population locations and MDS or PCA maps of genetic variation inferred from single-nucleotide polymorphisms (SNPs). However, this similarity has been evident primarily in a qualitative sense; and, because different multivariate techniques and marker sets have been used in different studies, it has not been possible to formally compare genetic variation datasets in terms of their levels of similarity with geography. In this study, using genome-wide SNP data from 128 populations worldwide, we perform a systematic analysis to quantitatively evaluate the similarity of genes and geography in different geographic regions. For each of a series of regions, we apply a Procrustes analysis approach to find an optimal transformation that maximizes the similarity between PCA maps of genetic variation and geographic maps of population locations. We consider examples in Europe, Sub-Saharan Africa, Asia, East Asia, and Central/South Asia, as well as in a worldwide sample, finding that significant similarity between genes and geography exists in general at different geographic levels. The similarity is highest in our examples for Asia and, once highly distinctive populations have been removed, Sub-Saharan Africa. Our results provide a quantitative assessment of the geographic structure of human genetic variation worldwide, supporting the view that geography plays a strong role in giving rise to human population structure.
format Online
Article
Text
id pubmed-3426559
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-34265592012-08-27 A Quantitative Comparison of the Similarity between Genes and Geography in Worldwide Human Populations Wang, Chaolong Zöllner, Sebastian Rosenberg, Noah A. PLoS Genet Research Article Multivariate statistical techniques such as principal components analysis (PCA) and multidimensional scaling (MDS) have been widely used to summarize the structure of human genetic variation, often in easily visualized two-dimensional maps. Many recent studies have reported similarity between geographic maps of population locations and MDS or PCA maps of genetic variation inferred from single-nucleotide polymorphisms (SNPs). However, this similarity has been evident primarily in a qualitative sense; and, because different multivariate techniques and marker sets have been used in different studies, it has not been possible to formally compare genetic variation datasets in terms of their levels of similarity with geography. In this study, using genome-wide SNP data from 128 populations worldwide, we perform a systematic analysis to quantitatively evaluate the similarity of genes and geography in different geographic regions. For each of a series of regions, we apply a Procrustes analysis approach to find an optimal transformation that maximizes the similarity between PCA maps of genetic variation and geographic maps of population locations. We consider examples in Europe, Sub-Saharan Africa, Asia, East Asia, and Central/South Asia, as well as in a worldwide sample, finding that significant similarity between genes and geography exists in general at different geographic levels. The similarity is highest in our examples for Asia and, once highly distinctive populations have been removed, Sub-Saharan Africa. Our results provide a quantitative assessment of the geographic structure of human genetic variation worldwide, supporting the view that geography plays a strong role in giving rise to human population structure. Public Library of Science 2012-08-23 /pmc/articles/PMC3426559/ /pubmed/22927824 http://dx.doi.org/10.1371/journal.pgen.1002886 Text en © 2012 Wang et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Wang, Chaolong
Zöllner, Sebastian
Rosenberg, Noah A.
A Quantitative Comparison of the Similarity between Genes and Geography in Worldwide Human Populations
title A Quantitative Comparison of the Similarity between Genes and Geography in Worldwide Human Populations
title_full A Quantitative Comparison of the Similarity between Genes and Geography in Worldwide Human Populations
title_fullStr A Quantitative Comparison of the Similarity between Genes and Geography in Worldwide Human Populations
title_full_unstemmed A Quantitative Comparison of the Similarity between Genes and Geography in Worldwide Human Populations
title_short A Quantitative Comparison of the Similarity between Genes and Geography in Worldwide Human Populations
title_sort quantitative comparison of the similarity between genes and geography in worldwide human populations
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3426559/
https://www.ncbi.nlm.nih.gov/pubmed/22927824
http://dx.doi.org/10.1371/journal.pgen.1002886
work_keys_str_mv AT wangchaolong aquantitativecomparisonofthesimilaritybetweengenesandgeographyinworldwidehumanpopulations
AT zollnersebastian aquantitativecomparisonofthesimilaritybetweengenesandgeographyinworldwidehumanpopulations
AT rosenbergnoaha aquantitativecomparisonofthesimilaritybetweengenesandgeographyinworldwidehumanpopulations
AT wangchaolong quantitativecomparisonofthesimilaritybetweengenesandgeographyinworldwidehumanpopulations
AT zollnersebastian quantitativecomparisonofthesimilaritybetweengenesandgeographyinworldwidehumanpopulations
AT rosenbergnoaha quantitativecomparisonofthesimilaritybetweengenesandgeographyinworldwidehumanpopulations