Cargando…

Laplacian Eigenfunctions Learn Population Structure

Principal components analysis has been used for decades to summarize genetic variation across geographic regions and to infer population migration history. More recently, with the advent of genome-wide association studies of complex traits, it has become a commonly-used tool for detection and correc...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Jun, Niyogi, Partha, McPeek, Mary Sara
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2779848/
https://www.ncbi.nlm.nih.gov/pubmed/19956572
http://dx.doi.org/10.1371/journal.pone.0007928
_version_ 1782174438639271936
author Zhang, Jun
Niyogi, Partha
McPeek, Mary Sara
author_facet Zhang, Jun
Niyogi, Partha
McPeek, Mary Sara
author_sort Zhang, Jun
collection PubMed
description Principal components analysis has been used for decades to summarize genetic variation across geographic regions and to infer population migration history. More recently, with the advent of genome-wide association studies of complex traits, it has become a commonly-used tool for detection and correction of confounding due to population structure. However, principal components are generally sensitive to outliers. Recently there has also been concern about its interpretation. Motivated from geometric learning, we describe a method based on spectral graph theory. Regarding each study subject as a node with suitably defined weights for its edges to close neighbors, one can form a weighted graph. We suggest using the spectrum of the associated graph Laplacian operator, namely, Laplacian eigenfunctions, to infer population structure. In simulations and real data on a ring species of birds, Laplacian eigenfunctions reveal more meaningful and less noisy structure of the underlying population, compared with principal components. The proposed approach is simple and computationally fast. It is expected to become a promising and basic method for population genetics and disease association studies.
format Text
id pubmed-2779848
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-27798482009-12-03 Laplacian Eigenfunctions Learn Population Structure Zhang, Jun Niyogi, Partha McPeek, Mary Sara PLoS One Research Article Principal components analysis has been used for decades to summarize genetic variation across geographic regions and to infer population migration history. More recently, with the advent of genome-wide association studies of complex traits, it has become a commonly-used tool for detection and correction of confounding due to population structure. However, principal components are generally sensitive to outliers. Recently there has also been concern about its interpretation. Motivated from geometric learning, we describe a method based on spectral graph theory. Regarding each study subject as a node with suitably defined weights for its edges to close neighbors, one can form a weighted graph. We suggest using the spectrum of the associated graph Laplacian operator, namely, Laplacian eigenfunctions, to infer population structure. In simulations and real data on a ring species of birds, Laplacian eigenfunctions reveal more meaningful and less noisy structure of the underlying population, compared with principal components. The proposed approach is simple and computationally fast. It is expected to become a promising and basic method for population genetics and disease association studies. Public Library of Science 2009-12-01 /pmc/articles/PMC2779848/ /pubmed/19956572 http://dx.doi.org/10.1371/journal.pone.0007928 Text en Zhang et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Zhang, Jun
Niyogi, Partha
McPeek, Mary Sara
Laplacian Eigenfunctions Learn Population Structure
title Laplacian Eigenfunctions Learn Population Structure
title_full Laplacian Eigenfunctions Learn Population Structure
title_fullStr Laplacian Eigenfunctions Learn Population Structure
title_full_unstemmed Laplacian Eigenfunctions Learn Population Structure
title_short Laplacian Eigenfunctions Learn Population Structure
title_sort laplacian eigenfunctions learn population structure
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2779848/
https://www.ncbi.nlm.nih.gov/pubmed/19956572
http://dx.doi.org/10.1371/journal.pone.0007928
work_keys_str_mv AT zhangjun laplacianeigenfunctionslearnpopulationstructure
AT niyogipartha laplacianeigenfunctionslearnpopulationstructure
AT mcpeekmarysara laplacianeigenfunctionslearnpopulationstructure