Cargando…

Graphic analysis of population structure on genome-wide rheumatoid arthritis data

Principal-component analysis (PCA) has been used for decades to summarize the human genetic variation across geographic regions and to infer population migration history. Reduction of spurious associations due to population structure is crucial for the success of disease association studies. Recentl...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Jun, Weng, Chunhua, Niyogi, Partha
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2795882/
https://www.ncbi.nlm.nih.gov/pubmed/20017975
_version_ 1782175461381505024
author Zhang, Jun
Weng, Chunhua
Niyogi, Partha
author_facet Zhang, Jun
Weng, Chunhua
Niyogi, Partha
author_sort Zhang, Jun
collection PubMed
description Principal-component analysis (PCA) has been used for decades to summarize the human genetic variation across geographic regions and to infer population migration history. Reduction of spurious associations due to population structure is crucial for the success of disease association studies. Recently, PCA has also become a popular method for detecting population structure and correction of population stratification in disease association studies. Inspired by manifold learning, we propose a novel method based on spectral graph theory. Regarding each study subject as a node with suitably defined weights for its edges to close neighbors, one can form a weighted graph. We suggest using the spectrum of the associated graph Laplacian operator, namely, Laplacian eigenfunctions, to infer population structures instead of principal components (PCs). For the whole genome-wide association data for the North American Rheumatoid Arthritis Consortium (NARAC) provided by Genetic Workshop Analysis 16, Laplacian eigenfunctions revealed more meaningful structures of the underlying population than PCA. The proposed method has connection to PCA, and it naturally includes PCA as a special case. Our simple method is computationally fast and is suitable for disease studies at the genome-wide scale.
format Text
id pubmed-2795882
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27958822009-12-18 Graphic analysis of population structure on genome-wide rheumatoid arthritis data Zhang, Jun Weng, Chunhua Niyogi, Partha BMC Proc Proceedings Principal-component analysis (PCA) has been used for decades to summarize the human genetic variation across geographic regions and to infer population migration history. Reduction of spurious associations due to population structure is crucial for the success of disease association studies. Recently, PCA has also become a popular method for detecting population structure and correction of population stratification in disease association studies. Inspired by manifold learning, we propose a novel method based on spectral graph theory. Regarding each study subject as a node with suitably defined weights for its edges to close neighbors, one can form a weighted graph. We suggest using the spectrum of the associated graph Laplacian operator, namely, Laplacian eigenfunctions, to infer population structures instead of principal components (PCs). For the whole genome-wide association data for the North American Rheumatoid Arthritis Consortium (NARAC) provided by Genetic Workshop Analysis 16, Laplacian eigenfunctions revealed more meaningful structures of the underlying population than PCA. The proposed method has connection to PCA, and it naturally includes PCA as a special case. Our simple method is computationally fast and is suitable for disease studies at the genome-wide scale. BioMed Central 2009-12-15 /pmc/articles/PMC2795882/ /pubmed/20017975 Text en Copyright ©2009 Zhang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Zhang, Jun
Weng, Chunhua
Niyogi, Partha
Graphic analysis of population structure on genome-wide rheumatoid arthritis data
title Graphic analysis of population structure on genome-wide rheumatoid arthritis data
title_full Graphic analysis of population structure on genome-wide rheumatoid arthritis data
title_fullStr Graphic analysis of population structure on genome-wide rheumatoid arthritis data
title_full_unstemmed Graphic analysis of population structure on genome-wide rheumatoid arthritis data
title_short Graphic analysis of population structure on genome-wide rheumatoid arthritis data
title_sort graphic analysis of population structure on genome-wide rheumatoid arthritis data
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2795882/
https://www.ncbi.nlm.nih.gov/pubmed/20017975
work_keys_str_mv AT zhangjun graphicanalysisofpopulationstructureongenomewiderheumatoidarthritisdata
AT wengchunhua graphicanalysisofpopulationstructureongenomewiderheumatoidarthritisdata
AT niyogipartha graphicanalysisofpopulationstructureongenomewiderheumatoidarthritisdata