Cargando…

Inference of kinship using spatial distributions of SNPs for genome-wide association studies

BACKGROUND: Genome-wide association studies (GWASs) are powerful in identifying genetic loci which cause complex traits of common diseases. However, it is well known that inappropriately accounting for pedigree or population structure leads to spurious associations. GWASs have often encountered incr...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Hyokyeong, Chen, Liang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4873983/
https://www.ncbi.nlm.nih.gov/pubmed/27206321
http://dx.doi.org/10.1186/s12864-016-2696-0
_version_ 1782432978545147904
author Lee, Hyokyeong
Chen, Liang
author_facet Lee, Hyokyeong
Chen, Liang
author_sort Lee, Hyokyeong
collection PubMed
description BACKGROUND: Genome-wide association studies (GWASs) are powerful in identifying genetic loci which cause complex traits of common diseases. However, it is well known that inappropriately accounting for pedigree or population structure leads to spurious associations. GWASs have often encountered increased type I error rates due to the correlated genotypes of cryptically related individuals or subgroups. Therefore, accurate pedigree information is crucial for successful GWASs. RESULTS: We propose a distance-based method KIND to estimate kinship coefficients among individuals. Our method utilizes the spatial distribution of SNPs in the genome that represents how far each minor-allele variant is located from its neighboring minor-allele variants. The SNP distribution of each individual was presented in a feature vector in Euclidean space, and then the kinship coefficient was inferred from the two vectors of each individual pair. We demonstrate that the distance information can measure the similarity of genetic variants of individuals accurately and efficiently. We applied our method to a synthetic data set and two real data sets (i.e. the HapMap phase III and the 1000 genomes data). We investigated the estimation accuracy of kinship coefficients not only within homogeneous populations but also for a population with extreme stratification. CONCLUSIONS: Our method KIND usually produces more accurate and more robust kinship coefficient estimates than existing methods especially for populations with extreme stratification. It can serve as an important and very efficient tool for GWASs. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2696-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4873983
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-48739832016-05-21 Inference of kinship using spatial distributions of SNPs for genome-wide association studies Lee, Hyokyeong Chen, Liang BMC Genomics Methodology Article BACKGROUND: Genome-wide association studies (GWASs) are powerful in identifying genetic loci which cause complex traits of common diseases. However, it is well known that inappropriately accounting for pedigree or population structure leads to spurious associations. GWASs have often encountered increased type I error rates due to the correlated genotypes of cryptically related individuals or subgroups. Therefore, accurate pedigree information is crucial for successful GWASs. RESULTS: We propose a distance-based method KIND to estimate kinship coefficients among individuals. Our method utilizes the spatial distribution of SNPs in the genome that represents how far each minor-allele variant is located from its neighboring minor-allele variants. The SNP distribution of each individual was presented in a feature vector in Euclidean space, and then the kinship coefficient was inferred from the two vectors of each individual pair. We demonstrate that the distance information can measure the similarity of genetic variants of individuals accurately and efficiently. We applied our method to a synthetic data set and two real data sets (i.e. the HapMap phase III and the 1000 genomes data). We investigated the estimation accuracy of kinship coefficients not only within homogeneous populations but also for a population with extreme stratification. CONCLUSIONS: Our method KIND usually produces more accurate and more robust kinship coefficient estimates than existing methods especially for populations with extreme stratification. It can serve as an important and very efficient tool for GWASs. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2696-0) contains supplementary material, which is available to authorized users. BioMed Central 2016-05-20 /pmc/articles/PMC4873983/ /pubmed/27206321 http://dx.doi.org/10.1186/s12864-016-2696-0 Text en © Lee and Chen. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Lee, Hyokyeong
Chen, Liang
Inference of kinship using spatial distributions of SNPs for genome-wide association studies
title Inference of kinship using spatial distributions of SNPs for genome-wide association studies
title_full Inference of kinship using spatial distributions of SNPs for genome-wide association studies
title_fullStr Inference of kinship using spatial distributions of SNPs for genome-wide association studies
title_full_unstemmed Inference of kinship using spatial distributions of SNPs for genome-wide association studies
title_short Inference of kinship using spatial distributions of SNPs for genome-wide association studies
title_sort inference of kinship using spatial distributions of snps for genome-wide association studies
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4873983/
https://www.ncbi.nlm.nih.gov/pubmed/27206321
http://dx.doi.org/10.1186/s12864-016-2696-0
work_keys_str_mv AT leehyokyeong inferenceofkinshipusingspatialdistributionsofsnpsforgenomewideassociationstudies
AT chenliang inferenceofkinshipusingspatialdistributionsofsnpsforgenomewideassociationstudies