Cargando…
Inference of kinship using spatial distributions of SNPs for genome-wide association studies
BACKGROUND: Genome-wide association studies (GWASs) are powerful in identifying genetic loci which cause complex traits of common diseases. However, it is well known that inappropriately accounting for pedigree or population structure leads to spurious associations. GWASs have often encountered incr...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4873983/ https://www.ncbi.nlm.nih.gov/pubmed/27206321 http://dx.doi.org/10.1186/s12864-016-2696-0 |
_version_ | 1782432978545147904 |
---|---|
author | Lee, Hyokyeong Chen, Liang |
author_facet | Lee, Hyokyeong Chen, Liang |
author_sort | Lee, Hyokyeong |
collection | PubMed |
description | BACKGROUND: Genome-wide association studies (GWASs) are powerful in identifying genetic loci which cause complex traits of common diseases. However, it is well known that inappropriately accounting for pedigree or population structure leads to spurious associations. GWASs have often encountered increased type I error rates due to the correlated genotypes of cryptically related individuals or subgroups. Therefore, accurate pedigree information is crucial for successful GWASs. RESULTS: We propose a distance-based method KIND to estimate kinship coefficients among individuals. Our method utilizes the spatial distribution of SNPs in the genome that represents how far each minor-allele variant is located from its neighboring minor-allele variants. The SNP distribution of each individual was presented in a feature vector in Euclidean space, and then the kinship coefficient was inferred from the two vectors of each individual pair. We demonstrate that the distance information can measure the similarity of genetic variants of individuals accurately and efficiently. We applied our method to a synthetic data set and two real data sets (i.e. the HapMap phase III and the 1000 genomes data). We investigated the estimation accuracy of kinship coefficients not only within homogeneous populations but also for a population with extreme stratification. CONCLUSIONS: Our method KIND usually produces more accurate and more robust kinship coefficient estimates than existing methods especially for populations with extreme stratification. It can serve as an important and very efficient tool for GWASs. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2696-0) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4873983 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-48739832016-05-21 Inference of kinship using spatial distributions of SNPs for genome-wide association studies Lee, Hyokyeong Chen, Liang BMC Genomics Methodology Article BACKGROUND: Genome-wide association studies (GWASs) are powerful in identifying genetic loci which cause complex traits of common diseases. However, it is well known that inappropriately accounting for pedigree or population structure leads to spurious associations. GWASs have often encountered increased type I error rates due to the correlated genotypes of cryptically related individuals or subgroups. Therefore, accurate pedigree information is crucial for successful GWASs. RESULTS: We propose a distance-based method KIND to estimate kinship coefficients among individuals. Our method utilizes the spatial distribution of SNPs in the genome that represents how far each minor-allele variant is located from its neighboring minor-allele variants. The SNP distribution of each individual was presented in a feature vector in Euclidean space, and then the kinship coefficient was inferred from the two vectors of each individual pair. We demonstrate that the distance information can measure the similarity of genetic variants of individuals accurately and efficiently. We applied our method to a synthetic data set and two real data sets (i.e. the HapMap phase III and the 1000 genomes data). We investigated the estimation accuracy of kinship coefficients not only within homogeneous populations but also for a population with extreme stratification. CONCLUSIONS: Our method KIND usually produces more accurate and more robust kinship coefficient estimates than existing methods especially for populations with extreme stratification. It can serve as an important and very efficient tool for GWASs. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2696-0) contains supplementary material, which is available to authorized users. BioMed Central 2016-05-20 /pmc/articles/PMC4873983/ /pubmed/27206321 http://dx.doi.org/10.1186/s12864-016-2696-0 Text en © Lee and Chen. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Lee, Hyokyeong Chen, Liang Inference of kinship using spatial distributions of SNPs for genome-wide association studies |
title | Inference of kinship using spatial distributions of SNPs for genome-wide association studies |
title_full | Inference of kinship using spatial distributions of SNPs for genome-wide association studies |
title_fullStr | Inference of kinship using spatial distributions of SNPs for genome-wide association studies |
title_full_unstemmed | Inference of kinship using spatial distributions of SNPs for genome-wide association studies |
title_short | Inference of kinship using spatial distributions of SNPs for genome-wide association studies |
title_sort | inference of kinship using spatial distributions of snps for genome-wide association studies |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4873983/ https://www.ncbi.nlm.nih.gov/pubmed/27206321 http://dx.doi.org/10.1186/s12864-016-2696-0 |
work_keys_str_mv | AT leehyokyeong inferenceofkinshipusingspatialdistributionsofsnpsforgenomewideassociationstudies AT chenliang inferenceofkinshipusingspatialdistributionsofsnpsforgenomewideassociationstudies |