Cargando…
Identifying rare and common disease associated variants in genomic data using Parkinson’s disease as a model
BACKGROUND: Genome-wide association studies have been successful in identifying common genetic variants for human diseases. However, much of the heritable variation associated with diseases such as Parkinson’s disease remains unknown suggesting that many more risk loci are yet to be identified. Rare...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4428531/ https://www.ncbi.nlm.nih.gov/pubmed/25175702 http://dx.doi.org/10.1186/s12929-014-0088-9 |
_version_ | 1782370910767939584 |
---|---|
author | Lin, Ying-Chao Hsieh, Ai-Ru Hsiao, Ching-Lin Wu, Shang-Jung Wang, Hui-Min Lian, Ie-Bin Fann, Cathy SJ |
author_facet | Lin, Ying-Chao Hsieh, Ai-Ru Hsiao, Ching-Lin Wu, Shang-Jung Wang, Hui-Min Lian, Ie-Bin Fann, Cathy SJ |
author_sort | Lin, Ying-Chao |
collection | PubMed |
description | BACKGROUND: Genome-wide association studies have been successful in identifying common genetic variants for human diseases. However, much of the heritable variation associated with diseases such as Parkinson’s disease remains unknown suggesting that many more risk loci are yet to be identified. Rare variants have become important in disease association studies for explaining missing heritability. Methods for detecting this type of association require prior knowledge on candidate genes and combining variants within the region. These methods may suffer from power loss in situations with many neutral variants or causal variants with opposite effects. RESULTS: We propose a method capable of scanning genetic variants to identify the region most likely harbouring disease gene with rare and/or common causal variants. Our method assigns a score at each individual variant based on our scoring system. It uses aggregate scores to identify the region with disease association. We evaluate performance by simulation based on 1000 Genomes sequencing data and compare with three commonly used methods. We use a Parkinson’s disease case–control dataset as a model to demonstrate the application of our method. Our method has better power than CMC and WSS and similar power to SKAT-O with well-controlled type I error under simulation based on 1000 Genomes sequencing data. In real data analysis, we confirm the association of α-synuclein gene (SNCA) with Parkinson’s disease (p = 0.005). We further identify association with hyaluronan synthase 2 (HAS2, p = 0.028) and kringle containing transmembrane protein 1 (KREMEN1, p = 0.006). KREMEN1 is associated with Wnt signalling pathway which has been shown to play an important role for neurodegeneration in Parkinson’s disease. CONCLUSIONS: Our method is time efficient and less sensitive to inclusion of neutral variants and direction effect of causal variants. It can narrow down a genomic region or a chromosome to a disease associated region. Using Parkinson’s disease as a model, our method not only confirms association for a known gene but also identifies two genes previously found by other studies. In spite of many existing methods, we conclude that our method serves as an efficient alternative for exploring genomic data containing both rare and common variants. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12929-014-0088-9) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4428531 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-44285312015-05-13 Identifying rare and common disease associated variants in genomic data using Parkinson’s disease as a model Lin, Ying-Chao Hsieh, Ai-Ru Hsiao, Ching-Lin Wu, Shang-Jung Wang, Hui-Min Lian, Ie-Bin Fann, Cathy SJ J Biomed Sci Research BACKGROUND: Genome-wide association studies have been successful in identifying common genetic variants for human diseases. However, much of the heritable variation associated with diseases such as Parkinson’s disease remains unknown suggesting that many more risk loci are yet to be identified. Rare variants have become important in disease association studies for explaining missing heritability. Methods for detecting this type of association require prior knowledge on candidate genes and combining variants within the region. These methods may suffer from power loss in situations with many neutral variants or causal variants with opposite effects. RESULTS: We propose a method capable of scanning genetic variants to identify the region most likely harbouring disease gene with rare and/or common causal variants. Our method assigns a score at each individual variant based on our scoring system. It uses aggregate scores to identify the region with disease association. We evaluate performance by simulation based on 1000 Genomes sequencing data and compare with three commonly used methods. We use a Parkinson’s disease case–control dataset as a model to demonstrate the application of our method. Our method has better power than CMC and WSS and similar power to SKAT-O with well-controlled type I error under simulation based on 1000 Genomes sequencing data. In real data analysis, we confirm the association of α-synuclein gene (SNCA) with Parkinson’s disease (p = 0.005). We further identify association with hyaluronan synthase 2 (HAS2, p = 0.028) and kringle containing transmembrane protein 1 (KREMEN1, p = 0.006). KREMEN1 is associated with Wnt signalling pathway which has been shown to play an important role for neurodegeneration in Parkinson’s disease. CONCLUSIONS: Our method is time efficient and less sensitive to inclusion of neutral variants and direction effect of causal variants. It can narrow down a genomic region or a chromosome to a disease associated region. Using Parkinson’s disease as a model, our method not only confirms association for a known gene but also identifies two genes previously found by other studies. In spite of many existing methods, we conclude that our method serves as an efficient alternative for exploring genomic data containing both rare and common variants. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12929-014-0088-9) contains supplementary material, which is available to authorized users. BioMed Central 2014-08-30 /pmc/articles/PMC4428531/ /pubmed/25175702 http://dx.doi.org/10.1186/s12929-014-0088-9 Text en © Lin et al.; licensee BioMed Central Ltd. 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Lin, Ying-Chao Hsieh, Ai-Ru Hsiao, Ching-Lin Wu, Shang-Jung Wang, Hui-Min Lian, Ie-Bin Fann, Cathy SJ Identifying rare and common disease associated variants in genomic data using Parkinson’s disease as a model |
title | Identifying rare and common disease associated variants in genomic data using Parkinson’s disease as a model |
title_full | Identifying rare and common disease associated variants in genomic data using Parkinson’s disease as a model |
title_fullStr | Identifying rare and common disease associated variants in genomic data using Parkinson’s disease as a model |
title_full_unstemmed | Identifying rare and common disease associated variants in genomic data using Parkinson’s disease as a model |
title_short | Identifying rare and common disease associated variants in genomic data using Parkinson’s disease as a model |
title_sort | identifying rare and common disease associated variants in genomic data using parkinson’s disease as a model |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4428531/ https://www.ncbi.nlm.nih.gov/pubmed/25175702 http://dx.doi.org/10.1186/s12929-014-0088-9 |
work_keys_str_mv | AT linyingchao identifyingrareandcommondiseaseassociatedvariantsingenomicdatausingparkinsonsdiseaseasamodel AT hsiehairu identifyingrareandcommondiseaseassociatedvariantsingenomicdatausingparkinsonsdiseaseasamodel AT hsiaochinglin identifyingrareandcommondiseaseassociatedvariantsingenomicdatausingparkinsonsdiseaseasamodel AT wushangjung identifyingrareandcommondiseaseassociatedvariantsingenomicdatausingparkinsonsdiseaseasamodel AT wanghuimin identifyingrareandcommondiseaseassociatedvariantsingenomicdatausingparkinsonsdiseaseasamodel AT lianiebin identifyingrareandcommondiseaseassociatedvariantsingenomicdatausingparkinsonsdiseaseasamodel AT fanncathysj identifyingrareandcommondiseaseassociatedvariantsingenomicdatausingparkinsonsdiseaseasamodel |