Cargando…

Identifying rare and common disease associated variants in genomic data using Parkinson’s disease as a model

BACKGROUND: Genome-wide association studies have been successful in identifying common genetic variants for human diseases. However, much of the heritable variation associated with diseases such as Parkinson’s disease remains unknown suggesting that many more risk loci are yet to be identified. Rare...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Ying-Chao, Hsieh, Ai-Ru, Hsiao, Ching-Lin, Wu, Shang-Jung, Wang, Hui-Min, Lian, Ie-Bin, Fann, Cathy SJ
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4428531/
https://www.ncbi.nlm.nih.gov/pubmed/25175702
http://dx.doi.org/10.1186/s12929-014-0088-9
_version_ 1782370910767939584
author Lin, Ying-Chao
Hsieh, Ai-Ru
Hsiao, Ching-Lin
Wu, Shang-Jung
Wang, Hui-Min
Lian, Ie-Bin
Fann, Cathy SJ
author_facet Lin, Ying-Chao
Hsieh, Ai-Ru
Hsiao, Ching-Lin
Wu, Shang-Jung
Wang, Hui-Min
Lian, Ie-Bin
Fann, Cathy SJ
author_sort Lin, Ying-Chao
collection PubMed
description BACKGROUND: Genome-wide association studies have been successful in identifying common genetic variants for human diseases. However, much of the heritable variation associated with diseases such as Parkinson’s disease remains unknown suggesting that many more risk loci are yet to be identified. Rare variants have become important in disease association studies for explaining missing heritability. Methods for detecting this type of association require prior knowledge on candidate genes and combining variants within the region. These methods may suffer from power loss in situations with many neutral variants or causal variants with opposite effects. RESULTS: We propose a method capable of scanning genetic variants to identify the region most likely harbouring disease gene with rare and/or common causal variants. Our method assigns a score at each individual variant based on our scoring system. It uses aggregate scores to identify the region with disease association. We evaluate performance by simulation based on 1000 Genomes sequencing data and compare with three commonly used methods. We use a Parkinson’s disease case–control dataset as a model to demonstrate the application of our method. Our method has better power than CMC and WSS and similar power to SKAT-O with well-controlled type I error under simulation based on 1000 Genomes sequencing data. In real data analysis, we confirm the association of α-synuclein gene (SNCA) with Parkinson’s disease (p = 0.005). We further identify association with hyaluronan synthase 2 (HAS2, p = 0.028) and kringle containing transmembrane protein 1 (KREMEN1, p = 0.006). KREMEN1 is associated with Wnt signalling pathway which has been shown to play an important role for neurodegeneration in Parkinson’s disease. CONCLUSIONS: Our method is time efficient and less sensitive to inclusion of neutral variants and direction effect of causal variants. It can narrow down a genomic region or a chromosome to a disease associated region. Using Parkinson’s disease as a model, our method not only confirms association for a known gene but also identifies two genes previously found by other studies. In spite of many existing methods, we conclude that our method serves as an efficient alternative for exploring genomic data containing both rare and common variants. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12929-014-0088-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4428531
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44285312015-05-13 Identifying rare and common disease associated variants in genomic data using Parkinson’s disease as a model Lin, Ying-Chao Hsieh, Ai-Ru Hsiao, Ching-Lin Wu, Shang-Jung Wang, Hui-Min Lian, Ie-Bin Fann, Cathy SJ J Biomed Sci Research BACKGROUND: Genome-wide association studies have been successful in identifying common genetic variants for human diseases. However, much of the heritable variation associated with diseases such as Parkinson’s disease remains unknown suggesting that many more risk loci are yet to be identified. Rare variants have become important in disease association studies for explaining missing heritability. Methods for detecting this type of association require prior knowledge on candidate genes and combining variants within the region. These methods may suffer from power loss in situations with many neutral variants or causal variants with opposite effects. RESULTS: We propose a method capable of scanning genetic variants to identify the region most likely harbouring disease gene with rare and/or common causal variants. Our method assigns a score at each individual variant based on our scoring system. It uses aggregate scores to identify the region with disease association. We evaluate performance by simulation based on 1000 Genomes sequencing data and compare with three commonly used methods. We use a Parkinson’s disease case–control dataset as a model to demonstrate the application of our method. Our method has better power than CMC and WSS and similar power to SKAT-O with well-controlled type I error under simulation based on 1000 Genomes sequencing data. In real data analysis, we confirm the association of α-synuclein gene (SNCA) with Parkinson’s disease (p = 0.005). We further identify association with hyaluronan synthase 2 (HAS2, p = 0.028) and kringle containing transmembrane protein 1 (KREMEN1, p = 0.006). KREMEN1 is associated with Wnt signalling pathway which has been shown to play an important role for neurodegeneration in Parkinson’s disease. CONCLUSIONS: Our method is time efficient and less sensitive to inclusion of neutral variants and direction effect of causal variants. It can narrow down a genomic region or a chromosome to a disease associated region. Using Parkinson’s disease as a model, our method not only confirms association for a known gene but also identifies two genes previously found by other studies. In spite of many existing methods, we conclude that our method serves as an efficient alternative for exploring genomic data containing both rare and common variants. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12929-014-0088-9) contains supplementary material, which is available to authorized users. BioMed Central 2014-08-30 /pmc/articles/PMC4428531/ /pubmed/25175702 http://dx.doi.org/10.1186/s12929-014-0088-9 Text en © Lin et al.; licensee BioMed Central Ltd. 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Lin, Ying-Chao
Hsieh, Ai-Ru
Hsiao, Ching-Lin
Wu, Shang-Jung
Wang, Hui-Min
Lian, Ie-Bin
Fann, Cathy SJ
Identifying rare and common disease associated variants in genomic data using Parkinson’s disease as a model
title Identifying rare and common disease associated variants in genomic data using Parkinson’s disease as a model
title_full Identifying rare and common disease associated variants in genomic data using Parkinson’s disease as a model
title_fullStr Identifying rare and common disease associated variants in genomic data using Parkinson’s disease as a model
title_full_unstemmed Identifying rare and common disease associated variants in genomic data using Parkinson’s disease as a model
title_short Identifying rare and common disease associated variants in genomic data using Parkinson’s disease as a model
title_sort identifying rare and common disease associated variants in genomic data using parkinson’s disease as a model
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4428531/
https://www.ncbi.nlm.nih.gov/pubmed/25175702
http://dx.doi.org/10.1186/s12929-014-0088-9
work_keys_str_mv AT linyingchao identifyingrareandcommondiseaseassociatedvariantsingenomicdatausingparkinsonsdiseaseasamodel
AT hsiehairu identifyingrareandcommondiseaseassociatedvariantsingenomicdatausingparkinsonsdiseaseasamodel
AT hsiaochinglin identifyingrareandcommondiseaseassociatedvariantsingenomicdatausingparkinsonsdiseaseasamodel
AT wushangjung identifyingrareandcommondiseaseassociatedvariantsingenomicdatausingparkinsonsdiseaseasamodel
AT wanghuimin identifyingrareandcommondiseaseassociatedvariantsingenomicdatausingparkinsonsdiseaseasamodel
AT lianiebin identifyingrareandcommondiseaseassociatedvariantsingenomicdatausingparkinsonsdiseaseasamodel
AT fanncathysj identifyingrareandcommondiseaseassociatedvariantsingenomicdatausingparkinsonsdiseaseasamodel