Cargando…

Identification of causal genes for complex traits

Motivation: Although genome-wide association studies (GWAS) have identified thousands of variants associated with common diseases and complex traits, only a handful of these variants are validated to be causal. We consider ‘causal variants’ as variants which are responsible for the association signa...

Descripción completa

Detalles Bibliográficos
Autores principales: Hormozdiari, Farhad, Kichaev, Gleb, Yang, Wen-Yun, Pasaniuc, Bogdan, Eskin, Eleazar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4542778/
https://www.ncbi.nlm.nih.gov/pubmed/26072484
http://dx.doi.org/10.1093/bioinformatics/btv240
_version_ 1782386561101332480
author Hormozdiari, Farhad
Kichaev, Gleb
Yang, Wen-Yun
Pasaniuc, Bogdan
Eskin, Eleazar
author_facet Hormozdiari, Farhad
Kichaev, Gleb
Yang, Wen-Yun
Pasaniuc, Bogdan
Eskin, Eleazar
author_sort Hormozdiari, Farhad
collection PubMed
description Motivation: Although genome-wide association studies (GWAS) have identified thousands of variants associated with common diseases and complex traits, only a handful of these variants are validated to be causal. We consider ‘causal variants’ as variants which are responsible for the association signal at a locus. As opposed to association studies that benefit from linkage disequilibrium (LD), the main challenge in identifying causal variants at associated loci lies in distinguishing among the many closely correlated variants due to LD. This is particularly important for model organisms such as inbred mice, where LD extends much further than in human populations, resulting in large stretches of the genome with significantly associated variants. Furthermore, these model organisms are highly structured and require correction for population structure to remove potential spurious associations. Results: In this work, we propose CAVIAR-Gene (CAusal Variants Identification in Associated Regions), a novel method that is able to operate across large LD regions of the genome while also correcting for population structure. A key feature of our approach is that it provides as output a minimally sized set of genes that captures the genes which harbor causal variants with probability ρ. Through extensive simulations, we demonstrate that our method not only speeds up computation, but also have an average of 10% higher recall rate compared with the existing approaches. We validate our method using a real mouse high-density lipoprotein data (HDL) and show that CAVIAR-Gene is able to identify Apoa2 (a gene known to harbor causal variants for HDL), while reducing the number of genes that need to be tested for functionality by a factor of 2. Availability and implementation: Software is freely available for download at genetics.cs.ucla.edu/caviar. Contact: eeskin@cs.ucla.edu
format Online
Article
Text
id pubmed-4542778
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-45427782015-08-25 Identification of causal genes for complex traits Hormozdiari, Farhad Kichaev, Gleb Yang, Wen-Yun Pasaniuc, Bogdan Eskin, Eleazar Bioinformatics Ismb/Eccb 2015 Proceedings Papers Committee July 10 to July 14, 2015, Dublin, Ireland Motivation: Although genome-wide association studies (GWAS) have identified thousands of variants associated with common diseases and complex traits, only a handful of these variants are validated to be causal. We consider ‘causal variants’ as variants which are responsible for the association signal at a locus. As opposed to association studies that benefit from linkage disequilibrium (LD), the main challenge in identifying causal variants at associated loci lies in distinguishing among the many closely correlated variants due to LD. This is particularly important for model organisms such as inbred mice, where LD extends much further than in human populations, resulting in large stretches of the genome with significantly associated variants. Furthermore, these model organisms are highly structured and require correction for population structure to remove potential spurious associations. Results: In this work, we propose CAVIAR-Gene (CAusal Variants Identification in Associated Regions), a novel method that is able to operate across large LD regions of the genome while also correcting for population structure. A key feature of our approach is that it provides as output a minimally sized set of genes that captures the genes which harbor causal variants with probability ρ. Through extensive simulations, we demonstrate that our method not only speeds up computation, but also have an average of 10% higher recall rate compared with the existing approaches. We validate our method using a real mouse high-density lipoprotein data (HDL) and show that CAVIAR-Gene is able to identify Apoa2 (a gene known to harbor causal variants for HDL), while reducing the number of genes that need to be tested for functionality by a factor of 2. Availability and implementation: Software is freely available for download at genetics.cs.ucla.edu/caviar. Contact: eeskin@cs.ucla.edu Oxford University Press 2015-06-15 2015-06-10 /pmc/articles/PMC4542778/ /pubmed/26072484 http://dx.doi.org/10.1093/bioinformatics/btv240 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/),which permits non-commercial reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb/Eccb 2015 Proceedings Papers Committee July 10 to July 14, 2015, Dublin, Ireland
Hormozdiari, Farhad
Kichaev, Gleb
Yang, Wen-Yun
Pasaniuc, Bogdan
Eskin, Eleazar
Identification of causal genes for complex traits
title Identification of causal genes for complex traits
title_full Identification of causal genes for complex traits
title_fullStr Identification of causal genes for complex traits
title_full_unstemmed Identification of causal genes for complex traits
title_short Identification of causal genes for complex traits
title_sort identification of causal genes for complex traits
topic Ismb/Eccb 2015 Proceedings Papers Committee July 10 to July 14, 2015, Dublin, Ireland
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4542778/
https://www.ncbi.nlm.nih.gov/pubmed/26072484
http://dx.doi.org/10.1093/bioinformatics/btv240
work_keys_str_mv AT hormozdiarifarhad identificationofcausalgenesforcomplextraits
AT kichaevgleb identificationofcausalgenesforcomplextraits
AT yangwenyun identificationofcausalgenesforcomplextraits
AT pasaniucbogdan identificationofcausalgenesforcomplextraits
AT eskineleazar identificationofcausalgenesforcomplextraits