Cargando…
Identification of causal genes for complex traits
Motivation: Although genome-wide association studies (GWAS) have identified thousands of variants associated with common diseases and complex traits, only a handful of these variants are validated to be causal. We consider ‘causal variants’ as variants which are responsible for the association signa...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4542778/ https://www.ncbi.nlm.nih.gov/pubmed/26072484 http://dx.doi.org/10.1093/bioinformatics/btv240 |
_version_ | 1782386561101332480 |
---|---|
author | Hormozdiari, Farhad Kichaev, Gleb Yang, Wen-Yun Pasaniuc, Bogdan Eskin, Eleazar |
author_facet | Hormozdiari, Farhad Kichaev, Gleb Yang, Wen-Yun Pasaniuc, Bogdan Eskin, Eleazar |
author_sort | Hormozdiari, Farhad |
collection | PubMed |
description | Motivation: Although genome-wide association studies (GWAS) have identified thousands of variants associated with common diseases and complex traits, only a handful of these variants are validated to be causal. We consider ‘causal variants’ as variants which are responsible for the association signal at a locus. As opposed to association studies that benefit from linkage disequilibrium (LD), the main challenge in identifying causal variants at associated loci lies in distinguishing among the many closely correlated variants due to LD. This is particularly important for model organisms such as inbred mice, where LD extends much further than in human populations, resulting in large stretches of the genome with significantly associated variants. Furthermore, these model organisms are highly structured and require correction for population structure to remove potential spurious associations. Results: In this work, we propose CAVIAR-Gene (CAusal Variants Identification in Associated Regions), a novel method that is able to operate across large LD regions of the genome while also correcting for population structure. A key feature of our approach is that it provides as output a minimally sized set of genes that captures the genes which harbor causal variants with probability ρ. Through extensive simulations, we demonstrate that our method not only speeds up computation, but also have an average of 10% higher recall rate compared with the existing approaches. We validate our method using a real mouse high-density lipoprotein data (HDL) and show that CAVIAR-Gene is able to identify Apoa2 (a gene known to harbor causal variants for HDL), while reducing the number of genes that need to be tested for functionality by a factor of 2. Availability and implementation: Software is freely available for download at genetics.cs.ucla.edu/caviar. Contact: eeskin@cs.ucla.edu |
format | Online Article Text |
id | pubmed-4542778 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-45427782015-08-25 Identification of causal genes for complex traits Hormozdiari, Farhad Kichaev, Gleb Yang, Wen-Yun Pasaniuc, Bogdan Eskin, Eleazar Bioinformatics Ismb/Eccb 2015 Proceedings Papers Committee July 10 to July 14, 2015, Dublin, Ireland Motivation: Although genome-wide association studies (GWAS) have identified thousands of variants associated with common diseases and complex traits, only a handful of these variants are validated to be causal. We consider ‘causal variants’ as variants which are responsible for the association signal at a locus. As opposed to association studies that benefit from linkage disequilibrium (LD), the main challenge in identifying causal variants at associated loci lies in distinguishing among the many closely correlated variants due to LD. This is particularly important for model organisms such as inbred mice, where LD extends much further than in human populations, resulting in large stretches of the genome with significantly associated variants. Furthermore, these model organisms are highly structured and require correction for population structure to remove potential spurious associations. Results: In this work, we propose CAVIAR-Gene (CAusal Variants Identification in Associated Regions), a novel method that is able to operate across large LD regions of the genome while also correcting for population structure. A key feature of our approach is that it provides as output a minimally sized set of genes that captures the genes which harbor causal variants with probability ρ. Through extensive simulations, we demonstrate that our method not only speeds up computation, but also have an average of 10% higher recall rate compared with the existing approaches. We validate our method using a real mouse high-density lipoprotein data (HDL) and show that CAVIAR-Gene is able to identify Apoa2 (a gene known to harbor causal variants for HDL), while reducing the number of genes that need to be tested for functionality by a factor of 2. Availability and implementation: Software is freely available for download at genetics.cs.ucla.edu/caviar. Contact: eeskin@cs.ucla.edu Oxford University Press 2015-06-15 2015-06-10 /pmc/articles/PMC4542778/ /pubmed/26072484 http://dx.doi.org/10.1093/bioinformatics/btv240 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/),which permits non-commercial reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Ismb/Eccb 2015 Proceedings Papers Committee July 10 to July 14, 2015, Dublin, Ireland Hormozdiari, Farhad Kichaev, Gleb Yang, Wen-Yun Pasaniuc, Bogdan Eskin, Eleazar Identification of causal genes for complex traits |
title | Identification of causal genes for complex traits |
title_full | Identification of causal genes for complex traits |
title_fullStr | Identification of causal genes for complex traits |
title_full_unstemmed | Identification of causal genes for complex traits |
title_short | Identification of causal genes for complex traits |
title_sort | identification of causal genes for complex traits |
topic | Ismb/Eccb 2015 Proceedings Papers Committee July 10 to July 14, 2015, Dublin, Ireland |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4542778/ https://www.ncbi.nlm.nih.gov/pubmed/26072484 http://dx.doi.org/10.1093/bioinformatics/btv240 |
work_keys_str_mv | AT hormozdiarifarhad identificationofcausalgenesforcomplextraits AT kichaevgleb identificationofcausalgenesforcomplextraits AT yangwenyun identificationofcausalgenesforcomplextraits AT pasaniucbogdan identificationofcausalgenesforcomplextraits AT eskineleazar identificationofcausalgenesforcomplextraits |