Cargando…

Applying meta-analysis to genotype-tissue expression data from multiple tissues to identify eQTLs and increase the number of eGenes

MOTIVATION: There is recent interest in using gene expression data to contextualize findings from traditional genome-wide association studies (GWAS). Conditioned on a tissue, expression quantitative trait loci (eQTLs) are genetic variants associated with gene expression, and eGenes are genes whose e...

Descripción completa

Detalles Bibliográficos
Autores principales: Duong, Dat, Gai, Lisa, Snir, Sagi, Kang, Eun Yong, Han, Buhm, Sul, Jae Hoon, Eskin, Eleazar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5870567/
https://www.ncbi.nlm.nih.gov/pubmed/28881962
http://dx.doi.org/10.1093/bioinformatics/btx227
_version_ 1783309511336919040
author Duong, Dat
Gai, Lisa
Snir, Sagi
Kang, Eun Yong
Han, Buhm
Sul, Jae Hoon
Eskin, Eleazar
author_facet Duong, Dat
Gai, Lisa
Snir, Sagi
Kang, Eun Yong
Han, Buhm
Sul, Jae Hoon
Eskin, Eleazar
author_sort Duong, Dat
collection PubMed
description MOTIVATION: There is recent interest in using gene expression data to contextualize findings from traditional genome-wide association studies (GWAS). Conditioned on a tissue, expression quantitative trait loci (eQTLs) are genetic variants associated with gene expression, and eGenes are genes whose expression levels are associated with genetic variants. eQTLs and eGenes provide great supporting evidence for GWAS hits and important insights into the regulatory pathways involved in many diseases. When a significant variant or a candidate gene identified by GWAS is also an eQTL or eGene, there is strong evidence to further study this variant or gene. Multi-tissue gene expression datasets like the Gene Tissue Expression (GTEx) data are used to find eQTLs and eGenes. Unfortunately, these datasets often have small sample sizes in some tissues. For this reason, there have been many meta-analysis methods designed to combine gene expression data across many tissues to increase power for finding eQTLs and eGenes. However, these existing techniques are not scalable to datasets containing many tissues, like the GTEx data. Furthermore, these methods ignore a biological insight that the same variant may be associated with the same gene across similar tissues. RESULTS: We introduce a meta-analysis model that addresses these problems in existing methods. We focus on the problem of finding eGenes in gene expression data from many tissues, and show that our model is better than other types of meta-analyses. AVAILABILITY AND IMPLEMENTATION: Source code is at https://github.com/datduong/RECOV. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-5870567
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58705672018-04-05 Applying meta-analysis to genotype-tissue expression data from multiple tissues to identify eQTLs and increase the number of eGenes Duong, Dat Gai, Lisa Snir, Sagi Kang, Eun Yong Han, Buhm Sul, Jae Hoon Eskin, Eleazar Bioinformatics Ismb/Eccb 2017: The 25th Annual Conference Intelligent Systems for Molecular Biology Held Jointly with the 16th Annual European Conference on Computational Biology, Prague, Czech Republic, July 21–25, 2017 MOTIVATION: There is recent interest in using gene expression data to contextualize findings from traditional genome-wide association studies (GWAS). Conditioned on a tissue, expression quantitative trait loci (eQTLs) are genetic variants associated with gene expression, and eGenes are genes whose expression levels are associated with genetic variants. eQTLs and eGenes provide great supporting evidence for GWAS hits and important insights into the regulatory pathways involved in many diseases. When a significant variant or a candidate gene identified by GWAS is also an eQTL or eGene, there is strong evidence to further study this variant or gene. Multi-tissue gene expression datasets like the Gene Tissue Expression (GTEx) data are used to find eQTLs and eGenes. Unfortunately, these datasets often have small sample sizes in some tissues. For this reason, there have been many meta-analysis methods designed to combine gene expression data across many tissues to increase power for finding eQTLs and eGenes. However, these existing techniques are not scalable to datasets containing many tissues, like the GTEx data. Furthermore, these methods ignore a biological insight that the same variant may be associated with the same gene across similar tissues. RESULTS: We introduce a meta-analysis model that addresses these problems in existing methods. We focus on the problem of finding eGenes in gene expression data from many tissues, and show that our model is better than other types of meta-analyses. AVAILABILITY AND IMPLEMENTATION: Source code is at https://github.com/datduong/RECOV. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2017-07-15 2017-07-12 /pmc/articles/PMC5870567/ /pubmed/28881962 http://dx.doi.org/10.1093/bioinformatics/btx227 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb/Eccb 2017: The 25th Annual Conference Intelligent Systems for Molecular Biology Held Jointly with the 16th Annual European Conference on Computational Biology, Prague, Czech Republic, July 21–25, 2017
Duong, Dat
Gai, Lisa
Snir, Sagi
Kang, Eun Yong
Han, Buhm
Sul, Jae Hoon
Eskin, Eleazar
Applying meta-analysis to genotype-tissue expression data from multiple tissues to identify eQTLs and increase the number of eGenes
title Applying meta-analysis to genotype-tissue expression data from multiple tissues to identify eQTLs and increase the number of eGenes
title_full Applying meta-analysis to genotype-tissue expression data from multiple tissues to identify eQTLs and increase the number of eGenes
title_fullStr Applying meta-analysis to genotype-tissue expression data from multiple tissues to identify eQTLs and increase the number of eGenes
title_full_unstemmed Applying meta-analysis to genotype-tissue expression data from multiple tissues to identify eQTLs and increase the number of eGenes
title_short Applying meta-analysis to genotype-tissue expression data from multiple tissues to identify eQTLs and increase the number of eGenes
title_sort applying meta-analysis to genotype-tissue expression data from multiple tissues to identify eqtls and increase the number of egenes
topic Ismb/Eccb 2017: The 25th Annual Conference Intelligent Systems for Molecular Biology Held Jointly with the 16th Annual European Conference on Computational Biology, Prague, Czech Republic, July 21–25, 2017
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5870567/
https://www.ncbi.nlm.nih.gov/pubmed/28881962
http://dx.doi.org/10.1093/bioinformatics/btx227
work_keys_str_mv AT duongdat applyingmetaanalysistogenotypetissueexpressiondatafrommultipletissuestoidentifyeqtlsandincreasethenumberofegenes
AT gailisa applyingmetaanalysistogenotypetissueexpressiondatafrommultipletissuestoidentifyeqtlsandincreasethenumberofegenes
AT snirsagi applyingmetaanalysistogenotypetissueexpressiondatafrommultipletissuestoidentifyeqtlsandincreasethenumberofegenes
AT kangeunyong applyingmetaanalysistogenotypetissueexpressiondatafrommultipletissuestoidentifyeqtlsandincreasethenumberofegenes
AT hanbuhm applyingmetaanalysistogenotypetissueexpressiondatafrommultipletissuestoidentifyeqtlsandincreasethenumberofegenes
AT suljaehoon applyingmetaanalysistogenotypetissueexpressiondatafrommultipletissuestoidentifyeqtlsandincreasethenumberofegenes
AT eskineleazar applyingmetaanalysistogenotypetissueexpressiondatafrommultipletissuestoidentifyeqtlsandincreasethenumberofegenes