Cargando…

Simultaneous inferences based on empirical Bayes methods and false discovery rates ineQTL data analysis

BACKGROUND: Genome-wide association studies (GWAS) have identified hundreds of genetic variants associated with complex human diseases, clinical conditions and traits. Genetic mapping of expression quantitative trait loci (eQTLs) is providing us with novel functional effects of thousands of single n...

Descripción completa

Detalles Bibliográficos
Autores principales: Chakraborty, Arindom, Jiang, Guanglong, Boustani, Malaz, Liu, Yunlong, Skaar, Todd, Li, Lang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4042241/
https://www.ncbi.nlm.nih.gov/pubmed/24564682
http://dx.doi.org/10.1186/1471-2164-14-S8-S8
_version_ 1782318779036860416
author Chakraborty, Arindom
Jiang, Guanglong
Boustani, Malaz
Liu, Yunlong
Skaar, Todd
Li, Lang
author_facet Chakraborty, Arindom
Jiang, Guanglong
Boustani, Malaz
Liu, Yunlong
Skaar, Todd
Li, Lang
author_sort Chakraborty, Arindom
collection PubMed
description BACKGROUND: Genome-wide association studies (GWAS) have identified hundreds of genetic variants associated with complex human diseases, clinical conditions and traits. Genetic mapping of expression quantitative trait loci (eQTLs) is providing us with novel functional effects of thousands of single nucleotide polymorphisms (SNPs). In a classical quantitative trail loci (QTL) mapping problem multiple tests are done to assess whether one trait is associated with a number of loci. In contrast to QTL studies, thousands of traits are measured alongwith thousands of gene expressions in an eQTL study. For such a study, a huge number of tests have to be performed ([Formula: see text]). This extreme multiplicity gives rise to many computational and statistical problems. In this paper we have tried to address these issues using two closely related inferential approaches: an empirical Bayes method that bears the Bayesian flavor without having much a priori knowledge and the frequentist method of false discovery rates. A three-component t-mixture model has been used for the parametric empirical Bayes (PEB) method. Inferences have been obtained using Expectation/Conditional Maximization Either (ECME) algorithm. A simulation study has also been performed and has been compared with a nonparametric empirical Bayes (NPEB) alternative. RESULTS: The results show that PEB has an edge over NPEB. The proposed methodology has been applied to human liver cohort (LHC) data. Our method enables to discover more significant SNPs with FDR<10% compared to the previous study done by Yang et al. (Genome Research, 2010). CONCLUSIONS: In contrast to previously available methods based on p-values, the empirical Bayes method uses local false discovery rate (lfdr) as the threshold. This method controls false positive rate.
format Online
Article
Text
id pubmed-4042241
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40422412014-06-04 Simultaneous inferences based on empirical Bayes methods and false discovery rates ineQTL data analysis Chakraborty, Arindom Jiang, Guanglong Boustani, Malaz Liu, Yunlong Skaar, Todd Li, Lang BMC Genomics Research BACKGROUND: Genome-wide association studies (GWAS) have identified hundreds of genetic variants associated with complex human diseases, clinical conditions and traits. Genetic mapping of expression quantitative trait loci (eQTLs) is providing us with novel functional effects of thousands of single nucleotide polymorphisms (SNPs). In a classical quantitative trail loci (QTL) mapping problem multiple tests are done to assess whether one trait is associated with a number of loci. In contrast to QTL studies, thousands of traits are measured alongwith thousands of gene expressions in an eQTL study. For such a study, a huge number of tests have to be performed ([Formula: see text]). This extreme multiplicity gives rise to many computational and statistical problems. In this paper we have tried to address these issues using two closely related inferential approaches: an empirical Bayes method that bears the Bayesian flavor without having much a priori knowledge and the frequentist method of false discovery rates. A three-component t-mixture model has been used for the parametric empirical Bayes (PEB) method. Inferences have been obtained using Expectation/Conditional Maximization Either (ECME) algorithm. A simulation study has also been performed and has been compared with a nonparametric empirical Bayes (NPEB) alternative. RESULTS: The results show that PEB has an edge over NPEB. The proposed methodology has been applied to human liver cohort (LHC) data. Our method enables to discover more significant SNPs with FDR<10% compared to the previous study done by Yang et al. (Genome Research, 2010). CONCLUSIONS: In contrast to previously available methods based on p-values, the empirical Bayes method uses local false discovery rate (lfdr) as the threshold. This method controls false positive rate. BioMed Central 2013-12-09 /pmc/articles/PMC4042241/ /pubmed/24564682 http://dx.doi.org/10.1186/1471-2164-14-S8-S8 Text en Copyright © 2013 Chakraborty et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Chakraborty, Arindom
Jiang, Guanglong
Boustani, Malaz
Liu, Yunlong
Skaar, Todd
Li, Lang
Simultaneous inferences based on empirical Bayes methods and false discovery rates ineQTL data analysis
title Simultaneous inferences based on empirical Bayes methods and false discovery rates ineQTL data analysis
title_full Simultaneous inferences based on empirical Bayes methods and false discovery rates ineQTL data analysis
title_fullStr Simultaneous inferences based on empirical Bayes methods and false discovery rates ineQTL data analysis
title_full_unstemmed Simultaneous inferences based on empirical Bayes methods and false discovery rates ineQTL data analysis
title_short Simultaneous inferences based on empirical Bayes methods and false discovery rates ineQTL data analysis
title_sort simultaneous inferences based on empirical bayes methods and false discovery rates ineqtl data analysis
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4042241/
https://www.ncbi.nlm.nih.gov/pubmed/24564682
http://dx.doi.org/10.1186/1471-2164-14-S8-S8
work_keys_str_mv AT chakrabortyarindom simultaneousinferencesbasedonempiricalbayesmethodsandfalsediscoveryratesineqtldataanalysis
AT jiangguanglong simultaneousinferencesbasedonempiricalbayesmethodsandfalsediscoveryratesineqtldataanalysis
AT boustanimalaz simultaneousinferencesbasedonempiricalbayesmethodsandfalsediscoveryratesineqtldataanalysis
AT liuyunlong simultaneousinferencesbasedonempiricalbayesmethodsandfalsediscoveryratesineqtldataanalysis
AT skaartodd simultaneousinferencesbasedonempiricalbayesmethodsandfalsediscoveryratesineqtldataanalysis
AT lilang simultaneousinferencesbasedonempiricalbayesmethodsandfalsediscoveryratesineqtldataanalysis