Cargando…

A Bayesian variable selection procedure to rank overlapping gene sets

BACKGROUND: Genome-wide expression profiling using microarrays or sequence-based technologies allows us to identify genes and genetic pathways whose expression patterns influence complex traits. Different methods to prioritize gene sets, such as the genes in a given molecular pathway, have been desc...

Descripción completa

Detalles Bibliográficos
Autores principales: Skarman, Axel, Shariati, Mohammad, Jans, Luc, Jiang, Li, Sørensen, Peter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3434019/
https://www.ncbi.nlm.nih.gov/pubmed/22554182
http://dx.doi.org/10.1186/1471-2105-13-73
_version_ 1782242373804228608
author Skarman, Axel
Shariati, Mohammad
Jans, Luc
Jiang, Li
Sørensen, Peter
author_facet Skarman, Axel
Shariati, Mohammad
Jans, Luc
Jiang, Li
Sørensen, Peter
author_sort Skarman, Axel
collection PubMed
description BACKGROUND: Genome-wide expression profiling using microarrays or sequence-based technologies allows us to identify genes and genetic pathways whose expression patterns influence complex traits. Different methods to prioritize gene sets, such as the genes in a given molecular pathway, have been described. In many cases, these methods test one gene set at a time, and therefore do not consider overlaps among the pathways. Here, we present a Bayesian variable selection method to prioritize gene sets that overcomes this limitation by considering all gene sets simultaneously. We applied Bayesian variable selection to differential expression to prioritize the molecular and genetic pathways involved in the responses to Escherichia coli infection in Danish Holstein cows. RESULTS: We used a Bayesian variable selection method to prioritize Kyoto Encyclopedia of Genes and Genomes pathways. We used our data to study how the variable selection method was affected by overlaps among the pathways. In addition, we compared our approach to another that ignores the overlaps, and studied the differences in the prioritization. The variable selection method was robust to a change in prior probability and stable given a limited number of observations. CONCLUSIONS: Bayesian variable selection is a useful way to prioritize gene sets while considering their overlaps. Ignoring the overlaps gives different and possibly misleading results. Additional procedures may be needed in cases of highly overlapping pathways that are hard to prioritize.
format Online
Article
Text
id pubmed-3434019
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34340192012-09-10 A Bayesian variable selection procedure to rank overlapping gene sets Skarman, Axel Shariati, Mohammad Jans, Luc Jiang, Li Sørensen, Peter BMC Bioinformatics Methodology Article BACKGROUND: Genome-wide expression profiling using microarrays or sequence-based technologies allows us to identify genes and genetic pathways whose expression patterns influence complex traits. Different methods to prioritize gene sets, such as the genes in a given molecular pathway, have been described. In many cases, these methods test one gene set at a time, and therefore do not consider overlaps among the pathways. Here, we present a Bayesian variable selection method to prioritize gene sets that overcomes this limitation by considering all gene sets simultaneously. We applied Bayesian variable selection to differential expression to prioritize the molecular and genetic pathways involved in the responses to Escherichia coli infection in Danish Holstein cows. RESULTS: We used a Bayesian variable selection method to prioritize Kyoto Encyclopedia of Genes and Genomes pathways. We used our data to study how the variable selection method was affected by overlaps among the pathways. In addition, we compared our approach to another that ignores the overlaps, and studied the differences in the prioritization. The variable selection method was robust to a change in prior probability and stable given a limited number of observations. CONCLUSIONS: Bayesian variable selection is a useful way to prioritize gene sets while considering their overlaps. Ignoring the overlaps gives different and possibly misleading results. Additional procedures may be needed in cases of highly overlapping pathways that are hard to prioritize. BioMed Central 2012-05-03 /pmc/articles/PMC3434019/ /pubmed/22554182 http://dx.doi.org/10.1186/1471-2105-13-73 Text en Copyright ©2012 Skarman et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Skarman, Axel
Shariati, Mohammad
Jans, Luc
Jiang, Li
Sørensen, Peter
A Bayesian variable selection procedure to rank overlapping gene sets
title A Bayesian variable selection procedure to rank overlapping gene sets
title_full A Bayesian variable selection procedure to rank overlapping gene sets
title_fullStr A Bayesian variable selection procedure to rank overlapping gene sets
title_full_unstemmed A Bayesian variable selection procedure to rank overlapping gene sets
title_short A Bayesian variable selection procedure to rank overlapping gene sets
title_sort bayesian variable selection procedure to rank overlapping gene sets
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3434019/
https://www.ncbi.nlm.nih.gov/pubmed/22554182
http://dx.doi.org/10.1186/1471-2105-13-73
work_keys_str_mv AT skarmanaxel abayesianvariableselectionproceduretorankoverlappinggenesets
AT shariatimohammad abayesianvariableselectionproceduretorankoverlappinggenesets
AT jansluc abayesianvariableselectionproceduretorankoverlappinggenesets
AT jiangli abayesianvariableselectionproceduretorankoverlappinggenesets
AT sørensenpeter abayesianvariableselectionproceduretorankoverlappinggenesets
AT skarmanaxel bayesianvariableselectionproceduretorankoverlappinggenesets
AT shariatimohammad bayesianvariableselectionproceduretorankoverlappinggenesets
AT jansluc bayesianvariableselectionproceduretorankoverlappinggenesets
AT jiangli bayesianvariableselectionproceduretorankoverlappinggenesets
AT sørensenpeter bayesianvariableselectionproceduretorankoverlappinggenesets