Cargando…
A strategy for evaluating pathway analysis methods
BACKGROUND: Researchers have previously developed a multitude of methods designed to identify biological pathways associated with specific clinical or experimental conditions of interest, with the aim of facilitating biological interpretation of high-throughput data. Before practically applying such...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5640951/ https://www.ncbi.nlm.nih.gov/pubmed/29029625 http://dx.doi.org/10.1186/s12859-017-1866-7 |
_version_ | 1783271127032791040 |
---|---|
author | Yu, Chenggang Woo, Hyung Jun Yu, Xueping Oyama, Tatsuya Wallqvist, Anders Reifman, Jaques |
author_facet | Yu, Chenggang Woo, Hyung Jun Yu, Xueping Oyama, Tatsuya Wallqvist, Anders Reifman, Jaques |
author_sort | Yu, Chenggang |
collection | PubMed |
description | BACKGROUND: Researchers have previously developed a multitude of methods designed to identify biological pathways associated with specific clinical or experimental conditions of interest, with the aim of facilitating biological interpretation of high-throughput data. Before practically applying such pathway analysis (PA) methods, we must first evaluate their performance and reliability, using datasets where the pathways perturbed by the conditions of interest have been well characterized in advance. However, such ‘ground truths’ (or gold standards) are often unavailable. Furthermore, previous evaluation strategies that have focused on defining ‘true answers’ are unable to systematically and objectively assess PA methods under a wide range of conditions. RESULTS: In this work, we propose a novel strategy for evaluating PA methods independently of any gold standard, either established or assumed. The strategy involves the use of two mutually complementary metrics, recall and discrimination. Recall measures the consistency of the perturbed pathways identified by applying a particular analysis method to an original large dataset and those identified by the same method to a sub-dataset of the original dataset. In contrast, discrimination measures specificity—the degree to which the perturbed pathways identified by a particular method to a dataset from one experiment differ from those identifying by the same method to a dataset from a different experiment. We used these metrics and 24 datasets to evaluate six widely used PA methods. The results highlighted the common challenge in reliably identifying significant pathways from small datasets. Importantly, we confirmed the effectiveness of our proposed dual-metric strategy by showing that previous comparative studies corroborate the performance evaluations of the six methods obtained by our strategy. CONCLUSIONS: Unlike any previously proposed strategy for evaluating the performance of PA methods, our dual-metric strategy does not rely on any ground truth, either established or assumed, of the pathways perturbed by a specific clinical or experimental condition. As such, our strategy allows researchers to systematically and objectively evaluate pathway analysis methods by employing any number of datasets for a variety of conditions. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-017-1866-7) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5640951 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-56409512017-10-18 A strategy for evaluating pathway analysis methods Yu, Chenggang Woo, Hyung Jun Yu, Xueping Oyama, Tatsuya Wallqvist, Anders Reifman, Jaques BMC Bioinformatics Methodology Article BACKGROUND: Researchers have previously developed a multitude of methods designed to identify biological pathways associated with specific clinical or experimental conditions of interest, with the aim of facilitating biological interpretation of high-throughput data. Before practically applying such pathway analysis (PA) methods, we must first evaluate their performance and reliability, using datasets where the pathways perturbed by the conditions of interest have been well characterized in advance. However, such ‘ground truths’ (or gold standards) are often unavailable. Furthermore, previous evaluation strategies that have focused on defining ‘true answers’ are unable to systematically and objectively assess PA methods under a wide range of conditions. RESULTS: In this work, we propose a novel strategy for evaluating PA methods independently of any gold standard, either established or assumed. The strategy involves the use of two mutually complementary metrics, recall and discrimination. Recall measures the consistency of the perturbed pathways identified by applying a particular analysis method to an original large dataset and those identified by the same method to a sub-dataset of the original dataset. In contrast, discrimination measures specificity—the degree to which the perturbed pathways identified by a particular method to a dataset from one experiment differ from those identifying by the same method to a dataset from a different experiment. We used these metrics and 24 datasets to evaluate six widely used PA methods. The results highlighted the common challenge in reliably identifying significant pathways from small datasets. Importantly, we confirmed the effectiveness of our proposed dual-metric strategy by showing that previous comparative studies corroborate the performance evaluations of the six methods obtained by our strategy. CONCLUSIONS: Unlike any previously proposed strategy for evaluating the performance of PA methods, our dual-metric strategy does not rely on any ground truth, either established or assumed, of the pathways perturbed by a specific clinical or experimental condition. As such, our strategy allows researchers to systematically and objectively evaluate pathway analysis methods by employing any number of datasets for a variety of conditions. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-017-1866-7) contains supplementary material, which is available to authorized users. BioMed Central 2017-10-13 /pmc/articles/PMC5640951/ /pubmed/29029625 http://dx.doi.org/10.1186/s12859-017-1866-7 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Yu, Chenggang Woo, Hyung Jun Yu, Xueping Oyama, Tatsuya Wallqvist, Anders Reifman, Jaques A strategy for evaluating pathway analysis methods |
title | A strategy for evaluating pathway analysis methods |
title_full | A strategy for evaluating pathway analysis methods |
title_fullStr | A strategy for evaluating pathway analysis methods |
title_full_unstemmed | A strategy for evaluating pathway analysis methods |
title_short | A strategy for evaluating pathway analysis methods |
title_sort | strategy for evaluating pathway analysis methods |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5640951/ https://www.ncbi.nlm.nih.gov/pubmed/29029625 http://dx.doi.org/10.1186/s12859-017-1866-7 |
work_keys_str_mv | AT yuchenggang astrategyforevaluatingpathwayanalysismethods AT woohyungjun astrategyforevaluatingpathwayanalysismethods AT yuxueping astrategyforevaluatingpathwayanalysismethods AT oyamatatsuya astrategyforevaluatingpathwayanalysismethods AT wallqvistanders astrategyforevaluatingpathwayanalysismethods AT reifmanjaques astrategyforevaluatingpathwayanalysismethods AT yuchenggang strategyforevaluatingpathwayanalysismethods AT woohyungjun strategyforevaluatingpathwayanalysismethods AT yuxueping strategyforevaluatingpathwayanalysismethods AT oyamatatsuya strategyforevaluatingpathwayanalysismethods AT wallqvistanders strategyforevaluatingpathwayanalysismethods AT reifmanjaques strategyforevaluatingpathwayanalysismethods |