Cargando…
A large-scale benchmark of gene prioritization methods
In order to maximize the use of results from high-throughput experimental studies, e.g. GWAS, for identification and diagnostics of new disease-associated genes, it is important to have properly analyzed and benchmarked gene prioritization tools. While prospective benchmarks are underpowered to prov...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5399445/ https://www.ncbi.nlm.nih.gov/pubmed/28429739 http://dx.doi.org/10.1038/srep46598 |
_version_ | 1783230648785305600 |
---|---|
author | Guala, Dimitri Sonnhammer, Erik L. L. |
author_facet | Guala, Dimitri Sonnhammer, Erik L. L. |
author_sort | Guala, Dimitri |
collection | PubMed |
description | In order to maximize the use of results from high-throughput experimental studies, e.g. GWAS, for identification and diagnostics of new disease-associated genes, it is important to have properly analyzed and benchmarked gene prioritization tools. While prospective benchmarks are underpowered to provide statistically significant results in their attempt to differentiate the performance of gene prioritization tools, a strategy for retrospective benchmarking has been missing, and new tools usually only provide internal validations. The Gene Ontology(GO) contains genes clustered around annotation terms. This intrinsic property of GO can be utilized in construction of robust benchmarks, objective to the problem domain. We demonstrate how this can be achieved for network-based gene prioritization tools, utilizing the FunCoup network. We use cross-validation and a set of appropriate performance measures to compare state-of-the-art gene prioritization algorithms: three based on network diffusion, NetRank and two implementations of Random Walk with Restart, and MaxLink that utilizes network neighborhood. Our benchmark suite provides a systematic and objective way to compare the multitude of available and future gene prioritization tools, enabling researchers to select the best gene prioritization tool for the task at hand, and helping to guide the development of more accurate methods. |
format | Online Article Text |
id | pubmed-5399445 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-53994452017-04-21 A large-scale benchmark of gene prioritization methods Guala, Dimitri Sonnhammer, Erik L. L. Sci Rep Article In order to maximize the use of results from high-throughput experimental studies, e.g. GWAS, for identification and diagnostics of new disease-associated genes, it is important to have properly analyzed and benchmarked gene prioritization tools. While prospective benchmarks are underpowered to provide statistically significant results in their attempt to differentiate the performance of gene prioritization tools, a strategy for retrospective benchmarking has been missing, and new tools usually only provide internal validations. The Gene Ontology(GO) contains genes clustered around annotation terms. This intrinsic property of GO can be utilized in construction of robust benchmarks, objective to the problem domain. We demonstrate how this can be achieved for network-based gene prioritization tools, utilizing the FunCoup network. We use cross-validation and a set of appropriate performance measures to compare state-of-the-art gene prioritization algorithms: three based on network diffusion, NetRank and two implementations of Random Walk with Restart, and MaxLink that utilizes network neighborhood. Our benchmark suite provides a systematic and objective way to compare the multitude of available and future gene prioritization tools, enabling researchers to select the best gene prioritization tool for the task at hand, and helping to guide the development of more accurate methods. Nature Publishing Group 2017-04-21 /pmc/articles/PMC5399445/ /pubmed/28429739 http://dx.doi.org/10.1038/srep46598 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Guala, Dimitri Sonnhammer, Erik L. L. A large-scale benchmark of gene prioritization methods |
title | A large-scale benchmark of gene prioritization methods |
title_full | A large-scale benchmark of gene prioritization methods |
title_fullStr | A large-scale benchmark of gene prioritization methods |
title_full_unstemmed | A large-scale benchmark of gene prioritization methods |
title_short | A large-scale benchmark of gene prioritization methods |
title_sort | large-scale benchmark of gene prioritization methods |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5399445/ https://www.ncbi.nlm.nih.gov/pubmed/28429739 http://dx.doi.org/10.1038/srep46598 |
work_keys_str_mv | AT gualadimitri alargescalebenchmarkofgeneprioritizationmethods AT sonnhammererikll alargescalebenchmarkofgeneprioritizationmethods AT gualadimitri largescalebenchmarkofgeneprioritizationmethods AT sonnhammererikll largescalebenchmarkofgeneprioritizationmethods |