Cargando…

A large-scale benchmark of gene prioritization methods

In order to maximize the use of results from high-throughput experimental studies, e.g. GWAS, for identification and diagnostics of new disease-associated genes, it is important to have properly analyzed and benchmarked gene prioritization tools. While prospective benchmarks are underpowered to prov...

Descripción completa

Detalles Bibliográficos
Autores principales: Guala, Dimitri, Sonnhammer, Erik L. L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5399445/
https://www.ncbi.nlm.nih.gov/pubmed/28429739
http://dx.doi.org/10.1038/srep46598
_version_ 1783230648785305600
author Guala, Dimitri
Sonnhammer, Erik L. L.
author_facet Guala, Dimitri
Sonnhammer, Erik L. L.
author_sort Guala, Dimitri
collection PubMed
description In order to maximize the use of results from high-throughput experimental studies, e.g. GWAS, for identification and diagnostics of new disease-associated genes, it is important to have properly analyzed and benchmarked gene prioritization tools. While prospective benchmarks are underpowered to provide statistically significant results in their attempt to differentiate the performance of gene prioritization tools, a strategy for retrospective benchmarking has been missing, and new tools usually only provide internal validations. The Gene Ontology(GO) contains genes clustered around annotation terms. This intrinsic property of GO can be utilized in construction of robust benchmarks, objective to the problem domain. We demonstrate how this can be achieved for network-based gene prioritization tools, utilizing the FunCoup network. We use cross-validation and a set of appropriate performance measures to compare state-of-the-art gene prioritization algorithms: three based on network diffusion, NetRank and two implementations of Random Walk with Restart, and MaxLink that utilizes network neighborhood. Our benchmark suite provides a systematic and objective way to compare the multitude of available and future gene prioritization tools, enabling researchers to select the best gene prioritization tool for the task at hand, and helping to guide the development of more accurate methods.
format Online
Article
Text
id pubmed-5399445
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-53994452017-04-21 A large-scale benchmark of gene prioritization methods Guala, Dimitri Sonnhammer, Erik L. L. Sci Rep Article In order to maximize the use of results from high-throughput experimental studies, e.g. GWAS, for identification and diagnostics of new disease-associated genes, it is important to have properly analyzed and benchmarked gene prioritization tools. While prospective benchmarks are underpowered to provide statistically significant results in their attempt to differentiate the performance of gene prioritization tools, a strategy for retrospective benchmarking has been missing, and new tools usually only provide internal validations. The Gene Ontology(GO) contains genes clustered around annotation terms. This intrinsic property of GO can be utilized in construction of robust benchmarks, objective to the problem domain. We demonstrate how this can be achieved for network-based gene prioritization tools, utilizing the FunCoup network. We use cross-validation and a set of appropriate performance measures to compare state-of-the-art gene prioritization algorithms: three based on network diffusion, NetRank and two implementations of Random Walk with Restart, and MaxLink that utilizes network neighborhood. Our benchmark suite provides a systematic and objective way to compare the multitude of available and future gene prioritization tools, enabling researchers to select the best gene prioritization tool for the task at hand, and helping to guide the development of more accurate methods. Nature Publishing Group 2017-04-21 /pmc/articles/PMC5399445/ /pubmed/28429739 http://dx.doi.org/10.1038/srep46598 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Guala, Dimitri
Sonnhammer, Erik L. L.
A large-scale benchmark of gene prioritization methods
title A large-scale benchmark of gene prioritization methods
title_full A large-scale benchmark of gene prioritization methods
title_fullStr A large-scale benchmark of gene prioritization methods
title_full_unstemmed A large-scale benchmark of gene prioritization methods
title_short A large-scale benchmark of gene prioritization methods
title_sort large-scale benchmark of gene prioritization methods
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5399445/
https://www.ncbi.nlm.nih.gov/pubmed/28429739
http://dx.doi.org/10.1038/srep46598
work_keys_str_mv AT gualadimitri alargescalebenchmarkofgeneprioritizationmethods
AT sonnhammererikll alargescalebenchmarkofgeneprioritizationmethods
AT gualadimitri largescalebenchmarkofgeneprioritizationmethods
AT sonnhammererikll largescalebenchmarkofgeneprioritizationmethods