Cargando…

FocusHeuristics – expression-data-driven network optimization and disease gene prediction

To identify genes contributing to disease phenotypes remains a challenge for bioinformatics. Static knowledge on biological networks is often combined with the dynamics observed in gene expression levels over disease development, to find markers for diagnostics and therapy, and also putative disease...

Descripción completa

Detalles Bibliográficos
Autores principales: Ernst, Mathias, Du, Yang, Warsow, Gregor, Hamed, Mohamed, Endlich, Nicole, Endlich, Karlhans, Murua Escobar, Hugo, Sklarz, Lisa-Madeleine, Sender, Sina, Junghanß, Christian, Möller, Steffen, Fuellen, Georg, Struckmann, Stephan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5311990/
https://www.ncbi.nlm.nih.gov/pubmed/28205611
http://dx.doi.org/10.1038/srep42638
_version_ 1782508126663081984
author Ernst, Mathias
Du, Yang
Warsow, Gregor
Hamed, Mohamed
Endlich, Nicole
Endlich, Karlhans
Murua Escobar, Hugo
Sklarz, Lisa-Madeleine
Sender, Sina
Junghanß, Christian
Möller, Steffen
Fuellen, Georg
Struckmann, Stephan
author_facet Ernst, Mathias
Du, Yang
Warsow, Gregor
Hamed, Mohamed
Endlich, Nicole
Endlich, Karlhans
Murua Escobar, Hugo
Sklarz, Lisa-Madeleine
Sender, Sina
Junghanß, Christian
Möller, Steffen
Fuellen, Georg
Struckmann, Stephan
author_sort Ernst, Mathias
collection PubMed
description To identify genes contributing to disease phenotypes remains a challenge for bioinformatics. Static knowledge on biological networks is often combined with the dynamics observed in gene expression levels over disease development, to find markers for diagnostics and therapy, and also putative disease-modulatory drug targets and drugs. The basis of current methods ranges from a focus on expression-levels (Limma) to concentrating on network characteristics (PageRank, HITS/Authority Score), and both (DeMAND, Local Radiality). We present an integrative approach (the FocusHeuristics) that is thoroughly evaluated based on public expression data and molecular disease characteristics provided by DisGeNet. The FocusHeuristics combines three scores, i.e. the log fold change and another two, based on the sum and difference of log fold changes of genes/proteins linked in a network. A gene is kept when one of the scores to which it contributes is above a threshold. Our FocusHeuristics is both, a predictor for gene-disease-association and a bioinformatics method to reduce biological networks to their disease-relevant parts, by highlighting the dynamics observed in expression data. The FocusHeuristics is slightly, but significantly better than other methods by its more successful identification of disease-associated genes measured by AUC, and it delivers mechanistic explanations for its choice of genes.
format Online
Article
Text
id pubmed-5311990
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-53119902017-02-23 FocusHeuristics – expression-data-driven network optimization and disease gene prediction Ernst, Mathias Du, Yang Warsow, Gregor Hamed, Mohamed Endlich, Nicole Endlich, Karlhans Murua Escobar, Hugo Sklarz, Lisa-Madeleine Sender, Sina Junghanß, Christian Möller, Steffen Fuellen, Georg Struckmann, Stephan Sci Rep Article To identify genes contributing to disease phenotypes remains a challenge for bioinformatics. Static knowledge on biological networks is often combined with the dynamics observed in gene expression levels over disease development, to find markers for diagnostics and therapy, and also putative disease-modulatory drug targets and drugs. The basis of current methods ranges from a focus on expression-levels (Limma) to concentrating on network characteristics (PageRank, HITS/Authority Score), and both (DeMAND, Local Radiality). We present an integrative approach (the FocusHeuristics) that is thoroughly evaluated based on public expression data and molecular disease characteristics provided by DisGeNet. The FocusHeuristics combines three scores, i.e. the log fold change and another two, based on the sum and difference of log fold changes of genes/proteins linked in a network. A gene is kept when one of the scores to which it contributes is above a threshold. Our FocusHeuristics is both, a predictor for gene-disease-association and a bioinformatics method to reduce biological networks to their disease-relevant parts, by highlighting the dynamics observed in expression data. The FocusHeuristics is slightly, but significantly better than other methods by its more successful identification of disease-associated genes measured by AUC, and it delivers mechanistic explanations for its choice of genes. Nature Publishing Group 2017-02-16 /pmc/articles/PMC5311990/ /pubmed/28205611 http://dx.doi.org/10.1038/srep42638 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Ernst, Mathias
Du, Yang
Warsow, Gregor
Hamed, Mohamed
Endlich, Nicole
Endlich, Karlhans
Murua Escobar, Hugo
Sklarz, Lisa-Madeleine
Sender, Sina
Junghanß, Christian
Möller, Steffen
Fuellen, Georg
Struckmann, Stephan
FocusHeuristics – expression-data-driven network optimization and disease gene prediction
title FocusHeuristics – expression-data-driven network optimization and disease gene prediction
title_full FocusHeuristics – expression-data-driven network optimization and disease gene prediction
title_fullStr FocusHeuristics – expression-data-driven network optimization and disease gene prediction
title_full_unstemmed FocusHeuristics – expression-data-driven network optimization and disease gene prediction
title_short FocusHeuristics – expression-data-driven network optimization and disease gene prediction
title_sort focusheuristics – expression-data-driven network optimization and disease gene prediction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5311990/
https://www.ncbi.nlm.nih.gov/pubmed/28205611
http://dx.doi.org/10.1038/srep42638
work_keys_str_mv AT ernstmathias focusheuristicsexpressiondatadrivennetworkoptimizationanddiseasegeneprediction
AT duyang focusheuristicsexpressiondatadrivennetworkoptimizationanddiseasegeneprediction
AT warsowgregor focusheuristicsexpressiondatadrivennetworkoptimizationanddiseasegeneprediction
AT hamedmohamed focusheuristicsexpressiondatadrivennetworkoptimizationanddiseasegeneprediction
AT endlichnicole focusheuristicsexpressiondatadrivennetworkoptimizationanddiseasegeneprediction
AT endlichkarlhans focusheuristicsexpressiondatadrivennetworkoptimizationanddiseasegeneprediction
AT muruaescobarhugo focusheuristicsexpressiondatadrivennetworkoptimizationanddiseasegeneprediction
AT sklarzlisamadeleine focusheuristicsexpressiondatadrivennetworkoptimizationanddiseasegeneprediction
AT sendersina focusheuristicsexpressiondatadrivennetworkoptimizationanddiseasegeneprediction
AT junghanßchristian focusheuristicsexpressiondatadrivennetworkoptimizationanddiseasegeneprediction
AT mollersteffen focusheuristicsexpressiondatadrivennetworkoptimizationanddiseasegeneprediction
AT fuellengeorg focusheuristicsexpressiondatadrivennetworkoptimizationanddiseasegeneprediction
AT struckmannstephan focusheuristicsexpressiondatadrivennetworkoptimizationanddiseasegeneprediction