Cargando…

Functional networks inference from rule-based machine learning models

BACKGROUND: Functional networks play an important role in the analysis of biological processes and systems. The inference of these networks from high-throughput (-omics) data is an area of intense research. So far, the similarity-based inference paradigm (e.g. gene co-expression) has been the most p...

Descripción completa

Detalles Bibliográficos
Autores principales: Lazzarini, Nicola, Widera, Paweł, Williamson, Stuart, Heer, Rakesh, Krasnogor, Natalio, Bacardit, Jaume
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5011349/
https://www.ncbi.nlm.nih.gov/pubmed/27597880
http://dx.doi.org/10.1186/s13040-016-0106-4
_version_ 1782451814397902848
author Lazzarini, Nicola
Widera, Paweł,
Williamson, Stuart
Heer, Rakesh
Krasnogor, Natalio
Bacardit, Jaume
author_facet Lazzarini, Nicola
Widera, Paweł,
Williamson, Stuart
Heer, Rakesh
Krasnogor, Natalio
Bacardit, Jaume
author_sort Lazzarini, Nicola
collection PubMed
description BACKGROUND: Functional networks play an important role in the analysis of biological processes and systems. The inference of these networks from high-throughput (-omics) data is an area of intense research. So far, the similarity-based inference paradigm (e.g. gene co-expression) has been the most popular approach. It assumes a functional relationship between genes which are expressed at similar levels across different samples. An alternative to this paradigm is the inference of relationships from the structure of machine learning models. These models are able to capture complex relationships between variables, that often are different/complementary to the similarity-based methods. RESULTS: We propose a protocol to infer functional networks from machine learning models, called FuNeL. It assumes, that genes used together within a rule-based machine learning model to classify the samples, might also be functionally related at a biological level. The protocol is first tested on synthetic datasets and then evaluated on a test suite of 8 real-world datasets related to human cancer. The networks inferred from the real-world data are compared against gene co-expression networks of equal size, generated with 3 different methods. The comparison is performed from two different points of view. We analyse the enriched biological terms in the set of network nodes and the relationships between known disease-associated genes in a context of the network topology. The comparison confirms both the biological relevance and the complementary character of the knowledge captured by the FuNeL networks in relation to similarity-based methods and demonstrates its potential to identify known disease associations as core elements of the network. Finally, using a prostate cancer dataset as a case study, we confirm that the biological knowledge captured by our method is relevant to the disease and consistent with the specialised literature and with an independent dataset not used in the inference process. AVAILABILITY: The implementation of our network inference protocol is available at: http://ico2s.org/software/funel.html ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13040-016-0106-4) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5011349
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-50113492016-09-06 Functional networks inference from rule-based machine learning models Lazzarini, Nicola Widera, Paweł, Williamson, Stuart Heer, Rakesh Krasnogor, Natalio Bacardit, Jaume BioData Min Methodology BACKGROUND: Functional networks play an important role in the analysis of biological processes and systems. The inference of these networks from high-throughput (-omics) data is an area of intense research. So far, the similarity-based inference paradigm (e.g. gene co-expression) has been the most popular approach. It assumes a functional relationship between genes which are expressed at similar levels across different samples. An alternative to this paradigm is the inference of relationships from the structure of machine learning models. These models are able to capture complex relationships between variables, that often are different/complementary to the similarity-based methods. RESULTS: We propose a protocol to infer functional networks from machine learning models, called FuNeL. It assumes, that genes used together within a rule-based machine learning model to classify the samples, might also be functionally related at a biological level. The protocol is first tested on synthetic datasets and then evaluated on a test suite of 8 real-world datasets related to human cancer. The networks inferred from the real-world data are compared against gene co-expression networks of equal size, generated with 3 different methods. The comparison is performed from two different points of view. We analyse the enriched biological terms in the set of network nodes and the relationships between known disease-associated genes in a context of the network topology. The comparison confirms both the biological relevance and the complementary character of the knowledge captured by the FuNeL networks in relation to similarity-based methods and demonstrates its potential to identify known disease associations as core elements of the network. Finally, using a prostate cancer dataset as a case study, we confirm that the biological knowledge captured by our method is relevant to the disease and consistent with the specialised literature and with an independent dataset not used in the inference process. AVAILABILITY: The implementation of our network inference protocol is available at: http://ico2s.org/software/funel.html ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13040-016-0106-4) contains supplementary material, which is available to authorized users. BioMed Central 2016-09-05 /pmc/articles/PMC5011349/ /pubmed/27597880 http://dx.doi.org/10.1186/s13040-016-0106-4 Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Lazzarini, Nicola
Widera, Paweł,
Williamson, Stuart
Heer, Rakesh
Krasnogor, Natalio
Bacardit, Jaume
Functional networks inference from rule-based machine learning models
title Functional networks inference from rule-based machine learning models
title_full Functional networks inference from rule-based machine learning models
title_fullStr Functional networks inference from rule-based machine learning models
title_full_unstemmed Functional networks inference from rule-based machine learning models
title_short Functional networks inference from rule-based machine learning models
title_sort functional networks inference from rule-based machine learning models
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5011349/
https://www.ncbi.nlm.nih.gov/pubmed/27597880
http://dx.doi.org/10.1186/s13040-016-0106-4
work_keys_str_mv AT lazzarininicola functionalnetworksinferencefromrulebasedmachinelearningmodels
AT widerapaweł functionalnetworksinferencefromrulebasedmachinelearningmodels
AT williamsonstuart functionalnetworksinferencefromrulebasedmachinelearningmodels
AT heerrakesh functionalnetworksinferencefromrulebasedmachinelearningmodels
AT krasnogornatalio functionalnetworksinferencefromrulebasedmachinelearningmodels
AT bacarditjaume functionalnetworksinferencefromrulebasedmachinelearningmodels