Cargando…
Self organising hypothesis networks: a new approach for representing and structuring SAR knowledge
BACKGROUND: Combining different sources of knowledge to build improved structure activity relationship models is not easy owing to the variety of knowledge formats and the absence of a common framework to interoperate between learning techniques. Most of the current approaches address this problem b...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4048587/ https://www.ncbi.nlm.nih.gov/pubmed/24959206 http://dx.doi.org/10.1186/1758-2946-6-21 |
_version_ | 1782480552906981376 |
---|---|
author | Hanser, Thierry Barber, Chris Rosser, Edward Vessey, Jonathan D Webb, Samuel J Werner, Stéphane |
author_facet | Hanser, Thierry Barber, Chris Rosser, Edward Vessey, Jonathan D Webb, Samuel J Werner, Stéphane |
author_sort | Hanser, Thierry |
collection | PubMed |
description | BACKGROUND: Combining different sources of knowledge to build improved structure activity relationship models is not easy owing to the variety of knowledge formats and the absence of a common framework to interoperate between learning techniques. Most of the current approaches address this problem by using consensus models that operate at the prediction level. We explore the possibility to directly combine these sources at the knowledge level, with the aim to harvest potentially increased synergy at an earlier stage. Our goal is to design a general methodology to facilitate knowledge discovery and produce accurate and interpretable models. RESULTS: To combine models at the knowledge level, we propose to decouple the learning phase from the knowledge application phase using a pivot representation (lingua franca) based on the concept of hypothesis. A hypothesis is a simple and interpretable knowledge unit. Regardless of its origin, knowledge is broken down into a collection of hypotheses. These hypotheses are subsequently organised into hierarchical network. This unification permits to combine different sources of knowledge into a common formalised framework. The approach allows us to create a synergistic system between different forms of knowledge and new algorithms can be applied to leverage this unified model. This first article focuses on the general principle of the Self Organising Hypothesis Network (SOHN) approach in the context of binary classification problems along with an illustrative application to the prediction of mutagenicity. CONCLUSION: It is possible to represent knowledge in the unified form of a hypothesis network allowing interpretable predictions with performances comparable to mainstream machine learning techniques. This new approach offers the potential to combine knowledge from different sources into a common framework in which high level reasoning and meta-learning can be applied; these latter perspectives will be explored in future work. |
format | Online Article Text |
id | pubmed-4048587 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-40485872014-06-23 Self organising hypothesis networks: a new approach for representing and structuring SAR knowledge Hanser, Thierry Barber, Chris Rosser, Edward Vessey, Jonathan D Webb, Samuel J Werner, Stéphane J Cheminform Research Article BACKGROUND: Combining different sources of knowledge to build improved structure activity relationship models is not easy owing to the variety of knowledge formats and the absence of a common framework to interoperate between learning techniques. Most of the current approaches address this problem by using consensus models that operate at the prediction level. We explore the possibility to directly combine these sources at the knowledge level, with the aim to harvest potentially increased synergy at an earlier stage. Our goal is to design a general methodology to facilitate knowledge discovery and produce accurate and interpretable models. RESULTS: To combine models at the knowledge level, we propose to decouple the learning phase from the knowledge application phase using a pivot representation (lingua franca) based on the concept of hypothesis. A hypothesis is a simple and interpretable knowledge unit. Regardless of its origin, knowledge is broken down into a collection of hypotheses. These hypotheses are subsequently organised into hierarchical network. This unification permits to combine different sources of knowledge into a common formalised framework. The approach allows us to create a synergistic system between different forms of knowledge and new algorithms can be applied to leverage this unified model. This first article focuses on the general principle of the Self Organising Hypothesis Network (SOHN) approach in the context of binary classification problems along with an illustrative application to the prediction of mutagenicity. CONCLUSION: It is possible to represent knowledge in the unified form of a hypothesis network allowing interpretable predictions with performances comparable to mainstream machine learning techniques. This new approach offers the potential to combine knowledge from different sources into a common framework in which high level reasoning and meta-learning can be applied; these latter perspectives will be explored in future work. BioMed Central 2014-05-08 /pmc/articles/PMC4048587/ /pubmed/24959206 http://dx.doi.org/10.1186/1758-2946-6-21 Text en Copyright © 2014 Hanser et al.; licensee Chemistry Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Hanser, Thierry Barber, Chris Rosser, Edward Vessey, Jonathan D Webb, Samuel J Werner, Stéphane Self organising hypothesis networks: a new approach for representing and structuring SAR knowledge |
title | Self organising hypothesis networks: a new approach for representing and structuring SAR knowledge |
title_full | Self organising hypothesis networks: a new approach for representing and structuring SAR knowledge |
title_fullStr | Self organising hypothesis networks: a new approach for representing and structuring SAR knowledge |
title_full_unstemmed | Self organising hypothesis networks: a new approach for representing and structuring SAR knowledge |
title_short | Self organising hypothesis networks: a new approach for representing and structuring SAR knowledge |
title_sort | self organising hypothesis networks: a new approach for representing and structuring sar knowledge |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4048587/ https://www.ncbi.nlm.nih.gov/pubmed/24959206 http://dx.doi.org/10.1186/1758-2946-6-21 |
work_keys_str_mv | AT hanserthierry selforganisinghypothesisnetworksanewapproachforrepresentingandstructuringsarknowledge AT barberchris selforganisinghypothesisnetworksanewapproachforrepresentingandstructuringsarknowledge AT rosseredward selforganisinghypothesisnetworksanewapproachforrepresentingandstructuringsarknowledge AT vesseyjonathand selforganisinghypothesisnetworksanewapproachforrepresentingandstructuringsarknowledge AT webbsamuelj selforganisinghypothesisnetworksanewapproachforrepresentingandstructuringsarknowledge AT wernerstephane selforganisinghypothesisnetworksanewapproachforrepresentingandstructuringsarknowledge |