Cargando…

Self organising hypothesis networks: a new approach for representing and structuring SAR knowledge

BACKGROUND: Combining different sources of knowledge to build improved structure activity relationship models is not easy owing to the variety of knowledge formats and the absence of a common framework to interoperate between learning techniques. Most of the current approaches address this problem b...

Descripción completa

Detalles Bibliográficos
Autores principales: Hanser, Thierry, Barber, Chris, Rosser, Edward, Vessey, Jonathan D, Webb, Samuel J, Werner, Stéphane
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4048587/
https://www.ncbi.nlm.nih.gov/pubmed/24959206
http://dx.doi.org/10.1186/1758-2946-6-21
_version_ 1782480552906981376
author Hanser, Thierry
Barber, Chris
Rosser, Edward
Vessey, Jonathan D
Webb, Samuel J
Werner, Stéphane
author_facet Hanser, Thierry
Barber, Chris
Rosser, Edward
Vessey, Jonathan D
Webb, Samuel J
Werner, Stéphane
author_sort Hanser, Thierry
collection PubMed
description BACKGROUND: Combining different sources of knowledge to build improved structure activity relationship models is not easy owing to the variety of knowledge formats and the absence of a common framework to interoperate between learning techniques. Most of the current approaches address this problem by using consensus models that operate at the prediction level. We explore the possibility to directly combine these sources at the knowledge level, with the aim to harvest potentially increased synergy at an earlier stage. Our goal is to design a general methodology to facilitate knowledge discovery and produce accurate and interpretable models. RESULTS: To combine models at the knowledge level, we propose to decouple the learning phase from the knowledge application phase using a pivot representation (lingua franca) based on the concept of hypothesis. A hypothesis is a simple and interpretable knowledge unit. Regardless of its origin, knowledge is broken down into a collection of hypotheses. These hypotheses are subsequently organised into hierarchical network. This unification permits to combine different sources of knowledge into a common formalised framework. The approach allows us to create a synergistic system between different forms of knowledge and new algorithms can be applied to leverage this unified model. This first article focuses on the general principle of the Self Organising Hypothesis Network (SOHN) approach in the context of binary classification problems along with an illustrative application to the prediction of mutagenicity. CONCLUSION: It is possible to represent knowledge in the unified form of a hypothesis network allowing interpretable predictions with performances comparable to mainstream machine learning techniques. This new approach offers the potential to combine knowledge from different sources into a common framework in which high level reasoning and meta-learning can be applied; these latter perspectives will be explored in future work.
format Online
Article
Text
id pubmed-4048587
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40485872014-06-23 Self organising hypothesis networks: a new approach for representing and structuring SAR knowledge Hanser, Thierry Barber, Chris Rosser, Edward Vessey, Jonathan D Webb, Samuel J Werner, Stéphane J Cheminform Research Article BACKGROUND: Combining different sources of knowledge to build improved structure activity relationship models is not easy owing to the variety of knowledge formats and the absence of a common framework to interoperate between learning techniques. Most of the current approaches address this problem by using consensus models that operate at the prediction level. We explore the possibility to directly combine these sources at the knowledge level, with the aim to harvest potentially increased synergy at an earlier stage. Our goal is to design a general methodology to facilitate knowledge discovery and produce accurate and interpretable models. RESULTS: To combine models at the knowledge level, we propose to decouple the learning phase from the knowledge application phase using a pivot representation (lingua franca) based on the concept of hypothesis. A hypothesis is a simple and interpretable knowledge unit. Regardless of its origin, knowledge is broken down into a collection of hypotheses. These hypotheses are subsequently organised into hierarchical network. This unification permits to combine different sources of knowledge into a common formalised framework. The approach allows us to create a synergistic system between different forms of knowledge and new algorithms can be applied to leverage this unified model. This first article focuses on the general principle of the Self Organising Hypothesis Network (SOHN) approach in the context of binary classification problems along with an illustrative application to the prediction of mutagenicity. CONCLUSION: It is possible to represent knowledge in the unified form of a hypothesis network allowing interpretable predictions with performances comparable to mainstream machine learning techniques. This new approach offers the potential to combine knowledge from different sources into a common framework in which high level reasoning and meta-learning can be applied; these latter perspectives will be explored in future work. BioMed Central 2014-05-08 /pmc/articles/PMC4048587/ /pubmed/24959206 http://dx.doi.org/10.1186/1758-2946-6-21 Text en Copyright © 2014 Hanser et al.; licensee Chemistry Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Hanser, Thierry
Barber, Chris
Rosser, Edward
Vessey, Jonathan D
Webb, Samuel J
Werner, Stéphane
Self organising hypothesis networks: a new approach for representing and structuring SAR knowledge
title Self organising hypothesis networks: a new approach for representing and structuring SAR knowledge
title_full Self organising hypothesis networks: a new approach for representing and structuring SAR knowledge
title_fullStr Self organising hypothesis networks: a new approach for representing and structuring SAR knowledge
title_full_unstemmed Self organising hypothesis networks: a new approach for representing and structuring SAR knowledge
title_short Self organising hypothesis networks: a new approach for representing and structuring SAR knowledge
title_sort self organising hypothesis networks: a new approach for representing and structuring sar knowledge
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4048587/
https://www.ncbi.nlm.nih.gov/pubmed/24959206
http://dx.doi.org/10.1186/1758-2946-6-21
work_keys_str_mv AT hanserthierry selforganisinghypothesisnetworksanewapproachforrepresentingandstructuringsarknowledge
AT barberchris selforganisinghypothesisnetworksanewapproachforrepresentingandstructuringsarknowledge
AT rosseredward selforganisinghypothesisnetworksanewapproachforrepresentingandstructuringsarknowledge
AT vesseyjonathand selforganisinghypothesisnetworksanewapproachforrepresentingandstructuringsarknowledge
AT webbsamuelj selforganisinghypothesisnetworksanewapproachforrepresentingandstructuringsarknowledge
AT wernerstephane selforganisinghypothesisnetworksanewapproachforrepresentingandstructuringsarknowledge