Cargando…

Semantic interestingness measures for discovering association rules in the skeletal dysplasia domain

BACKGROUND: Lately, ontologies have become a fundamental building block in the process of formalising and storing complex biomedical information. With the currently existing wealth of formalised knowledge, the ability to discover implicit relationships between different ontological concepts becomes...

Descripción completa

Detalles Bibliográficos
Autores principales: Paul, Razan, Groza, Tudor, Hunter, Jane, Zankl, Andreas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3936824/
https://www.ncbi.nlm.nih.gov/pubmed/24499729
http://dx.doi.org/10.1186/2041-1480-5-8
_version_ 1782305371320221696
author Paul, Razan
Groza, Tudor
Hunter, Jane
Zankl, Andreas
author_facet Paul, Razan
Groza, Tudor
Hunter, Jane
Zankl, Andreas
author_sort Paul, Razan
collection PubMed
description BACKGROUND: Lately, ontologies have become a fundamental building block in the process of formalising and storing complex biomedical information. With the currently existing wealth of formalised knowledge, the ability to discover implicit relationships between different ontological concepts becomes particularly important. One of the most widely used methods to achieve this is association rule mining. However, while previous research exists on applying traditional association rule mining on ontologies, no approach has, to date, exploited the advantages brought by using the structure of these ontologies in computing rule interestingness measures. RESULTS: We introduce a method that combines concept similarity metrics, formulated using the intrinsic structure of a given ontology, with traditional interestingness measures to compute semantic interestingness measures in the process of association rule mining. We apply the method in our domain of interest – bone dysplasias – using the core ontologies characterising it and an annotated dataset of patient clinical summaries, with the goal of discovering implicit relationships between clinical features and disorders. Experimental results show that, using the above mentioned dataset and a voting strategy classification evaluation, the best scoring traditional interestingness measure achieves an accuracy of 57.33%, while the best scoring semantic interestingness measure achieves an accuracy of 64.38%, both at the recall cut-off point 5. CONCLUSIONS: Semantic interestingness measures outperform the traditional ones, and hence show that they are able to exploit the semantic similarities inherently present between ontological concepts. Nevertheless, this is dependent on the domain, and implicitly, on the semantic similarity metric chosen to model it.
format Online
Article
Text
id pubmed-3936824
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-39368242014-03-06 Semantic interestingness measures for discovering association rules in the skeletal dysplasia domain Paul, Razan Groza, Tudor Hunter, Jane Zankl, Andreas J Biomed Semantics Research BACKGROUND: Lately, ontologies have become a fundamental building block in the process of formalising and storing complex biomedical information. With the currently existing wealth of formalised knowledge, the ability to discover implicit relationships between different ontological concepts becomes particularly important. One of the most widely used methods to achieve this is association rule mining. However, while previous research exists on applying traditional association rule mining on ontologies, no approach has, to date, exploited the advantages brought by using the structure of these ontologies in computing rule interestingness measures. RESULTS: We introduce a method that combines concept similarity metrics, formulated using the intrinsic structure of a given ontology, with traditional interestingness measures to compute semantic interestingness measures in the process of association rule mining. We apply the method in our domain of interest – bone dysplasias – using the core ontologies characterising it and an annotated dataset of patient clinical summaries, with the goal of discovering implicit relationships between clinical features and disorders. Experimental results show that, using the above mentioned dataset and a voting strategy classification evaluation, the best scoring traditional interestingness measure achieves an accuracy of 57.33%, while the best scoring semantic interestingness measure achieves an accuracy of 64.38%, both at the recall cut-off point 5. CONCLUSIONS: Semantic interestingness measures outperform the traditional ones, and hence show that they are able to exploit the semantic similarities inherently present between ontological concepts. Nevertheless, this is dependent on the domain, and implicitly, on the semantic similarity metric chosen to model it. BioMed Central 2014-02-05 /pmc/articles/PMC3936824/ /pubmed/24499729 http://dx.doi.org/10.1186/2041-1480-5-8 Text en Copyright © 2014 Paul et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Paul, Razan
Groza, Tudor
Hunter, Jane
Zankl, Andreas
Semantic interestingness measures for discovering association rules in the skeletal dysplasia domain
title Semantic interestingness measures for discovering association rules in the skeletal dysplasia domain
title_full Semantic interestingness measures for discovering association rules in the skeletal dysplasia domain
title_fullStr Semantic interestingness measures for discovering association rules in the skeletal dysplasia domain
title_full_unstemmed Semantic interestingness measures for discovering association rules in the skeletal dysplasia domain
title_short Semantic interestingness measures for discovering association rules in the skeletal dysplasia domain
title_sort semantic interestingness measures for discovering association rules in the skeletal dysplasia domain
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3936824/
https://www.ncbi.nlm.nih.gov/pubmed/24499729
http://dx.doi.org/10.1186/2041-1480-5-8
work_keys_str_mv AT paulrazan semanticinterestingnessmeasuresfordiscoveringassociationrulesintheskeletaldysplasiadomain
AT grozatudor semanticinterestingnessmeasuresfordiscoveringassociationrulesintheskeletaldysplasiadomain
AT hunterjane semanticinterestingnessmeasuresfordiscoveringassociationrulesintheskeletaldysplasiadomain
AT zanklandreas semanticinterestingnessmeasuresfordiscoveringassociationrulesintheskeletaldysplasiadomain