Cargando…
Non-redundant association rules between diseases and medications: an automated method for knowledge base construction
BACKGROUND: The widespread use of electronic health records (EHRs) has generated massive clinical data storage. Association rules mining is a feasible technique to convert this large amount of data into usable knowledge for clinical decision making, research or billing. We present a data driven meth...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4415340/ https://www.ncbi.nlm.nih.gov/pubmed/25888890 http://dx.doi.org/10.1186/s12911-015-0151-9 |
_version_ | 1782369059278422016 |
---|---|
author | Séverac, François Sauleau, Erik A Meyer, Nicolas Lefèvre, Hassina Nisand, Gabriel Jay, Nicolas |
author_facet | Séverac, François Sauleau, Erik A Meyer, Nicolas Lefèvre, Hassina Nisand, Gabriel Jay, Nicolas |
author_sort | Séverac, François |
collection | PubMed |
description | BACKGROUND: The widespread use of electronic health records (EHRs) has generated massive clinical data storage. Association rules mining is a feasible technique to convert this large amount of data into usable knowledge for clinical decision making, research or billing. We present a data driven method to create a knowledge base linking medications to pathological conditions through their therapeutic indications from elements within the EHRs. METHODS: Association rules were created from the data of patients hospitalised between May 2012 and May 2013 in the department of Cardiology at the University Hospital of Strasbourg. Medications were extracted from the medication list, and the pathological conditions were extracted from the discharge summaries using a natural language processing tool. Association rules were generated along with different interestingness measures: chi square, lift, conviction, dependency, novelty and satisfaction. All medication-disease pairs were compared to the Summary of Product Characteristics, which is the gold standard. A score based on the other interestingness measures was created to filter the best rules, and the indices were calculated for the different interestingness measures. RESULTS: After the evaluation against the gold standard, a list of accurate association rules was successfully retrieved. Dependency represents the best recall (0.76). Our score exhibited higher exactness (0.84) and precision (0.27) than all of the others interestingness measures. Further reductions in noise produced by this method must be performed to improve the classification precision. CONCLUSIONS: Association rules mining using the unstructured elements of the EHR is a feasible technique to identify clinically accurate associations between medications and pathological conditions. |
format | Online Article Text |
id | pubmed-4415340 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-44153402015-05-01 Non-redundant association rules between diseases and medications: an automated method for knowledge base construction Séverac, François Sauleau, Erik A Meyer, Nicolas Lefèvre, Hassina Nisand, Gabriel Jay, Nicolas BMC Med Inform Decis Mak Research Article BACKGROUND: The widespread use of electronic health records (EHRs) has generated massive clinical data storage. Association rules mining is a feasible technique to convert this large amount of data into usable knowledge for clinical decision making, research or billing. We present a data driven method to create a knowledge base linking medications to pathological conditions through their therapeutic indications from elements within the EHRs. METHODS: Association rules were created from the data of patients hospitalised between May 2012 and May 2013 in the department of Cardiology at the University Hospital of Strasbourg. Medications were extracted from the medication list, and the pathological conditions were extracted from the discharge summaries using a natural language processing tool. Association rules were generated along with different interestingness measures: chi square, lift, conviction, dependency, novelty and satisfaction. All medication-disease pairs were compared to the Summary of Product Characteristics, which is the gold standard. A score based on the other interestingness measures was created to filter the best rules, and the indices were calculated for the different interestingness measures. RESULTS: After the evaluation against the gold standard, a list of accurate association rules was successfully retrieved. Dependency represents the best recall (0.76). Our score exhibited higher exactness (0.84) and precision (0.27) than all of the others interestingness measures. Further reductions in noise produced by this method must be performed to improve the classification precision. CONCLUSIONS: Association rules mining using the unstructured elements of the EHR is a feasible technique to identify clinically accurate associations between medications and pathological conditions. BioMed Central 2015-04-15 /pmc/articles/PMC4415340/ /pubmed/25888890 http://dx.doi.org/10.1186/s12911-015-0151-9 Text en © Séverac et al.; licensee BioMed Central. 2015 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Séverac, François Sauleau, Erik A Meyer, Nicolas Lefèvre, Hassina Nisand, Gabriel Jay, Nicolas Non-redundant association rules between diseases and medications: an automated method for knowledge base construction |
title | Non-redundant association rules between diseases and medications: an automated method for knowledge base construction |
title_full | Non-redundant association rules between diseases and medications: an automated method for knowledge base construction |
title_fullStr | Non-redundant association rules between diseases and medications: an automated method for knowledge base construction |
title_full_unstemmed | Non-redundant association rules between diseases and medications: an automated method for knowledge base construction |
title_short | Non-redundant association rules between diseases and medications: an automated method for knowledge base construction |
title_sort | non-redundant association rules between diseases and medications: an automated method for knowledge base construction |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4415340/ https://www.ncbi.nlm.nih.gov/pubmed/25888890 http://dx.doi.org/10.1186/s12911-015-0151-9 |
work_keys_str_mv | AT severacfrancois nonredundantassociationrulesbetweendiseasesandmedicationsanautomatedmethodforknowledgebaseconstruction AT sauleauerika nonredundantassociationrulesbetweendiseasesandmedicationsanautomatedmethodforknowledgebaseconstruction AT meyernicolas nonredundantassociationrulesbetweendiseasesandmedicationsanautomatedmethodforknowledgebaseconstruction AT lefevrehassina nonredundantassociationrulesbetweendiseasesandmedicationsanautomatedmethodforknowledgebaseconstruction AT nisandgabriel nonredundantassociationrulesbetweendiseasesandmedicationsanautomatedmethodforknowledgebaseconstruction AT jaynicolas nonredundantassociationrulesbetweendiseasesandmedicationsanautomatedmethodforknowledgebaseconstruction |