Cargando…

Comparison of Machine Learning Algorithms for Classification of the Sentences in Three Clinical Practice Guidelines

OBJECTIVES: Clinical Practice Guidelines (CPGs) are an effective tool for minimizing the gap between a physician's clinical decision and medical evidence and for modeling the systematic and standardized pathway used to provide better medical treatment to patients. METHODS: In this study, senten...

Descripción completa

Detalles Bibliográficos
Autores principales: Song, Mi Hwa, Lee, Young Ho, Kang, Un Gu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Korean Society of Medical Informatics 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3633167/
https://www.ncbi.nlm.nih.gov/pubmed/23626914
http://dx.doi.org/10.4258/hir.2013.19.1.16
_version_ 1782266941594927104
author Song, Mi Hwa
Lee, Young Ho
Kang, Un Gu
author_facet Song, Mi Hwa
Lee, Young Ho
Kang, Un Gu
author_sort Song, Mi Hwa
collection PubMed
description OBJECTIVES: Clinical Practice Guidelines (CPGs) are an effective tool for minimizing the gap between a physician's clinical decision and medical evidence and for modeling the systematic and standardized pathway used to provide better medical treatment to patients. METHODS: In this study, sentences within the clinical guidelines are categorized according to a classification system. We used three clinical guidelines that incorporated knowledge from medical experts in the field of family medicine. These were the seventh report of the Joint National Committee (JNC7) on Prevention, Detection, Evaluation, and Treatment of High Blood Pressure from the National Heart, Lung, and Blood Institute; the third report of the National Cholesterol Education Program (NCEP) Expert Panel on Detection, Evaluation, and Treatment of High Blood Cholesterol in Adults from the same institution; and the Standards of Medical Care in Diabetes 2010 report from the American Diabetes Association. Three annotators each tagged 346 sentences hand-chosen from these three clinical guidelines. The three annotators then carried out cross-validations of the tagged corpus. We also used various machine learning-based classifiers for sentence classification. RESULTS: We conducted experiments using real-valued features and token units, as well as a Boolean feature. The results showed that the combination of maximum entropy-based learning and information gain-based feature extraction gave the best classification performance (over 98% f-measure) in four sentence categories. CONCLUSIONS: This result confirmed the contribution of the feature reduction algorithm and optimal technique for very sparse feature spaces, such as the sentence classification problem in the clinical guideline document.
format Online
Article
Text
id pubmed-3633167
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Korean Society of Medical Informatics
record_format MEDLINE/PubMed
spelling pubmed-36331672013-04-26 Comparison of Machine Learning Algorithms for Classification of the Sentences in Three Clinical Practice Guidelines Song, Mi Hwa Lee, Young Ho Kang, Un Gu Healthc Inform Res Original Article OBJECTIVES: Clinical Practice Guidelines (CPGs) are an effective tool for minimizing the gap between a physician's clinical decision and medical evidence and for modeling the systematic and standardized pathway used to provide better medical treatment to patients. METHODS: In this study, sentences within the clinical guidelines are categorized according to a classification system. We used three clinical guidelines that incorporated knowledge from medical experts in the field of family medicine. These were the seventh report of the Joint National Committee (JNC7) on Prevention, Detection, Evaluation, and Treatment of High Blood Pressure from the National Heart, Lung, and Blood Institute; the third report of the National Cholesterol Education Program (NCEP) Expert Panel on Detection, Evaluation, and Treatment of High Blood Cholesterol in Adults from the same institution; and the Standards of Medical Care in Diabetes 2010 report from the American Diabetes Association. Three annotators each tagged 346 sentences hand-chosen from these three clinical guidelines. The three annotators then carried out cross-validations of the tagged corpus. We also used various machine learning-based classifiers for sentence classification. RESULTS: We conducted experiments using real-valued features and token units, as well as a Boolean feature. The results showed that the combination of maximum entropy-based learning and information gain-based feature extraction gave the best classification performance (over 98% f-measure) in four sentence categories. CONCLUSIONS: This result confirmed the contribution of the feature reduction algorithm and optimal technique for very sparse feature spaces, such as the sentence classification problem in the clinical guideline document. Korean Society of Medical Informatics 2013-03 2013-03-31 /pmc/articles/PMC3633167/ /pubmed/23626914 http://dx.doi.org/10.4258/hir.2013.19.1.16 Text en © 2013 The Korean Society of Medical Informatics http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Song, Mi Hwa
Lee, Young Ho
Kang, Un Gu
Comparison of Machine Learning Algorithms for Classification of the Sentences in Three Clinical Practice Guidelines
title Comparison of Machine Learning Algorithms for Classification of the Sentences in Three Clinical Practice Guidelines
title_full Comparison of Machine Learning Algorithms for Classification of the Sentences in Three Clinical Practice Guidelines
title_fullStr Comparison of Machine Learning Algorithms for Classification of the Sentences in Three Clinical Practice Guidelines
title_full_unstemmed Comparison of Machine Learning Algorithms for Classification of the Sentences in Three Clinical Practice Guidelines
title_short Comparison of Machine Learning Algorithms for Classification of the Sentences in Three Clinical Practice Guidelines
title_sort comparison of machine learning algorithms for classification of the sentences in three clinical practice guidelines
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3633167/
https://www.ncbi.nlm.nih.gov/pubmed/23626914
http://dx.doi.org/10.4258/hir.2013.19.1.16
work_keys_str_mv AT songmihwa comparisonofmachinelearningalgorithmsforclassificationofthesentencesinthreeclinicalpracticeguidelines
AT leeyoungho comparisonofmachinelearningalgorithmsforclassificationofthesentencesinthreeclinicalpracticeguidelines
AT kangungu comparisonofmachinelearningalgorithmsforclassificationofthesentencesinthreeclinicalpracticeguidelines