Cargando…

Clinical Context–Aware Biomedical Text Summarization Using Deep Neural Network: Model Development and Validation

BACKGROUND: Automatic text summarization (ATS) enables users to retrieve meaningful evidence from big data of biomedical repositories to make complex clinical decisions. Deep neural and recurrent networks outperform traditional machine-learning techniques in areas of natural language processing and...

Descripción completa

Detalles Bibliográficos
Autores principales:	Afzal, Muhammad, Alam, Fakhare, Malik, Khalid Mahmood, Malik, Ghaus M
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	JMIR Publications 2020
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7647812/ https://www.ncbi.nlm.nih.gov/pubmed/33095174 http://dx.doi.org/10.2196/19810

_version_	1783606986476093440
author	Afzal, Muhammad Alam, Fakhare Malik, Khalid Mahmood Malik, Ghaus M
author_facet	Afzal, Muhammad Alam, Fakhare Malik, Khalid Mahmood Malik, Ghaus M
author_sort	Afzal, Muhammad
collection	PubMed
description	BACKGROUND: Automatic text summarization (ATS) enables users to retrieve meaningful evidence from big data of biomedical repositories to make complex clinical decisions. Deep neural and recurrent networks outperform traditional machine-learning techniques in areas of natural language processing and computer vision; however, they are yet to be explored in the ATS domain, particularly for medical text summarization. OBJECTIVE: Traditional approaches in ATS for biomedical text suffer from fundamental issues such as an inability to capture clinical context, quality of evidence, and purpose-driven selection of passages for the summary. We aimed to circumvent these limitations through achieving precise, succinct, and coherent information extraction from credible published biomedical resources, and to construct a simplified summary containing the most informative content that can offer a review particular to clinical needs. METHODS: In our proposed approach, we introduce a novel framework, termed Biomed-Summarizer, that provides quality-aware Patient/Problem, Intervention, Comparison, and Outcome (PICO)-based intelligent and context-enabled summarization of biomedical text. Biomed-Summarizer integrates the prognosis quality recognition model with a clinical context–aware model to locate text sequences in the body of a biomedical article for use in the final summary. First, we developed a deep neural network binary classifier for quality recognition to acquire scientifically sound studies and filter out others. Second, we developed a bidirectional long-short term memory recurrent neural network as a clinical context–aware classifier, which was trained on semantically enriched features generated using a word-embedding tokenizer for identification of meaningful sentences representing PICO text sequences. Third, we calculated the similarity between query and PICO text sequences using Jaccard similarity with semantic enrichments, where the semantic enrichments are obtained using medical ontologies. Last, we generated a representative summary from the high-scoring PICO sequences aggregated by study type, publication credibility, and freshness score. RESULTS: Evaluation of the prognosis quality recognition model using a large dataset of biomedical literature related to intracranial aneurysm showed an accuracy of 95.41% (2562/2686) in terms of recognizing quality articles. The clinical context–aware multiclass classifier outperformed the traditional machine-learning algorithms, including support vector machine, gradient boosted tree, linear regression, K-nearest neighbor, and naïve Bayes, by achieving 93% (16127/17341) accuracy for classifying five categories: aim, population, intervention, results, and outcome. The semantic similarity algorithm achieved a significant Pearson correlation coefficient of 0.61 (0-1 scale) on a well-known BIOSSES dataset (with 100 pair sentences) after semantic enrichment, representing an improvement of 8.9% over baseline Jaccard similarity. Finally, we found a highly positive correlation among the evaluations performed by three domain experts concerning different metrics, suggesting that the automated summarization is satisfactory. CONCLUSIONS: By employing the proposed method Biomed-Summarizer, high accuracy in ATS was achieved, enabling seamless curation of research evidence from the biomedical literature to use for clinical decision-making.
format	Online Article Text
id	pubmed-7647812
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	JMIR Publications
record_format	MEDLINE/PubMed
spelling	pubmed-76478122020-11-17 Clinical Context–Aware Biomedical Text Summarization Using Deep Neural Network: Model Development and Validation Afzal, Muhammad Alam, Fakhare Malik, Khalid Mahmood Malik, Ghaus M J Med Internet Res Original Paper BACKGROUND: Automatic text summarization (ATS) enables users to retrieve meaningful evidence from big data of biomedical repositories to make complex clinical decisions. Deep neural and recurrent networks outperform traditional machine-learning techniques in areas of natural language processing and computer vision; however, they are yet to be explored in the ATS domain, particularly for medical text summarization. OBJECTIVE: Traditional approaches in ATS for biomedical text suffer from fundamental issues such as an inability to capture clinical context, quality of evidence, and purpose-driven selection of passages for the summary. We aimed to circumvent these limitations through achieving precise, succinct, and coherent information extraction from credible published biomedical resources, and to construct a simplified summary containing the most informative content that can offer a review particular to clinical needs. METHODS: In our proposed approach, we introduce a novel framework, termed Biomed-Summarizer, that provides quality-aware Patient/Problem, Intervention, Comparison, and Outcome (PICO)-based intelligent and context-enabled summarization of biomedical text. Biomed-Summarizer integrates the prognosis quality recognition model with a clinical context–aware model to locate text sequences in the body of a biomedical article for use in the final summary. First, we developed a deep neural network binary classifier for quality recognition to acquire scientifically sound studies and filter out others. Second, we developed a bidirectional long-short term memory recurrent neural network as a clinical context–aware classifier, which was trained on semantically enriched features generated using a word-embedding tokenizer for identification of meaningful sentences representing PICO text sequences. Third, we calculated the similarity between query and PICO text sequences using Jaccard similarity with semantic enrichments, where the semantic enrichments are obtained using medical ontologies. Last, we generated a representative summary from the high-scoring PICO sequences aggregated by study type, publication credibility, and freshness score. RESULTS: Evaluation of the prognosis quality recognition model using a large dataset of biomedical literature related to intracranial aneurysm showed an accuracy of 95.41% (2562/2686) in terms of recognizing quality articles. The clinical context–aware multiclass classifier outperformed the traditional machine-learning algorithms, including support vector machine, gradient boosted tree, linear regression, K-nearest neighbor, and naïve Bayes, by achieving 93% (16127/17341) accuracy for classifying five categories: aim, population, intervention, results, and outcome. The semantic similarity algorithm achieved a significant Pearson correlation coefficient of 0.61 (0-1 scale) on a well-known BIOSSES dataset (with 100 pair sentences) after semantic enrichment, representing an improvement of 8.9% over baseline Jaccard similarity. Finally, we found a highly positive correlation among the evaluations performed by three domain experts concerning different metrics, suggesting that the automated summarization is satisfactory. CONCLUSIONS: By employing the proposed method Biomed-Summarizer, high accuracy in ATS was achieved, enabling seamless curation of research evidence from the biomedical literature to use for clinical decision-making. JMIR Publications 2020-10-23 /pmc/articles/PMC7647812/ /pubmed/33095174 http://dx.doi.org/10.2196/19810 Text en ©Muhammad Afzal, Fakhare Alam, Khalid Mahmood Malik, Ghaus M Malik. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 23.10.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle	Original Paper Afzal, Muhammad Alam, Fakhare Malik, Khalid Mahmood Malik, Ghaus M Clinical Context–Aware Biomedical Text Summarization Using Deep Neural Network: Model Development and Validation
title	Clinical Context–Aware Biomedical Text Summarization Using Deep Neural Network: Model Development and Validation
title_full	Clinical Context–Aware Biomedical Text Summarization Using Deep Neural Network: Model Development and Validation
title_fullStr	Clinical Context–Aware Biomedical Text Summarization Using Deep Neural Network: Model Development and Validation
title_full_unstemmed	Clinical Context–Aware Biomedical Text Summarization Using Deep Neural Network: Model Development and Validation
title_short	Clinical Context–Aware Biomedical Text Summarization Using Deep Neural Network: Model Development and Validation
title_sort	clinical context–aware biomedical text summarization using deep neural network: model development and validation
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7647812/ https://www.ncbi.nlm.nih.gov/pubmed/33095174 http://dx.doi.org/10.2196/19810
work_keys_str_mv	AT afzalmuhammad clinicalcontextawarebiomedicaltextsummarizationusingdeepneuralnetworkmodeldevelopmentandvalidation AT alamfakhare clinicalcontextawarebiomedicaltextsummarizationusingdeepneuralnetworkmodeldevelopmentandvalidation AT malikkhalidmahmood clinicalcontextawarebiomedicaltextsummarizationusingdeepneuralnetworkmodeldevelopmentandvalidation AT malikghausm clinicalcontextawarebiomedicaltextsummarizationusingdeepneuralnetworkmodeldevelopmentandvalidation

Clinical Context–Aware Biomedical Text Summarization Using Deep Neural Network: Model Development and Validation

Ejemplares similares