Cargando…

Medical Text Simplification Using Reinforcement Learning (TESLEA): Deep Learning–Based Text Simplification Approach

BACKGROUND: In most cases, the abstracts of articles in the medical domain are publicly available. Although these are accessible by everyone, they are hard to comprehend for a wider audience due to the complex medical vocabulary. Thus, simplifying these complex abstracts is essential to make medical...

Descripción completa

Detalles Bibliográficos
Autores principales:	Phatak, Atharva, Savage, David W, Ohle, Robert, Smith, Jonathan, Mago, Vijay
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	JMIR Publications 2022
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9719064/ https://www.ncbi.nlm.nih.gov/pubmed/36399375 http://dx.doi.org/10.2196/38095

_version_	1784843233601257472
author	Phatak, Atharva Savage, David W Ohle, Robert Smith, Jonathan Mago, Vijay
author_facet	Phatak, Atharva Savage, David W Ohle, Robert Smith, Jonathan Mago, Vijay
author_sort	Phatak, Atharva
collection	PubMed
description	BACKGROUND: In most cases, the abstracts of articles in the medical domain are publicly available. Although these are accessible by everyone, they are hard to comprehend for a wider audience due to the complex medical vocabulary. Thus, simplifying these complex abstracts is essential to make medical research accessible to the general public. OBJECTIVE: This study aims to develop a deep learning–based text simplification (TS) approach that converts complex medical text into a simpler version while maintaining the quality of the generated text. METHODS: A TS approach using reinforcement learning and transformer–based language models was developed. Relevance reward, Flesch-Kincaid reward, and lexical simplicity reward were optimized to help simplify jargon-dense complex medical paragraphs to their simpler versions while retaining the quality of the text. The model was trained using 3568 complex-simple medical paragraphs and evaluated on 480 paragraphs via the help of automated metrics and human annotation. RESULTS: The proposed method outperformed previous baselines on Flesch-Kincaid scores (11.84) and achieved comparable performance with other baselines when measured using ROUGE-1 (0.39), ROUGE-2 (0.11), and SARI scores (0.40). Manual evaluation showed that percentage agreement between human annotators was more than 70% when factors such as fluency, coherence, and adequacy were considered. CONCLUSIONS: A unique medical TS approach is successfully developed that leverages reinforcement learning and accurately simplifies complex medical paragraphs, thereby increasing their readability. The proposed TS approach can be applied to automatically generate simplified text for complex medical text data, which would enhance the accessibility of biomedical research to a wider audience.
format	Online Article Text
id	pubmed-9719064
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	JMIR Publications
record_format	MEDLINE/PubMed
spelling	pubmed-97190642022-12-04 Medical Text Simplification Using Reinforcement Learning (TESLEA): Deep Learning–Based Text Simplification Approach Phatak, Atharva Savage, David W Ohle, Robert Smith, Jonathan Mago, Vijay JMIR Med Inform Original Paper BACKGROUND: In most cases, the abstracts of articles in the medical domain are publicly available. Although these are accessible by everyone, they are hard to comprehend for a wider audience due to the complex medical vocabulary. Thus, simplifying these complex abstracts is essential to make medical research accessible to the general public. OBJECTIVE: This study aims to develop a deep learning–based text simplification (TS) approach that converts complex medical text into a simpler version while maintaining the quality of the generated text. METHODS: A TS approach using reinforcement learning and transformer–based language models was developed. Relevance reward, Flesch-Kincaid reward, and lexical simplicity reward were optimized to help simplify jargon-dense complex medical paragraphs to their simpler versions while retaining the quality of the text. The model was trained using 3568 complex-simple medical paragraphs and evaluated on 480 paragraphs via the help of automated metrics and human annotation. RESULTS: The proposed method outperformed previous baselines on Flesch-Kincaid scores (11.84) and achieved comparable performance with other baselines when measured using ROUGE-1 (0.39), ROUGE-2 (0.11), and SARI scores (0.40). Manual evaluation showed that percentage agreement between human annotators was more than 70% when factors such as fluency, coherence, and adequacy were considered. CONCLUSIONS: A unique medical TS approach is successfully developed that leverages reinforcement learning and accurately simplifies complex medical paragraphs, thereby increasing their readability. The proposed TS approach can be applied to automatically generate simplified text for complex medical text data, which would enhance the accessibility of biomedical research to a wider audience. JMIR Publications 2022-11-18 /pmc/articles/PMC9719064/ /pubmed/36399375 http://dx.doi.org/10.2196/38095 Text en ©Atharva Phatak, David W Savage, Robert Ohle, Jonathan Smith, Vijay Mago. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 18.11.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle	Original Paper Phatak, Atharva Savage, David W Ohle, Robert Smith, Jonathan Mago, Vijay Medical Text Simplification Using Reinforcement Learning (TESLEA): Deep Learning–Based Text Simplification Approach
title	Medical Text Simplification Using Reinforcement Learning (TESLEA): Deep Learning–Based Text Simplification Approach
title_full	Medical Text Simplification Using Reinforcement Learning (TESLEA): Deep Learning–Based Text Simplification Approach
title_fullStr	Medical Text Simplification Using Reinforcement Learning (TESLEA): Deep Learning–Based Text Simplification Approach
title_full_unstemmed	Medical Text Simplification Using Reinforcement Learning (TESLEA): Deep Learning–Based Text Simplification Approach
title_short	Medical Text Simplification Using Reinforcement Learning (TESLEA): Deep Learning–Based Text Simplification Approach
title_sort	medical text simplification using reinforcement learning (teslea): deep learning–based text simplification approach
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9719064/ https://www.ncbi.nlm.nih.gov/pubmed/36399375 http://dx.doi.org/10.2196/38095
work_keys_str_mv	AT phatakatharva medicaltextsimplificationusingreinforcementlearningtesleadeeplearningbasedtextsimplificationapproach AT savagedavidw medicaltextsimplificationusingreinforcementlearningtesleadeeplearningbasedtextsimplificationapproach AT ohlerobert medicaltextsimplificationusingreinforcementlearningtesleadeeplearningbasedtextsimplificationapproach AT smithjonathan medicaltextsimplificationusingreinforcementlearningtesleadeeplearningbasedtextsimplificationapproach AT magovijay medicaltextsimplificationusingreinforcementlearningtesleadeeplearningbasedtextsimplificationapproach

Medical Text Simplification Using Reinforcement Learning (TESLEA): Deep Learning–Based Text Simplification Approach

Ejemplares similares