Cargando…
Neural text generation in regulatory medical writing
Background: A steep increase in new drug applications has increased the overhead of writing technical documents such as medication guides. Natural language processing can contribute to reducing this burden. Objective: To generate medication guides from texts that relate to prescription drug labeling...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9950092/ https://www.ncbi.nlm.nih.gov/pubmed/36843925 http://dx.doi.org/10.3389/fphar.2023.1086913 |
_version_ | 1784893086808145920 |
---|---|
author | Meyer, Claudia Adkins, Daniel Pal, Koyena Galici, Ruggero Garcia-Agundez, Augusto Eickhoff , Carsten |
author_facet | Meyer, Claudia Adkins, Daniel Pal, Koyena Galici, Ruggero Garcia-Agundez, Augusto Eickhoff , Carsten |
author_sort | Meyer, Claudia |
collection | PubMed |
description | Background: A steep increase in new drug applications has increased the overhead of writing technical documents such as medication guides. Natural language processing can contribute to reducing this burden. Objective: To generate medication guides from texts that relate to prescription drug labeling information. Materials and Methods: We collected official drug label information from the DailyMed website. We focused on drug labels containing medication guide sections to train and test our model. To construct our training dataset, we aligned “source” text from the document with similar “target” text from the medication guide using three families of alignment techniques: global, manual, and heuristic alignment. The resulting source-target pairs were provided as input to a Pointer Generator Network, an abstractive text summarization model. Results: Global alignment produced the lowest ROUGE scores and relatively poor qualitative results, as running the model frequently resulted in mode collapse. Manual alignment also resulted in mode collapse, albeit higher ROUGE scores than global alignment. Within the family of heuristic alignment approaches, we compared different methods and found BM25-based alignments to produce significantly better summaries (at least 6.8 ROUGE points above the other techniques). This alignment surpassed both the global and manual alignments in terms of ROUGE and qualitative scoring. Conclusion: The results of this study indicate that a heuristic approach to generating inputs for an abstractive summarization model increased ROUGE scores, compared to a global or manual approach when automatically generating biomedical text. Such methods hold the potential to significantly reduce the manual labor burden in medical writing and related disciplines. |
format | Online Article Text |
id | pubmed-9950092 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-99500922023-02-25 Neural text generation in regulatory medical writing Meyer, Claudia Adkins, Daniel Pal, Koyena Galici, Ruggero Garcia-Agundez, Augusto Eickhoff , Carsten Front Pharmacol Pharmacology Background: A steep increase in new drug applications has increased the overhead of writing technical documents such as medication guides. Natural language processing can contribute to reducing this burden. Objective: To generate medication guides from texts that relate to prescription drug labeling information. Materials and Methods: We collected official drug label information from the DailyMed website. We focused on drug labels containing medication guide sections to train and test our model. To construct our training dataset, we aligned “source” text from the document with similar “target” text from the medication guide using three families of alignment techniques: global, manual, and heuristic alignment. The resulting source-target pairs were provided as input to a Pointer Generator Network, an abstractive text summarization model. Results: Global alignment produced the lowest ROUGE scores and relatively poor qualitative results, as running the model frequently resulted in mode collapse. Manual alignment also resulted in mode collapse, albeit higher ROUGE scores than global alignment. Within the family of heuristic alignment approaches, we compared different methods and found BM25-based alignments to produce significantly better summaries (at least 6.8 ROUGE points above the other techniques). This alignment surpassed both the global and manual alignments in terms of ROUGE and qualitative scoring. Conclusion: The results of this study indicate that a heuristic approach to generating inputs for an abstractive summarization model increased ROUGE scores, compared to a global or manual approach when automatically generating biomedical text. Such methods hold the potential to significantly reduce the manual labor burden in medical writing and related disciplines. Frontiers Media S.A. 2023-02-10 /pmc/articles/PMC9950092/ /pubmed/36843925 http://dx.doi.org/10.3389/fphar.2023.1086913 Text en Copyright © 2023 Meyer, Adkins, Pal, Galici, Garcia-Agundez and Eickhoff . https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Pharmacology Meyer, Claudia Adkins, Daniel Pal, Koyena Galici, Ruggero Garcia-Agundez, Augusto Eickhoff , Carsten Neural text generation in regulatory medical writing |
title | Neural text generation in regulatory medical writing |
title_full | Neural text generation in regulatory medical writing |
title_fullStr | Neural text generation in regulatory medical writing |
title_full_unstemmed | Neural text generation in regulatory medical writing |
title_short | Neural text generation in regulatory medical writing |
title_sort | neural text generation in regulatory medical writing |
topic | Pharmacology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9950092/ https://www.ncbi.nlm.nih.gov/pubmed/36843925 http://dx.doi.org/10.3389/fphar.2023.1086913 |
work_keys_str_mv | AT meyerclaudia neuraltextgenerationinregulatorymedicalwriting AT adkinsdaniel neuraltextgenerationinregulatorymedicalwriting AT palkoyena neuraltextgenerationinregulatorymedicalwriting AT galiciruggero neuraltextgenerationinregulatorymedicalwriting AT garciaagundezaugusto neuraltextgenerationinregulatorymedicalwriting AT eickhoffcarsten neuraltextgenerationinregulatorymedicalwriting |