Cargando…
Developing Artificial Intelligence Models for Extracting Oncologic Outcomes from Japanese Electronic Health Records
INTRODUCTION: A framework that extracts oncological outcomes from large-scale databases using artificial intelligence (AI) is not well established. Thus, we aimed to develop AI models to extract outcomes in patients with lung cancer using unstructured text data from electronic health records of mult...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Healthcare
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9988800/ https://www.ncbi.nlm.nih.gov/pubmed/36547809 http://dx.doi.org/10.1007/s12325-022-02397-7 |
_version_ | 1784901644019826688 |
---|---|
author | Araki, Kenji Matsumoto, Nobuhiro Togo, Kanae Yonemoto, Naohiro Ohki, Emiko Xu, Linghua Hasegawa, Yoshiyuki Satoh, Daisuke Takemoto, Ryota Miyazaki, Taiga |
author_facet | Araki, Kenji Matsumoto, Nobuhiro Togo, Kanae Yonemoto, Naohiro Ohki, Emiko Xu, Linghua Hasegawa, Yoshiyuki Satoh, Daisuke Takemoto, Ryota Miyazaki, Taiga |
author_sort | Araki, Kenji |
collection | PubMed |
description | INTRODUCTION: A framework that extracts oncological outcomes from large-scale databases using artificial intelligence (AI) is not well established. Thus, we aimed to develop AI models to extract outcomes in patients with lung cancer using unstructured text data from electronic health records of multiple hospitals. METHODS: We constructed AI models (Bidirectional Encoder Representations from Transformers [BERT], Naïve Bayes, and Longformer) for tumor evaluation using the University of Miyazaki Hospital (UMH) database. This data included both structured and unstructured data from progress notes, radiology reports, and discharge summaries. The BERT model was applied to the Life Data Initiative (LDI) data set of six hospitals. Study outcomes included the performance of AI models and time to progression of disease (TTP) for each line of treatment based on the treatment response extracted by AI models. RESULTS: For the UMH data set, the BERT model exhibited higher precision accuracy compared to the Naïve Bayes or the Longformer models, respectively (precision [0.42 vs. 0.47 or 0.22], recall [0.63 vs. 0.46 or 0.33] and F1 scores [0.50 vs. 0.46 or 0.27]). When this BERT model was applied to LDI data, prediction accuracy remained quite similar. The Kaplan–Meier plots of TTP (months) showed similar trends for the first (median 14.9 [95% confidence interval 11.5, 21.1] and 16.8 [12.6, 21.8]), the second (7.8 [6.7, 10.7] and 7.8 [6.7, 10.7]), and the later lines of treatment for the predicted data by the BERT model and the manually curated data. CONCLUSION: We developed AI models to extract treatment responses in patients with lung cancer using a large EHR database; however, the model requires further improvement. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s12325-022-02397-7. |
format | Online Article Text |
id | pubmed-9988800 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Springer Healthcare |
record_format | MEDLINE/PubMed |
spelling | pubmed-99888002023-03-08 Developing Artificial Intelligence Models for Extracting Oncologic Outcomes from Japanese Electronic Health Records Araki, Kenji Matsumoto, Nobuhiro Togo, Kanae Yonemoto, Naohiro Ohki, Emiko Xu, Linghua Hasegawa, Yoshiyuki Satoh, Daisuke Takemoto, Ryota Miyazaki, Taiga Adv Ther Original Research INTRODUCTION: A framework that extracts oncological outcomes from large-scale databases using artificial intelligence (AI) is not well established. Thus, we aimed to develop AI models to extract outcomes in patients with lung cancer using unstructured text data from electronic health records of multiple hospitals. METHODS: We constructed AI models (Bidirectional Encoder Representations from Transformers [BERT], Naïve Bayes, and Longformer) for tumor evaluation using the University of Miyazaki Hospital (UMH) database. This data included both structured and unstructured data from progress notes, radiology reports, and discharge summaries. The BERT model was applied to the Life Data Initiative (LDI) data set of six hospitals. Study outcomes included the performance of AI models and time to progression of disease (TTP) for each line of treatment based on the treatment response extracted by AI models. RESULTS: For the UMH data set, the BERT model exhibited higher precision accuracy compared to the Naïve Bayes or the Longformer models, respectively (precision [0.42 vs. 0.47 or 0.22], recall [0.63 vs. 0.46 or 0.33] and F1 scores [0.50 vs. 0.46 or 0.27]). When this BERT model was applied to LDI data, prediction accuracy remained quite similar. The Kaplan–Meier plots of TTP (months) showed similar trends for the first (median 14.9 [95% confidence interval 11.5, 21.1] and 16.8 [12.6, 21.8]), the second (7.8 [6.7, 10.7] and 7.8 [6.7, 10.7]), and the later lines of treatment for the predicted data by the BERT model and the manually curated data. CONCLUSION: We developed AI models to extract treatment responses in patients with lung cancer using a large EHR database; however, the model requires further improvement. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s12325-022-02397-7. Springer Healthcare 2022-12-22 2023 /pmc/articles/PMC9988800/ /pubmed/36547809 http://dx.doi.org/10.1007/s12325-022-02397-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by-nc/4.0/Open AccessThis article is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License, which permits any non-commercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) . |
spellingShingle | Original Research Araki, Kenji Matsumoto, Nobuhiro Togo, Kanae Yonemoto, Naohiro Ohki, Emiko Xu, Linghua Hasegawa, Yoshiyuki Satoh, Daisuke Takemoto, Ryota Miyazaki, Taiga Developing Artificial Intelligence Models for Extracting Oncologic Outcomes from Japanese Electronic Health Records |
title | Developing Artificial Intelligence Models for Extracting Oncologic Outcomes from Japanese Electronic Health Records |
title_full | Developing Artificial Intelligence Models for Extracting Oncologic Outcomes from Japanese Electronic Health Records |
title_fullStr | Developing Artificial Intelligence Models for Extracting Oncologic Outcomes from Japanese Electronic Health Records |
title_full_unstemmed | Developing Artificial Intelligence Models for Extracting Oncologic Outcomes from Japanese Electronic Health Records |
title_short | Developing Artificial Intelligence Models for Extracting Oncologic Outcomes from Japanese Electronic Health Records |
title_sort | developing artificial intelligence models for extracting oncologic outcomes from japanese electronic health records |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9988800/ https://www.ncbi.nlm.nih.gov/pubmed/36547809 http://dx.doi.org/10.1007/s12325-022-02397-7 |
work_keys_str_mv | AT arakikenji developingartificialintelligencemodelsforextractingoncologicoutcomesfromjapaneseelectronichealthrecords AT matsumotonobuhiro developingartificialintelligencemodelsforextractingoncologicoutcomesfromjapaneseelectronichealthrecords AT togokanae developingartificialintelligencemodelsforextractingoncologicoutcomesfromjapaneseelectronichealthrecords AT yonemotonaohiro developingartificialintelligencemodelsforextractingoncologicoutcomesfromjapaneseelectronichealthrecords AT ohkiemiko developingartificialintelligencemodelsforextractingoncologicoutcomesfromjapaneseelectronichealthrecords AT xulinghua developingartificialintelligencemodelsforextractingoncologicoutcomesfromjapaneseelectronichealthrecords AT hasegawayoshiyuki developingartificialintelligencemodelsforextractingoncologicoutcomesfromjapaneseelectronichealthrecords AT satohdaisuke developingartificialintelligencemodelsforextractingoncologicoutcomesfromjapaneseelectronichealthrecords AT takemotoryota developingartificialintelligencemodelsforextractingoncologicoutcomesfromjapaneseelectronichealthrecords AT miyazakitaiga developingartificialintelligencemodelsforextractingoncologicoutcomesfromjapaneseelectronichealthrecords |