Cargando…

Developing Artificial Intelligence Models for Extracting Oncologic Outcomes from Japanese Electronic Health Records

INTRODUCTION: A framework that extracts oncological outcomes from large-scale databases using artificial intelligence (AI) is not well established. Thus, we aimed to develop AI models to extract outcomes in patients with lung cancer using unstructured text data from electronic health records of mult...

Descripción completa

Detalles Bibliográficos
Autores principales: Araki, Kenji, Matsumoto, Nobuhiro, Togo, Kanae, Yonemoto, Naohiro, Ohki, Emiko, Xu, Linghua, Hasegawa, Yoshiyuki, Satoh, Daisuke, Takemoto, Ryota, Miyazaki, Taiga
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Healthcare 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9988800/
https://www.ncbi.nlm.nih.gov/pubmed/36547809
http://dx.doi.org/10.1007/s12325-022-02397-7
_version_ 1784901644019826688
author Araki, Kenji
Matsumoto, Nobuhiro
Togo, Kanae
Yonemoto, Naohiro
Ohki, Emiko
Xu, Linghua
Hasegawa, Yoshiyuki
Satoh, Daisuke
Takemoto, Ryota
Miyazaki, Taiga
author_facet Araki, Kenji
Matsumoto, Nobuhiro
Togo, Kanae
Yonemoto, Naohiro
Ohki, Emiko
Xu, Linghua
Hasegawa, Yoshiyuki
Satoh, Daisuke
Takemoto, Ryota
Miyazaki, Taiga
author_sort Araki, Kenji
collection PubMed
description INTRODUCTION: A framework that extracts oncological outcomes from large-scale databases using artificial intelligence (AI) is not well established. Thus, we aimed to develop AI models to extract outcomes in patients with lung cancer using unstructured text data from electronic health records of multiple hospitals. METHODS: We constructed AI models (Bidirectional Encoder Representations from Transformers [BERT], Naïve Bayes, and Longformer) for tumor evaluation using the University of Miyazaki Hospital (UMH) database. This data included both structured and unstructured data from progress notes, radiology reports, and discharge summaries. The BERT model was applied to the Life Data Initiative (LDI) data set of six hospitals. Study outcomes included the performance of AI models and time to progression of disease (TTP) for each line of treatment based on the treatment response extracted by AI models. RESULTS: For the UMH data set, the BERT model exhibited higher precision accuracy compared to the Naïve Bayes or the Longformer models, respectively (precision [0.42 vs. 0.47 or 0.22], recall [0.63 vs. 0.46 or 0.33] and F1 scores [0.50 vs. 0.46 or 0.27]). When this BERT model was applied to LDI data, prediction accuracy remained quite similar. The Kaplan–Meier plots of TTP (months) showed similar trends for the first (median 14.9 [95% confidence interval 11.5, 21.1] and 16.8 [12.6, 21.8]), the second (7.8 [6.7, 10.7] and 7.8 [6.7, 10.7]), and the later lines of treatment for the predicted data by the BERT model and the manually curated data. CONCLUSION: We developed AI models to extract treatment responses in patients with lung cancer using a large EHR database; however, the model requires further improvement. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s12325-022-02397-7.
format Online
Article
Text
id pubmed-9988800
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer Healthcare
record_format MEDLINE/PubMed
spelling pubmed-99888002023-03-08 Developing Artificial Intelligence Models for Extracting Oncologic Outcomes from Japanese Electronic Health Records Araki, Kenji Matsumoto, Nobuhiro Togo, Kanae Yonemoto, Naohiro Ohki, Emiko Xu, Linghua Hasegawa, Yoshiyuki Satoh, Daisuke Takemoto, Ryota Miyazaki, Taiga Adv Ther Original Research INTRODUCTION: A framework that extracts oncological outcomes from large-scale databases using artificial intelligence (AI) is not well established. Thus, we aimed to develop AI models to extract outcomes in patients with lung cancer using unstructured text data from electronic health records of multiple hospitals. METHODS: We constructed AI models (Bidirectional Encoder Representations from Transformers [BERT], Naïve Bayes, and Longformer) for tumor evaluation using the University of Miyazaki Hospital (UMH) database. This data included both structured and unstructured data from progress notes, radiology reports, and discharge summaries. The BERT model was applied to the Life Data Initiative (LDI) data set of six hospitals. Study outcomes included the performance of AI models and time to progression of disease (TTP) for each line of treatment based on the treatment response extracted by AI models. RESULTS: For the UMH data set, the BERT model exhibited higher precision accuracy compared to the Naïve Bayes or the Longformer models, respectively (precision [0.42 vs. 0.47 or 0.22], recall [0.63 vs. 0.46 or 0.33] and F1 scores [0.50 vs. 0.46 or 0.27]). When this BERT model was applied to LDI data, prediction accuracy remained quite similar. The Kaplan–Meier plots of TTP (months) showed similar trends for the first (median 14.9 [95% confidence interval 11.5, 21.1] and 16.8 [12.6, 21.8]), the second (7.8 [6.7, 10.7] and 7.8 [6.7, 10.7]), and the later lines of treatment for the predicted data by the BERT model and the manually curated data. CONCLUSION: We developed AI models to extract treatment responses in patients with lung cancer using a large EHR database; however, the model requires further improvement. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s12325-022-02397-7. Springer Healthcare 2022-12-22 2023 /pmc/articles/PMC9988800/ /pubmed/36547809 http://dx.doi.org/10.1007/s12325-022-02397-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by-nc/4.0/Open AccessThis article is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License, which permits any non-commercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle Original Research
Araki, Kenji
Matsumoto, Nobuhiro
Togo, Kanae
Yonemoto, Naohiro
Ohki, Emiko
Xu, Linghua
Hasegawa, Yoshiyuki
Satoh, Daisuke
Takemoto, Ryota
Miyazaki, Taiga
Developing Artificial Intelligence Models for Extracting Oncologic Outcomes from Japanese Electronic Health Records
title Developing Artificial Intelligence Models for Extracting Oncologic Outcomes from Japanese Electronic Health Records
title_full Developing Artificial Intelligence Models for Extracting Oncologic Outcomes from Japanese Electronic Health Records
title_fullStr Developing Artificial Intelligence Models for Extracting Oncologic Outcomes from Japanese Electronic Health Records
title_full_unstemmed Developing Artificial Intelligence Models for Extracting Oncologic Outcomes from Japanese Electronic Health Records
title_short Developing Artificial Intelligence Models for Extracting Oncologic Outcomes from Japanese Electronic Health Records
title_sort developing artificial intelligence models for extracting oncologic outcomes from japanese electronic health records
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9988800/
https://www.ncbi.nlm.nih.gov/pubmed/36547809
http://dx.doi.org/10.1007/s12325-022-02397-7
work_keys_str_mv AT arakikenji developingartificialintelligencemodelsforextractingoncologicoutcomesfromjapaneseelectronichealthrecords
AT matsumotonobuhiro developingartificialintelligencemodelsforextractingoncologicoutcomesfromjapaneseelectronichealthrecords
AT togokanae developingartificialintelligencemodelsforextractingoncologicoutcomesfromjapaneseelectronichealthrecords
AT yonemotonaohiro developingartificialintelligencemodelsforextractingoncologicoutcomesfromjapaneseelectronichealthrecords
AT ohkiemiko developingartificialintelligencemodelsforextractingoncologicoutcomesfromjapaneseelectronichealthrecords
AT xulinghua developingartificialintelligencemodelsforextractingoncologicoutcomesfromjapaneseelectronichealthrecords
AT hasegawayoshiyuki developingartificialintelligencemodelsforextractingoncologicoutcomesfromjapaneseelectronichealthrecords
AT satohdaisuke developingartificialintelligencemodelsforextractingoncologicoutcomesfromjapaneseelectronichealthrecords
AT takemotoryota developingartificialintelligencemodelsforextractingoncologicoutcomesfromjapaneseelectronichealthrecords
AT miyazakitaiga developingartificialintelligencemodelsforextractingoncologicoutcomesfromjapaneseelectronichealthrecords