Cargando…
A comparison of large language model versus manual chart review for extraction of data elements from the electronic health record
IMPORTANCE: Large language models (LLMs) have proven useful for extracting data from publicly available sources, but their uses in clinical settings and with clinical data are unknown. OBJECTIVE: To determine the accuracy of data extraction using “Versa Chat,” a chat implementation of the general-pu...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491368/ https://www.ncbi.nlm.nih.gov/pubmed/37693398 http://dx.doi.org/10.1101/2023.08.31.23294924 |
_version_ | 1785104046093238272 |
---|---|
author | Ge, Jin Li, Michael Delk, Molly B. Lai, Jennifer C. |
author_facet | Ge, Jin Li, Michael Delk, Molly B. Lai, Jennifer C. |
author_sort | Ge, Jin |
collection | PubMed |
description | IMPORTANCE: Large language models (LLMs) have proven useful for extracting data from publicly available sources, but their uses in clinical settings and with clinical data are unknown. OBJECTIVE: To determine the accuracy of data extraction using “Versa Chat,” a chat implementation of the general-purpose OpenAI gpt-35-turbo LLM model, versus manual chart review for hepatocellular carcinoma (HCC) imaging reports. DESIGN: We engineered a prompt for the data extraction task of six distinct data elements and input 182 abdominal imaging reports that were also manually tagged. We evaluated performance by calculating accuracy, precision, recall, and F1 scores. SETTING/PARTICIPANTS: Cross-sectional abdominal imaging reports of patients diagnosed with hepatocellular carcinoma enrolled in the Functional Assessment in Liver Transplantation (FrAILT) study. |
format | Online Article Text |
id | pubmed-10491368 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-104913682023-09-09 A comparison of large language model versus manual chart review for extraction of data elements from the electronic health record Ge, Jin Li, Michael Delk, Molly B. Lai, Jennifer C. medRxiv Article IMPORTANCE: Large language models (LLMs) have proven useful for extracting data from publicly available sources, but their uses in clinical settings and with clinical data are unknown. OBJECTIVE: To determine the accuracy of data extraction using “Versa Chat,” a chat implementation of the general-purpose OpenAI gpt-35-turbo LLM model, versus manual chart review for hepatocellular carcinoma (HCC) imaging reports. DESIGN: We engineered a prompt for the data extraction task of six distinct data elements and input 182 abdominal imaging reports that were also manually tagged. We evaluated performance by calculating accuracy, precision, recall, and F1 scores. SETTING/PARTICIPANTS: Cross-sectional abdominal imaging reports of patients diagnosed with hepatocellular carcinoma enrolled in the Functional Assessment in Liver Transplantation (FrAILT) study. Cold Spring Harbor Laboratory 2023-09-04 /pmc/articles/PMC10491368/ /pubmed/37693398 http://dx.doi.org/10.1101/2023.08.31.23294924 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator. |
spellingShingle | Article Ge, Jin Li, Michael Delk, Molly B. Lai, Jennifer C. A comparison of large language model versus manual chart review for extraction of data elements from the electronic health record |
title | A comparison of large language model versus manual chart review for extraction of data elements from the electronic health record |
title_full | A comparison of large language model versus manual chart review for extraction of data elements from the electronic health record |
title_fullStr | A comparison of large language model versus manual chart review for extraction of data elements from the electronic health record |
title_full_unstemmed | A comparison of large language model versus manual chart review for extraction of data elements from the electronic health record |
title_short | A comparison of large language model versus manual chart review for extraction of data elements from the electronic health record |
title_sort | comparison of large language model versus manual chart review for extraction of data elements from the electronic health record |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491368/ https://www.ncbi.nlm.nih.gov/pubmed/37693398 http://dx.doi.org/10.1101/2023.08.31.23294924 |
work_keys_str_mv | AT gejin acomparisonoflargelanguagemodelversusmanualchartreviewforextractionofdataelementsfromtheelectronichealthrecord AT limichael acomparisonoflargelanguagemodelversusmanualchartreviewforextractionofdataelementsfromtheelectronichealthrecord AT delkmollyb acomparisonoflargelanguagemodelversusmanualchartreviewforextractionofdataelementsfromtheelectronichealthrecord AT laijenniferc acomparisonoflargelanguagemodelversusmanualchartreviewforextractionofdataelementsfromtheelectronichealthrecord AT gejin comparisonoflargelanguagemodelversusmanualchartreviewforextractionofdataelementsfromtheelectronichealthrecord AT limichael comparisonoflargelanguagemodelversusmanualchartreviewforextractionofdataelementsfromtheelectronichealthrecord AT delkmollyb comparisonoflargelanguagemodelversusmanualchartreviewforextractionofdataelementsfromtheelectronichealthrecord AT laijenniferc comparisonoflargelanguagemodelversusmanualchartreviewforextractionofdataelementsfromtheelectronichealthrecord |