Cargando…

A comparison of large language model versus manual chart review for extraction of data elements from the electronic health record

IMPORTANCE: Large language models (LLMs) have proven useful for extracting data from publicly available sources, but their uses in clinical settings and with clinical data are unknown. OBJECTIVE: To determine the accuracy of data extraction using “Versa Chat,” a chat implementation of the general-pu...

Descripción completa

Detalles Bibliográficos
Autores principales: Ge, Jin, Li, Michael, Delk, Molly B., Lai, Jennifer C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491368/
https://www.ncbi.nlm.nih.gov/pubmed/37693398
http://dx.doi.org/10.1101/2023.08.31.23294924
_version_ 1785104046093238272
author Ge, Jin
Li, Michael
Delk, Molly B.
Lai, Jennifer C.
author_facet Ge, Jin
Li, Michael
Delk, Molly B.
Lai, Jennifer C.
author_sort Ge, Jin
collection PubMed
description IMPORTANCE: Large language models (LLMs) have proven useful for extracting data from publicly available sources, but their uses in clinical settings and with clinical data are unknown. OBJECTIVE: To determine the accuracy of data extraction using “Versa Chat,” a chat implementation of the general-purpose OpenAI gpt-35-turbo LLM model, versus manual chart review for hepatocellular carcinoma (HCC) imaging reports. DESIGN: We engineered a prompt for the data extraction task of six distinct data elements and input 182 abdominal imaging reports that were also manually tagged. We evaluated performance by calculating accuracy, precision, recall, and F1 scores. SETTING/PARTICIPANTS: Cross-sectional abdominal imaging reports of patients diagnosed with hepatocellular carcinoma enrolled in the Functional Assessment in Liver Transplantation (FrAILT) study.
format Online
Article
Text
id pubmed-10491368
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-104913682023-09-09 A comparison of large language model versus manual chart review for extraction of data elements from the electronic health record Ge, Jin Li, Michael Delk, Molly B. Lai, Jennifer C. medRxiv Article IMPORTANCE: Large language models (LLMs) have proven useful for extracting data from publicly available sources, but their uses in clinical settings and with clinical data are unknown. OBJECTIVE: To determine the accuracy of data extraction using “Versa Chat,” a chat implementation of the general-purpose OpenAI gpt-35-turbo LLM model, versus manual chart review for hepatocellular carcinoma (HCC) imaging reports. DESIGN: We engineered a prompt for the data extraction task of six distinct data elements and input 182 abdominal imaging reports that were also manually tagged. We evaluated performance by calculating accuracy, precision, recall, and F1 scores. SETTING/PARTICIPANTS: Cross-sectional abdominal imaging reports of patients diagnosed with hepatocellular carcinoma enrolled in the Functional Assessment in Liver Transplantation (FrAILT) study. Cold Spring Harbor Laboratory 2023-09-04 /pmc/articles/PMC10491368/ /pubmed/37693398 http://dx.doi.org/10.1101/2023.08.31.23294924 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Ge, Jin
Li, Michael
Delk, Molly B.
Lai, Jennifer C.
A comparison of large language model versus manual chart review for extraction of data elements from the electronic health record
title A comparison of large language model versus manual chart review for extraction of data elements from the electronic health record
title_full A comparison of large language model versus manual chart review for extraction of data elements from the electronic health record
title_fullStr A comparison of large language model versus manual chart review for extraction of data elements from the electronic health record
title_full_unstemmed A comparison of large language model versus manual chart review for extraction of data elements from the electronic health record
title_short A comparison of large language model versus manual chart review for extraction of data elements from the electronic health record
title_sort comparison of large language model versus manual chart review for extraction of data elements from the electronic health record
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491368/
https://www.ncbi.nlm.nih.gov/pubmed/37693398
http://dx.doi.org/10.1101/2023.08.31.23294924
work_keys_str_mv AT gejin acomparisonoflargelanguagemodelversusmanualchartreviewforextractionofdataelementsfromtheelectronichealthrecord
AT limichael acomparisonoflargelanguagemodelversusmanualchartreviewforextractionofdataelementsfromtheelectronichealthrecord
AT delkmollyb acomparisonoflargelanguagemodelversusmanualchartreviewforextractionofdataelementsfromtheelectronichealthrecord
AT laijenniferc acomparisonoflargelanguagemodelversusmanualchartreviewforextractionofdataelementsfromtheelectronichealthrecord
AT gejin comparisonoflargelanguagemodelversusmanualchartreviewforextractionofdataelementsfromtheelectronichealthrecord
AT limichael comparisonoflargelanguagemodelversusmanualchartreviewforextractionofdataelementsfromtheelectronichealthrecord
AT delkmollyb comparisonoflargelanguagemodelversusmanualchartreviewforextractionofdataelementsfromtheelectronichealthrecord
AT laijenniferc comparisonoflargelanguagemodelversusmanualchartreviewforextractionofdataelementsfromtheelectronichealthrecord