Cargando…

The utility of ChatGPT in the assessment of literature on the prevention of migraine: an observational, qualitative study

BACKGROUND: It is not known how large language models, such as ChatGPT, can be applied toward the assessment of the efficacy of medications, including in the prevention of migraine, and how it might support those claims with existing medical evidence. METHODS: We queried ChatGPT-3.5 on the efficacy...

Descripción completa

Detalles Bibliográficos
Autores principales: Moskatel, Leon S., Zhang, Niushen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10469750/
https://www.ncbi.nlm.nih.gov/pubmed/37662036
http://dx.doi.org/10.3389/fneur.2023.1225223
_version_ 1785099513222922240
author Moskatel, Leon S.
Zhang, Niushen
author_facet Moskatel, Leon S.
Zhang, Niushen
author_sort Moskatel, Leon S.
collection PubMed
description BACKGROUND: It is not known how large language models, such as ChatGPT, can be applied toward the assessment of the efficacy of medications, including in the prevention of migraine, and how it might support those claims with existing medical evidence. METHODS: We queried ChatGPT-3.5 on the efficacy of 47 medications for the prevention of migraine and then asked it to give citations in support of its assessment. ChatGPT’s evaluations were then compared to their FDA approval status for this indication as well as the American Academy of Neurology 2012 evidence-based guidelines for the prevention of migraine. The citations ChatGPT generated for these evaluations were then assessed to see if they were real papers and if they were relevant to the query. RESULTS: ChatGPT affirmed that the 14 medications that have either received FDA approval for prevention of migraine or AAN Grade A/B evidence were effective for migraine. Its assessments of the other 33 medications were unreliable including suggesting possible efficacy for four medications that have never been used for the prevention of migraine. Critically, only 33/115 (29%) of the papers ChatGPT cited were real, while 76/115 (66%) were “hallucinated” not real papers and 6/115 (5%) shared the names of real papers but had not real citations. CONCLUSION: While ChatGPT produced tailored answers on the efficacy of the queried medications, the results were unreliable and inaccurate because of the overwhelming volume of “hallucinated” articles it generated and cited.
format Online
Article
Text
id pubmed-10469750
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-104697502023-09-01 The utility of ChatGPT in the assessment of literature on the prevention of migraine: an observational, qualitative study Moskatel, Leon S. Zhang, Niushen Front Neurol Neurology BACKGROUND: It is not known how large language models, such as ChatGPT, can be applied toward the assessment of the efficacy of medications, including in the prevention of migraine, and how it might support those claims with existing medical evidence. METHODS: We queried ChatGPT-3.5 on the efficacy of 47 medications for the prevention of migraine and then asked it to give citations in support of its assessment. ChatGPT’s evaluations were then compared to their FDA approval status for this indication as well as the American Academy of Neurology 2012 evidence-based guidelines for the prevention of migraine. The citations ChatGPT generated for these evaluations were then assessed to see if they were real papers and if they were relevant to the query. RESULTS: ChatGPT affirmed that the 14 medications that have either received FDA approval for prevention of migraine or AAN Grade A/B evidence were effective for migraine. Its assessments of the other 33 medications were unreliable including suggesting possible efficacy for four medications that have never been used for the prevention of migraine. Critically, only 33/115 (29%) of the papers ChatGPT cited were real, while 76/115 (66%) were “hallucinated” not real papers and 6/115 (5%) shared the names of real papers but had not real citations. CONCLUSION: While ChatGPT produced tailored answers on the efficacy of the queried medications, the results were unreliable and inaccurate because of the overwhelming volume of “hallucinated” articles it generated and cited. Frontiers Media S.A. 2023-08-17 /pmc/articles/PMC10469750/ /pubmed/37662036 http://dx.doi.org/10.3389/fneur.2023.1225223 Text en Copyright © 2023 Moskatel and Zhang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neurology
Moskatel, Leon S.
Zhang, Niushen
The utility of ChatGPT in the assessment of literature on the prevention of migraine: an observational, qualitative study
title The utility of ChatGPT in the assessment of literature on the prevention of migraine: an observational, qualitative study
title_full The utility of ChatGPT in the assessment of literature on the prevention of migraine: an observational, qualitative study
title_fullStr The utility of ChatGPT in the assessment of literature on the prevention of migraine: an observational, qualitative study
title_full_unstemmed The utility of ChatGPT in the assessment of literature on the prevention of migraine: an observational, qualitative study
title_short The utility of ChatGPT in the assessment of literature on the prevention of migraine: an observational, qualitative study
title_sort utility of chatgpt in the assessment of literature on the prevention of migraine: an observational, qualitative study
topic Neurology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10469750/
https://www.ncbi.nlm.nih.gov/pubmed/37662036
http://dx.doi.org/10.3389/fneur.2023.1225223
work_keys_str_mv AT moskatelleons theutilityofchatgptintheassessmentofliteratureonthepreventionofmigraineanobservationalqualitativestudy
AT zhangniushen theutilityofchatgptintheassessmentofliteratureonthepreventionofmigraineanobservationalqualitativestudy
AT moskatelleons utilityofchatgptintheassessmentofliteratureonthepreventionofmigraineanobservationalqualitativestudy
AT zhangniushen utilityofchatgptintheassessmentofliteratureonthepreventionofmigraineanobservationalqualitativestudy