Cargando…

Evaluation of the performance of GPT-3.5 and GPT-4 on the Polish Medical Final Examination

The study aimed to evaluate the performance of two Large Language Models (LLMs): ChatGPT (based on GPT-3.5) and GPT-4 with two temperature parameter values, on the Polish Medical Final Examination (MFE). The models were tested on three editions of the MFE from: Spring 2022, Autumn 2022, and Spring 2...

Descripción completa

Detalles Bibliográficos
Autores principales: Rosoł, Maciej, Gąsior, Jakub S., Łaba, Jonasz, Korzeniewski, Kacper, Młyńczak, Marcel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10665355/
https://www.ncbi.nlm.nih.gov/pubmed/37993519
http://dx.doi.org/10.1038/s41598-023-46995-z