Cargando…

Evaluation of the performance of GPT-3.5 and GPT-4 on the Polish Medical Final Examination

The study aimed to evaluate the performance of two Large Language Models (LLMs): ChatGPT (based on GPT-3.5) and GPT-4 with two temperature parameter values, on the Polish Medical Final Examination (MFE). The models were tested on three editions of the MFE from: Spring 2022, Autumn 2022, and Spring 2...

Descripción completa

Detalles Bibliográficos
Autores principales:	Rosoł, Maciej, Gąsior, Jakub S., Łaba, Jonasz, Korzeniewski, Kacper, Młyńczak, Marcel
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10665355/ https://www.ncbi.nlm.nih.gov/pubmed/37993519 http://dx.doi.org/10.1038/s41598-023-46995-z

Internet

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10665355/
https://www.ncbi.nlm.nih.gov/pubmed/37993519
http://dx.doi.org/10.1038/s41598-023-46995-z

Evaluation of the performance of GPT-3.5 and GPT-4 on the Polish Medical Final Examination

Internet

Ejemplares similares