Cargando…
Evaluation of the performance of GPT-3.5 and GPT-4 on the Polish Medical Final Examination
The study aimed to evaluate the performance of two Large Language Models (LLMs): ChatGPT (based on GPT-3.5) and GPT-4 with two temperature parameter values, on the Polish Medical Final Examination (MFE). The models were tested on three editions of the MFE from: Spring 2022, Autumn 2022, and Spring 2...
Autores principales: | Rosoł, Maciej, Gąsior, Jakub S., Łaba, Jonasz, Korzeniewski, Kacper, Młyńczak, Marcel |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10665355/ https://www.ncbi.nlm.nih.gov/pubmed/37993519 http://dx.doi.org/10.1038/s41598-023-46995-z |
Ejemplares similares
-
Performance of GPT-3.5 and GPT-4 on the Japanese Medical Licensing Examination: Comparison Study
por: Takagi, Soshi, et al.
Publicado: (2023) -
Assessing the Performance of GPT-3.5 and GPT-4 on the 2023 Japanese Nursing Examination
por: Kaneda, Yudai, et al.
Publicado: (2023) -
Comparison of ChatGPT–3.5, ChatGPT-4, and Orthopaedic Resident Performance on Orthopaedic Assessment Examinations
por: Massey, Patrick A., et al.
Publicado: (2023) -
Improved Performance of ChatGPT-4 on the OKAP Examination: A Comparative Study with ChatGPT-3.5
por: Teebagy, Sean, et al.
Publicado: (2023) -
Comparative performance of humans versus GPT-4.0 and GPT-3.5 in the self-assessment program of American Academy of Ophthalmology
por: Taloni, Andrea, et al.
Publicado: (2023)