Cargando…

Evaluation of the performance of GPT-3.5 and GPT-4 on the Polish Medical Final Examination

The study aimed to evaluate the performance of two Large Language Models (LLMs): ChatGPT (based on GPT-3.5) and GPT-4 with two temperature parameter values, on the Polish Medical Final Examination (MFE). The models were tested on three editions of the MFE from: Spring 2022, Autumn 2022, and Spring 2...

Descripción completa

Detalles Bibliográficos
Autores principales:	Rosoł, Maciej, Gąsior, Jakub S., Łaba, Jonasz, Korzeniewski, Kacper, Młyńczak, Marcel
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10665355/ https://www.ncbi.nlm.nih.gov/pubmed/37993519 http://dx.doi.org/10.1038/s41598-023-46995-z

Ejemplares similares

Performance of GPT-3.5 and GPT-4 on the Japanese Medical Licensing Examination: Comparison Study
por: Takagi, Soshi, et al.
Publicado: (2023)

Assessing the Performance of GPT-3.5 and GPT-4 on the 2023 Japanese Nursing Examination
por: Kaneda, Yudai, et al.
Publicado: (2023)

Comparison of ChatGPT–3.5, ChatGPT-4, and Orthopaedic Resident Performance on Orthopaedic Assessment Examinations
por: Massey, Patrick A., et al.
Publicado: (2023)

Improved Performance of ChatGPT-4 on the OKAP Examination: A Comparative Study with ChatGPT-3.5
por: Teebagy, Sean, et al.
Publicado: (2023)

Comparative performance of humans versus GPT-4.0 and GPT-3.5 in the self-assessment program of American Academy of Ophthalmology
por: Taloni, Andrea, et al.
Publicado: (2023)

Evaluating ChatGPT Performance on the Orthopaedic In-Training Examination
por: Kung, Justin E., et al.
Publicado: (2023)

Suicide Risk Assessments Through the Eyes of ChatGPT-3.5 Versus ChatGPT-4: Vignette Study
por: Levkovich, Inbar, et al.
Publicado: (2023)

Comparing ChatGPT and GPT-4 performance in USMLE soft skill assessments
por: Brin, Dana, et al.
Publicado: (2023)

Benchmarking large language models’ performances for myopia care: a comparative analysis of ChatGPT-3.5, ChatGPT-4.0, and Google Bard
por: Lim, Zhi Wei, et al.
Publicado: (2023)

Artificial Intelligence in Ophthalmology: A Comparative Analysis of GPT-3.5, GPT-4, and Human Expertise in Answering StatPearls Questions
por: Moshirfar, Majid, et al.
Publicado: (2023)

Performance of ChatGPT on the Peruvian National Licensing Medical Examination: Cross-Sectional Study
por: Flores-Cohaila, Javier A, et al.
Publicado: (2023)

Assessment of ChatGPT’s performance on neurology written board examination questions
por: Chen, Tse Chiang, et al.
Publicado: (2023)

ChatGPT, authorship, and medical publishing
por: Kleebayoon, Amnuay, et al.
Publicado: (2023)

Examining Real-World Medication Consultations and Drug-Herb Interactions: ChatGPT Performance Evaluation
por: Hsu, Hsing-Yu, et al.
Publicado: (2023)

ChatGPT performance in the medical specialty exam: An observational study
por: Oztermeli, Ayse Dilara, et al.
Publicado: (2023)

Will ChatGPT pass the Polish specialty exam in radiology and diagnostic imaging? Insights into strengths and limitations
por: Kufel, Jakub, et al.
Publicado: (2023)

ChatGPT/GPT-4: enabling a new era of surgical oncology
por: Cheng, Kunming, et al.
Publicado: (2023)

To ChatGPT or not to ChatGPT: the use of artificial intelligence in writing scientific papers
por: Marescotti, Manuela
Publicado: (2023)

The potential impact of ChatGPT/GPT-4 on surgery: will it topple the profession of surgeons?
por: Cheng, Kunming, et al.
Publicado: (2023)

Analyzing the Performance of ChatGPT About Osteoporosis
por: Cinar, Cigdem
Publicado: (2023)

ChatGPT for medical applications and urological science
por: Reis, Leonardo O.
Publicado: (2023)

ChatGPT for Future Medical and Dental Research
por: Fatani, Bader
Publicado: (2023)

Evaluating the limits of AI in medical specialisation: ChatGPT’s performance on the UK Neurology Specialty Certificate Examination
por: Giannos, Panagiotis
Publicado: (2023)

Performance of ChatGPT-4 in answering questions from the Brazilian National Examination for Medical Degree Revalidation
por: Gobira, Mauro, et al.
Publicado: (2023)

Evaluating the Sensitivity, Specificity, and Accuracy of ChatGPT-3.5, ChatGPT-4, Bing AI, and Bard Against Conventional Drug-Drug Interactions Clinical Tools
por: Al-Ashwal, Fahmi Y, et al.
Publicado: (2023)

Using cognitive psychology to understand GPT-3
por: Binz, Marcel, et al.
Publicado: (2023)

Performance of ChatGPT, human radiologists, and context-aware ChatGPT in identifying AO codes from radiology reports
por: Russe, Maximilian F., et al.
Publicado: (2023)

102. Assessing ChatGPT Performance in the Brazilian Infectious Disease Specialist Certification Examination
por: Chaves Fernandes, Alexandre, et al.
Publicado: (2023)

Artificial hallucination: GPT on LSD?
por: Beutel, Gernot, et al.
Publicado: (2023)

ChatGPT in Clinical Toxicology
por: Sabry Abdel-Messih, Mary, et al.
Publicado: (2023)

Authorship Policy and ChatGPT
por: Kleebayoon, Amnuay, et al.
Publicado: (2023)

ChatGPT and scientific paper
por: Kleebayoon, Amnuay, et al.
Publicado: (2023)

ChatGPT- Quo Vadis?
por: Kaliyadan, Feroze, et al.
Publicado: (2023)

ChatGPT and Environmental Research
por: Zhu, Jun-Jie, et al.
Publicado: (2023)

Examining the Threat of ChatGPT to the Validity of Short Answer Assessments in an Undergraduate Medical Program
por: Morjaria, Leo, et al.
Publicado: (2023)

Accuracy of ChatGPT on Medical Questions in the National Medical Licensing Examination in Japan: Evaluation Study
por: Yanagita, Yasutaka, et al.
Publicado: (2023)

Feature-based detection of automated language models: tackling GPT-2, GPT-3 and Grover
por: Fröhling, Leon, et al.
Publicado: (2021)

Artificial Intelligence in Intensive Care Medicine: Toward a ChatGPT/GPT-4 Way?
por: Lu, Yanqiu, et al.
Publicado: (2023)

The potential role of ChatGPT and artificial intelligence in anatomy education: a conversation with ChatGPT
por: Totlis, Trifon, et al.
Publicado: (2023)

ChatGPT, GPT-4, and Other Large Language Models: The Next Revolution for Clinical Microbiology?
por: Egli, Adrian
Publicado: (2023)

Cannot write session to /tmp/vufind_sessions/sess_4ictgdc91mn4lkujbfgc1nqoq0