Cargando…

GPTZero Performance in Identifying Artificial Intelligence-Generated Medical Texts: A Preliminary Study

BACKGROUND: With emergence of chatbots to help authors with scientific writings, editors should have tools to identify artificial intelligence-generated texts. GPTZero is among the first websites that has sought media attention claiming to differentiate machine-generated from human-written texts. ME...

Descripción completa

Detalles Bibliográficos
Autor principal: Habibzadeh, Farrokh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Korean Academy of Medical Sciences 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10519776/
https://www.ncbi.nlm.nih.gov/pubmed/37750374
http://dx.doi.org/10.3346/jkms.2023.38.e319
Descripción
Sumario:BACKGROUND: With emergence of chatbots to help authors with scientific writings, editors should have tools to identify artificial intelligence-generated texts. GPTZero is among the first websites that has sought media attention claiming to differentiate machine-generated from human-written texts. METHODS: Using 20 text pieces generated by ChatGPT in response to arbitrary questions on various topics in medicine and 30 pieces chosen from previously published medical articles, the performance of GPTZero was assessed. RESULTS: GPTZero had a sensitivity of 0.65 (95% confidence interval, 0.41–0.85); specificity, 0.90 (0.73–0.98); accuracy, 0.80 (0.66–0.90); and positive and negative likelihood ratios, 6.5 (2.1–19.9) and 0.4 (0.2–0.7), respectively. CONCLUSION: GPTZero has a low false-positive (classifying a human-written text as machine-generated) and a high false-negative rate (classifying a machine-generated text as human-written).