Cargando…

Human-like problem-solving abilities in large language models using ChatGPT

BACKGROUNDS: The field of Artificial Intelligence (AI) has seen a major shift in recent years due to the development of new Machine Learning (ML) models such as Generative Pre-trained Transformer (GPT). GPT has achieved previously unheard-of levels of accuracy in most computerized language processin...

Descripción completa

Detalles Bibliográficos
Autores principales:	Orrù, Graziella, Piarulli, Andrea, Conversano, Ciro, Gemignani, Angelo
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2023
Materias:	Artificial Intelligence
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10244637/ https://www.ncbi.nlm.nih.gov/pubmed/37293238 http://dx.doi.org/10.3389/frai.2023.1199350

_version_	1785054684449341440
author	Orrù, Graziella Piarulli, Andrea Conversano, Ciro Gemignani, Angelo
author_facet	Orrù, Graziella Piarulli, Andrea Conversano, Ciro Gemignani, Angelo
author_sort	Orrù, Graziella
collection	PubMed
description	BACKGROUNDS: The field of Artificial Intelligence (AI) has seen a major shift in recent years due to the development of new Machine Learning (ML) models such as Generative Pre-trained Transformer (GPT). GPT has achieved previously unheard-of levels of accuracy in most computerized language processing tasks and their chat-based variations. AIM: The aim of this study was to investigate the problem-solving abilities of ChatGPT using two sets of verbal insight problems, with a known performance level established by a sample of human participants. MATERIALS AND METHODS: A total of 30 problems labeled as “practice problems” and “transfer problems” were administered to ChatGPT. ChatGPT's answers received a score of “0” for each incorrectly answered problem and a score of “1” for each correct response. The highest possible score for both the practice and transfer problems was 15 out of 15. The solution rate for each problem (based on a sample of 20 subjects) was used to assess and compare the performance of ChatGPT with that of human subjects. RESULTS: The study highlighted that ChatGPT can be trained in out-of-the-box thinking and demonstrated potential in solving verbal insight problems. The global performance of ChatGPT equalled the most probable outcome for the human sample in both practice problems and transfer problems as well as upon their combination. Additionally, ChatGPT answer combinations were among the 5% of most probable outcomes for the human sample both when considering practice problems and pooled problem sets. These findings demonstrate that ChatGPT performance on both set of problems was in line with the mean rate of success of human subjects, indicating that it performed reasonably well. CONCLUSIONS: The use of transformer architecture and self-attention in ChatGPT may have helped to prioritize inputs while predicting, contributing to its potential in verbal insight problem-solving. ChatGPT has shown potential in solving insight problems, thus highlighting the importance of incorporating AI into psychological research. However, it is acknowledged that there are still open challenges. Indeed, further research is required to fully understand AI's capabilities and limitations in verbal problem-solving.
format	Online Article Text
id	pubmed-10244637
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-102446372023-06-08 Human-like problem-solving abilities in large language models using ChatGPT Orrù, Graziella Piarulli, Andrea Conversano, Ciro Gemignani, Angelo Front Artif Intell Artificial Intelligence BACKGROUNDS: The field of Artificial Intelligence (AI) has seen a major shift in recent years due to the development of new Machine Learning (ML) models such as Generative Pre-trained Transformer (GPT). GPT has achieved previously unheard-of levels of accuracy in most computerized language processing tasks and their chat-based variations. AIM: The aim of this study was to investigate the problem-solving abilities of ChatGPT using two sets of verbal insight problems, with a known performance level established by a sample of human participants. MATERIALS AND METHODS: A total of 30 problems labeled as “practice problems” and “transfer problems” were administered to ChatGPT. ChatGPT's answers received a score of “0” for each incorrectly answered problem and a score of “1” for each correct response. The highest possible score for both the practice and transfer problems was 15 out of 15. The solution rate for each problem (based on a sample of 20 subjects) was used to assess and compare the performance of ChatGPT with that of human subjects. RESULTS: The study highlighted that ChatGPT can be trained in out-of-the-box thinking and demonstrated potential in solving verbal insight problems. The global performance of ChatGPT equalled the most probable outcome for the human sample in both practice problems and transfer problems as well as upon their combination. Additionally, ChatGPT answer combinations were among the 5% of most probable outcomes for the human sample both when considering practice problems and pooled problem sets. These findings demonstrate that ChatGPT performance on both set of problems was in line with the mean rate of success of human subjects, indicating that it performed reasonably well. CONCLUSIONS: The use of transformer architecture and self-attention in ChatGPT may have helped to prioritize inputs while predicting, contributing to its potential in verbal insight problem-solving. ChatGPT has shown potential in solving insight problems, thus highlighting the importance of incorporating AI into psychological research. However, it is acknowledged that there are still open challenges. Indeed, further research is required to fully understand AI's capabilities and limitations in verbal problem-solving. Frontiers Media S.A. 2023-05-24 /pmc/articles/PMC10244637/ /pubmed/37293238 http://dx.doi.org/10.3389/frai.2023.1199350 Text en Copyright © 2023 Orrù, Piarulli, Conversano and Gemignani. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Artificial Intelligence Orrù, Graziella Piarulli, Andrea Conversano, Ciro Gemignani, Angelo Human-like problem-solving abilities in large language models using ChatGPT
title	Human-like problem-solving abilities in large language models using ChatGPT
title_full	Human-like problem-solving abilities in large language models using ChatGPT
title_fullStr	Human-like problem-solving abilities in large language models using ChatGPT
title_full_unstemmed	Human-like problem-solving abilities in large language models using ChatGPT
title_short	Human-like problem-solving abilities in large language models using ChatGPT
title_sort	human-like problem-solving abilities in large language models using chatgpt
topic	Artificial Intelligence
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10244637/ https://www.ncbi.nlm.nih.gov/pubmed/37293238 http://dx.doi.org/10.3389/frai.2023.1199350
work_keys_str_mv	AT orrugraziella humanlikeproblemsolvingabilitiesinlargelanguagemodelsusingchatgpt AT piarulliandrea humanlikeproblemsolvingabilitiesinlargelanguagemodelsusingchatgpt AT conversanociro humanlikeproblemsolvingabilitiesinlargelanguagemodelsusingchatgpt AT gemignaniangelo humanlikeproblemsolvingabilitiesinlargelanguagemodelsusingchatgpt

Human-like problem-solving abilities in large language models using ChatGPT

Ejemplares similares