Cargando…

The Effectiveness of Artificial Intelligence Conversational Agents in Health Care: Systematic Review

BACKGROUND: The high demand for health care services and the growing capability of artificial intelligence have led to the development of conversational agents designed to support a variety of health-related activities, including behavior change, treatment support, health monitoring, training, triag...

Descripción completa

Detalles Bibliográficos
Autores principales: Milne-Ives, Madison, de Cock, Caroline, Lim, Ernest, Shehadeh, Melissa Harper, de Pennington, Nick, Mole, Guy, Normando, Eduardo, Meinert, Edward
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7644372/
https://www.ncbi.nlm.nih.gov/pubmed/33090118
http://dx.doi.org/10.2196/20346
_version_ 1783606441424191488
author Milne-Ives, Madison
de Cock, Caroline
Lim, Ernest
Shehadeh, Melissa Harper
de Pennington, Nick
Mole, Guy
Normando, Eduardo
Meinert, Edward
author_facet Milne-Ives, Madison
de Cock, Caroline
Lim, Ernest
Shehadeh, Melissa Harper
de Pennington, Nick
Mole, Guy
Normando, Eduardo
Meinert, Edward
author_sort Milne-Ives, Madison
collection PubMed
description BACKGROUND: The high demand for health care services and the growing capability of artificial intelligence have led to the development of conversational agents designed to support a variety of health-related activities, including behavior change, treatment support, health monitoring, training, triage, and screening support. Automation of these tasks could free clinicians to focus on more complex work and increase the accessibility to health care services for the public. An overarching assessment of the acceptability, usability, and effectiveness of these agents in health care is needed to collate the evidence so that future development can target areas for improvement and potential for sustainable adoption. OBJECTIVE: This systematic review aims to assess the effectiveness and usability of conversational agents in health care and identify the elements that users like and dislike to inform future research and development of these agents. METHODS: PubMed, Medline (Ovid), EMBASE (Excerpta Medica dataBASE), CINAHL (Cumulative Index to Nursing and Allied Health Literature), Web of Science, and the Association for Computing Machinery Digital Library were systematically searched for articles published since 2008 that evaluated unconstrained natural language processing conversational agents used in health care. EndNote (version X9, Clarivate Analytics) reference management software was used for initial screening, and full-text screening was conducted by 1 reviewer. Data were extracted, and the risk of bias was assessed by one reviewer and validated by another. RESULTS: A total of 31 studies were selected and included a variety of conversational agents, including 14 chatbots (2 of which were voice chatbots), 6 embodied conversational agents (3 of which were interactive voice response calls, virtual patients, and speech recognition screening systems), 1 contextual question-answering agent, and 1 voice recognition triage system. Overall, the evidence reported was mostly positive or mixed. Usability and satisfaction performed well (27/30 and 26/31), and positive or mixed effectiveness was found in three-quarters of the studies (23/30). However, there were several limitations of the agents highlighted in specific qualitative feedback. CONCLUSIONS: The studies generally reported positive or mixed evidence for the effectiveness, usability, and satisfactoriness of the conversational agents investigated, but qualitative user perceptions were more mixed. The quality of many of the studies was limited, and improved study design and reporting are necessary to more accurately evaluate the usefulness of the agents in health care and identify key areas for improvement. Further research should also analyze the cost-effectiveness, privacy, and security of the agents. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID): RR2-10.2196/16934
format Online
Article
Text
id pubmed-7644372
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-76443722020-11-16 The Effectiveness of Artificial Intelligence Conversational Agents in Health Care: Systematic Review Milne-Ives, Madison de Cock, Caroline Lim, Ernest Shehadeh, Melissa Harper de Pennington, Nick Mole, Guy Normando, Eduardo Meinert, Edward J Med Internet Res Review BACKGROUND: The high demand for health care services and the growing capability of artificial intelligence have led to the development of conversational agents designed to support a variety of health-related activities, including behavior change, treatment support, health monitoring, training, triage, and screening support. Automation of these tasks could free clinicians to focus on more complex work and increase the accessibility to health care services for the public. An overarching assessment of the acceptability, usability, and effectiveness of these agents in health care is needed to collate the evidence so that future development can target areas for improvement and potential for sustainable adoption. OBJECTIVE: This systematic review aims to assess the effectiveness and usability of conversational agents in health care and identify the elements that users like and dislike to inform future research and development of these agents. METHODS: PubMed, Medline (Ovid), EMBASE (Excerpta Medica dataBASE), CINAHL (Cumulative Index to Nursing and Allied Health Literature), Web of Science, and the Association for Computing Machinery Digital Library were systematically searched for articles published since 2008 that evaluated unconstrained natural language processing conversational agents used in health care. EndNote (version X9, Clarivate Analytics) reference management software was used for initial screening, and full-text screening was conducted by 1 reviewer. Data were extracted, and the risk of bias was assessed by one reviewer and validated by another. RESULTS: A total of 31 studies were selected and included a variety of conversational agents, including 14 chatbots (2 of which were voice chatbots), 6 embodied conversational agents (3 of which were interactive voice response calls, virtual patients, and speech recognition screening systems), 1 contextual question-answering agent, and 1 voice recognition triage system. Overall, the evidence reported was mostly positive or mixed. Usability and satisfaction performed well (27/30 and 26/31), and positive or mixed effectiveness was found in three-quarters of the studies (23/30). However, there were several limitations of the agents highlighted in specific qualitative feedback. CONCLUSIONS: The studies generally reported positive or mixed evidence for the effectiveness, usability, and satisfactoriness of the conversational agents investigated, but qualitative user perceptions were more mixed. The quality of many of the studies was limited, and improved study design and reporting are necessary to more accurately evaluate the usefulness of the agents in health care and identify key areas for improvement. Further research should also analyze the cost-effectiveness, privacy, and security of the agents. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID): RR2-10.2196/16934 JMIR Publications 2020-10-22 /pmc/articles/PMC7644372/ /pubmed/33090118 http://dx.doi.org/10.2196/20346 Text en ©Madison Milne-Ives, Caroline de Cock, Ernest Lim, Melissa Harper Shehadeh, Nick de Pennington, Guy Mole, Eduardo Normando, Edward Meinert. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 22.10.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Review
Milne-Ives, Madison
de Cock, Caroline
Lim, Ernest
Shehadeh, Melissa Harper
de Pennington, Nick
Mole, Guy
Normando, Eduardo
Meinert, Edward
The Effectiveness of Artificial Intelligence Conversational Agents in Health Care: Systematic Review
title The Effectiveness of Artificial Intelligence Conversational Agents in Health Care: Systematic Review
title_full The Effectiveness of Artificial Intelligence Conversational Agents in Health Care: Systematic Review
title_fullStr The Effectiveness of Artificial Intelligence Conversational Agents in Health Care: Systematic Review
title_full_unstemmed The Effectiveness of Artificial Intelligence Conversational Agents in Health Care: Systematic Review
title_short The Effectiveness of Artificial Intelligence Conversational Agents in Health Care: Systematic Review
title_sort effectiveness of artificial intelligence conversational agents in health care: systematic review
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7644372/
https://www.ncbi.nlm.nih.gov/pubmed/33090118
http://dx.doi.org/10.2196/20346
work_keys_str_mv AT milneivesmadison theeffectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview
AT decockcaroline theeffectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview
AT limernest theeffectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview
AT shehadehmelissaharper theeffectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview
AT depenningtonnick theeffectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview
AT moleguy theeffectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview
AT normandoeduardo theeffectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview
AT meinertedward theeffectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview
AT milneivesmadison effectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview
AT decockcaroline effectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview
AT limernest effectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview
AT shehadehmelissaharper effectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview
AT depenningtonnick effectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview
AT moleguy effectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview
AT normandoeduardo effectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview
AT meinertedward effectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview