Cargando…
The Effectiveness of Artificial Intelligence Conversational Agents in Health Care: Systematic Review
BACKGROUND: The high demand for health care services and the growing capability of artificial intelligence have led to the development of conversational agents designed to support a variety of health-related activities, including behavior change, treatment support, health monitoring, training, triag...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7644372/ https://www.ncbi.nlm.nih.gov/pubmed/33090118 http://dx.doi.org/10.2196/20346 |
_version_ | 1783606441424191488 |
---|---|
author | Milne-Ives, Madison de Cock, Caroline Lim, Ernest Shehadeh, Melissa Harper de Pennington, Nick Mole, Guy Normando, Eduardo Meinert, Edward |
author_facet | Milne-Ives, Madison de Cock, Caroline Lim, Ernest Shehadeh, Melissa Harper de Pennington, Nick Mole, Guy Normando, Eduardo Meinert, Edward |
author_sort | Milne-Ives, Madison |
collection | PubMed |
description | BACKGROUND: The high demand for health care services and the growing capability of artificial intelligence have led to the development of conversational agents designed to support a variety of health-related activities, including behavior change, treatment support, health monitoring, training, triage, and screening support. Automation of these tasks could free clinicians to focus on more complex work and increase the accessibility to health care services for the public. An overarching assessment of the acceptability, usability, and effectiveness of these agents in health care is needed to collate the evidence so that future development can target areas for improvement and potential for sustainable adoption. OBJECTIVE: This systematic review aims to assess the effectiveness and usability of conversational agents in health care and identify the elements that users like and dislike to inform future research and development of these agents. METHODS: PubMed, Medline (Ovid), EMBASE (Excerpta Medica dataBASE), CINAHL (Cumulative Index to Nursing and Allied Health Literature), Web of Science, and the Association for Computing Machinery Digital Library were systematically searched for articles published since 2008 that evaluated unconstrained natural language processing conversational agents used in health care. EndNote (version X9, Clarivate Analytics) reference management software was used for initial screening, and full-text screening was conducted by 1 reviewer. Data were extracted, and the risk of bias was assessed by one reviewer and validated by another. RESULTS: A total of 31 studies were selected and included a variety of conversational agents, including 14 chatbots (2 of which were voice chatbots), 6 embodied conversational agents (3 of which were interactive voice response calls, virtual patients, and speech recognition screening systems), 1 contextual question-answering agent, and 1 voice recognition triage system. Overall, the evidence reported was mostly positive or mixed. Usability and satisfaction performed well (27/30 and 26/31), and positive or mixed effectiveness was found in three-quarters of the studies (23/30). However, there were several limitations of the agents highlighted in specific qualitative feedback. CONCLUSIONS: The studies generally reported positive or mixed evidence for the effectiveness, usability, and satisfactoriness of the conversational agents investigated, but qualitative user perceptions were more mixed. The quality of many of the studies was limited, and improved study design and reporting are necessary to more accurately evaluate the usefulness of the agents in health care and identify key areas for improvement. Further research should also analyze the cost-effectiveness, privacy, and security of the agents. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID): RR2-10.2196/16934 |
format | Online Article Text |
id | pubmed-7644372 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-76443722020-11-16 The Effectiveness of Artificial Intelligence Conversational Agents in Health Care: Systematic Review Milne-Ives, Madison de Cock, Caroline Lim, Ernest Shehadeh, Melissa Harper de Pennington, Nick Mole, Guy Normando, Eduardo Meinert, Edward J Med Internet Res Review BACKGROUND: The high demand for health care services and the growing capability of artificial intelligence have led to the development of conversational agents designed to support a variety of health-related activities, including behavior change, treatment support, health monitoring, training, triage, and screening support. Automation of these tasks could free clinicians to focus on more complex work and increase the accessibility to health care services for the public. An overarching assessment of the acceptability, usability, and effectiveness of these agents in health care is needed to collate the evidence so that future development can target areas for improvement and potential for sustainable adoption. OBJECTIVE: This systematic review aims to assess the effectiveness and usability of conversational agents in health care and identify the elements that users like and dislike to inform future research and development of these agents. METHODS: PubMed, Medline (Ovid), EMBASE (Excerpta Medica dataBASE), CINAHL (Cumulative Index to Nursing and Allied Health Literature), Web of Science, and the Association for Computing Machinery Digital Library were systematically searched for articles published since 2008 that evaluated unconstrained natural language processing conversational agents used in health care. EndNote (version X9, Clarivate Analytics) reference management software was used for initial screening, and full-text screening was conducted by 1 reviewer. Data were extracted, and the risk of bias was assessed by one reviewer and validated by another. RESULTS: A total of 31 studies were selected and included a variety of conversational agents, including 14 chatbots (2 of which were voice chatbots), 6 embodied conversational agents (3 of which were interactive voice response calls, virtual patients, and speech recognition screening systems), 1 contextual question-answering agent, and 1 voice recognition triage system. Overall, the evidence reported was mostly positive or mixed. Usability and satisfaction performed well (27/30 and 26/31), and positive or mixed effectiveness was found in three-quarters of the studies (23/30). However, there were several limitations of the agents highlighted in specific qualitative feedback. CONCLUSIONS: The studies generally reported positive or mixed evidence for the effectiveness, usability, and satisfactoriness of the conversational agents investigated, but qualitative user perceptions were more mixed. The quality of many of the studies was limited, and improved study design and reporting are necessary to more accurately evaluate the usefulness of the agents in health care and identify key areas for improvement. Further research should also analyze the cost-effectiveness, privacy, and security of the agents. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID): RR2-10.2196/16934 JMIR Publications 2020-10-22 /pmc/articles/PMC7644372/ /pubmed/33090118 http://dx.doi.org/10.2196/20346 Text en ©Madison Milne-Ives, Caroline de Cock, Ernest Lim, Melissa Harper Shehadeh, Nick de Pennington, Guy Mole, Eduardo Normando, Edward Meinert. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 22.10.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Review Milne-Ives, Madison de Cock, Caroline Lim, Ernest Shehadeh, Melissa Harper de Pennington, Nick Mole, Guy Normando, Eduardo Meinert, Edward The Effectiveness of Artificial Intelligence Conversational Agents in Health Care: Systematic Review |
title | The Effectiveness of Artificial Intelligence Conversational Agents in Health Care: Systematic Review |
title_full | The Effectiveness of Artificial Intelligence Conversational Agents in Health Care: Systematic Review |
title_fullStr | The Effectiveness of Artificial Intelligence Conversational Agents in Health Care: Systematic Review |
title_full_unstemmed | The Effectiveness of Artificial Intelligence Conversational Agents in Health Care: Systematic Review |
title_short | The Effectiveness of Artificial Intelligence Conversational Agents in Health Care: Systematic Review |
title_sort | effectiveness of artificial intelligence conversational agents in health care: systematic review |
topic | Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7644372/ https://www.ncbi.nlm.nih.gov/pubmed/33090118 http://dx.doi.org/10.2196/20346 |
work_keys_str_mv | AT milneivesmadison theeffectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview AT decockcaroline theeffectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview AT limernest theeffectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview AT shehadehmelissaharper theeffectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview AT depenningtonnick theeffectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview AT moleguy theeffectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview AT normandoeduardo theeffectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview AT meinertedward theeffectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview AT milneivesmadison effectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview AT decockcaroline effectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview AT limernest effectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview AT shehadehmelissaharper effectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview AT depenningtonnick effectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview AT moleguy effectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview AT normandoeduardo effectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview AT meinertedward effectivenessofartificialintelligenceconversationalagentsinhealthcaresystematicreview |