Cargando…

A dataset of simulated patient-physician medical interviews with a focus on respiratory cases

Artificial Intelligence (AI) is playing a major role in medical education, diagnosis, and outbreak detection through Natural Language Processing (NLP), machine learning models and deep learning tools. However, in order to train AI to facilitate these medical fields, well-documented and accurate medi...

Descripción completa

Detalles Bibliográficos
Autores principales: Fareez, Faiha, Parikh, Tishya, Wavell, Christopher, Shahab, Saba, Chevalier, Meghan, Good, Scott, De Blasi, Isabella, Rhouma, Rafik, McMahon, Christopher, Lam, Jean-Paul, Lo, Thomas, Smith, Christopher W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9203765/
https://www.ncbi.nlm.nih.gov/pubmed/35710769
http://dx.doi.org/10.1038/s41597-022-01423-1
_version_ 1784728770410708992
author Fareez, Faiha
Parikh, Tishya
Wavell, Christopher
Shahab, Saba
Chevalier, Meghan
Good, Scott
De Blasi, Isabella
Rhouma, Rafik
McMahon, Christopher
Lam, Jean-Paul
Lo, Thomas
Smith, Christopher W.
author_facet Fareez, Faiha
Parikh, Tishya
Wavell, Christopher
Shahab, Saba
Chevalier, Meghan
Good, Scott
De Blasi, Isabella
Rhouma, Rafik
McMahon, Christopher
Lam, Jean-Paul
Lo, Thomas
Smith, Christopher W.
author_sort Fareez, Faiha
collection PubMed
description Artificial Intelligence (AI) is playing a major role in medical education, diagnosis, and outbreak detection through Natural Language Processing (NLP), machine learning models and deep learning tools. However, in order to train AI to facilitate these medical fields, well-documented and accurate medical conversations are needed. The dataset presented covers a series of medical conversations in the format of Objective Structured Clinical Examinations (OSCE), with a focus on respiratory cases in audio format and corresponding text documents. These cases were simulated, recorded, transcribed, and manually corrected with the underlying aim of providing a comprehensive set of medical conversation data to the academic and industry community. Potential applications include speech recognition detection for speech-to-text errors, training NLP models to extract symptoms, detecting diseases, or for educational purposes, including training an avatar to converse with healthcare professional students as a standardized patient during clinical examinations. The application opportunities for the presented dataset are vast, given that this calibre of data is difficult to access and costly to develop.
format Online
Article
Text
id pubmed-9203765
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-92037652022-06-18 A dataset of simulated patient-physician medical interviews with a focus on respiratory cases Fareez, Faiha Parikh, Tishya Wavell, Christopher Shahab, Saba Chevalier, Meghan Good, Scott De Blasi, Isabella Rhouma, Rafik McMahon, Christopher Lam, Jean-Paul Lo, Thomas Smith, Christopher W. Sci Data Data Descriptor Artificial Intelligence (AI) is playing a major role in medical education, diagnosis, and outbreak detection through Natural Language Processing (NLP), machine learning models and deep learning tools. However, in order to train AI to facilitate these medical fields, well-documented and accurate medical conversations are needed. The dataset presented covers a series of medical conversations in the format of Objective Structured Clinical Examinations (OSCE), with a focus on respiratory cases in audio format and corresponding text documents. These cases were simulated, recorded, transcribed, and manually corrected with the underlying aim of providing a comprehensive set of medical conversation data to the academic and industry community. Potential applications include speech recognition detection for speech-to-text errors, training NLP models to extract symptoms, detecting diseases, or for educational purposes, including training an avatar to converse with healthcare professional students as a standardized patient during clinical examinations. The application opportunities for the presented dataset are vast, given that this calibre of data is difficult to access and costly to develop. Nature Publishing Group UK 2022-06-16 /pmc/articles/PMC9203765/ /pubmed/35710769 http://dx.doi.org/10.1038/s41597-022-01423-1 Text en © The Author(s) 2022, corrected publication 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Data Descriptor
Fareez, Faiha
Parikh, Tishya
Wavell, Christopher
Shahab, Saba
Chevalier, Meghan
Good, Scott
De Blasi, Isabella
Rhouma, Rafik
McMahon, Christopher
Lam, Jean-Paul
Lo, Thomas
Smith, Christopher W.
A dataset of simulated patient-physician medical interviews with a focus on respiratory cases
title A dataset of simulated patient-physician medical interviews with a focus on respiratory cases
title_full A dataset of simulated patient-physician medical interviews with a focus on respiratory cases
title_fullStr A dataset of simulated patient-physician medical interviews with a focus on respiratory cases
title_full_unstemmed A dataset of simulated patient-physician medical interviews with a focus on respiratory cases
title_short A dataset of simulated patient-physician medical interviews with a focus on respiratory cases
title_sort dataset of simulated patient-physician medical interviews with a focus on respiratory cases
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9203765/
https://www.ncbi.nlm.nih.gov/pubmed/35710769
http://dx.doi.org/10.1038/s41597-022-01423-1
work_keys_str_mv AT fareezfaiha adatasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases
AT parikhtishya adatasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases
AT wavellchristopher adatasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases
AT shahabsaba adatasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases
AT chevaliermeghan adatasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases
AT goodscott adatasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases
AT deblasiisabella adatasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases
AT rhoumarafik adatasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases
AT mcmahonchristopher adatasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases
AT lamjeanpaul adatasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases
AT lothomas adatasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases
AT smithchristopherw adatasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases
AT fareezfaiha datasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases
AT parikhtishya datasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases
AT wavellchristopher datasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases
AT shahabsaba datasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases
AT chevaliermeghan datasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases
AT goodscott datasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases
AT deblasiisabella datasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases
AT rhoumarafik datasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases
AT mcmahonchristopher datasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases
AT lamjeanpaul datasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases
AT lothomas datasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases
AT smithchristopherw datasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases