Cargando…
A dataset of simulated patient-physician medical interviews with a focus on respiratory cases
Artificial Intelligence (AI) is playing a major role in medical education, diagnosis, and outbreak detection through Natural Language Processing (NLP), machine learning models and deep learning tools. However, in order to train AI to facilitate these medical fields, well-documented and accurate medi...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9203765/ https://www.ncbi.nlm.nih.gov/pubmed/35710769 http://dx.doi.org/10.1038/s41597-022-01423-1 |
_version_ | 1784728770410708992 |
---|---|
author | Fareez, Faiha Parikh, Tishya Wavell, Christopher Shahab, Saba Chevalier, Meghan Good, Scott De Blasi, Isabella Rhouma, Rafik McMahon, Christopher Lam, Jean-Paul Lo, Thomas Smith, Christopher W. |
author_facet | Fareez, Faiha Parikh, Tishya Wavell, Christopher Shahab, Saba Chevalier, Meghan Good, Scott De Blasi, Isabella Rhouma, Rafik McMahon, Christopher Lam, Jean-Paul Lo, Thomas Smith, Christopher W. |
author_sort | Fareez, Faiha |
collection | PubMed |
description | Artificial Intelligence (AI) is playing a major role in medical education, diagnosis, and outbreak detection through Natural Language Processing (NLP), machine learning models and deep learning tools. However, in order to train AI to facilitate these medical fields, well-documented and accurate medical conversations are needed. The dataset presented covers a series of medical conversations in the format of Objective Structured Clinical Examinations (OSCE), with a focus on respiratory cases in audio format and corresponding text documents. These cases were simulated, recorded, transcribed, and manually corrected with the underlying aim of providing a comprehensive set of medical conversation data to the academic and industry community. Potential applications include speech recognition detection for speech-to-text errors, training NLP models to extract symptoms, detecting diseases, or for educational purposes, including training an avatar to converse with healthcare professional students as a standardized patient during clinical examinations. The application opportunities for the presented dataset are vast, given that this calibre of data is difficult to access and costly to develop. |
format | Online Article Text |
id | pubmed-9203765 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-92037652022-06-18 A dataset of simulated patient-physician medical interviews with a focus on respiratory cases Fareez, Faiha Parikh, Tishya Wavell, Christopher Shahab, Saba Chevalier, Meghan Good, Scott De Blasi, Isabella Rhouma, Rafik McMahon, Christopher Lam, Jean-Paul Lo, Thomas Smith, Christopher W. Sci Data Data Descriptor Artificial Intelligence (AI) is playing a major role in medical education, diagnosis, and outbreak detection through Natural Language Processing (NLP), machine learning models and deep learning tools. However, in order to train AI to facilitate these medical fields, well-documented and accurate medical conversations are needed. The dataset presented covers a series of medical conversations in the format of Objective Structured Clinical Examinations (OSCE), with a focus on respiratory cases in audio format and corresponding text documents. These cases were simulated, recorded, transcribed, and manually corrected with the underlying aim of providing a comprehensive set of medical conversation data to the academic and industry community. Potential applications include speech recognition detection for speech-to-text errors, training NLP models to extract symptoms, detecting diseases, or for educational purposes, including training an avatar to converse with healthcare professional students as a standardized patient during clinical examinations. The application opportunities for the presented dataset are vast, given that this calibre of data is difficult to access and costly to develop. Nature Publishing Group UK 2022-06-16 /pmc/articles/PMC9203765/ /pubmed/35710769 http://dx.doi.org/10.1038/s41597-022-01423-1 Text en © The Author(s) 2022, corrected publication 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Data Descriptor Fareez, Faiha Parikh, Tishya Wavell, Christopher Shahab, Saba Chevalier, Meghan Good, Scott De Blasi, Isabella Rhouma, Rafik McMahon, Christopher Lam, Jean-Paul Lo, Thomas Smith, Christopher W. A dataset of simulated patient-physician medical interviews with a focus on respiratory cases |
title | A dataset of simulated patient-physician medical interviews with a focus on respiratory cases |
title_full | A dataset of simulated patient-physician medical interviews with a focus on respiratory cases |
title_fullStr | A dataset of simulated patient-physician medical interviews with a focus on respiratory cases |
title_full_unstemmed | A dataset of simulated patient-physician medical interviews with a focus on respiratory cases |
title_short | A dataset of simulated patient-physician medical interviews with a focus on respiratory cases |
title_sort | dataset of simulated patient-physician medical interviews with a focus on respiratory cases |
topic | Data Descriptor |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9203765/ https://www.ncbi.nlm.nih.gov/pubmed/35710769 http://dx.doi.org/10.1038/s41597-022-01423-1 |
work_keys_str_mv | AT fareezfaiha adatasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases AT parikhtishya adatasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases AT wavellchristopher adatasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases AT shahabsaba adatasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases AT chevaliermeghan adatasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases AT goodscott adatasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases AT deblasiisabella adatasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases AT rhoumarafik adatasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases AT mcmahonchristopher adatasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases AT lamjeanpaul adatasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases AT lothomas adatasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases AT smithchristopherw adatasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases AT fareezfaiha datasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases AT parikhtishya datasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases AT wavellchristopher datasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases AT shahabsaba datasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases AT chevaliermeghan datasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases AT goodscott datasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases AT deblasiisabella datasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases AT rhoumarafik datasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases AT mcmahonchristopher datasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases AT lamjeanpaul datasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases AT lothomas datasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases AT smithchristopherw datasetofsimulatedpatientphysicianmedicalinterviewswithafocusonrespiratorycases |