Cargando…

“Yes, but will it work for my patients?” Driving clinically relevant research with benchmark datasets

Benchmark datasets have a powerful normative influence: by determining how the real world is represented in data, they define which problems will first be solved by algorithms built using the datasets and, by extension, who these algorithms will work for. It is desirable for these datasets to serve...

Descripción completa

Detalles Bibliográficos
Autores principales: Panch, Trishan, Pollard, Tom J., Mattie, Heather, Lindemer, Emily, Keane, Pearse A., Celi, Leo Anthony
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7305156/
https://www.ncbi.nlm.nih.gov/pubmed/32577534
http://dx.doi.org/10.1038/s41746-020-0295-6
_version_ 1783548399740518400
author Panch, Trishan
Pollard, Tom J.
Mattie, Heather
Lindemer, Emily
Keane, Pearse A.
Celi, Leo Anthony
author_facet Panch, Trishan
Pollard, Tom J.
Mattie, Heather
Lindemer, Emily
Keane, Pearse A.
Celi, Leo Anthony
author_sort Panch, Trishan
collection PubMed
description Benchmark datasets have a powerful normative influence: by determining how the real world is represented in data, they define which problems will first be solved by algorithms built using the datasets and, by extension, who these algorithms will work for. It is desirable for these datasets to serve four functions: (1) enabling the creation of clinically relevant algorithms; (2) facilitating like-for-like comparison of algorithmic performance; (3) ensuring reproducibility of algorithms; (4) asserting a normative influence on the clinical domains and diversity of patients that will potentially benefit from technological advances. Without benchmark datasets that satisfy these functions, it is impossible to address two perennial concerns of clinicians experienced in computational research: “the data scientists just go where the data is rather than where the needs are,” and, “yes, but will this work for my patients?” If algorithms are to be developed and applied for the care of patients, then it is prudent for the research community to create benchmark datasets proactively, across specialties. As yet, best practice in this area has not been defined. Broadly speaking, efforts will include design of the dataset; compliance and contracting issues relating to the sharing of sensitive data; enabling access and reuse; and planning for translation of algorithms to the clinical environment. If a deliberate and systematic approach is not followed, not only will the considerable benefits of clinical algorithms fail to be realized, but the potential harms may be regressively incurred across existing gradients of social inequity.
format Online
Article
Text
id pubmed-7305156
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-73051562020-06-22 “Yes, but will it work for my patients?” Driving clinically relevant research with benchmark datasets Panch, Trishan Pollard, Tom J. Mattie, Heather Lindemer, Emily Keane, Pearse A. Celi, Leo Anthony NPJ Digit Med Perspective Benchmark datasets have a powerful normative influence: by determining how the real world is represented in data, they define which problems will first be solved by algorithms built using the datasets and, by extension, who these algorithms will work for. It is desirable for these datasets to serve four functions: (1) enabling the creation of clinically relevant algorithms; (2) facilitating like-for-like comparison of algorithmic performance; (3) ensuring reproducibility of algorithms; (4) asserting a normative influence on the clinical domains and diversity of patients that will potentially benefit from technological advances. Without benchmark datasets that satisfy these functions, it is impossible to address two perennial concerns of clinicians experienced in computational research: “the data scientists just go where the data is rather than where the needs are,” and, “yes, but will this work for my patients?” If algorithms are to be developed and applied for the care of patients, then it is prudent for the research community to create benchmark datasets proactively, across specialties. As yet, best practice in this area has not been defined. Broadly speaking, efforts will include design of the dataset; compliance and contracting issues relating to the sharing of sensitive data; enabling access and reuse; and planning for translation of algorithms to the clinical environment. If a deliberate and systematic approach is not followed, not only will the considerable benefits of clinical algorithms fail to be realized, but the potential harms may be regressively incurred across existing gradients of social inequity. Nature Publishing Group UK 2020-06-19 /pmc/articles/PMC7305156/ /pubmed/32577534 http://dx.doi.org/10.1038/s41746-020-0295-6 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Perspective
Panch, Trishan
Pollard, Tom J.
Mattie, Heather
Lindemer, Emily
Keane, Pearse A.
Celi, Leo Anthony
“Yes, but will it work for my patients?” Driving clinically relevant research with benchmark datasets
title “Yes, but will it work for my patients?” Driving clinically relevant research with benchmark datasets
title_full “Yes, but will it work for my patients?” Driving clinically relevant research with benchmark datasets
title_fullStr “Yes, but will it work for my patients?” Driving clinically relevant research with benchmark datasets
title_full_unstemmed “Yes, but will it work for my patients?” Driving clinically relevant research with benchmark datasets
title_short “Yes, but will it work for my patients?” Driving clinically relevant research with benchmark datasets
title_sort “yes, but will it work for my patients?” driving clinically relevant research with benchmark datasets
topic Perspective
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7305156/
https://www.ncbi.nlm.nih.gov/pubmed/32577534
http://dx.doi.org/10.1038/s41746-020-0295-6
work_keys_str_mv AT panchtrishan yesbutwillitworkformypatientsdrivingclinicallyrelevantresearchwithbenchmarkdatasets
AT pollardtomj yesbutwillitworkformypatientsdrivingclinicallyrelevantresearchwithbenchmarkdatasets
AT mattieheather yesbutwillitworkformypatientsdrivingclinicallyrelevantresearchwithbenchmarkdatasets
AT lindemeremily yesbutwillitworkformypatientsdrivingclinicallyrelevantresearchwithbenchmarkdatasets
AT keanepearsea yesbutwillitworkformypatientsdrivingclinicallyrelevantresearchwithbenchmarkdatasets
AT celileoanthony yesbutwillitworkformypatientsdrivingclinicallyrelevantresearchwithbenchmarkdatasets