Cargando…

Understanding Views Around the Creation of a Consented, Donated Databank of Clinical Free Text to Develop and Train Natural Language Processing Models for Research: Focus Group Interviews With Stakeholders

BACKGROUND: Information stored within electronic health records is often recorded as unstructured text. Special computerized natural language processing (NLP) tools are needed to process this text; however, complex governance arrangements make such data in the National Health Service hard to access,...

Descripción completa

Detalles Bibliográficos
Autores principales:	Fitzpatrick, Natalie K, Dobson, Richard, Roberts, Angus, Jones, Kerina, Shah, Anoop D, Nenadic, Goran, Ford, Elizabeth
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	JMIR Publications 2023
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10193205/ https://www.ncbi.nlm.nih.gov/pubmed/37133927 http://dx.doi.org/10.2196/45534

_version_	1785043790467170304
author	Fitzpatrick, Natalie K Dobson, Richard Roberts, Angus Jones, Kerina Shah, Anoop D Nenadic, Goran Ford, Elizabeth
author_facet	Fitzpatrick, Natalie K Dobson, Richard Roberts, Angus Jones, Kerina Shah, Anoop D Nenadic, Goran Ford, Elizabeth
author_sort	Fitzpatrick, Natalie K
collection	PubMed
description	BACKGROUND: Information stored within electronic health records is often recorded as unstructured text. Special computerized natural language processing (NLP) tools are needed to process this text; however, complex governance arrangements make such data in the National Health Service hard to access, and therefore, it is difficult to use for research in improving NLP methods. The creation of a donated databank of clinical free text could provide an important opportunity for researchers to develop NLP methods and tools and may circumvent delays in accessing the data needed to train the models. However, to date, there has been little or no engagement with stakeholders on the acceptability and design considerations of establishing a free-text databank for this purpose. OBJECTIVE: This study aimed to ascertain stakeholder views around the creation of a consented, donated databank of clinical free text to help create, train, and evaluate NLP for clinical research and to inform the potential next steps for adopting a partner-led approach to establish a national, funded databank of free text for use by the research community. METHODS: Web-based in-depth focus group interviews were conducted with 4 stakeholder groups (patients and members of the public, clinicians, information governance leads and research ethics members, and NLP researchers). RESULTS: All stakeholder groups were strongly in favor of the databank and saw great value in creating an environment where NLP tools can be tested and trained to improve their accuracy. Participants highlighted a range of complex issues for consideration as the databank is developed, including communicating the intended purpose, the approach to access and safeguarding the data, who should have access, and how to fund the databank. Participants recommended that a small-scale, gradual approach be adopted to start to gather donations and encouraged further engagement with stakeholders to develop a road map and set of standards for the databank. CONCLUSIONS: These findings provide a clear mandate to begin developing the databank and a framework for stakeholder expectations, which we would aim to meet with the databank delivery.
format	Online Article Text
id	pubmed-10193205
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	JMIR Publications
record_format	MEDLINE/PubMed
spelling	pubmed-101932052023-05-19 Understanding Views Around the Creation of a Consented, Donated Databank of Clinical Free Text to Develop and Train Natural Language Processing Models for Research: Focus Group Interviews With Stakeholders Fitzpatrick, Natalie K Dobson, Richard Roberts, Angus Jones, Kerina Shah, Anoop D Nenadic, Goran Ford, Elizabeth JMIR Med Inform Original Paper BACKGROUND: Information stored within electronic health records is often recorded as unstructured text. Special computerized natural language processing (NLP) tools are needed to process this text; however, complex governance arrangements make such data in the National Health Service hard to access, and therefore, it is difficult to use for research in improving NLP methods. The creation of a donated databank of clinical free text could provide an important opportunity for researchers to develop NLP methods and tools and may circumvent delays in accessing the data needed to train the models. However, to date, there has been little or no engagement with stakeholders on the acceptability and design considerations of establishing a free-text databank for this purpose. OBJECTIVE: This study aimed to ascertain stakeholder views around the creation of a consented, donated databank of clinical free text to help create, train, and evaluate NLP for clinical research and to inform the potential next steps for adopting a partner-led approach to establish a national, funded databank of free text for use by the research community. METHODS: Web-based in-depth focus group interviews were conducted with 4 stakeholder groups (patients and members of the public, clinicians, information governance leads and research ethics members, and NLP researchers). RESULTS: All stakeholder groups were strongly in favor of the databank and saw great value in creating an environment where NLP tools can be tested and trained to improve their accuracy. Participants highlighted a range of complex issues for consideration as the databank is developed, including communicating the intended purpose, the approach to access and safeguarding the data, who should have access, and how to fund the databank. Participants recommended that a small-scale, gradual approach be adopted to start to gather donations and encouraged further engagement with stakeholders to develop a road map and set of standards for the databank. CONCLUSIONS: These findings provide a clear mandate to begin developing the databank and a framework for stakeholder expectations, which we would aim to meet with the databank delivery. JMIR Publications 2023-05-03 /pmc/articles/PMC10193205/ /pubmed/37133927 http://dx.doi.org/10.2196/45534 Text en ©Natalie K Fitzpatrick, Richard Dobson, Angus Roberts, Kerina Jones, Anoop D Shah, Goran Nenadic, Elizabeth Ford. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 03.05.2023. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle	Original Paper Fitzpatrick, Natalie K Dobson, Richard Roberts, Angus Jones, Kerina Shah, Anoop D Nenadic, Goran Ford, Elizabeth Understanding Views Around the Creation of a Consented, Donated Databank of Clinical Free Text to Develop and Train Natural Language Processing Models for Research: Focus Group Interviews With Stakeholders
title	Understanding Views Around the Creation of a Consented, Donated Databank of Clinical Free Text to Develop and Train Natural Language Processing Models for Research: Focus Group Interviews With Stakeholders
title_full	Understanding Views Around the Creation of a Consented, Donated Databank of Clinical Free Text to Develop and Train Natural Language Processing Models for Research: Focus Group Interviews With Stakeholders
title_fullStr	Understanding Views Around the Creation of a Consented, Donated Databank of Clinical Free Text to Develop and Train Natural Language Processing Models for Research: Focus Group Interviews With Stakeholders
title_full_unstemmed	Understanding Views Around the Creation of a Consented, Donated Databank of Clinical Free Text to Develop and Train Natural Language Processing Models for Research: Focus Group Interviews With Stakeholders
title_short	Understanding Views Around the Creation of a Consented, Donated Databank of Clinical Free Text to Develop and Train Natural Language Processing Models for Research: Focus Group Interviews With Stakeholders
title_sort	understanding views around the creation of a consented, donated databank of clinical free text to develop and train natural language processing models for research: focus group interviews with stakeholders
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10193205/ https://www.ncbi.nlm.nih.gov/pubmed/37133927 http://dx.doi.org/10.2196/45534
work_keys_str_mv	AT fitzpatricknataliek understandingviewsaroundthecreationofaconsenteddonateddatabankofclinicalfreetexttodevelopandtrainnaturallanguageprocessingmodelsforresearchfocusgroupinterviewswithstakeholders AT dobsonrichard understandingviewsaroundthecreationofaconsenteddonateddatabankofclinicalfreetexttodevelopandtrainnaturallanguageprocessingmodelsforresearchfocusgroupinterviewswithstakeholders AT robertsangus understandingviewsaroundthecreationofaconsenteddonateddatabankofclinicalfreetexttodevelopandtrainnaturallanguageprocessingmodelsforresearchfocusgroupinterviewswithstakeholders AT joneskerina understandingviewsaroundthecreationofaconsenteddonateddatabankofclinicalfreetexttodevelopandtrainnaturallanguageprocessingmodelsforresearchfocusgroupinterviewswithstakeholders AT shahanoopd understandingviewsaroundthecreationofaconsenteddonateddatabankofclinicalfreetexttodevelopandtrainnaturallanguageprocessingmodelsforresearchfocusgroupinterviewswithstakeholders AT nenadicgoran understandingviewsaroundthecreationofaconsenteddonateddatabankofclinicalfreetexttodevelopandtrainnaturallanguageprocessingmodelsforresearchfocusgroupinterviewswithstakeholders AT fordelizabeth understandingviewsaroundthecreationofaconsenteddonateddatabankofclinicalfreetexttodevelopandtrainnaturallanguageprocessingmodelsforresearchfocusgroupinterviewswithstakeholders

Understanding Views Around the Creation of a Consented, Donated Databank of Clinical Free Text to Develop and Train Natural Language Processing Models for Research: Focus Group Interviews With Stakeholders

Ejemplares similares