Cargando…
A survey on clinical natural language processing in the United Kingdom from 2007 to 2022
Much of the knowledge and information needed for enabling high-quality clinical research is stored in free-text format. Natural language processing (NLP) has been used to extract information from these sources at scale for several decades. This paper aims to present a comprehensive review of clinica...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9770568/ https://www.ncbi.nlm.nih.gov/pubmed/36544046 http://dx.doi.org/10.1038/s41746-022-00730-6 |
_version_ | 1784854630647201792 |
---|---|
author | Wu, Honghan Wang, Minhong Wu, Jinge Francis, Farah Chang, Yun-Hsuan Shavick, Alex Dong, Hang Poon, Michael T. C. Fitzpatrick, Natalie Levine, Adam P. Slater, Luke T. Handy, Alex Karwath, Andreas Gkoutos, Georgios V. Chelala, Claude Shah, Anoop Dinesh Stewart, Robert Collier, Nigel Alex, Beatrice Whiteley, William Sudlow, Cathie Roberts, Angus Dobson, Richard J. B. |
author_facet | Wu, Honghan Wang, Minhong Wu, Jinge Francis, Farah Chang, Yun-Hsuan Shavick, Alex Dong, Hang Poon, Michael T. C. Fitzpatrick, Natalie Levine, Adam P. Slater, Luke T. Handy, Alex Karwath, Andreas Gkoutos, Georgios V. Chelala, Claude Shah, Anoop Dinesh Stewart, Robert Collier, Nigel Alex, Beatrice Whiteley, William Sudlow, Cathie Roberts, Angus Dobson, Richard J. B. |
author_sort | Wu, Honghan |
collection | PubMed |
description | Much of the knowledge and information needed for enabling high-quality clinical research is stored in free-text format. Natural language processing (NLP) has been used to extract information from these sources at scale for several decades. This paper aims to present a comprehensive review of clinical NLP for the past 15 years in the UK to identify the community, depict its evolution, analyse methodologies and applications, and identify the main barriers. We collect a dataset of clinical NLP projects (n = 94; £ = 41.97 m) funded by UK funders or the European Union’s funding programmes. Additionally, we extract details on 9 funders, 137 organisations, 139 persons and 431 research papers. Networks are created from timestamped data interlinking all entities, and network analysis is subsequently applied to generate insights. 431 publications are identified as part of a literature review, of which 107 are eligible for final analysis. Results show, not surprisingly, clinical NLP in the UK has increased substantially in the last 15 years: the total budget in the period of 2019–2022 was 80 times that of 2007–2010. However, the effort is required to deepen areas such as disease (sub-)phenotyping and broaden application domains. There is also a need to improve links between academia and industry and enable deployments in real-world settings for the realisation of clinical NLP’s great potential in care delivery. The major barriers include research and development access to hospital data, lack of capable computational resources in the right places, the scarcity of labelled data and barriers to sharing of pretrained models. |
format | Online Article Text |
id | pubmed-9770568 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-97705682022-12-22 A survey on clinical natural language processing in the United Kingdom from 2007 to 2022 Wu, Honghan Wang, Minhong Wu, Jinge Francis, Farah Chang, Yun-Hsuan Shavick, Alex Dong, Hang Poon, Michael T. C. Fitzpatrick, Natalie Levine, Adam P. Slater, Luke T. Handy, Alex Karwath, Andreas Gkoutos, Georgios V. Chelala, Claude Shah, Anoop Dinesh Stewart, Robert Collier, Nigel Alex, Beatrice Whiteley, William Sudlow, Cathie Roberts, Angus Dobson, Richard J. B. NPJ Digit Med Review Article Much of the knowledge and information needed for enabling high-quality clinical research is stored in free-text format. Natural language processing (NLP) has been used to extract information from these sources at scale for several decades. This paper aims to present a comprehensive review of clinical NLP for the past 15 years in the UK to identify the community, depict its evolution, analyse methodologies and applications, and identify the main barriers. We collect a dataset of clinical NLP projects (n = 94; £ = 41.97 m) funded by UK funders or the European Union’s funding programmes. Additionally, we extract details on 9 funders, 137 organisations, 139 persons and 431 research papers. Networks are created from timestamped data interlinking all entities, and network analysis is subsequently applied to generate insights. 431 publications are identified as part of a literature review, of which 107 are eligible for final analysis. Results show, not surprisingly, clinical NLP in the UK has increased substantially in the last 15 years: the total budget in the period of 2019–2022 was 80 times that of 2007–2010. However, the effort is required to deepen areas such as disease (sub-)phenotyping and broaden application domains. There is also a need to improve links between academia and industry and enable deployments in real-world settings for the realisation of clinical NLP’s great potential in care delivery. The major barriers include research and development access to hospital data, lack of capable computational resources in the right places, the scarcity of labelled data and barriers to sharing of pretrained models. Nature Publishing Group UK 2022-12-21 /pmc/articles/PMC9770568/ /pubmed/36544046 http://dx.doi.org/10.1038/s41746-022-00730-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Review Article Wu, Honghan Wang, Minhong Wu, Jinge Francis, Farah Chang, Yun-Hsuan Shavick, Alex Dong, Hang Poon, Michael T. C. Fitzpatrick, Natalie Levine, Adam P. Slater, Luke T. Handy, Alex Karwath, Andreas Gkoutos, Georgios V. Chelala, Claude Shah, Anoop Dinesh Stewart, Robert Collier, Nigel Alex, Beatrice Whiteley, William Sudlow, Cathie Roberts, Angus Dobson, Richard J. B. A survey on clinical natural language processing in the United Kingdom from 2007 to 2022 |
title | A survey on clinical natural language processing in the United Kingdom from 2007 to 2022 |
title_full | A survey on clinical natural language processing in the United Kingdom from 2007 to 2022 |
title_fullStr | A survey on clinical natural language processing in the United Kingdom from 2007 to 2022 |
title_full_unstemmed | A survey on clinical natural language processing in the United Kingdom from 2007 to 2022 |
title_short | A survey on clinical natural language processing in the United Kingdom from 2007 to 2022 |
title_sort | survey on clinical natural language processing in the united kingdom from 2007 to 2022 |
topic | Review Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9770568/ https://www.ncbi.nlm.nih.gov/pubmed/36544046 http://dx.doi.org/10.1038/s41746-022-00730-6 |
work_keys_str_mv | AT wuhonghan asurveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT wangminhong asurveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT wujinge asurveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT francisfarah asurveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT changyunhsuan asurveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT shavickalex asurveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT donghang asurveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT poonmichaeltc asurveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT fitzpatricknatalie asurveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT levineadamp asurveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT slaterluket asurveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT handyalex asurveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT karwathandreas asurveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT gkoutosgeorgiosv asurveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT chelalaclaude asurveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT shahanoopdinesh asurveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT stewartrobert asurveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT colliernigel asurveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT alexbeatrice asurveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT whiteleywilliam asurveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT sudlowcathie asurveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT robertsangus asurveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT dobsonrichardjb asurveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT wuhonghan surveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT wangminhong surveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT wujinge surveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT francisfarah surveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT changyunhsuan surveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT shavickalex surveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT donghang surveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT poonmichaeltc surveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT fitzpatricknatalie surveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT levineadamp surveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT slaterluket surveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT handyalex surveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT karwathandreas surveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT gkoutosgeorgiosv surveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT chelalaclaude surveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT shahanoopdinesh surveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT stewartrobert surveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT colliernigel surveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT alexbeatrice surveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT whiteleywilliam surveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT sudlowcathie surveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT robertsangus surveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 AT dobsonrichardjb surveyonclinicalnaturallanguageprocessingintheunitedkingdomfrom2007to2022 |