Cargando…

Advancing the State of the Art in Clinical Natural Language Processing through Shared Tasks

Objectives:  To review the latest scientific challenges organized in clinical Natural Language Processing (NLP) by highlighting the tasks, the most effective methodologies used, the data, and the sharing strategies. Methods:  We harvested the literature by using Google Scholar and PubMed Central to...

Descripción completa

Detalles Bibliográficos
Autores principales: Filannino, Michele, Uzuner, Özlem
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Georg Thieme Verlag KG 2018
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6115235/
https://www.ncbi.nlm.nih.gov/pubmed/30157522
http://dx.doi.org/10.1055/s-0038-1667079
_version_ 1783351342653243392
author Filannino, Michele
Uzuner, Özlem
author_facet Filannino, Michele
Uzuner, Özlem
author_sort Filannino, Michele
collection PubMed
description Objectives:  To review the latest scientific challenges organized in clinical Natural Language Processing (NLP) by highlighting the tasks, the most effective methodologies used, the data, and the sharing strategies. Methods:  We harvested the literature by using Google Scholar and PubMed Central to retrieve all shared tasks organized since 2015 on clinical NLP problems on English data. Results:  We surveyed 17 shared tasks. We grouped the data into four types (synthetic, drug labels, social data, and clinical data) which are correlated with size and sensitivity. We found named entity recognition and classification to be the most common tasks. Most of the methods used to tackle the shared tasks have been data-driven. There is homogeneity in the methods used to tackle the named entity recognition tasks, while more diverse solutions are investigated for relation extraction, multi-class classification, and information retrieval problems. Conclusions:  There is a clear trend in using data-driven methods to tackle problems in clinical NLP. The availability of more and varied data from different institutions will undoubtedly lead to bigger advances in the field, for the benefit of healthcare as a whole.
format Online
Article
Text
id pubmed-6115235
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Georg Thieme Verlag KG
record_format MEDLINE/PubMed
spelling pubmed-61152352019-04-01 Advancing the State of the Art in Clinical Natural Language Processing through Shared Tasks Filannino, Michele Uzuner, Özlem Yearb Med Inform Objectives:  To review the latest scientific challenges organized in clinical Natural Language Processing (NLP) by highlighting the tasks, the most effective methodologies used, the data, and the sharing strategies. Methods:  We harvested the literature by using Google Scholar and PubMed Central to retrieve all shared tasks organized since 2015 on clinical NLP problems on English data. Results:  We surveyed 17 shared tasks. We grouped the data into four types (synthetic, drug labels, social data, and clinical data) which are correlated with size and sensitivity. We found named entity recognition and classification to be the most common tasks. Most of the methods used to tackle the shared tasks have been data-driven. There is homogeneity in the methods used to tackle the named entity recognition tasks, while more diverse solutions are investigated for relation extraction, multi-class classification, and information retrieval problems. Conclusions:  There is a clear trend in using data-driven methods to tackle problems in clinical NLP. The availability of more and varied data from different institutions will undoubtedly lead to bigger advances in the field, for the benefit of healthcare as a whole. Georg Thieme Verlag KG 2018-08 2018-08-29 /pmc/articles/PMC6115235/ /pubmed/30157522 http://dx.doi.org/10.1055/s-0038-1667079 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives License, which permits unrestricted reproduction and distribution, for non-commercial purposes only; and use and reproduction, but not distribution, of adapted material for non-commercial purposes only, provided the original work is properly cited.
spellingShingle Filannino, Michele
Uzuner, Özlem
Advancing the State of the Art in Clinical Natural Language Processing through Shared Tasks
title Advancing the State of the Art in Clinical Natural Language Processing through Shared Tasks
title_full Advancing the State of the Art in Clinical Natural Language Processing through Shared Tasks
title_fullStr Advancing the State of the Art in Clinical Natural Language Processing through Shared Tasks
title_full_unstemmed Advancing the State of the Art in Clinical Natural Language Processing through Shared Tasks
title_short Advancing the State of the Art in Clinical Natural Language Processing through Shared Tasks
title_sort advancing the state of the art in clinical natural language processing through shared tasks
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6115235/
https://www.ncbi.nlm.nih.gov/pubmed/30157522
http://dx.doi.org/10.1055/s-0038-1667079
work_keys_str_mv AT filanninomichele advancingthestateoftheartinclinicalnaturallanguageprocessingthroughsharedtasks
AT uzunerozlem advancingthestateoftheartinclinicalnaturallanguageprocessingthroughsharedtasks