Cargando…
Advancing the State of the Art in Clinical Natural Language Processing through Shared Tasks
Objectives: To review the latest scientific challenges organized in clinical Natural Language Processing (NLP) by highlighting the tasks, the most effective methodologies used, the data, and the sharing strategies. Methods: We harvested the literature by using Google Scholar and PubMed Central to...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Georg Thieme Verlag KG
2018
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6115235/ https://www.ncbi.nlm.nih.gov/pubmed/30157522 http://dx.doi.org/10.1055/s-0038-1667079 |
_version_ | 1783351342653243392 |
---|---|
author | Filannino, Michele Uzuner, Özlem |
author_facet | Filannino, Michele Uzuner, Özlem |
author_sort | Filannino, Michele |
collection | PubMed |
description | Objectives: To review the latest scientific challenges organized in clinical Natural Language Processing (NLP) by highlighting the tasks, the most effective methodologies used, the data, and the sharing strategies. Methods: We harvested the literature by using Google Scholar and PubMed Central to retrieve all shared tasks organized since 2015 on clinical NLP problems on English data. Results: We surveyed 17 shared tasks. We grouped the data into four types (synthetic, drug labels, social data, and clinical data) which are correlated with size and sensitivity. We found named entity recognition and classification to be the most common tasks. Most of the methods used to tackle the shared tasks have been data-driven. There is homogeneity in the methods used to tackle the named entity recognition tasks, while more diverse solutions are investigated for relation extraction, multi-class classification, and information retrieval problems. Conclusions: There is a clear trend in using data-driven methods to tackle problems in clinical NLP. The availability of more and varied data from different institutions will undoubtedly lead to bigger advances in the field, for the benefit of healthcare as a whole. |
format | Online Article Text |
id | pubmed-6115235 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Georg Thieme Verlag KG |
record_format | MEDLINE/PubMed |
spelling | pubmed-61152352019-04-01 Advancing the State of the Art in Clinical Natural Language Processing through Shared Tasks Filannino, Michele Uzuner, Özlem Yearb Med Inform Objectives: To review the latest scientific challenges organized in clinical Natural Language Processing (NLP) by highlighting the tasks, the most effective methodologies used, the data, and the sharing strategies. Methods: We harvested the literature by using Google Scholar and PubMed Central to retrieve all shared tasks organized since 2015 on clinical NLP problems on English data. Results: We surveyed 17 shared tasks. We grouped the data into four types (synthetic, drug labels, social data, and clinical data) which are correlated with size and sensitivity. We found named entity recognition and classification to be the most common tasks. Most of the methods used to tackle the shared tasks have been data-driven. There is homogeneity in the methods used to tackle the named entity recognition tasks, while more diverse solutions are investigated for relation extraction, multi-class classification, and information retrieval problems. Conclusions: There is a clear trend in using data-driven methods to tackle problems in clinical NLP. The availability of more and varied data from different institutions will undoubtedly lead to bigger advances in the field, for the benefit of healthcare as a whole. Georg Thieme Verlag KG 2018-08 2018-08-29 /pmc/articles/PMC6115235/ /pubmed/30157522 http://dx.doi.org/10.1055/s-0038-1667079 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives License, which permits unrestricted reproduction and distribution, for non-commercial purposes only; and use and reproduction, but not distribution, of adapted material for non-commercial purposes only, provided the original work is properly cited. |
spellingShingle | Filannino, Michele Uzuner, Özlem Advancing the State of the Art in Clinical Natural Language Processing through Shared Tasks |
title | Advancing the State of the Art in Clinical Natural Language Processing through Shared Tasks |
title_full | Advancing the State of the Art in Clinical Natural Language Processing through Shared Tasks |
title_fullStr | Advancing the State of the Art in Clinical Natural Language Processing through Shared Tasks |
title_full_unstemmed | Advancing the State of the Art in Clinical Natural Language Processing through Shared Tasks |
title_short | Advancing the State of the Art in Clinical Natural Language Processing through Shared Tasks |
title_sort | advancing the state of the art in clinical natural language processing through shared tasks |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6115235/ https://www.ncbi.nlm.nih.gov/pubmed/30157522 http://dx.doi.org/10.1055/s-0038-1667079 |
work_keys_str_mv | AT filanninomichele advancingthestateoftheartinclinicalnaturallanguageprocessingthroughsharedtasks AT uzunerozlem advancingthestateoftheartinclinicalnaturallanguageprocessingthroughsharedtasks |