Cargando…

Disorder recognition in clinical texts using multi-label structured SVM

BACKGROUND: Information extraction in clinical texts enables medical workers to find out problems of patients faster as well as makes intelligent diagnosis possible in the future. There has been a lot of work about disorder mention recognition in clinical narratives. But recognition of some more com...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Wutao, Ji, Donghong, Lu, Yanan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5282630/
https://www.ncbi.nlm.nih.gov/pubmed/28143488
http://dx.doi.org/10.1186/s12859-017-1476-4
_version_ 1782503360065175552
author Lin, Wutao
Ji, Donghong
Lu, Yanan
author_facet Lin, Wutao
Ji, Donghong
Lu, Yanan
author_sort Lin, Wutao
collection PubMed
description BACKGROUND: Information extraction in clinical texts enables medical workers to find out problems of patients faster as well as makes intelligent diagnosis possible in the future. There has been a lot of work about disorder mention recognition in clinical narratives. But recognition of some more complicated disorder mentions like overlapping ones is still an open issue. This paper proposes a multi-label structured Support Vector Machine (SVM) based method for disorder mention recognition. We present a multi-label scheme which could be used in complicated entity recognition tasks. RESULTS: We performed three sets of experiments to evaluate our model. Our best F(1)-Score on the 2013 Conference and Labs of the Evaluation Forum data set is 0.7343. There are six types of labels in our multi-label scheme, all of which are represented by 24-bit binary numbers. The binary digits of each label contain information about different disorder mentions. Our multi-label method can recognize not only disorder mentions in the form of contiguous or discontiguous words but also mentions whose spans overlap with each other. The experiments indicate that our multi-label structured SVM model outperforms the condition random field (CRF) model for this disorder mention recognition task. The experiments show that our multi-label scheme surpasses the baseline. Especially for overlapping disorder mentions, the F(1)-Score of our multi-label scheme is 0.1428 higher than the baseline BIOHD1234 scheme. CONCLUSIONS: This multi-label structured SVM based approach is demonstrated to work well with this disorder recognition task. The novel multi-label scheme we presented is superior to the baseline and it can be used in other models to solve various types of complicated entity recognition tasks as well.
format Online
Article
Text
id pubmed-5282630
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-52826302017-02-03 Disorder recognition in clinical texts using multi-label structured SVM Lin, Wutao Ji, Donghong Lu, Yanan BMC Bioinformatics Methodology Article BACKGROUND: Information extraction in clinical texts enables medical workers to find out problems of patients faster as well as makes intelligent diagnosis possible in the future. There has been a lot of work about disorder mention recognition in clinical narratives. But recognition of some more complicated disorder mentions like overlapping ones is still an open issue. This paper proposes a multi-label structured Support Vector Machine (SVM) based method for disorder mention recognition. We present a multi-label scheme which could be used in complicated entity recognition tasks. RESULTS: We performed three sets of experiments to evaluate our model. Our best F(1)-Score on the 2013 Conference and Labs of the Evaluation Forum data set is 0.7343. There are six types of labels in our multi-label scheme, all of which are represented by 24-bit binary numbers. The binary digits of each label contain information about different disorder mentions. Our multi-label method can recognize not only disorder mentions in the form of contiguous or discontiguous words but also mentions whose spans overlap with each other. The experiments indicate that our multi-label structured SVM model outperforms the condition random field (CRF) model for this disorder mention recognition task. The experiments show that our multi-label scheme surpasses the baseline. Especially for overlapping disorder mentions, the F(1)-Score of our multi-label scheme is 0.1428 higher than the baseline BIOHD1234 scheme. CONCLUSIONS: This multi-label structured SVM based approach is demonstrated to work well with this disorder recognition task. The novel multi-label scheme we presented is superior to the baseline and it can be used in other models to solve various types of complicated entity recognition tasks as well. BioMed Central 2017-01-31 /pmc/articles/PMC5282630/ /pubmed/28143488 http://dx.doi.org/10.1186/s12859-017-1476-4 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Lin, Wutao
Ji, Donghong
Lu, Yanan
Disorder recognition in clinical texts using multi-label structured SVM
title Disorder recognition in clinical texts using multi-label structured SVM
title_full Disorder recognition in clinical texts using multi-label structured SVM
title_fullStr Disorder recognition in clinical texts using multi-label structured SVM
title_full_unstemmed Disorder recognition in clinical texts using multi-label structured SVM
title_short Disorder recognition in clinical texts using multi-label structured SVM
title_sort disorder recognition in clinical texts using multi-label structured svm
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5282630/
https://www.ncbi.nlm.nih.gov/pubmed/28143488
http://dx.doi.org/10.1186/s12859-017-1476-4
work_keys_str_mv AT linwutao disorderrecognitioninclinicaltextsusingmultilabelstructuredsvm
AT jidonghong disorderrecognitioninclinicaltextsusingmultilabelstructuredsvm
AT luyanan disorderrecognitioninclinicaltextsusingmultilabelstructuredsvm