Cargando…
Disorder recognition in clinical texts using multi-label structured SVM
BACKGROUND: Information extraction in clinical texts enables medical workers to find out problems of patients faster as well as makes intelligent diagnosis possible in the future. There has been a lot of work about disorder mention recognition in clinical narratives. But recognition of some more com...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5282630/ https://www.ncbi.nlm.nih.gov/pubmed/28143488 http://dx.doi.org/10.1186/s12859-017-1476-4 |
_version_ | 1782503360065175552 |
---|---|
author | Lin, Wutao Ji, Donghong Lu, Yanan |
author_facet | Lin, Wutao Ji, Donghong Lu, Yanan |
author_sort | Lin, Wutao |
collection | PubMed |
description | BACKGROUND: Information extraction in clinical texts enables medical workers to find out problems of patients faster as well as makes intelligent diagnosis possible in the future. There has been a lot of work about disorder mention recognition in clinical narratives. But recognition of some more complicated disorder mentions like overlapping ones is still an open issue. This paper proposes a multi-label structured Support Vector Machine (SVM) based method for disorder mention recognition. We present a multi-label scheme which could be used in complicated entity recognition tasks. RESULTS: We performed three sets of experiments to evaluate our model. Our best F(1)-Score on the 2013 Conference and Labs of the Evaluation Forum data set is 0.7343. There are six types of labels in our multi-label scheme, all of which are represented by 24-bit binary numbers. The binary digits of each label contain information about different disorder mentions. Our multi-label method can recognize not only disorder mentions in the form of contiguous or discontiguous words but also mentions whose spans overlap with each other. The experiments indicate that our multi-label structured SVM model outperforms the condition random field (CRF) model for this disorder mention recognition task. The experiments show that our multi-label scheme surpasses the baseline. Especially for overlapping disorder mentions, the F(1)-Score of our multi-label scheme is 0.1428 higher than the baseline BIOHD1234 scheme. CONCLUSIONS: This multi-label structured SVM based approach is demonstrated to work well with this disorder recognition task. The novel multi-label scheme we presented is superior to the baseline and it can be used in other models to solve various types of complicated entity recognition tasks as well. |
format | Online Article Text |
id | pubmed-5282630 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-52826302017-02-03 Disorder recognition in clinical texts using multi-label structured SVM Lin, Wutao Ji, Donghong Lu, Yanan BMC Bioinformatics Methodology Article BACKGROUND: Information extraction in clinical texts enables medical workers to find out problems of patients faster as well as makes intelligent diagnosis possible in the future. There has been a lot of work about disorder mention recognition in clinical narratives. But recognition of some more complicated disorder mentions like overlapping ones is still an open issue. This paper proposes a multi-label structured Support Vector Machine (SVM) based method for disorder mention recognition. We present a multi-label scheme which could be used in complicated entity recognition tasks. RESULTS: We performed three sets of experiments to evaluate our model. Our best F(1)-Score on the 2013 Conference and Labs of the Evaluation Forum data set is 0.7343. There are six types of labels in our multi-label scheme, all of which are represented by 24-bit binary numbers. The binary digits of each label contain information about different disorder mentions. Our multi-label method can recognize not only disorder mentions in the form of contiguous or discontiguous words but also mentions whose spans overlap with each other. The experiments indicate that our multi-label structured SVM model outperforms the condition random field (CRF) model for this disorder mention recognition task. The experiments show that our multi-label scheme surpasses the baseline. Especially for overlapping disorder mentions, the F(1)-Score of our multi-label scheme is 0.1428 higher than the baseline BIOHD1234 scheme. CONCLUSIONS: This multi-label structured SVM based approach is demonstrated to work well with this disorder recognition task. The novel multi-label scheme we presented is superior to the baseline and it can be used in other models to solve various types of complicated entity recognition tasks as well. BioMed Central 2017-01-31 /pmc/articles/PMC5282630/ /pubmed/28143488 http://dx.doi.org/10.1186/s12859-017-1476-4 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Lin, Wutao Ji, Donghong Lu, Yanan Disorder recognition in clinical texts using multi-label structured SVM |
title | Disorder recognition in clinical texts using multi-label structured SVM |
title_full | Disorder recognition in clinical texts using multi-label structured SVM |
title_fullStr | Disorder recognition in clinical texts using multi-label structured SVM |
title_full_unstemmed | Disorder recognition in clinical texts using multi-label structured SVM |
title_short | Disorder recognition in clinical texts using multi-label structured SVM |
title_sort | disorder recognition in clinical texts using multi-label structured svm |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5282630/ https://www.ncbi.nlm.nih.gov/pubmed/28143488 http://dx.doi.org/10.1186/s12859-017-1476-4 |
work_keys_str_mv | AT linwutao disorderrecognitioninclinicaltextsusingmultilabelstructuredsvm AT jidonghong disorderrecognitioninclinicaltextsusingmultilabelstructuredsvm AT luyanan disorderrecognitioninclinicaltextsusingmultilabelstructuredsvm |