Cargando…
Leveraging text skeleton for de-identification of electronic medical records
BACKGROUND: De-identification is the first step to use these records for data processing or further medical investigations in electronic medical records. Consequently, a reliable automated de-identification system would be of high value. METHODS: In this paper, a method of combining text skeleton an...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5872383/ https://www.ncbi.nlm.nih.gov/pubmed/29589571 http://dx.doi.org/10.1186/s12911-018-0598-6 |
_version_ | 1783309824467927040 |
---|---|
author | Zhao, Yue-Shu Zhang, Kun-Li Ma, Hong-Chao Li, Kun |
author_facet | Zhao, Yue-Shu Zhang, Kun-Li Ma, Hong-Chao Li, Kun |
author_sort | Zhao, Yue-Shu |
collection | PubMed |
description | BACKGROUND: De-identification is the first step to use these records for data processing or further medical investigations in electronic medical records. Consequently, a reliable automated de-identification system would be of high value. METHODS: In this paper, a method of combining text skeleton and recurrent neural network is proposed to solve the problem of de-identification. Text skeleton is the general structure of a medical record, which can help neural networks to learn better. RESULTS: We evaluated our method on three datasets involving two English datasets from i2b2 de-identification challenge and a Chinese dataset we annotated. Empirical results show that the text skeleton based method we proposed can help the network to recognize protected health information. CONCLUSIONS: The comparison between our method and state-of-the-art frameworks indicates that our method achieves high performance on the problem of medical record de-identification. |
format | Online Article Text |
id | pubmed-5872383 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-58723832018-04-02 Leveraging text skeleton for de-identification of electronic medical records Zhao, Yue-Shu Zhang, Kun-Li Ma, Hong-Chao Li, Kun BMC Med Inform Decis Mak Research BACKGROUND: De-identification is the first step to use these records for data processing or further medical investigations in electronic medical records. Consequently, a reliable automated de-identification system would be of high value. METHODS: In this paper, a method of combining text skeleton and recurrent neural network is proposed to solve the problem of de-identification. Text skeleton is the general structure of a medical record, which can help neural networks to learn better. RESULTS: We evaluated our method on three datasets involving two English datasets from i2b2 de-identification challenge and a Chinese dataset we annotated. Empirical results show that the text skeleton based method we proposed can help the network to recognize protected health information. CONCLUSIONS: The comparison between our method and state-of-the-art frameworks indicates that our method achieves high performance on the problem of medical record de-identification. BioMed Central 2018-03-22 /pmc/articles/PMC5872383/ /pubmed/29589571 http://dx.doi.org/10.1186/s12911-018-0598-6 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Zhao, Yue-Shu Zhang, Kun-Li Ma, Hong-Chao Li, Kun Leveraging text skeleton for de-identification of electronic medical records |
title | Leveraging text skeleton for de-identification of electronic medical records |
title_full | Leveraging text skeleton for de-identification of electronic medical records |
title_fullStr | Leveraging text skeleton for de-identification of electronic medical records |
title_full_unstemmed | Leveraging text skeleton for de-identification of electronic medical records |
title_short | Leveraging text skeleton for de-identification of electronic medical records |
title_sort | leveraging text skeleton for de-identification of electronic medical records |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5872383/ https://www.ncbi.nlm.nih.gov/pubmed/29589571 http://dx.doi.org/10.1186/s12911-018-0598-6 |
work_keys_str_mv | AT zhaoyueshu leveragingtextskeletonfordeidentificationofelectronicmedicalrecords AT zhangkunli leveragingtextskeletonfordeidentificationofelectronicmedicalrecords AT mahongchao leveragingtextskeletonfordeidentificationofelectronicmedicalrecords AT likun leveragingtextskeletonfordeidentificationofelectronicmedicalrecords |