Cargando…

Leveraging text skeleton for de-identification of electronic medical records

BACKGROUND: De-identification is the first step to use these records for data processing or further medical investigations in electronic medical records. Consequently, a reliable automated de-identification system would be of high value. METHODS: In this paper, a method of combining text skeleton an...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Yue-Shu, Zhang, Kun-Li, Ma, Hong-Chao, Li, Kun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5872383/
https://www.ncbi.nlm.nih.gov/pubmed/29589571
http://dx.doi.org/10.1186/s12911-018-0598-6
Descripción
Sumario:BACKGROUND: De-identification is the first step to use these records for data processing or further medical investigations in electronic medical records. Consequently, a reliable automated de-identification system would be of high value. METHODS: In this paper, a method of combining text skeleton and recurrent neural network is proposed to solve the problem of de-identification. Text skeleton is the general structure of a medical record, which can help neural networks to learn better. RESULTS: We evaluated our method on three datasets involving two English datasets from i2b2 de-identification challenge and a Chinese dataset we annotated. Empirical results show that the text skeleton based method we proposed can help the network to recognize protected health information. CONCLUSIONS: The comparison between our method and state-of-the-art frameworks indicates that our method achieves high performance on the problem of medical record de-identification.