Cargando…

Comparison of Word Embeddings for Extraction from Medical Records

This paper is an extension of the work originally presented in the 16th International Conference on Wearable, Micro and Nano Technologies for Personalized Health. Despite using electronic medical records, free narrative text is still widely used for medical records. To make data from texts available...

Descripción completa

Detalles Bibliográficos
Autores principales: Dudchenko, Aleksei, Kopanitsa, Georgy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6888408/
https://www.ncbi.nlm.nih.gov/pubmed/31717300
http://dx.doi.org/10.3390/ijerph16224360
_version_ 1783475223891279872
author Dudchenko, Aleksei
Kopanitsa, Georgy
author_facet Dudchenko, Aleksei
Kopanitsa, Georgy
author_sort Dudchenko, Aleksei
collection PubMed
description This paper is an extension of the work originally presented in the 16th International Conference on Wearable, Micro and Nano Technologies for Personalized Health. Despite using electronic medical records, free narrative text is still widely used for medical records. To make data from texts available for decision support systems, supervised machine learning algorithms might be successfully applied. In this work, we developed and compared a prototype of a medical data extraction system based on different artificial neural network architectures to process free medical texts in the Russian language. Three classifiers were applied to extract entities from snippets of text. Multi-layer perceptron (MLP) and convolutional neural network (CNN) classifiers showed similar results to all three embedding models. MLP exceeded convolutional network on pipelines that used the embedding model trained on medical records with preliminary lemmatization. Nevertheless, the highest F-score was achieved by CNN. CNN slightly exceeded MLP when the biggest word2vec model was applied (F-score 0.9763).
format Online
Article
Text
id pubmed-6888408
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-68884082019-12-09 Comparison of Word Embeddings for Extraction from Medical Records Dudchenko, Aleksei Kopanitsa, Georgy Int J Environ Res Public Health Article This paper is an extension of the work originally presented in the 16th International Conference on Wearable, Micro and Nano Technologies for Personalized Health. Despite using electronic medical records, free narrative text is still widely used for medical records. To make data from texts available for decision support systems, supervised machine learning algorithms might be successfully applied. In this work, we developed and compared a prototype of a medical data extraction system based on different artificial neural network architectures to process free medical texts in the Russian language. Three classifiers were applied to extract entities from snippets of text. Multi-layer perceptron (MLP) and convolutional neural network (CNN) classifiers showed similar results to all three embedding models. MLP exceeded convolutional network on pipelines that used the embedding model trained on medical records with preliminary lemmatization. Nevertheless, the highest F-score was achieved by CNN. CNN slightly exceeded MLP when the biggest word2vec model was applied (F-score 0.9763). MDPI 2019-11-08 2019-11 /pmc/articles/PMC6888408/ /pubmed/31717300 http://dx.doi.org/10.3390/ijerph16224360 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Dudchenko, Aleksei
Kopanitsa, Georgy
Comparison of Word Embeddings for Extraction from Medical Records
title Comparison of Word Embeddings for Extraction from Medical Records
title_full Comparison of Word Embeddings for Extraction from Medical Records
title_fullStr Comparison of Word Embeddings for Extraction from Medical Records
title_full_unstemmed Comparison of Word Embeddings for Extraction from Medical Records
title_short Comparison of Word Embeddings for Extraction from Medical Records
title_sort comparison of word embeddings for extraction from medical records
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6888408/
https://www.ncbi.nlm.nih.gov/pubmed/31717300
http://dx.doi.org/10.3390/ijerph16224360
work_keys_str_mv AT dudchenkoaleksei comparisonofwordembeddingsforextractionfrommedicalrecords
AT kopanitsageorgy comparisonofwordembeddingsforextractionfrommedicalrecords