Cargando…

Data Processing and Text Mining Technologies on Electronic Medical Records: A Review

Currently, medical institutes generally use EMR to record patient's condition, including diagnostic information, procedures performed, and treatment results. EMR has been recognized as a valuable resource for large-scale analysis. However, EMR has the characteristics of diversity, incompletenes...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Wencheng, Cai, Zhiping, Li, Yangyang, Liu, Fang, Fang, Shengqun, Wang, Guoyan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5911323/
https://www.ncbi.nlm.nih.gov/pubmed/29849998
http://dx.doi.org/10.1155/2018/4302425
_version_ 1783316192372457472
author Sun, Wencheng
Cai, Zhiping
Li, Yangyang
Liu, Fang
Fang, Shengqun
Wang, Guoyan
author_facet Sun, Wencheng
Cai, Zhiping
Li, Yangyang
Liu, Fang
Fang, Shengqun
Wang, Guoyan
author_sort Sun, Wencheng
collection PubMed
description Currently, medical institutes generally use EMR to record patient's condition, including diagnostic information, procedures performed, and treatment results. EMR has been recognized as a valuable resource for large-scale analysis. However, EMR has the characteristics of diversity, incompleteness, redundancy, and privacy, which make it difficult to carry out data mining and analysis directly. Therefore, it is necessary to preprocess the source data in order to improve data quality and improve the data mining results. Different types of data require different processing technologies. Most structured data commonly needs classic preprocessing technologies, including data cleansing, data integration, data transformation, and data reduction. For semistructured or unstructured data, such as medical text, containing more health information, it requires more complex and challenging processing methods. The task of information extraction for medical texts mainly includes NER (named-entity recognition) and RE (relation extraction). This paper focuses on the process of EMR processing and emphatically analyzes the key techniques. In addition, we make an in-depth study on the applications developed based on text mining together with the open challenges and research issues for future work.
format Online
Article
Text
id pubmed-5911323
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-59113232018-05-30 Data Processing and Text Mining Technologies on Electronic Medical Records: A Review Sun, Wencheng Cai, Zhiping Li, Yangyang Liu, Fang Fang, Shengqun Wang, Guoyan J Healthc Eng Review Article Currently, medical institutes generally use EMR to record patient's condition, including diagnostic information, procedures performed, and treatment results. EMR has been recognized as a valuable resource for large-scale analysis. However, EMR has the characteristics of diversity, incompleteness, redundancy, and privacy, which make it difficult to carry out data mining and analysis directly. Therefore, it is necessary to preprocess the source data in order to improve data quality and improve the data mining results. Different types of data require different processing technologies. Most structured data commonly needs classic preprocessing technologies, including data cleansing, data integration, data transformation, and data reduction. For semistructured or unstructured data, such as medical text, containing more health information, it requires more complex and challenging processing methods. The task of information extraction for medical texts mainly includes NER (named-entity recognition) and RE (relation extraction). This paper focuses on the process of EMR processing and emphatically analyzes the key techniques. In addition, we make an in-depth study on the applications developed based on text mining together with the open challenges and research issues for future work. Hindawi 2018-04-08 /pmc/articles/PMC5911323/ /pubmed/29849998 http://dx.doi.org/10.1155/2018/4302425 Text en Copyright © 2018 Wencheng Sun et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Review Article
Sun, Wencheng
Cai, Zhiping
Li, Yangyang
Liu, Fang
Fang, Shengqun
Wang, Guoyan
Data Processing and Text Mining Technologies on Electronic Medical Records: A Review
title Data Processing and Text Mining Technologies on Electronic Medical Records: A Review
title_full Data Processing and Text Mining Technologies on Electronic Medical Records: A Review
title_fullStr Data Processing and Text Mining Technologies on Electronic Medical Records: A Review
title_full_unstemmed Data Processing and Text Mining Technologies on Electronic Medical Records: A Review
title_short Data Processing and Text Mining Technologies on Electronic Medical Records: A Review
title_sort data processing and text mining technologies on electronic medical records: a review
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5911323/
https://www.ncbi.nlm.nih.gov/pubmed/29849998
http://dx.doi.org/10.1155/2018/4302425
work_keys_str_mv AT sunwencheng dataprocessingandtextminingtechnologiesonelectronicmedicalrecordsareview
AT caizhiping dataprocessingandtextminingtechnologiesonelectronicmedicalrecordsareview
AT liyangyang dataprocessingandtextminingtechnologiesonelectronicmedicalrecordsareview
AT liufang dataprocessingandtextminingtechnologiesonelectronicmedicalrecordsareview
AT fangshengqun dataprocessingandtextminingtechnologiesonelectronicmedicalrecordsareview
AT wangguoyan dataprocessingandtextminingtechnologiesonelectronicmedicalrecordsareview