Cargando…
Wave2Vec: Vectorizing Electroencephalography Bio-Signal for Prediction of Brain Disease
Interest in research involving health-medical information analysis based on artificial intelligence, especially for deep learning techniques, has recently been increasing. Most of the research in this field has been focused on searching for new knowledge for predicting and diagnosing disease by reve...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6121271/ https://www.ncbi.nlm.nih.gov/pubmed/30111710 http://dx.doi.org/10.3390/ijerph15081750 |
_version_ | 1783352428640337920 |
---|---|
author | Kim, Seonho Kim, Jungjoon Chun, Hong-Woo |
author_facet | Kim, Seonho Kim, Jungjoon Chun, Hong-Woo |
author_sort | Kim, Seonho |
collection | PubMed |
description | Interest in research involving health-medical information analysis based on artificial intelligence, especially for deep learning techniques, has recently been increasing. Most of the research in this field has been focused on searching for new knowledge for predicting and diagnosing disease by revealing the relation between disease and various information features of data. These features are extracted by analyzing various clinical pathology data, such as EHR (electronic health records), and academic literature using the techniques of data analysis, natural language processing, etc. However, still needed are more research and interest in applying the latest advanced artificial intelligence-based data analysis technique to bio-signal data, which are continuous physiological records, such as EEG (electroencephalography) and ECG (electrocardiogram). Unlike the other types of data, applying deep learning to bio-signal data, which is in the form of time series of real numbers, has many issues that need to be resolved in preprocessing, learning, and analysis. Such issues include leaving feature selection, learning parts that are black boxes, difficulties in recognizing and identifying effective features, high computational complexities, etc. In this paper, to solve these issues, we provide an encoding-based Wave2vec time series classifier model, which combines signal-processing and deep learning-based natural language processing techniques. To demonstrate its advantages, we provide the results of three experiments conducted with EEG data of the University of California Irvine, which are a real-world benchmark bio-signal dataset. After converting the bio-signals (in the form of waves), which are a real number time series, into a sequence of symbols or a sequence of wavelet patterns that are converted into symbols, through encoding, the proposed model vectorizes the symbols by learning the sequence using deep learning-based natural language processing. The models of each class can be constructed through learning from the vectorized wavelet patterns and training data. The implemented models can be used for prediction and diagnosis of diseases by classifying the new data. The proposed method enhanced data readability and intuition of feature selection and learning processes by converting the time series of real number data into sequences of symbols. In addition, it facilitates intuitive and easy recognition, and identification of influential patterns. Furthermore, real-time large-capacity data analysis is facilitated, which is essential in the development of real-time analysis diagnosis systems, by drastically reducing the complexity of calculation without deterioration of analysis performance by data simplification through the encoding process. |
format | Online Article Text |
id | pubmed-6121271 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-61212712018-09-07 Wave2Vec: Vectorizing Electroencephalography Bio-Signal for Prediction of Brain Disease Kim, Seonho Kim, Jungjoon Chun, Hong-Woo Int J Environ Res Public Health Article Interest in research involving health-medical information analysis based on artificial intelligence, especially for deep learning techniques, has recently been increasing. Most of the research in this field has been focused on searching for new knowledge for predicting and diagnosing disease by revealing the relation between disease and various information features of data. These features are extracted by analyzing various clinical pathology data, such as EHR (electronic health records), and academic literature using the techniques of data analysis, natural language processing, etc. However, still needed are more research and interest in applying the latest advanced artificial intelligence-based data analysis technique to bio-signal data, which are continuous physiological records, such as EEG (electroencephalography) and ECG (electrocardiogram). Unlike the other types of data, applying deep learning to bio-signal data, which is in the form of time series of real numbers, has many issues that need to be resolved in preprocessing, learning, and analysis. Such issues include leaving feature selection, learning parts that are black boxes, difficulties in recognizing and identifying effective features, high computational complexities, etc. In this paper, to solve these issues, we provide an encoding-based Wave2vec time series classifier model, which combines signal-processing and deep learning-based natural language processing techniques. To demonstrate its advantages, we provide the results of three experiments conducted with EEG data of the University of California Irvine, which are a real-world benchmark bio-signal dataset. After converting the bio-signals (in the form of waves), which are a real number time series, into a sequence of symbols or a sequence of wavelet patterns that are converted into symbols, through encoding, the proposed model vectorizes the symbols by learning the sequence using deep learning-based natural language processing. The models of each class can be constructed through learning from the vectorized wavelet patterns and training data. The implemented models can be used for prediction and diagnosis of diseases by classifying the new data. The proposed method enhanced data readability and intuition of feature selection and learning processes by converting the time series of real number data into sequences of symbols. In addition, it facilitates intuitive and easy recognition, and identification of influential patterns. Furthermore, real-time large-capacity data analysis is facilitated, which is essential in the development of real-time analysis diagnosis systems, by drastically reducing the complexity of calculation without deterioration of analysis performance by data simplification through the encoding process. MDPI 2018-08-15 2018-08 /pmc/articles/PMC6121271/ /pubmed/30111710 http://dx.doi.org/10.3390/ijerph15081750 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Kim, Seonho Kim, Jungjoon Chun, Hong-Woo Wave2Vec: Vectorizing Electroencephalography Bio-Signal for Prediction of Brain Disease |
title | Wave2Vec: Vectorizing Electroencephalography Bio-Signal for Prediction of Brain Disease |
title_full | Wave2Vec: Vectorizing Electroencephalography Bio-Signal for Prediction of Brain Disease |
title_fullStr | Wave2Vec: Vectorizing Electroencephalography Bio-Signal for Prediction of Brain Disease |
title_full_unstemmed | Wave2Vec: Vectorizing Electroencephalography Bio-Signal for Prediction of Brain Disease |
title_short | Wave2Vec: Vectorizing Electroencephalography Bio-Signal for Prediction of Brain Disease |
title_sort | wave2vec: vectorizing electroencephalography bio-signal for prediction of brain disease |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6121271/ https://www.ncbi.nlm.nih.gov/pubmed/30111710 http://dx.doi.org/10.3390/ijerph15081750 |
work_keys_str_mv | AT kimseonho wave2vecvectorizingelectroencephalographybiosignalforpredictionofbraindisease AT kimjungjoon wave2vecvectorizingelectroencephalographybiosignalforpredictionofbraindisease AT chunhongwoo wave2vecvectorizingelectroencephalographybiosignalforpredictionofbraindisease |