Cargando…

Machine Learning and Data Mining Methods in Diabetes Research

The remarkable advances in biotechnology and health sciences have led to a significant production of data, such as high throughput genetic data and clinical information, generated from large Electronic Health Records (EHRs). To this end, application of machine learning and data mining methods in bio...

Descripción completa

Detalles Bibliográficos
Autores principales: Kavakiotis, Ioannis, Tsave, Olga, Salifoglou, Athanasios, Maglaveras, Nicos, Vlahavas, Ioannis, Chouvarda, Ioanna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5257026/
https://www.ncbi.nlm.nih.gov/pubmed/28138367
http://dx.doi.org/10.1016/j.csbj.2016.12.005
_version_ 1782498797914423296
author Kavakiotis, Ioannis
Tsave, Olga
Salifoglou, Athanasios
Maglaveras, Nicos
Vlahavas, Ioannis
Chouvarda, Ioanna
author_facet Kavakiotis, Ioannis
Tsave, Olga
Salifoglou, Athanasios
Maglaveras, Nicos
Vlahavas, Ioannis
Chouvarda, Ioanna
author_sort Kavakiotis, Ioannis
collection PubMed
description The remarkable advances in biotechnology and health sciences have led to a significant production of data, such as high throughput genetic data and clinical information, generated from large Electronic Health Records (EHRs). To this end, application of machine learning and data mining methods in biosciences is presently, more than ever before, vital and indispensable in efforts to transform intelligently all available information into valuable knowledge. Diabetes mellitus (DM) is defined as a group of metabolic disorders exerting significant pressure on human health worldwide. Extensive research in all aspects of diabetes (diagnosis, etiopathophysiology, therapy, etc.) has led to the generation of huge amounts of data. The aim of the present study is to conduct a systematic review of the applications of machine learning, data mining techniques and tools in the field of diabetes research with respect to a) Prediction and Diagnosis, b) Diabetic Complications, c) Genetic Background and Environment, and e) Health Care and Management with the first category appearing to be the most popular. A wide range of machine learning algorithms were employed. In general, 85% of those used were characterized by supervised learning approaches and 15% by unsupervised ones, and more specifically, association rules. Support vector machines (SVM) arise as the most successful and widely used algorithm. Concerning the type of data, clinical datasets were mainly used. The title applications in the selected articles project the usefulness of extracting valuable knowledge leading to new hypotheses targeting deeper understanding and further investigation in DM.
format Online
Article
Text
id pubmed-5257026
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-52570262017-01-30 Machine Learning and Data Mining Methods in Diabetes Research Kavakiotis, Ioannis Tsave, Olga Salifoglou, Athanasios Maglaveras, Nicos Vlahavas, Ioannis Chouvarda, Ioanna Comput Struct Biotechnol J Review Article The remarkable advances in biotechnology and health sciences have led to a significant production of data, such as high throughput genetic data and clinical information, generated from large Electronic Health Records (EHRs). To this end, application of machine learning and data mining methods in biosciences is presently, more than ever before, vital and indispensable in efforts to transform intelligently all available information into valuable knowledge. Diabetes mellitus (DM) is defined as a group of metabolic disorders exerting significant pressure on human health worldwide. Extensive research in all aspects of diabetes (diagnosis, etiopathophysiology, therapy, etc.) has led to the generation of huge amounts of data. The aim of the present study is to conduct a systematic review of the applications of machine learning, data mining techniques and tools in the field of diabetes research with respect to a) Prediction and Diagnosis, b) Diabetic Complications, c) Genetic Background and Environment, and e) Health Care and Management with the first category appearing to be the most popular. A wide range of machine learning algorithms were employed. In general, 85% of those used were characterized by supervised learning approaches and 15% by unsupervised ones, and more specifically, association rules. Support vector machines (SVM) arise as the most successful and widely used algorithm. Concerning the type of data, clinical datasets were mainly used. The title applications in the selected articles project the usefulness of extracting valuable knowledge leading to new hypotheses targeting deeper understanding and further investigation in DM. Research Network of Computational and Structural Biotechnology 2017-01-08 /pmc/articles/PMC5257026/ /pubmed/28138367 http://dx.doi.org/10.1016/j.csbj.2016.12.005 Text en © 2017 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Review Article
Kavakiotis, Ioannis
Tsave, Olga
Salifoglou, Athanasios
Maglaveras, Nicos
Vlahavas, Ioannis
Chouvarda, Ioanna
Machine Learning and Data Mining Methods in Diabetes Research
title Machine Learning and Data Mining Methods in Diabetes Research
title_full Machine Learning and Data Mining Methods in Diabetes Research
title_fullStr Machine Learning and Data Mining Methods in Diabetes Research
title_full_unstemmed Machine Learning and Data Mining Methods in Diabetes Research
title_short Machine Learning and Data Mining Methods in Diabetes Research
title_sort machine learning and data mining methods in diabetes research
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5257026/
https://www.ncbi.nlm.nih.gov/pubmed/28138367
http://dx.doi.org/10.1016/j.csbj.2016.12.005
work_keys_str_mv AT kavakiotisioannis machinelearninganddataminingmethodsindiabetesresearch
AT tsaveolga machinelearninganddataminingmethodsindiabetesresearch
AT salifoglouathanasios machinelearninganddataminingmethodsindiabetesresearch
AT maglaverasnicos machinelearninganddataminingmethodsindiabetesresearch
AT vlahavasioannis machinelearninganddataminingmethodsindiabetesresearch
AT chouvardaioanna machinelearninganddataminingmethodsindiabetesresearch