Cargando…
Machine Learning and Data Mining Methods in Diabetes Research
The remarkable advances in biotechnology and health sciences have led to a significant production of data, such as high throughput genetic data and clinical information, generated from large Electronic Health Records (EHRs). To this end, application of machine learning and data mining methods in bio...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5257026/ https://www.ncbi.nlm.nih.gov/pubmed/28138367 http://dx.doi.org/10.1016/j.csbj.2016.12.005 |
_version_ | 1782498797914423296 |
---|---|
author | Kavakiotis, Ioannis Tsave, Olga Salifoglou, Athanasios Maglaveras, Nicos Vlahavas, Ioannis Chouvarda, Ioanna |
author_facet | Kavakiotis, Ioannis Tsave, Olga Salifoglou, Athanasios Maglaveras, Nicos Vlahavas, Ioannis Chouvarda, Ioanna |
author_sort | Kavakiotis, Ioannis |
collection | PubMed |
description | The remarkable advances in biotechnology and health sciences have led to a significant production of data, such as high throughput genetic data and clinical information, generated from large Electronic Health Records (EHRs). To this end, application of machine learning and data mining methods in biosciences is presently, more than ever before, vital and indispensable in efforts to transform intelligently all available information into valuable knowledge. Diabetes mellitus (DM) is defined as a group of metabolic disorders exerting significant pressure on human health worldwide. Extensive research in all aspects of diabetes (diagnosis, etiopathophysiology, therapy, etc.) has led to the generation of huge amounts of data. The aim of the present study is to conduct a systematic review of the applications of machine learning, data mining techniques and tools in the field of diabetes research with respect to a) Prediction and Diagnosis, b) Diabetic Complications, c) Genetic Background and Environment, and e) Health Care and Management with the first category appearing to be the most popular. A wide range of machine learning algorithms were employed. In general, 85% of those used were characterized by supervised learning approaches and 15% by unsupervised ones, and more specifically, association rules. Support vector machines (SVM) arise as the most successful and widely used algorithm. Concerning the type of data, clinical datasets were mainly used. The title applications in the selected articles project the usefulness of extracting valuable knowledge leading to new hypotheses targeting deeper understanding and further investigation in DM. |
format | Online Article Text |
id | pubmed-5257026 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-52570262017-01-30 Machine Learning and Data Mining Methods in Diabetes Research Kavakiotis, Ioannis Tsave, Olga Salifoglou, Athanasios Maglaveras, Nicos Vlahavas, Ioannis Chouvarda, Ioanna Comput Struct Biotechnol J Review Article The remarkable advances in biotechnology and health sciences have led to a significant production of data, such as high throughput genetic data and clinical information, generated from large Electronic Health Records (EHRs). To this end, application of machine learning and data mining methods in biosciences is presently, more than ever before, vital and indispensable in efforts to transform intelligently all available information into valuable knowledge. Diabetes mellitus (DM) is defined as a group of metabolic disorders exerting significant pressure on human health worldwide. Extensive research in all aspects of diabetes (diagnosis, etiopathophysiology, therapy, etc.) has led to the generation of huge amounts of data. The aim of the present study is to conduct a systematic review of the applications of machine learning, data mining techniques and tools in the field of diabetes research with respect to a) Prediction and Diagnosis, b) Diabetic Complications, c) Genetic Background and Environment, and e) Health Care and Management with the first category appearing to be the most popular. A wide range of machine learning algorithms were employed. In general, 85% of those used were characterized by supervised learning approaches and 15% by unsupervised ones, and more specifically, association rules. Support vector machines (SVM) arise as the most successful and widely used algorithm. Concerning the type of data, clinical datasets were mainly used. The title applications in the selected articles project the usefulness of extracting valuable knowledge leading to new hypotheses targeting deeper understanding and further investigation in DM. Research Network of Computational and Structural Biotechnology 2017-01-08 /pmc/articles/PMC5257026/ /pubmed/28138367 http://dx.doi.org/10.1016/j.csbj.2016.12.005 Text en © 2017 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Review Article Kavakiotis, Ioannis Tsave, Olga Salifoglou, Athanasios Maglaveras, Nicos Vlahavas, Ioannis Chouvarda, Ioanna Machine Learning and Data Mining Methods in Diabetes Research |
title | Machine Learning and Data Mining Methods in Diabetes Research |
title_full | Machine Learning and Data Mining Methods in Diabetes Research |
title_fullStr | Machine Learning and Data Mining Methods in Diabetes Research |
title_full_unstemmed | Machine Learning and Data Mining Methods in Diabetes Research |
title_short | Machine Learning and Data Mining Methods in Diabetes Research |
title_sort | machine learning and data mining methods in diabetes research |
topic | Review Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5257026/ https://www.ncbi.nlm.nih.gov/pubmed/28138367 http://dx.doi.org/10.1016/j.csbj.2016.12.005 |
work_keys_str_mv | AT kavakiotisioannis machinelearninganddataminingmethodsindiabetesresearch AT tsaveolga machinelearninganddataminingmethodsindiabetesresearch AT salifoglouathanasios machinelearninganddataminingmethodsindiabetesresearch AT maglaverasnicos machinelearninganddataminingmethodsindiabetesresearch AT vlahavasioannis machinelearninganddataminingmethodsindiabetesresearch AT chouvardaioanna machinelearninganddataminingmethodsindiabetesresearch |