Cargando…

Big Data Analysis Using Modern Statistical and Machine Learning Methods in Medicine

In this article we introduce modern statistical machine learning and bioinformatics approaches that have been used in learning statistical relationships from big data in medicine and behavioral science that typically include clinical, genomic (and proteomic) and environmental variables. Every year,...

Descripción completa

Detalles Bibliográficos
Autores principales: Yoo, Changwon, Ramirez, Luis, Liuzzi, Juan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Korean Continence Society 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4076480/
https://www.ncbi.nlm.nih.gov/pubmed/24987556
http://dx.doi.org/10.5213/inj.2014.18.2.50
_version_ 1782323489602011136
author Yoo, Changwon
Ramirez, Luis
Liuzzi, Juan
author_facet Yoo, Changwon
Ramirez, Luis
Liuzzi, Juan
author_sort Yoo, Changwon
collection PubMed
description In this article we introduce modern statistical machine learning and bioinformatics approaches that have been used in learning statistical relationships from big data in medicine and behavioral science that typically include clinical, genomic (and proteomic) and environmental variables. Every year, data collected from biomedical and behavioral science is getting larger and more complicated. Thus, in medicine, we also need to be aware of this trend and understand the statistical tools that are available to analyze these datasets. Many statistical analyses that are aimed to analyze such big datasets have been introduced recently. However, given many different types of clinical, genomic, and environmental data, it is rather uncommon to see statistical methods that combine knowledge resulting from those different data types. To this extent, we will introduce big data in terms of clinical data, single nucleotide polymorphism and gene expression studies and their interactions with environment. In this article, we will introduce the concept of well-known regression analyses such as linear and logistic regressions that has been widely used in clinical data analyses and modern statistical models such as Bayesian networks that has been introduced to analyze more complicated data. Also we will discuss how to represent the interaction among clinical, genomic, and environmental data in using modern statistical models. We conclude this article with a promising modern statistical method called Bayesian networks that is suitable in analyzing big data sets that consists with different type of large data from clinical, genomic, and environmental data. Such statistical model form big data will provide us with more comprehensive understanding of human physiology and disease.
format Online
Article
Text
id pubmed-4076480
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Korean Continence Society
record_format MEDLINE/PubMed
spelling pubmed-40764802014-07-01 Big Data Analysis Using Modern Statistical and Machine Learning Methods in Medicine Yoo, Changwon Ramirez, Luis Liuzzi, Juan Int Neurourol J Review Article In this article we introduce modern statistical machine learning and bioinformatics approaches that have been used in learning statistical relationships from big data in medicine and behavioral science that typically include clinical, genomic (and proteomic) and environmental variables. Every year, data collected from biomedical and behavioral science is getting larger and more complicated. Thus, in medicine, we also need to be aware of this trend and understand the statistical tools that are available to analyze these datasets. Many statistical analyses that are aimed to analyze such big datasets have been introduced recently. However, given many different types of clinical, genomic, and environmental data, it is rather uncommon to see statistical methods that combine knowledge resulting from those different data types. To this extent, we will introduce big data in terms of clinical data, single nucleotide polymorphism and gene expression studies and their interactions with environment. In this article, we will introduce the concept of well-known regression analyses such as linear and logistic regressions that has been widely used in clinical data analyses and modern statistical models such as Bayesian networks that has been introduced to analyze more complicated data. Also we will discuss how to represent the interaction among clinical, genomic, and environmental data in using modern statistical models. We conclude this article with a promising modern statistical method called Bayesian networks that is suitable in analyzing big data sets that consists with different type of large data from clinical, genomic, and environmental data. Such statistical model form big data will provide us with more comprehensive understanding of human physiology and disease. Korean Continence Society 2014-06 2014-06-26 /pmc/articles/PMC4076480/ /pubmed/24987556 http://dx.doi.org/10.5213/inj.2014.18.2.50 Text en Copyright © 2014 Korean Continence Society http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Review Article
Yoo, Changwon
Ramirez, Luis
Liuzzi, Juan
Big Data Analysis Using Modern Statistical and Machine Learning Methods in Medicine
title Big Data Analysis Using Modern Statistical and Machine Learning Methods in Medicine
title_full Big Data Analysis Using Modern Statistical and Machine Learning Methods in Medicine
title_fullStr Big Data Analysis Using Modern Statistical and Machine Learning Methods in Medicine
title_full_unstemmed Big Data Analysis Using Modern Statistical and Machine Learning Methods in Medicine
title_short Big Data Analysis Using Modern Statistical and Machine Learning Methods in Medicine
title_sort big data analysis using modern statistical and machine learning methods in medicine
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4076480/
https://www.ncbi.nlm.nih.gov/pubmed/24987556
http://dx.doi.org/10.5213/inj.2014.18.2.50
work_keys_str_mv AT yoochangwon bigdataanalysisusingmodernstatisticalandmachinelearningmethodsinmedicine
AT ramirezluis bigdataanalysisusingmodernstatisticalandmachinelearningmethodsinmedicine
AT liuzzijuan bigdataanalysisusingmodernstatisticalandmachinelearningmethodsinmedicine