Cargando…

Big Data, Data Science, and Causal Inference: A Primer for Clinicians

Clinicians handle a growing amount of clinical, biometric, and biomarker data. In this “big data” era, there is an emerging faith that the answer to all clinical and scientific questions reside in “big data” and that data will transform medicine into precision medicine. However, data by themselves a...

Descripción completa

Detalles Bibliográficos
Autores principales: Raita, Yoshihiko, Camargo, Carlos A., Liang, Liming, Hasegawa, Kohei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8290071/
https://www.ncbi.nlm.nih.gov/pubmed/34295910
http://dx.doi.org/10.3389/fmed.2021.678047
_version_ 1783724417457586176
author Raita, Yoshihiko
Camargo, Carlos A.
Liang, Liming
Hasegawa, Kohei
author_facet Raita, Yoshihiko
Camargo, Carlos A.
Liang, Liming
Hasegawa, Kohei
author_sort Raita, Yoshihiko
collection PubMed
description Clinicians handle a growing amount of clinical, biometric, and biomarker data. In this “big data” era, there is an emerging faith that the answer to all clinical and scientific questions reside in “big data” and that data will transform medicine into precision medicine. However, data by themselves are useless. It is the algorithms encoding causal reasoning and domain (e.g., clinical and biological) knowledge that prove transformative. The recent introduction of (health) data science presents an opportunity to re-think this data-centric view. For example, while precision medicine seeks to provide the right prevention and treatment strategy to the right patients at the right time, its realization cannot be achieved by algorithms that operate exclusively in data-driven prediction modes, as do most machine learning algorithms. Better understanding of data science and its tasks is vital to interpret findings and translate new discoveries into clinical practice. In this review, we first discuss the principles and major tasks of data science by organizing it into three defining tasks: (1) association and prediction, (2) intervention, and (3) counterfactual causal inference. Second, we review commonly-used data science tools with examples in the medical literature. Lastly, we outline current challenges and future directions in the fields of medicine, elaborating on how data science can enhance clinical effectiveness and inform medical practice. As machine learning algorithms become ubiquitous tools to handle quantitatively “big data,” their integration with causal reasoning and domain knowledge is instrumental to qualitatively transform medicine, which will, in turn, improve health outcomes of patients.
format Online
Article
Text
id pubmed-8290071
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-82900712021-07-21 Big Data, Data Science, and Causal Inference: A Primer for Clinicians Raita, Yoshihiko Camargo, Carlos A. Liang, Liming Hasegawa, Kohei Front Med (Lausanne) Medicine Clinicians handle a growing amount of clinical, biometric, and biomarker data. In this “big data” era, there is an emerging faith that the answer to all clinical and scientific questions reside in “big data” and that data will transform medicine into precision medicine. However, data by themselves are useless. It is the algorithms encoding causal reasoning and domain (e.g., clinical and biological) knowledge that prove transformative. The recent introduction of (health) data science presents an opportunity to re-think this data-centric view. For example, while precision medicine seeks to provide the right prevention and treatment strategy to the right patients at the right time, its realization cannot be achieved by algorithms that operate exclusively in data-driven prediction modes, as do most machine learning algorithms. Better understanding of data science and its tasks is vital to interpret findings and translate new discoveries into clinical practice. In this review, we first discuss the principles and major tasks of data science by organizing it into three defining tasks: (1) association and prediction, (2) intervention, and (3) counterfactual causal inference. Second, we review commonly-used data science tools with examples in the medical literature. Lastly, we outline current challenges and future directions in the fields of medicine, elaborating on how data science can enhance clinical effectiveness and inform medical practice. As machine learning algorithms become ubiquitous tools to handle quantitatively “big data,” their integration with causal reasoning and domain knowledge is instrumental to qualitatively transform medicine, which will, in turn, improve health outcomes of patients. Frontiers Media S.A. 2021-07-06 /pmc/articles/PMC8290071/ /pubmed/34295910 http://dx.doi.org/10.3389/fmed.2021.678047 Text en Copyright © 2021 Raita, Camargo, Liang and Hasegawa. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Medicine
Raita, Yoshihiko
Camargo, Carlos A.
Liang, Liming
Hasegawa, Kohei
Big Data, Data Science, and Causal Inference: A Primer for Clinicians
title Big Data, Data Science, and Causal Inference: A Primer for Clinicians
title_full Big Data, Data Science, and Causal Inference: A Primer for Clinicians
title_fullStr Big Data, Data Science, and Causal Inference: A Primer for Clinicians
title_full_unstemmed Big Data, Data Science, and Causal Inference: A Primer for Clinicians
title_short Big Data, Data Science, and Causal Inference: A Primer for Clinicians
title_sort big data, data science, and causal inference: a primer for clinicians
topic Medicine
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8290071/
https://www.ncbi.nlm.nih.gov/pubmed/34295910
http://dx.doi.org/10.3389/fmed.2021.678047
work_keys_str_mv AT raitayoshihiko bigdatadatascienceandcausalinferenceaprimerforclinicians
AT camargocarlosa bigdatadatascienceandcausalinferenceaprimerforclinicians
AT liangliming bigdatadatascienceandcausalinferenceaprimerforclinicians
AT hasegawakohei bigdatadatascienceandcausalinferenceaprimerforclinicians