Cargando…

The benefits and pitfalls of machine learning for biomarker discovery

Prospects for the discovery of robust and reproducible biomarkers have improved considerably with the development of sensitive omics platforms that can enable measurement of biological molecules at an unprecedented scale. With technical barriers to success lowering, the challenge is now moving into...

Descripción completa

Detalles Bibliográficos
Autores principales: Ng, Sandra, Masarone, Sara, Watson, David, Barnes, Michael R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10558383/
https://www.ncbi.nlm.nih.gov/pubmed/37498390
http://dx.doi.org/10.1007/s00441-023-03816-z
_version_ 1785117263318220800
author Ng, Sandra
Masarone, Sara
Watson, David
Barnes, Michael R.
author_facet Ng, Sandra
Masarone, Sara
Watson, David
Barnes, Michael R.
author_sort Ng, Sandra
collection PubMed
description Prospects for the discovery of robust and reproducible biomarkers have improved considerably with the development of sensitive omics platforms that can enable measurement of biological molecules at an unprecedented scale. With technical barriers to success lowering, the challenge is now moving into the analytical domain. Genome-wide discovery presents a problem of scale and multiple testing as standard statistical methods struggle to distinguish signal from noise in increasingly complex biological systems. Machine learning and AI methods are good at finding answers in large datasets, but they have a tendency to overfit solutions. It may be possible to find a local answer or mechanism in a specific patient sample or small group of samples, but this may not generalise to wider patient populations due to the high likelihood of false discovery. The rise of explainable AI offers to improve the opportunity for true discovery by providing explanations for predictions that can be explored mechanistically before proceeding to costly and time-consuming validation studies. This review aims to introduce some of the basic concepts of machine learning and AI for biomarker discovery with a focus on post hoc explanation of predictions. To illustrate this, we consider how explainable AI has already been used successfully, and we explore a case study that applies AI to biomarker discovery in rheumatoid arthritis, demonstrating the accessibility of tools for AI and machine learning. We use this to illustrate and discuss some of the potential challenges and solutions that may enable AI to critically interrogate disease and response mechanisms.
format Online
Article
Text
id pubmed-10558383
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-105583832023-10-08 The benefits and pitfalls of machine learning for biomarker discovery Ng, Sandra Masarone, Sara Watson, David Barnes, Michael R. Cell Tissue Res Review Prospects for the discovery of robust and reproducible biomarkers have improved considerably with the development of sensitive omics platforms that can enable measurement of biological molecules at an unprecedented scale. With technical barriers to success lowering, the challenge is now moving into the analytical domain. Genome-wide discovery presents a problem of scale and multiple testing as standard statistical methods struggle to distinguish signal from noise in increasingly complex biological systems. Machine learning and AI methods are good at finding answers in large datasets, but they have a tendency to overfit solutions. It may be possible to find a local answer or mechanism in a specific patient sample or small group of samples, but this may not generalise to wider patient populations due to the high likelihood of false discovery. The rise of explainable AI offers to improve the opportunity for true discovery by providing explanations for predictions that can be explored mechanistically before proceeding to costly and time-consuming validation studies. This review aims to introduce some of the basic concepts of machine learning and AI for biomarker discovery with a focus on post hoc explanation of predictions. To illustrate this, we consider how explainable AI has already been used successfully, and we explore a case study that applies AI to biomarker discovery in rheumatoid arthritis, demonstrating the accessibility of tools for AI and machine learning. We use this to illustrate and discuss some of the potential challenges and solutions that may enable AI to critically interrogate disease and response mechanisms. Springer Berlin Heidelberg 2023-07-27 2023 /pmc/articles/PMC10558383/ /pubmed/37498390 http://dx.doi.org/10.1007/s00441-023-03816-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Review
Ng, Sandra
Masarone, Sara
Watson, David
Barnes, Michael R.
The benefits and pitfalls of machine learning for biomarker discovery
title The benefits and pitfalls of machine learning for biomarker discovery
title_full The benefits and pitfalls of machine learning for biomarker discovery
title_fullStr The benefits and pitfalls of machine learning for biomarker discovery
title_full_unstemmed The benefits and pitfalls of machine learning for biomarker discovery
title_short The benefits and pitfalls of machine learning for biomarker discovery
title_sort benefits and pitfalls of machine learning for biomarker discovery
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10558383/
https://www.ncbi.nlm.nih.gov/pubmed/37498390
http://dx.doi.org/10.1007/s00441-023-03816-z
work_keys_str_mv AT ngsandra thebenefitsandpitfallsofmachinelearningforbiomarkerdiscovery
AT masaronesara thebenefitsandpitfallsofmachinelearningforbiomarkerdiscovery
AT watsondavid thebenefitsandpitfallsofmachinelearningforbiomarkerdiscovery
AT barnesmichaelr thebenefitsandpitfallsofmachinelearningforbiomarkerdiscovery
AT ngsandra benefitsandpitfallsofmachinelearningforbiomarkerdiscovery
AT masaronesara benefitsandpitfallsofmachinelearningforbiomarkerdiscovery
AT watsondavid benefitsandpitfallsofmachinelearningforbiomarkerdiscovery
AT barnesmichaelr benefitsandpitfallsofmachinelearningforbiomarkerdiscovery