Cargando…
The benefits and pitfalls of machine learning for biomarker discovery
Prospects for the discovery of robust and reproducible biomarkers have improved considerably with the development of sensitive omics platforms that can enable measurement of biological molecules at an unprecedented scale. With technical barriers to success lowering, the challenge is now moving into...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Berlin Heidelberg
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10558383/ https://www.ncbi.nlm.nih.gov/pubmed/37498390 http://dx.doi.org/10.1007/s00441-023-03816-z |
_version_ | 1785117263318220800 |
---|---|
author | Ng, Sandra Masarone, Sara Watson, David Barnes, Michael R. |
author_facet | Ng, Sandra Masarone, Sara Watson, David Barnes, Michael R. |
author_sort | Ng, Sandra |
collection | PubMed |
description | Prospects for the discovery of robust and reproducible biomarkers have improved considerably with the development of sensitive omics platforms that can enable measurement of biological molecules at an unprecedented scale. With technical barriers to success lowering, the challenge is now moving into the analytical domain. Genome-wide discovery presents a problem of scale and multiple testing as standard statistical methods struggle to distinguish signal from noise in increasingly complex biological systems. Machine learning and AI methods are good at finding answers in large datasets, but they have a tendency to overfit solutions. It may be possible to find a local answer or mechanism in a specific patient sample or small group of samples, but this may not generalise to wider patient populations due to the high likelihood of false discovery. The rise of explainable AI offers to improve the opportunity for true discovery by providing explanations for predictions that can be explored mechanistically before proceeding to costly and time-consuming validation studies. This review aims to introduce some of the basic concepts of machine learning and AI for biomarker discovery with a focus on post hoc explanation of predictions. To illustrate this, we consider how explainable AI has already been used successfully, and we explore a case study that applies AI to biomarker discovery in rheumatoid arthritis, demonstrating the accessibility of tools for AI and machine learning. We use this to illustrate and discuss some of the potential challenges and solutions that may enable AI to critically interrogate disease and response mechanisms. |
format | Online Article Text |
id | pubmed-10558383 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Springer Berlin Heidelberg |
record_format | MEDLINE/PubMed |
spelling | pubmed-105583832023-10-08 The benefits and pitfalls of machine learning for biomarker discovery Ng, Sandra Masarone, Sara Watson, David Barnes, Michael R. Cell Tissue Res Review Prospects for the discovery of robust and reproducible biomarkers have improved considerably with the development of sensitive omics platforms that can enable measurement of biological molecules at an unprecedented scale. With technical barriers to success lowering, the challenge is now moving into the analytical domain. Genome-wide discovery presents a problem of scale and multiple testing as standard statistical methods struggle to distinguish signal from noise in increasingly complex biological systems. Machine learning and AI methods are good at finding answers in large datasets, but they have a tendency to overfit solutions. It may be possible to find a local answer or mechanism in a specific patient sample or small group of samples, but this may not generalise to wider patient populations due to the high likelihood of false discovery. The rise of explainable AI offers to improve the opportunity for true discovery by providing explanations for predictions that can be explored mechanistically before proceeding to costly and time-consuming validation studies. This review aims to introduce some of the basic concepts of machine learning and AI for biomarker discovery with a focus on post hoc explanation of predictions. To illustrate this, we consider how explainable AI has already been used successfully, and we explore a case study that applies AI to biomarker discovery in rheumatoid arthritis, demonstrating the accessibility of tools for AI and machine learning. We use this to illustrate and discuss some of the potential challenges and solutions that may enable AI to critically interrogate disease and response mechanisms. Springer Berlin Heidelberg 2023-07-27 2023 /pmc/articles/PMC10558383/ /pubmed/37498390 http://dx.doi.org/10.1007/s00441-023-03816-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Review Ng, Sandra Masarone, Sara Watson, David Barnes, Michael R. The benefits and pitfalls of machine learning for biomarker discovery |
title | The benefits and pitfalls of machine learning for biomarker discovery |
title_full | The benefits and pitfalls of machine learning for biomarker discovery |
title_fullStr | The benefits and pitfalls of machine learning for biomarker discovery |
title_full_unstemmed | The benefits and pitfalls of machine learning for biomarker discovery |
title_short | The benefits and pitfalls of machine learning for biomarker discovery |
title_sort | benefits and pitfalls of machine learning for biomarker discovery |
topic | Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10558383/ https://www.ncbi.nlm.nih.gov/pubmed/37498390 http://dx.doi.org/10.1007/s00441-023-03816-z |
work_keys_str_mv | AT ngsandra thebenefitsandpitfallsofmachinelearningforbiomarkerdiscovery AT masaronesara thebenefitsandpitfallsofmachinelearningforbiomarkerdiscovery AT watsondavid thebenefitsandpitfallsofmachinelearningforbiomarkerdiscovery AT barnesmichaelr thebenefitsandpitfallsofmachinelearningforbiomarkerdiscovery AT ngsandra benefitsandpitfallsofmachinelearningforbiomarkerdiscovery AT masaronesara benefitsandpitfallsofmachinelearningforbiomarkerdiscovery AT watsondavid benefitsandpitfallsofmachinelearningforbiomarkerdiscovery AT barnesmichaelr benefitsandpitfallsofmachinelearningforbiomarkerdiscovery |