Cargando…

The use of machine learning in rare diseases: a scoping review

BACKGROUND: Emerging machine learning technologies are beginning to transform medicine and healthcare and could also improve the diagnosis and treatment of rare diseases. Currently, there are no systematic reviews that investigate, from a general perspective, how machine learning is used in a rare d...

Descripción completa

Detalles Bibliográficos
Autores principales: Schaefer, Julia, Lehne, Moritz, Schepers, Josef, Prasser, Fabian, Thun, Sylvia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7285453/
https://www.ncbi.nlm.nih.gov/pubmed/32517778
http://dx.doi.org/10.1186/s13023-020-01424-6
_version_ 1783544701128802304
author Schaefer, Julia
Lehne, Moritz
Schepers, Josef
Prasser, Fabian
Thun, Sylvia
author_facet Schaefer, Julia
Lehne, Moritz
Schepers, Josef
Prasser, Fabian
Thun, Sylvia
author_sort Schaefer, Julia
collection PubMed
description BACKGROUND: Emerging machine learning technologies are beginning to transform medicine and healthcare and could also improve the diagnosis and treatment of rare diseases. Currently, there are no systematic reviews that investigate, from a general perspective, how machine learning is used in a rare disease context. This scoping review aims to address this gap and explores the use of machine learning in rare diseases, investigating, for example, in which rare diseases machine learning is applied, which types of algorithms and input data are used or which medical applications (e.g., diagnosis, prognosis or treatment) are studied. METHODS: Using a complex search string including generic search terms and 381 individual disease names, studies from the past 10 years (2010–2019) that applied machine learning in a rare disease context were identified on PubMed. To systematically map the research activity, eligible studies were categorized along different dimensions (e.g., rare disease group, type of algorithm, input data), and the number of studies within these categories was analyzed. RESULTS: Two hundred eleven studies from 32 countries investigating 74 different rare diseases were identified. Diseases with a higher prevalence appeared more often in the studies than diseases with a lower prevalence. Moreover, some rare disease groups were investigated more frequently than to be expected (e.g., rare neurologic diseases and rare systemic or rheumatologic diseases), others less frequently (e.g., rare inborn errors of metabolism and rare skin diseases). Ensemble methods (36.0%), support vector machines (32.2%) and artificial neural networks (31.8%) were the algorithms most commonly applied in the studies. Only a small proportion of studies evaluated their algorithms on an external data set (11.8%) or against a human expert (2.4%). As input data, images (32.2%), demographic data (27.0%) and “omics” data (26.5%) were used most frequently. Most studies used machine learning for diagnosis (40.8%) or prognosis (38.4%) whereas studies aiming to improve treatment were relatively scarce (4.7%). Patient numbers in the studies were small, typically ranging from 20 to 99 (35.5%). CONCLUSION: Our review provides an overview of the use of machine learning in rare diseases. Mapping the current research activity, it can guide future work and help to facilitate the successful application of machine learning in rare diseases.
format Online
Article
Text
id pubmed-7285453
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-72854532020-06-10 The use of machine learning in rare diseases: a scoping review Schaefer, Julia Lehne, Moritz Schepers, Josef Prasser, Fabian Thun, Sylvia Orphanet J Rare Dis Review BACKGROUND: Emerging machine learning technologies are beginning to transform medicine and healthcare and could also improve the diagnosis and treatment of rare diseases. Currently, there are no systematic reviews that investigate, from a general perspective, how machine learning is used in a rare disease context. This scoping review aims to address this gap and explores the use of machine learning in rare diseases, investigating, for example, in which rare diseases machine learning is applied, which types of algorithms and input data are used or which medical applications (e.g., diagnosis, prognosis or treatment) are studied. METHODS: Using a complex search string including generic search terms and 381 individual disease names, studies from the past 10 years (2010–2019) that applied machine learning in a rare disease context were identified on PubMed. To systematically map the research activity, eligible studies were categorized along different dimensions (e.g., rare disease group, type of algorithm, input data), and the number of studies within these categories was analyzed. RESULTS: Two hundred eleven studies from 32 countries investigating 74 different rare diseases were identified. Diseases with a higher prevalence appeared more often in the studies than diseases with a lower prevalence. Moreover, some rare disease groups were investigated more frequently than to be expected (e.g., rare neurologic diseases and rare systemic or rheumatologic diseases), others less frequently (e.g., rare inborn errors of metabolism and rare skin diseases). Ensemble methods (36.0%), support vector machines (32.2%) and artificial neural networks (31.8%) were the algorithms most commonly applied in the studies. Only a small proportion of studies evaluated their algorithms on an external data set (11.8%) or against a human expert (2.4%). As input data, images (32.2%), demographic data (27.0%) and “omics” data (26.5%) were used most frequently. Most studies used machine learning for diagnosis (40.8%) or prognosis (38.4%) whereas studies aiming to improve treatment were relatively scarce (4.7%). Patient numbers in the studies were small, typically ranging from 20 to 99 (35.5%). CONCLUSION: Our review provides an overview of the use of machine learning in rare diseases. Mapping the current research activity, it can guide future work and help to facilitate the successful application of machine learning in rare diseases. BioMed Central 2020-06-09 /pmc/articles/PMC7285453/ /pubmed/32517778 http://dx.doi.org/10.1186/s13023-020-01424-6 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Review
Schaefer, Julia
Lehne, Moritz
Schepers, Josef
Prasser, Fabian
Thun, Sylvia
The use of machine learning in rare diseases: a scoping review
title The use of machine learning in rare diseases: a scoping review
title_full The use of machine learning in rare diseases: a scoping review
title_fullStr The use of machine learning in rare diseases: a scoping review
title_full_unstemmed The use of machine learning in rare diseases: a scoping review
title_short The use of machine learning in rare diseases: a scoping review
title_sort use of machine learning in rare diseases: a scoping review
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7285453/
https://www.ncbi.nlm.nih.gov/pubmed/32517778
http://dx.doi.org/10.1186/s13023-020-01424-6
work_keys_str_mv AT schaeferjulia theuseofmachinelearninginrarediseasesascopingreview
AT lehnemoritz theuseofmachinelearninginrarediseasesascopingreview
AT schepersjosef theuseofmachinelearninginrarediseasesascopingreview
AT prasserfabian theuseofmachinelearninginrarediseasesascopingreview
AT thunsylvia theuseofmachinelearninginrarediseasesascopingreview
AT schaeferjulia useofmachinelearninginrarediseasesascopingreview
AT lehnemoritz useofmachinelearninginrarediseasesascopingreview
AT schepersjosef useofmachinelearninginrarediseasesascopingreview
AT prasserfabian useofmachinelearninginrarediseasesascopingreview
AT thunsylvia useofmachinelearninginrarediseasesascopingreview