Cargando…

Predicting infectious disease for biopreparedness and response: A systematic review of machine learning and deep learning approaches

The complex, unpredictable nature of pathogen occurrence has required substantial efforts to accurately predict infectious diseases (IDs). With rising popularity of Machine Learning (ML) and Deep Learning (DL) techniques combined with their unique ability to uncover connections between large amounts...

Descripción completa

Detalles Bibliográficos
Autores principales: Keshavamurthy, Ravikiran, Dixon, Samuel, Pazdernik, Karl T., Charles, Lauren E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9582566/
https://www.ncbi.nlm.nih.gov/pubmed/36277100
http://dx.doi.org/10.1016/j.onehlt.2022.100439
_version_ 1784812868345004032
author Keshavamurthy, Ravikiran
Dixon, Samuel
Pazdernik, Karl T.
Charles, Lauren E.
author_facet Keshavamurthy, Ravikiran
Dixon, Samuel
Pazdernik, Karl T.
Charles, Lauren E.
author_sort Keshavamurthy, Ravikiran
collection PubMed
description The complex, unpredictable nature of pathogen occurrence has required substantial efforts to accurately predict infectious diseases (IDs). With rising popularity of Machine Learning (ML) and Deep Learning (DL) techniques combined with their unique ability to uncover connections between large amounts of diverse data, we conducted a PRISMA systematic review to investigate advances in ID prediction for human and animal diseases using ML and DL. This review included the type of IDs modeled, ML and DL techniques utilized, geographical distribution, prediction tasks performed, input features utilized, spatial and temporal scales, error metrics used, computational efficiency, uncertainty quantification, and missing data handling methods. Among 237 relevant articles published between January 2001 and May 2021, highly contagious diseases in humans were most often represented, including COVID-19 (37.1%), influenza/influenza-like illnesses (9.3%), dengue (8.9%), and malaria (5.1%). Out of 37 diseases identified, 51.4% were zoonotic, 37.8% were human-only, and 8.1% were animal-only, with only 1.6% economically significant, non-zoonotic livestock diseases. Despite the number of zoonoses, 86.5% of articles modeled humans whereas only a few articles (5.1%) contained more than one host species. Eastern Asia (32.5%), North America (17.7%), and Southern Asia (13.1%) were the most represented locations. Frequent approaches included tree-based ML (38.4%) and feed-forward neural networks (26.6%). Articles predicted temporal incidence (66.7%), disease risk (38.0%), and/or spatial movement (31.2%). Less than 10% of studies addressed uncertainty quantification, computational efficiency, and missing data, which are essential to operational use and deployment. This study highlights trends and gaps in ML and DL for ID prediction, providing guidelines for future works to better support biopreparedness and response. To fully utilize ML and DL for improved ID forecasting, models should include the full disease ecology in a One-Health context, important food and agricultural diseases, underrepresented hotspots, and important metrics required for operational deployment.
format Online
Article
Text
id pubmed-9582566
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-95825662022-10-21 Predicting infectious disease for biopreparedness and response: A systematic review of machine learning and deep learning approaches Keshavamurthy, Ravikiran Dixon, Samuel Pazdernik, Karl T. Charles, Lauren E. One Health Review Paper The complex, unpredictable nature of pathogen occurrence has required substantial efforts to accurately predict infectious diseases (IDs). With rising popularity of Machine Learning (ML) and Deep Learning (DL) techniques combined with their unique ability to uncover connections between large amounts of diverse data, we conducted a PRISMA systematic review to investigate advances in ID prediction for human and animal diseases using ML and DL. This review included the type of IDs modeled, ML and DL techniques utilized, geographical distribution, prediction tasks performed, input features utilized, spatial and temporal scales, error metrics used, computational efficiency, uncertainty quantification, and missing data handling methods. Among 237 relevant articles published between January 2001 and May 2021, highly contagious diseases in humans were most often represented, including COVID-19 (37.1%), influenza/influenza-like illnesses (9.3%), dengue (8.9%), and malaria (5.1%). Out of 37 diseases identified, 51.4% were zoonotic, 37.8% were human-only, and 8.1% were animal-only, with only 1.6% economically significant, non-zoonotic livestock diseases. Despite the number of zoonoses, 86.5% of articles modeled humans whereas only a few articles (5.1%) contained more than one host species. Eastern Asia (32.5%), North America (17.7%), and Southern Asia (13.1%) were the most represented locations. Frequent approaches included tree-based ML (38.4%) and feed-forward neural networks (26.6%). Articles predicted temporal incidence (66.7%), disease risk (38.0%), and/or spatial movement (31.2%). Less than 10% of studies addressed uncertainty quantification, computational efficiency, and missing data, which are essential to operational use and deployment. This study highlights trends and gaps in ML and DL for ID prediction, providing guidelines for future works to better support biopreparedness and response. To fully utilize ML and DL for improved ID forecasting, models should include the full disease ecology in a One-Health context, important food and agricultural diseases, underrepresented hotspots, and important metrics required for operational deployment. Elsevier 2022-10-01 /pmc/articles/PMC9582566/ /pubmed/36277100 http://dx.doi.org/10.1016/j.onehlt.2022.100439 Text en © 2022 Battelle Memorial Institute https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Review Paper
Keshavamurthy, Ravikiran
Dixon, Samuel
Pazdernik, Karl T.
Charles, Lauren E.
Predicting infectious disease for biopreparedness and response: A systematic review of machine learning and deep learning approaches
title Predicting infectious disease for biopreparedness and response: A systematic review of machine learning and deep learning approaches
title_full Predicting infectious disease for biopreparedness and response: A systematic review of machine learning and deep learning approaches
title_fullStr Predicting infectious disease for biopreparedness and response: A systematic review of machine learning and deep learning approaches
title_full_unstemmed Predicting infectious disease for biopreparedness and response: A systematic review of machine learning and deep learning approaches
title_short Predicting infectious disease for biopreparedness and response: A systematic review of machine learning and deep learning approaches
title_sort predicting infectious disease for biopreparedness and response: a systematic review of machine learning and deep learning approaches
topic Review Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9582566/
https://www.ncbi.nlm.nih.gov/pubmed/36277100
http://dx.doi.org/10.1016/j.onehlt.2022.100439
work_keys_str_mv AT keshavamurthyravikiran predictinginfectiousdiseaseforbiopreparednessandresponseasystematicreviewofmachinelearninganddeeplearningapproaches
AT dixonsamuel predictinginfectiousdiseaseforbiopreparednessandresponseasystematicreviewofmachinelearninganddeeplearningapproaches
AT pazdernikkarlt predictinginfectiousdiseaseforbiopreparednessandresponseasystematicreviewofmachinelearninganddeeplearningapproaches
AT charleslaurene predictinginfectiousdiseaseforbiopreparednessandresponseasystematicreviewofmachinelearninganddeeplearningapproaches