Cargando…
Predicting infectious disease for biopreparedness and response: A systematic review of machine learning and deep learning approaches
The complex, unpredictable nature of pathogen occurrence has required substantial efforts to accurately predict infectious diseases (IDs). With rising popularity of Machine Learning (ML) and Deep Learning (DL) techniques combined with their unique ability to uncover connections between large amounts...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9582566/ https://www.ncbi.nlm.nih.gov/pubmed/36277100 http://dx.doi.org/10.1016/j.onehlt.2022.100439 |
_version_ | 1784812868345004032 |
---|---|
author | Keshavamurthy, Ravikiran Dixon, Samuel Pazdernik, Karl T. Charles, Lauren E. |
author_facet | Keshavamurthy, Ravikiran Dixon, Samuel Pazdernik, Karl T. Charles, Lauren E. |
author_sort | Keshavamurthy, Ravikiran |
collection | PubMed |
description | The complex, unpredictable nature of pathogen occurrence has required substantial efforts to accurately predict infectious diseases (IDs). With rising popularity of Machine Learning (ML) and Deep Learning (DL) techniques combined with their unique ability to uncover connections between large amounts of diverse data, we conducted a PRISMA systematic review to investigate advances in ID prediction for human and animal diseases using ML and DL. This review included the type of IDs modeled, ML and DL techniques utilized, geographical distribution, prediction tasks performed, input features utilized, spatial and temporal scales, error metrics used, computational efficiency, uncertainty quantification, and missing data handling methods. Among 237 relevant articles published between January 2001 and May 2021, highly contagious diseases in humans were most often represented, including COVID-19 (37.1%), influenza/influenza-like illnesses (9.3%), dengue (8.9%), and malaria (5.1%). Out of 37 diseases identified, 51.4% were zoonotic, 37.8% were human-only, and 8.1% were animal-only, with only 1.6% economically significant, non-zoonotic livestock diseases. Despite the number of zoonoses, 86.5% of articles modeled humans whereas only a few articles (5.1%) contained more than one host species. Eastern Asia (32.5%), North America (17.7%), and Southern Asia (13.1%) were the most represented locations. Frequent approaches included tree-based ML (38.4%) and feed-forward neural networks (26.6%). Articles predicted temporal incidence (66.7%), disease risk (38.0%), and/or spatial movement (31.2%). Less than 10% of studies addressed uncertainty quantification, computational efficiency, and missing data, which are essential to operational use and deployment. This study highlights trends and gaps in ML and DL for ID prediction, providing guidelines for future works to better support biopreparedness and response. To fully utilize ML and DL for improved ID forecasting, models should include the full disease ecology in a One-Health context, important food and agricultural diseases, underrepresented hotspots, and important metrics required for operational deployment. |
format | Online Article Text |
id | pubmed-9582566 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-95825662022-10-21 Predicting infectious disease for biopreparedness and response: A systematic review of machine learning and deep learning approaches Keshavamurthy, Ravikiran Dixon, Samuel Pazdernik, Karl T. Charles, Lauren E. One Health Review Paper The complex, unpredictable nature of pathogen occurrence has required substantial efforts to accurately predict infectious diseases (IDs). With rising popularity of Machine Learning (ML) and Deep Learning (DL) techniques combined with their unique ability to uncover connections between large amounts of diverse data, we conducted a PRISMA systematic review to investigate advances in ID prediction for human and animal diseases using ML and DL. This review included the type of IDs modeled, ML and DL techniques utilized, geographical distribution, prediction tasks performed, input features utilized, spatial and temporal scales, error metrics used, computational efficiency, uncertainty quantification, and missing data handling methods. Among 237 relevant articles published between January 2001 and May 2021, highly contagious diseases in humans were most often represented, including COVID-19 (37.1%), influenza/influenza-like illnesses (9.3%), dengue (8.9%), and malaria (5.1%). Out of 37 diseases identified, 51.4% were zoonotic, 37.8% were human-only, and 8.1% were animal-only, with only 1.6% economically significant, non-zoonotic livestock diseases. Despite the number of zoonoses, 86.5% of articles modeled humans whereas only a few articles (5.1%) contained more than one host species. Eastern Asia (32.5%), North America (17.7%), and Southern Asia (13.1%) were the most represented locations. Frequent approaches included tree-based ML (38.4%) and feed-forward neural networks (26.6%). Articles predicted temporal incidence (66.7%), disease risk (38.0%), and/or spatial movement (31.2%). Less than 10% of studies addressed uncertainty quantification, computational efficiency, and missing data, which are essential to operational use and deployment. This study highlights trends and gaps in ML and DL for ID prediction, providing guidelines for future works to better support biopreparedness and response. To fully utilize ML and DL for improved ID forecasting, models should include the full disease ecology in a One-Health context, important food and agricultural diseases, underrepresented hotspots, and important metrics required for operational deployment. Elsevier 2022-10-01 /pmc/articles/PMC9582566/ /pubmed/36277100 http://dx.doi.org/10.1016/j.onehlt.2022.100439 Text en © 2022 Battelle Memorial Institute https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Review Paper Keshavamurthy, Ravikiran Dixon, Samuel Pazdernik, Karl T. Charles, Lauren E. Predicting infectious disease for biopreparedness and response: A systematic review of machine learning and deep learning approaches |
title | Predicting infectious disease for biopreparedness and response: A systematic review of machine learning and deep learning approaches |
title_full | Predicting infectious disease for biopreparedness and response: A systematic review of machine learning and deep learning approaches |
title_fullStr | Predicting infectious disease for biopreparedness and response: A systematic review of machine learning and deep learning approaches |
title_full_unstemmed | Predicting infectious disease for biopreparedness and response: A systematic review of machine learning and deep learning approaches |
title_short | Predicting infectious disease for biopreparedness and response: A systematic review of machine learning and deep learning approaches |
title_sort | predicting infectious disease for biopreparedness and response: a systematic review of machine learning and deep learning approaches |
topic | Review Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9582566/ https://www.ncbi.nlm.nih.gov/pubmed/36277100 http://dx.doi.org/10.1016/j.onehlt.2022.100439 |
work_keys_str_mv | AT keshavamurthyravikiran predictinginfectiousdiseaseforbiopreparednessandresponseasystematicreviewofmachinelearninganddeeplearningapproaches AT dixonsamuel predictinginfectiousdiseaseforbiopreparednessandresponseasystematicreviewofmachinelearninganddeeplearningapproaches AT pazdernikkarlt predictinginfectiousdiseaseforbiopreparednessandresponseasystematicreviewofmachinelearninganddeeplearningapproaches AT charleslaurene predictinginfectiousdiseaseforbiopreparednessandresponseasystematicreviewofmachinelearninganddeeplearningapproaches |