Cargando…

Algorithmic discovery of dynamic models from infectious disease data

Theoretical models are typically developed through a deductive process where a researcher formulates a system of dynamic equations from hypothesized mechanisms. Recent advances in algorithmic methods can discover dynamic models inductively–directly from data. Most previous research has tested these...

Descripción completa

Detalles Bibliográficos
Autores principales: Horrocks, Jonathan, Bauch, Chris T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7184751/
https://www.ncbi.nlm.nih.gov/pubmed/32341374
http://dx.doi.org/10.1038/s41598-020-63877-w
_version_ 1783526646996795392
author Horrocks, Jonathan
Bauch, Chris T.
author_facet Horrocks, Jonathan
Bauch, Chris T.
author_sort Horrocks, Jonathan
collection PubMed
description Theoretical models are typically developed through a deductive process where a researcher formulates a system of dynamic equations from hypothesized mechanisms. Recent advances in algorithmic methods can discover dynamic models inductively–directly from data. Most previous research has tested these methods by rediscovering models from synthetic data generated by the already known model. Here we apply Sparse Identification of Nonlinear Dynamics (SINDy) to discover mechanistic equations for disease dynamics from case notification data for measles, chickenpox, and rubella. The discovered models provide a good qualitative fit to the observed dynamics for all three diseases, However, the SINDy chickenpox model appears to overfit the empirical data, and recovering qualitatively correct rubella dynamics requires using power spectral density in the goodness-of-fit criterion. When SINDy uses a library of second-order functions, the discovered models tend to include mass action incidence and a seasonally varying transmission rate–a common feature of existing epidemiological models for childhood infectious diseases. We also find that the SINDy measles model is capable of out-of-sample prediction of a dynamical regime shift in measles case notification data. These results demonstrate the potential for algorithmic model discovery to enrich scientific understanding by providing a complementary approach to developing theoretical models.
format Online
Article
Text
id pubmed-7184751
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-71847512020-05-04 Algorithmic discovery of dynamic models from infectious disease data Horrocks, Jonathan Bauch, Chris T. Sci Rep Article Theoretical models are typically developed through a deductive process where a researcher formulates a system of dynamic equations from hypothesized mechanisms. Recent advances in algorithmic methods can discover dynamic models inductively–directly from data. Most previous research has tested these methods by rediscovering models from synthetic data generated by the already known model. Here we apply Sparse Identification of Nonlinear Dynamics (SINDy) to discover mechanistic equations for disease dynamics from case notification data for measles, chickenpox, and rubella. The discovered models provide a good qualitative fit to the observed dynamics for all three diseases, However, the SINDy chickenpox model appears to overfit the empirical data, and recovering qualitatively correct rubella dynamics requires using power spectral density in the goodness-of-fit criterion. When SINDy uses a library of second-order functions, the discovered models tend to include mass action incidence and a seasonally varying transmission rate–a common feature of existing epidemiological models for childhood infectious diseases. We also find that the SINDy measles model is capable of out-of-sample prediction of a dynamical regime shift in measles case notification data. These results demonstrate the potential for algorithmic model discovery to enrich scientific understanding by providing a complementary approach to developing theoretical models. Nature Publishing Group UK 2020-04-27 /pmc/articles/PMC7184751/ /pubmed/32341374 http://dx.doi.org/10.1038/s41598-020-63877-w Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Horrocks, Jonathan
Bauch, Chris T.
Algorithmic discovery of dynamic models from infectious disease data
title Algorithmic discovery of dynamic models from infectious disease data
title_full Algorithmic discovery of dynamic models from infectious disease data
title_fullStr Algorithmic discovery of dynamic models from infectious disease data
title_full_unstemmed Algorithmic discovery of dynamic models from infectious disease data
title_short Algorithmic discovery of dynamic models from infectious disease data
title_sort algorithmic discovery of dynamic models from infectious disease data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7184751/
https://www.ncbi.nlm.nih.gov/pubmed/32341374
http://dx.doi.org/10.1038/s41598-020-63877-w
work_keys_str_mv AT horrocksjonathan algorithmicdiscoveryofdynamicmodelsfrominfectiousdiseasedata
AT bauchchrist algorithmicdiscoveryofdynamicmodelsfrominfectiousdiseasedata