Cargando…

The non-linear impact of data handling on network diffusion models

Many computational models rely on real-world data, and the steps required in moving from data collection, to data preparation, to model calibration, and input are becoming increasingly complex. Errors in data can lead to errors in model output that might invalidate conclusions in extreme cases. Whil...

Descripción completa

Detalles Bibliográficos
Autores principales: Nevin, James, Lees, Michael, Groth, Paul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8672192/
https://www.ncbi.nlm.nih.gov/pubmed/34950910
http://dx.doi.org/10.1016/j.patter.2021.100397
_version_ 1784615308430934016
author Nevin, James
Lees, Michael
Groth, Paul
author_facet Nevin, James
Lees, Michael
Groth, Paul
author_sort Nevin, James
collection PubMed
description Many computational models rely on real-world data, and the steps required in moving from data collection, to data preparation, to model calibration, and input are becoming increasingly complex. Errors in data can lead to errors in model output that might invalidate conclusions in extreme cases. While the challenge of errors in data collection have been analyzed in the literature, here we highlight the importance of data handling in the modeling and simulation process, and how particular data handling errors can lead to errors in model output. We develop a framework for assessing the impact of potential data errors for models of spreading processes on networks, a broad class of models that capture many important real-world phenomena (e.g., epidemics, rumor spread, etc.). We focus on the susceptible-infected-removed (SIR) and Threshold models and examine how systematic errors in data handling impact the predicted spread of a virus (or information). Our results demonstrate that data handling errors can have significant impact on model conclusions especially in critical regions of a system.
format Online
Article
Text
id pubmed-8672192
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-86721922021-12-22 The non-linear impact of data handling on network diffusion models Nevin, James Lees, Michael Groth, Paul Patterns (N Y) Article Many computational models rely on real-world data, and the steps required in moving from data collection, to data preparation, to model calibration, and input are becoming increasingly complex. Errors in data can lead to errors in model output that might invalidate conclusions in extreme cases. While the challenge of errors in data collection have been analyzed in the literature, here we highlight the importance of data handling in the modeling and simulation process, and how particular data handling errors can lead to errors in model output. We develop a framework for assessing the impact of potential data errors for models of spreading processes on networks, a broad class of models that capture many important real-world phenomena (e.g., epidemics, rumor spread, etc.). We focus on the susceptible-infected-removed (SIR) and Threshold models and examine how systematic errors in data handling impact the predicted spread of a virus (or information). Our results demonstrate that data handling errors can have significant impact on model conclusions especially in critical regions of a system. Elsevier 2021-11-26 /pmc/articles/PMC8672192/ /pubmed/34950910 http://dx.doi.org/10.1016/j.patter.2021.100397 Text en © 2021 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Nevin, James
Lees, Michael
Groth, Paul
The non-linear impact of data handling on network diffusion models
title The non-linear impact of data handling on network diffusion models
title_full The non-linear impact of data handling on network diffusion models
title_fullStr The non-linear impact of data handling on network diffusion models
title_full_unstemmed The non-linear impact of data handling on network diffusion models
title_short The non-linear impact of data handling on network diffusion models
title_sort non-linear impact of data handling on network diffusion models
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8672192/
https://www.ncbi.nlm.nih.gov/pubmed/34950910
http://dx.doi.org/10.1016/j.patter.2021.100397
work_keys_str_mv AT nevinjames thenonlinearimpactofdatahandlingonnetworkdiffusionmodels
AT leesmichael thenonlinearimpactofdatahandlingonnetworkdiffusionmodels
AT grothpaul thenonlinearimpactofdatahandlingonnetworkdiffusionmodels
AT nevinjames nonlinearimpactofdatahandlingonnetworkdiffusionmodels
AT leesmichael nonlinearimpactofdatahandlingonnetworkdiffusionmodels
AT grothpaul nonlinearimpactofdatahandlingonnetworkdiffusionmodels