Cargando…

Unified real-time environmental-epidemiological data for multiscale modeling of the COVID-19 pandemic

An impressive number of COVID-19 data catalogs exist. However, none are fully optimized for data science applications. Inconsistent naming and data conventions, uneven quality control, and lack of alignment between disease data and potential predictors pose barriers to robust modeling and analysis....

Descripción completa

Detalles Bibliográficos
Autores principales: Badr, Hamada S., Zaitchik, Benjamin F., Kerr, Gaige H., Nguyen, Nhat-Lan H., Chen, Yen-Ting, Hinson, Patrick, Colston, Josh M., Kosek, Margaret N., Dong, Ensheng, Du, Hongru, Marshall, Maximilian, Nixon, Kristen, Mohegh, Arash, Goldberg, Daniel L., Anenberg, Susan C., Gardner, Lauren M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10245354/
https://www.ncbi.nlm.nih.gov/pubmed/37286690
http://dx.doi.org/10.1038/s41597-023-02276-y
Descripción
Sumario:An impressive number of COVID-19 data catalogs exist. However, none are fully optimized for data science applications. Inconsistent naming and data conventions, uneven quality control, and lack of alignment between disease data and potential predictors pose barriers to robust modeling and analysis. To address this gap, we generated a unified dataset that integrates and implements quality checks of the data from numerous leading sources of COVID-19 epidemiological and environmental data. We use a globally consistent hierarchy of administrative units to facilitate analysis within and across countries. The dataset applies this unified hierarchy to align COVID-19 epidemiological data with a number of other data types relevant to understanding and predicting COVID-19 risk, including hydrometeorological data, air quality, information on COVID-19 control policies, vaccine data, and key demographic characteristics.