Cargando…

Multivariate Tail Probabilities: Predicting Regional Pertussis Cases in Washington State

In disease modeling, a key statistical problem is the estimation of lower and upper tail probabilities of health events from given data sets of small size and limited range. Assuming such constraints, we describe a computational framework for the systematic fusion of observations from multiple sourc...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Xuze, Pyne, Saumyadipta, Kedem, Benjamin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8226468/
https://www.ncbi.nlm.nih.gov/pubmed/34072055
http://dx.doi.org/10.3390/e23060675
_version_ 1783712294008520704
author Zhang, Xuze
Pyne, Saumyadipta
Kedem, Benjamin
author_facet Zhang, Xuze
Pyne, Saumyadipta
Kedem, Benjamin
author_sort Zhang, Xuze
collection PubMed
description In disease modeling, a key statistical problem is the estimation of lower and upper tail probabilities of health events from given data sets of small size and limited range. Assuming such constraints, we describe a computational framework for the systematic fusion of observations from multiple sources to compute tail probabilities that could not be obtained otherwise due to a lack of lower or upper tail data. The estimation of multivariate lower and upper tail probabilities from a given small reference data set that lacks complete information about such tail data is addressed in terms of pertussis case count data. Fusion of data from multiple sources in conjunction with the density ratio model is used to give probability estimates that are non-obtainable from the empirical distribution. Based on a density ratio model with variable tilts, we first present a univariate fit and, subsequently, improve it with a multivariate extension. In the multivariate analysis, we selected the best model in terms of the Akaike Information Criterion (AIC). Regional prediction, in Washington state, of the number of pertussis cases is approached by providing joint probabilities using fused data from several relatively small samples following the selected density ratio model. The model is validated by a graphical goodness-of-fit plot comparing the estimated reference distribution obtained from the fused data with that of the empirical distribution obtained from the reference sample only.
format Online
Article
Text
id pubmed-8226468
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-82264682021-06-26 Multivariate Tail Probabilities: Predicting Regional Pertussis Cases in Washington State Zhang, Xuze Pyne, Saumyadipta Kedem, Benjamin Entropy (Basel) Article In disease modeling, a key statistical problem is the estimation of lower and upper tail probabilities of health events from given data sets of small size and limited range. Assuming such constraints, we describe a computational framework for the systematic fusion of observations from multiple sources to compute tail probabilities that could not be obtained otherwise due to a lack of lower or upper tail data. The estimation of multivariate lower and upper tail probabilities from a given small reference data set that lacks complete information about such tail data is addressed in terms of pertussis case count data. Fusion of data from multiple sources in conjunction with the density ratio model is used to give probability estimates that are non-obtainable from the empirical distribution. Based on a density ratio model with variable tilts, we first present a univariate fit and, subsequently, improve it with a multivariate extension. In the multivariate analysis, we selected the best model in terms of the Akaike Information Criterion (AIC). Regional prediction, in Washington state, of the number of pertussis cases is approached by providing joint probabilities using fused data from several relatively small samples following the selected density ratio model. The model is validated by a graphical goodness-of-fit plot comparing the estimated reference distribution obtained from the fused data with that of the empirical distribution obtained from the reference sample only. MDPI 2021-05-27 /pmc/articles/PMC8226468/ /pubmed/34072055 http://dx.doi.org/10.3390/e23060675 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhang, Xuze
Pyne, Saumyadipta
Kedem, Benjamin
Multivariate Tail Probabilities: Predicting Regional Pertussis Cases in Washington State
title Multivariate Tail Probabilities: Predicting Regional Pertussis Cases in Washington State
title_full Multivariate Tail Probabilities: Predicting Regional Pertussis Cases in Washington State
title_fullStr Multivariate Tail Probabilities: Predicting Regional Pertussis Cases in Washington State
title_full_unstemmed Multivariate Tail Probabilities: Predicting Regional Pertussis Cases in Washington State
title_short Multivariate Tail Probabilities: Predicting Regional Pertussis Cases in Washington State
title_sort multivariate tail probabilities: predicting regional pertussis cases in washington state
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8226468/
https://www.ncbi.nlm.nih.gov/pubmed/34072055
http://dx.doi.org/10.3390/e23060675
work_keys_str_mv AT zhangxuze multivariatetailprobabilitiespredictingregionalpertussiscasesinwashingtonstate
AT pynesaumyadipta multivariatetailprobabilitiespredictingregionalpertussiscasesinwashingtonstate
AT kedembenjamin multivariatetailprobabilitiespredictingregionalpertussiscasesinwashingtonstate