Cargando…

Comparison between pystan and numpyro in Bayesian item response theory: evaluation of agreement of estimated latent parameters and sampling performance

PURPOSE: The purpose of this study is to compare two libraries dedicated to the Markov chain Monte Carlo method: pystan and numpyro. In the comparison, we mainly focused on the agreement of estimated latent parameters and the performance of sampling using the Markov chain Monte Carlo method in Bayes...

Descripción completa

Detalles Bibliográficos
Autores principales: Nishio, Mizuho, Ota, Eiji, Matsuo, Hidetoshi, Matsunaga, Takaaki, Miyazaki, Aki, Murakami, Takamichi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10588711/
https://www.ncbi.nlm.nih.gov/pubmed/37869462
http://dx.doi.org/10.7717/peerj-cs.1620
_version_ 1785123637792079872
author Nishio, Mizuho
Ota, Eiji
Matsuo, Hidetoshi
Matsunaga, Takaaki
Miyazaki, Aki
Murakami, Takamichi
author_facet Nishio, Mizuho
Ota, Eiji
Matsuo, Hidetoshi
Matsunaga, Takaaki
Miyazaki, Aki
Murakami, Takamichi
author_sort Nishio, Mizuho
collection PubMed
description PURPOSE: The purpose of this study is to compare two libraries dedicated to the Markov chain Monte Carlo method: pystan and numpyro. In the comparison, we mainly focused on the agreement of estimated latent parameters and the performance of sampling using the Markov chain Monte Carlo method in Bayesian item response theory (IRT). MATERIALS AND METHODS: Bayesian 1PL-IRT and 2PL-IRT were implemented with pystan and numpyro. Then, the Bayesian 1PL-IRT and 2PL-IRT were applied to two types of medical data obtained from a published article. The same prior distributions of latent parameters were used in both pystan and numpyro. Estimation results of latent parameters of 1PL-IRT and 2PL-IRT were compared between pystan and numpyro. Additionally, the computational cost of the Markov chain Monte Carlo method was compared between the two libraries. To evaluate the computational cost of IRT models, simulation data were generated from the medical data and numpyro. RESULTS: For all the combinations of IRT types (1PL-IRT or 2PL-IRT) and medical data types, the mean and standard deviation of the estimated latent parameters were in good agreement between pystan and numpyro. In most cases, the sampling time using the Markov chain Monte Carlo method was shorter in numpyro than that in pystan. When the large-sized simulation data were used, numpyro with a graphics processing unit was useful for reducing the sampling time. CONCLUSION: Numpyro and pystan were useful for applying the Bayesian 1PL-IRT and 2PL-IRT. Our results show that the two libraries yielded similar estimation result and that regarding to sampling time, the fastest libraries differed based on the dataset size.
format Online
Article
Text
id pubmed-10588711
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-105887112023-10-21 Comparison between pystan and numpyro in Bayesian item response theory: evaluation of agreement of estimated latent parameters and sampling performance Nishio, Mizuho Ota, Eiji Matsuo, Hidetoshi Matsunaga, Takaaki Miyazaki, Aki Murakami, Takamichi PeerJ Comput Sci Bioinformatics PURPOSE: The purpose of this study is to compare two libraries dedicated to the Markov chain Monte Carlo method: pystan and numpyro. In the comparison, we mainly focused on the agreement of estimated latent parameters and the performance of sampling using the Markov chain Monte Carlo method in Bayesian item response theory (IRT). MATERIALS AND METHODS: Bayesian 1PL-IRT and 2PL-IRT were implemented with pystan and numpyro. Then, the Bayesian 1PL-IRT and 2PL-IRT were applied to two types of medical data obtained from a published article. The same prior distributions of latent parameters were used in both pystan and numpyro. Estimation results of latent parameters of 1PL-IRT and 2PL-IRT were compared between pystan and numpyro. Additionally, the computational cost of the Markov chain Monte Carlo method was compared between the two libraries. To evaluate the computational cost of IRT models, simulation data were generated from the medical data and numpyro. RESULTS: For all the combinations of IRT types (1PL-IRT or 2PL-IRT) and medical data types, the mean and standard deviation of the estimated latent parameters were in good agreement between pystan and numpyro. In most cases, the sampling time using the Markov chain Monte Carlo method was shorter in numpyro than that in pystan. When the large-sized simulation data were used, numpyro with a graphics processing unit was useful for reducing the sampling time. CONCLUSION: Numpyro and pystan were useful for applying the Bayesian 1PL-IRT and 2PL-IRT. Our results show that the two libraries yielded similar estimation result and that regarding to sampling time, the fastest libraries differed based on the dataset size. PeerJ Inc. 2023-10-05 /pmc/articles/PMC10588711/ /pubmed/37869462 http://dx.doi.org/10.7717/peerj-cs.1620 Text en ©2023 Nishio et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Nishio, Mizuho
Ota, Eiji
Matsuo, Hidetoshi
Matsunaga, Takaaki
Miyazaki, Aki
Murakami, Takamichi
Comparison between pystan and numpyro in Bayesian item response theory: evaluation of agreement of estimated latent parameters and sampling performance
title Comparison between pystan and numpyro in Bayesian item response theory: evaluation of agreement of estimated latent parameters and sampling performance
title_full Comparison between pystan and numpyro in Bayesian item response theory: evaluation of agreement of estimated latent parameters and sampling performance
title_fullStr Comparison between pystan and numpyro in Bayesian item response theory: evaluation of agreement of estimated latent parameters and sampling performance
title_full_unstemmed Comparison between pystan and numpyro in Bayesian item response theory: evaluation of agreement of estimated latent parameters and sampling performance
title_short Comparison between pystan and numpyro in Bayesian item response theory: evaluation of agreement of estimated latent parameters and sampling performance
title_sort comparison between pystan and numpyro in bayesian item response theory: evaluation of agreement of estimated latent parameters and sampling performance
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10588711/
https://www.ncbi.nlm.nih.gov/pubmed/37869462
http://dx.doi.org/10.7717/peerj-cs.1620
work_keys_str_mv AT nishiomizuho comparisonbetweenpystanandnumpyroinbayesianitemresponsetheoryevaluationofagreementofestimatedlatentparametersandsamplingperformance
AT otaeiji comparisonbetweenpystanandnumpyroinbayesianitemresponsetheoryevaluationofagreementofestimatedlatentparametersandsamplingperformance
AT matsuohidetoshi comparisonbetweenpystanandnumpyroinbayesianitemresponsetheoryevaluationofagreementofestimatedlatentparametersandsamplingperformance
AT matsunagatakaaki comparisonbetweenpystanandnumpyroinbayesianitemresponsetheoryevaluationofagreementofestimatedlatentparametersandsamplingperformance
AT miyazakiaki comparisonbetweenpystanandnumpyroinbayesianitemresponsetheoryevaluationofagreementofestimatedlatentparametersandsamplingperformance
AT murakamitakamichi comparisonbetweenpystanandnumpyroinbayesianitemresponsetheoryevaluationofagreementofestimatedlatentparametersandsamplingperformance