Cargando…
Comparison between pystan and numpyro in Bayesian item response theory: evaluation of agreement of estimated latent parameters and sampling performance
PURPOSE: The purpose of this study is to compare two libraries dedicated to the Markov chain Monte Carlo method: pystan and numpyro. In the comparison, we mainly focused on the agreement of estimated latent parameters and the performance of sampling using the Markov chain Monte Carlo method in Bayes...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10588711/ https://www.ncbi.nlm.nih.gov/pubmed/37869462 http://dx.doi.org/10.7717/peerj-cs.1620 |
_version_ | 1785123637792079872 |
---|---|
author | Nishio, Mizuho Ota, Eiji Matsuo, Hidetoshi Matsunaga, Takaaki Miyazaki, Aki Murakami, Takamichi |
author_facet | Nishio, Mizuho Ota, Eiji Matsuo, Hidetoshi Matsunaga, Takaaki Miyazaki, Aki Murakami, Takamichi |
author_sort | Nishio, Mizuho |
collection | PubMed |
description | PURPOSE: The purpose of this study is to compare two libraries dedicated to the Markov chain Monte Carlo method: pystan and numpyro. In the comparison, we mainly focused on the agreement of estimated latent parameters and the performance of sampling using the Markov chain Monte Carlo method in Bayesian item response theory (IRT). MATERIALS AND METHODS: Bayesian 1PL-IRT and 2PL-IRT were implemented with pystan and numpyro. Then, the Bayesian 1PL-IRT and 2PL-IRT were applied to two types of medical data obtained from a published article. The same prior distributions of latent parameters were used in both pystan and numpyro. Estimation results of latent parameters of 1PL-IRT and 2PL-IRT were compared between pystan and numpyro. Additionally, the computational cost of the Markov chain Monte Carlo method was compared between the two libraries. To evaluate the computational cost of IRT models, simulation data were generated from the medical data and numpyro. RESULTS: For all the combinations of IRT types (1PL-IRT or 2PL-IRT) and medical data types, the mean and standard deviation of the estimated latent parameters were in good agreement between pystan and numpyro. In most cases, the sampling time using the Markov chain Monte Carlo method was shorter in numpyro than that in pystan. When the large-sized simulation data were used, numpyro with a graphics processing unit was useful for reducing the sampling time. CONCLUSION: Numpyro and pystan were useful for applying the Bayesian 1PL-IRT and 2PL-IRT. Our results show that the two libraries yielded similar estimation result and that regarding to sampling time, the fastest libraries differed based on the dataset size. |
format | Online Article Text |
id | pubmed-10588711 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-105887112023-10-21 Comparison between pystan and numpyro in Bayesian item response theory: evaluation of agreement of estimated latent parameters and sampling performance Nishio, Mizuho Ota, Eiji Matsuo, Hidetoshi Matsunaga, Takaaki Miyazaki, Aki Murakami, Takamichi PeerJ Comput Sci Bioinformatics PURPOSE: The purpose of this study is to compare two libraries dedicated to the Markov chain Monte Carlo method: pystan and numpyro. In the comparison, we mainly focused on the agreement of estimated latent parameters and the performance of sampling using the Markov chain Monte Carlo method in Bayesian item response theory (IRT). MATERIALS AND METHODS: Bayesian 1PL-IRT and 2PL-IRT were implemented with pystan and numpyro. Then, the Bayesian 1PL-IRT and 2PL-IRT were applied to two types of medical data obtained from a published article. The same prior distributions of latent parameters were used in both pystan and numpyro. Estimation results of latent parameters of 1PL-IRT and 2PL-IRT were compared between pystan and numpyro. Additionally, the computational cost of the Markov chain Monte Carlo method was compared between the two libraries. To evaluate the computational cost of IRT models, simulation data were generated from the medical data and numpyro. RESULTS: For all the combinations of IRT types (1PL-IRT or 2PL-IRT) and medical data types, the mean and standard deviation of the estimated latent parameters were in good agreement between pystan and numpyro. In most cases, the sampling time using the Markov chain Monte Carlo method was shorter in numpyro than that in pystan. When the large-sized simulation data were used, numpyro with a graphics processing unit was useful for reducing the sampling time. CONCLUSION: Numpyro and pystan were useful for applying the Bayesian 1PL-IRT and 2PL-IRT. Our results show that the two libraries yielded similar estimation result and that regarding to sampling time, the fastest libraries differed based on the dataset size. PeerJ Inc. 2023-10-05 /pmc/articles/PMC10588711/ /pubmed/37869462 http://dx.doi.org/10.7717/peerj-cs.1620 Text en ©2023 Nishio et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Bioinformatics Nishio, Mizuho Ota, Eiji Matsuo, Hidetoshi Matsunaga, Takaaki Miyazaki, Aki Murakami, Takamichi Comparison between pystan and numpyro in Bayesian item response theory: evaluation of agreement of estimated latent parameters and sampling performance |
title | Comparison between pystan and numpyro in Bayesian item response theory: evaluation of agreement of estimated latent parameters and sampling performance |
title_full | Comparison between pystan and numpyro in Bayesian item response theory: evaluation of agreement of estimated latent parameters and sampling performance |
title_fullStr | Comparison between pystan and numpyro in Bayesian item response theory: evaluation of agreement of estimated latent parameters and sampling performance |
title_full_unstemmed | Comparison between pystan and numpyro in Bayesian item response theory: evaluation of agreement of estimated latent parameters and sampling performance |
title_short | Comparison between pystan and numpyro in Bayesian item response theory: evaluation of agreement of estimated latent parameters and sampling performance |
title_sort | comparison between pystan and numpyro in bayesian item response theory: evaluation of agreement of estimated latent parameters and sampling performance |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10588711/ https://www.ncbi.nlm.nih.gov/pubmed/37869462 http://dx.doi.org/10.7717/peerj-cs.1620 |
work_keys_str_mv | AT nishiomizuho comparisonbetweenpystanandnumpyroinbayesianitemresponsetheoryevaluationofagreementofestimatedlatentparametersandsamplingperformance AT otaeiji comparisonbetweenpystanandnumpyroinbayesianitemresponsetheoryevaluationofagreementofestimatedlatentparametersandsamplingperformance AT matsuohidetoshi comparisonbetweenpystanandnumpyroinbayesianitemresponsetheoryevaluationofagreementofestimatedlatentparametersandsamplingperformance AT matsunagatakaaki comparisonbetweenpystanandnumpyroinbayesianitemresponsetheoryevaluationofagreementofestimatedlatentparametersandsamplingperformance AT miyazakiaki comparisonbetweenpystanandnumpyroinbayesianitemresponsetheoryevaluationofagreementofestimatedlatentparametersandsamplingperformance AT murakamitakamichi comparisonbetweenpystanandnumpyroinbayesianitemresponsetheoryevaluationofagreementofestimatedlatentparametersandsamplingperformance |