Cargando…
On the Use of Entropy to Improve Model Selection Criteria
The most widely used forms of model selection criteria, the Bayesian Information Criterion (BIC) and the Akaike Information Criterion (AIC), are expressed in terms of synthetic indicators of the residual distribution: the variance and the mean-squared error of the residuals respectively. In many app...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7514877/ https://www.ncbi.nlm.nih.gov/pubmed/33267107 http://dx.doi.org/10.3390/e21040394 |
_version_ | 1783586689833238528 |
---|---|
author | Murari, Andrea Peluso, Emmanuele Cianfrani, Francesco Gaudio, Pasquale Lungaroni, Michele |
author_facet | Murari, Andrea Peluso, Emmanuele Cianfrani, Francesco Gaudio, Pasquale Lungaroni, Michele |
author_sort | Murari, Andrea |
collection | PubMed |
description | The most widely used forms of model selection criteria, the Bayesian Information Criterion (BIC) and the Akaike Information Criterion (AIC), are expressed in terms of synthetic indicators of the residual distribution: the variance and the mean-squared error of the residuals respectively. In many applications in science, the noise affecting the data can be expected to have a Gaussian distribution. Therefore, at the same level of variance and mean-squared error, models, whose residuals are more uniformly distributed, should be favoured. The degree of uniformity of the residuals can be quantified by the Shannon entropy. Including the Shannon entropy in the BIC and AIC expressions improves significantly these criteria. The better performances have been demonstrated empirically with a series of simulations for various classes of functions and for different levels and statistics of the noise. In presence of outliers, a better treatment of the errors, using the Geodesic Distance, has proved essential. |
format | Online Article Text |
id | pubmed-7514877 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-75148772020-11-09 On the Use of Entropy to Improve Model Selection Criteria Murari, Andrea Peluso, Emmanuele Cianfrani, Francesco Gaudio, Pasquale Lungaroni, Michele Entropy (Basel) Article The most widely used forms of model selection criteria, the Bayesian Information Criterion (BIC) and the Akaike Information Criterion (AIC), are expressed in terms of synthetic indicators of the residual distribution: the variance and the mean-squared error of the residuals respectively. In many applications in science, the noise affecting the data can be expected to have a Gaussian distribution. Therefore, at the same level of variance and mean-squared error, models, whose residuals are more uniformly distributed, should be favoured. The degree of uniformity of the residuals can be quantified by the Shannon entropy. Including the Shannon entropy in the BIC and AIC expressions improves significantly these criteria. The better performances have been demonstrated empirically with a series of simulations for various classes of functions and for different levels and statistics of the noise. In presence of outliers, a better treatment of the errors, using the Geodesic Distance, has proved essential. MDPI 2019-04-12 /pmc/articles/PMC7514877/ /pubmed/33267107 http://dx.doi.org/10.3390/e21040394 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Murari, Andrea Peluso, Emmanuele Cianfrani, Francesco Gaudio, Pasquale Lungaroni, Michele On the Use of Entropy to Improve Model Selection Criteria |
title | On the Use of Entropy to Improve Model Selection Criteria |
title_full | On the Use of Entropy to Improve Model Selection Criteria |
title_fullStr | On the Use of Entropy to Improve Model Selection Criteria |
title_full_unstemmed | On the Use of Entropy to Improve Model Selection Criteria |
title_short | On the Use of Entropy to Improve Model Selection Criteria |
title_sort | on the use of entropy to improve model selection criteria |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7514877/ https://www.ncbi.nlm.nih.gov/pubmed/33267107 http://dx.doi.org/10.3390/e21040394 |
work_keys_str_mv | AT murariandrea ontheuseofentropytoimprovemodelselectioncriteria AT pelusoemmanuele ontheuseofentropytoimprovemodelselectioncriteria AT cianfranifrancesco ontheuseofentropytoimprovemodelselectioncriteria AT gaudiopasquale ontheuseofentropytoimprovemodelselectioncriteria AT lungaronimichele ontheuseofentropytoimprovemodelselectioncriteria |