Cargando…

On the Use of Entropy to Improve Model Selection Criteria

The most widely used forms of model selection criteria, the Bayesian Information Criterion (BIC) and the Akaike Information Criterion (AIC), are expressed in terms of synthetic indicators of the residual distribution: the variance and the mean-squared error of the residuals respectively. In many app...

Descripción completa

Detalles Bibliográficos
Autores principales: Murari, Andrea, Peluso, Emmanuele, Cianfrani, Francesco, Gaudio, Pasquale, Lungaroni, Michele
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7514877/
https://www.ncbi.nlm.nih.gov/pubmed/33267107
http://dx.doi.org/10.3390/e21040394
_version_ 1783586689833238528
author Murari, Andrea
Peluso, Emmanuele
Cianfrani, Francesco
Gaudio, Pasquale
Lungaroni, Michele
author_facet Murari, Andrea
Peluso, Emmanuele
Cianfrani, Francesco
Gaudio, Pasquale
Lungaroni, Michele
author_sort Murari, Andrea
collection PubMed
description The most widely used forms of model selection criteria, the Bayesian Information Criterion (BIC) and the Akaike Information Criterion (AIC), are expressed in terms of synthetic indicators of the residual distribution: the variance and the mean-squared error of the residuals respectively. In many applications in science, the noise affecting the data can be expected to have a Gaussian distribution. Therefore, at the same level of variance and mean-squared error, models, whose residuals are more uniformly distributed, should be favoured. The degree of uniformity of the residuals can be quantified by the Shannon entropy. Including the Shannon entropy in the BIC and AIC expressions improves significantly these criteria. The better performances have been demonstrated empirically with a series of simulations for various classes of functions and for different levels and statistics of the noise. In presence of outliers, a better treatment of the errors, using the Geodesic Distance, has proved essential.
format Online
Article
Text
id pubmed-7514877
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75148772020-11-09 On the Use of Entropy to Improve Model Selection Criteria Murari, Andrea Peluso, Emmanuele Cianfrani, Francesco Gaudio, Pasquale Lungaroni, Michele Entropy (Basel) Article The most widely used forms of model selection criteria, the Bayesian Information Criterion (BIC) and the Akaike Information Criterion (AIC), are expressed in terms of synthetic indicators of the residual distribution: the variance and the mean-squared error of the residuals respectively. In many applications in science, the noise affecting the data can be expected to have a Gaussian distribution. Therefore, at the same level of variance and mean-squared error, models, whose residuals are more uniformly distributed, should be favoured. The degree of uniformity of the residuals can be quantified by the Shannon entropy. Including the Shannon entropy in the BIC and AIC expressions improves significantly these criteria. The better performances have been demonstrated empirically with a series of simulations for various classes of functions and for different levels and statistics of the noise. In presence of outliers, a better treatment of the errors, using the Geodesic Distance, has proved essential. MDPI 2019-04-12 /pmc/articles/PMC7514877/ /pubmed/33267107 http://dx.doi.org/10.3390/e21040394 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Murari, Andrea
Peluso, Emmanuele
Cianfrani, Francesco
Gaudio, Pasquale
Lungaroni, Michele
On the Use of Entropy to Improve Model Selection Criteria
title On the Use of Entropy to Improve Model Selection Criteria
title_full On the Use of Entropy to Improve Model Selection Criteria
title_fullStr On the Use of Entropy to Improve Model Selection Criteria
title_full_unstemmed On the Use of Entropy to Improve Model Selection Criteria
title_short On the Use of Entropy to Improve Model Selection Criteria
title_sort on the use of entropy to improve model selection criteria
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7514877/
https://www.ncbi.nlm.nih.gov/pubmed/33267107
http://dx.doi.org/10.3390/e21040394
work_keys_str_mv AT murariandrea ontheuseofentropytoimprovemodelselectioncriteria
AT pelusoemmanuele ontheuseofentropytoimprovemodelselectioncriteria
AT cianfranifrancesco ontheuseofentropytoimprovemodelselectioncriteria
AT gaudiopasquale ontheuseofentropytoimprovemodelselectioncriteria
AT lungaronimichele ontheuseofentropytoimprovemodelselectioncriteria