Cargando…

Variable selection using a smooth information criterion for distributional regression models

Modern variable selection procedures make use of penalization methods to execute simultaneous model selection and estimation. A popular method is the least absolute shrinkage and selection operator, the use of which requires selecting the value of a tuning parameter. This parameter is typically tune...

Descripción completa

Detalles Bibliográficos
Autores principales: O’Neill, Meadhbh, Burke, Kevin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10121547/
https://www.ncbi.nlm.nih.gov/pubmed/37155560
http://dx.doi.org/10.1007/s11222-023-10204-8
_version_ 1785029394360696832
author O’Neill, Meadhbh
Burke, Kevin
author_facet O’Neill, Meadhbh
Burke, Kevin
author_sort O’Neill, Meadhbh
collection PubMed
description Modern variable selection procedures make use of penalization methods to execute simultaneous model selection and estimation. A popular method is the least absolute shrinkage and selection operator, the use of which requires selecting the value of a tuning parameter. This parameter is typically tuned by minimizing the cross-validation error or Bayesian information criterion, but this can be computationally intensive as it involves fitting an array of different models and selecting the best one. In contrast with this standard approach, we have developed a procedure based on the so-called “smooth IC” (SIC) in which the tuning parameter is automatically selected in one step. We also extend this model selection procedure to the distributional regression framework, which is more flexible than classical regression modelling. Distributional regression, also known as multiparameter regression, introduces flexibility by taking account of the effect of covariates through multiple distributional parameters simultaneously, e.g., mean and variance. These models are useful in the context of normal linear regression when the process under study exhibits heteroscedastic behaviour. Reformulating the distributional regression estimation problem in terms of penalized likelihood enables us to take advantage of the close relationship between model selection criteria and penalization. Utilizing the SIC is computationally advantageous, as it obviates the issue of having to choose multiple tuning parameters. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s11222-023-10204-8.
format Online
Article
Text
id pubmed-10121547
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-101215472023-04-23 Variable selection using a smooth information criterion for distributional regression models O’Neill, Meadhbh Burke, Kevin Stat Comput Original Paper Modern variable selection procedures make use of penalization methods to execute simultaneous model selection and estimation. A popular method is the least absolute shrinkage and selection operator, the use of which requires selecting the value of a tuning parameter. This parameter is typically tuned by minimizing the cross-validation error or Bayesian information criterion, but this can be computationally intensive as it involves fitting an array of different models and selecting the best one. In contrast with this standard approach, we have developed a procedure based on the so-called “smooth IC” (SIC) in which the tuning parameter is automatically selected in one step. We also extend this model selection procedure to the distributional regression framework, which is more flexible than classical regression modelling. Distributional regression, also known as multiparameter regression, introduces flexibility by taking account of the effect of covariates through multiple distributional parameters simultaneously, e.g., mean and variance. These models are useful in the context of normal linear regression when the process under study exhibits heteroscedastic behaviour. Reformulating the distributional regression estimation problem in terms of penalized likelihood enables us to take advantage of the close relationship between model selection criteria and penalization. Utilizing the SIC is computationally advantageous, as it obviates the issue of having to choose multiple tuning parameters. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s11222-023-10204-8. Springer US 2023-04-21 2023 /pmc/articles/PMC10121547/ /pubmed/37155560 http://dx.doi.org/10.1007/s11222-023-10204-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Original Paper
O’Neill, Meadhbh
Burke, Kevin
Variable selection using a smooth information criterion for distributional regression models
title Variable selection using a smooth information criterion for distributional regression models
title_full Variable selection using a smooth information criterion for distributional regression models
title_fullStr Variable selection using a smooth information criterion for distributional regression models
title_full_unstemmed Variable selection using a smooth information criterion for distributional regression models
title_short Variable selection using a smooth information criterion for distributional regression models
title_sort variable selection using a smooth information criterion for distributional regression models
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10121547/
https://www.ncbi.nlm.nih.gov/pubmed/37155560
http://dx.doi.org/10.1007/s11222-023-10204-8
work_keys_str_mv AT oneillmeadhbh variableselectionusingasmoothinformationcriterionfordistributionalregressionmodels
AT burkekevin variableselectionusingasmoothinformationcriterionfordistributionalregressionmodels