Cargando…

Lossless Transformations and Excess Risk Bounds in Statistical Inference

We study the excess minimum risk in statistical inference, defined as the difference between the minimum expected loss when estimating a random variable from an observed feature vector and the minimum expected loss when estimating the same random variable from a transformation (statistic) of the fea...

Descripción completa

Detalles Bibliográficos
Autores principales: Györfi, László, Linder, Tamás, Walk, Harro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10606681/
https://www.ncbi.nlm.nih.gov/pubmed/37895515
http://dx.doi.org/10.3390/e25101394
_version_ 1785127374103248896
author Györfi, László
Linder, Tamás
Walk, Harro
author_facet Györfi, László
Linder, Tamás
Walk, Harro
author_sort Györfi, László
collection PubMed
description We study the excess minimum risk in statistical inference, defined as the difference between the minimum expected loss when estimating a random variable from an observed feature vector and the minimum expected loss when estimating the same random variable from a transformation (statistic) of the feature vector. After characterizing lossless transformations, i.e., transformations for which the excess risk is zero for all loss functions, we construct a partitioning test statistic for the hypothesis that a given transformation is lossless, and we show that for i.i.d. data the test is strongly consistent. More generally, we develop information-theoretic upper bounds on the excess risk that uniformly hold over fairly general classes of loss functions. Based on these bounds, we introduce the notion of a [Formula: see text]-lossless transformation and give sufficient conditions for a given transformation to be universally [Formula: see text]-lossless. Applications to classification, nonparametric regression, portfolio strategies, information bottlenecks, and deep learning are also surveyed.
format Online
Article
Text
id pubmed-10606681
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-106066812023-10-28 Lossless Transformations and Excess Risk Bounds in Statistical Inference Györfi, László Linder, Tamás Walk, Harro Entropy (Basel) Article We study the excess minimum risk in statistical inference, defined as the difference between the minimum expected loss when estimating a random variable from an observed feature vector and the minimum expected loss when estimating the same random variable from a transformation (statistic) of the feature vector. After characterizing lossless transformations, i.e., transformations for which the excess risk is zero for all loss functions, we construct a partitioning test statistic for the hypothesis that a given transformation is lossless, and we show that for i.i.d. data the test is strongly consistent. More generally, we develop information-theoretic upper bounds on the excess risk that uniformly hold over fairly general classes of loss functions. Based on these bounds, we introduce the notion of a [Formula: see text]-lossless transformation and give sufficient conditions for a given transformation to be universally [Formula: see text]-lossless. Applications to classification, nonparametric regression, portfolio strategies, information bottlenecks, and deep learning are also surveyed. MDPI 2023-09-28 /pmc/articles/PMC10606681/ /pubmed/37895515 http://dx.doi.org/10.3390/e25101394 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Györfi, László
Linder, Tamás
Walk, Harro
Lossless Transformations and Excess Risk Bounds in Statistical Inference
title Lossless Transformations and Excess Risk Bounds in Statistical Inference
title_full Lossless Transformations and Excess Risk Bounds in Statistical Inference
title_fullStr Lossless Transformations and Excess Risk Bounds in Statistical Inference
title_full_unstemmed Lossless Transformations and Excess Risk Bounds in Statistical Inference
title_short Lossless Transformations and Excess Risk Bounds in Statistical Inference
title_sort lossless transformations and excess risk bounds in statistical inference
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10606681/
https://www.ncbi.nlm.nih.gov/pubmed/37895515
http://dx.doi.org/10.3390/e25101394
work_keys_str_mv AT gyorfilaszlo losslesstransformationsandexcessriskboundsinstatisticalinference
AT lindertamas losslesstransformationsandexcessriskboundsinstatisticalinference
AT walkharro losslesstransformationsandexcessriskboundsinstatisticalinference