Cargando…

On the Impossibility of Learning the Missing Mass

This paper shows that one cannot learn the probability of rare events without imposing further structural assumptions. The event of interest is that of obtaining an outcome outside the coverage of an i.i.d. sample from a discrete distribution. The probability of this event is referred to as the “mis...

Descripción completa

Detalles Bibliográficos
Autores principales: Mossel, Elchanan, Ohannessian, Mesrob I.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7514132/
https://www.ncbi.nlm.nih.gov/pubmed/33266744
http://dx.doi.org/10.3390/e21010028
_version_ 1783586517472509952
author Mossel, Elchanan
Ohannessian, Mesrob I.
author_facet Mossel, Elchanan
Ohannessian, Mesrob I.
author_sort Mossel, Elchanan
collection PubMed
description This paper shows that one cannot learn the probability of rare events without imposing further structural assumptions. The event of interest is that of obtaining an outcome outside the coverage of an i.i.d. sample from a discrete distribution. The probability of this event is referred to as the “missing mass”. The impossibility result can then be stated as: the missing mass is not distribution-free learnable in relative error. The proof is semi-constructive and relies on a coupling argument using a dithered geometric distribution. Via a reduction, this impossibility also extends to both discrete and continuous tail estimation. These results formalize the folklore that in order to predict rare events without restrictive modeling, one necessarily needs distributions with “heavy tails”.
format Online
Article
Text
id pubmed-7514132
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75141322020-11-09 On the Impossibility of Learning the Missing Mass Mossel, Elchanan Ohannessian, Mesrob I. Entropy (Basel) Article This paper shows that one cannot learn the probability of rare events without imposing further structural assumptions. The event of interest is that of obtaining an outcome outside the coverage of an i.i.d. sample from a discrete distribution. The probability of this event is referred to as the “missing mass”. The impossibility result can then be stated as: the missing mass is not distribution-free learnable in relative error. The proof is semi-constructive and relies on a coupling argument using a dithered geometric distribution. Via a reduction, this impossibility also extends to both discrete and continuous tail estimation. These results formalize the folklore that in order to predict rare events without restrictive modeling, one necessarily needs distributions with “heavy tails”. MDPI 2019-01-02 /pmc/articles/PMC7514132/ /pubmed/33266744 http://dx.doi.org/10.3390/e21010028 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Mossel, Elchanan
Ohannessian, Mesrob I.
On the Impossibility of Learning the Missing Mass
title On the Impossibility of Learning the Missing Mass
title_full On the Impossibility of Learning the Missing Mass
title_fullStr On the Impossibility of Learning the Missing Mass
title_full_unstemmed On the Impossibility of Learning the Missing Mass
title_short On the Impossibility of Learning the Missing Mass
title_sort on the impossibility of learning the missing mass
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7514132/
https://www.ncbi.nlm.nih.gov/pubmed/33266744
http://dx.doi.org/10.3390/e21010028
work_keys_str_mv AT mosselelchanan ontheimpossibilityoflearningthemissingmass
AT ohannessianmesrobi ontheimpossibilityoflearningthemissingmass