Cargando…
On the Impossibility of Learning the Missing Mass
This paper shows that one cannot learn the probability of rare events without imposing further structural assumptions. The event of interest is that of obtaining an outcome outside the coverage of an i.i.d. sample from a discrete distribution. The probability of this event is referred to as the “mis...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7514132/ https://www.ncbi.nlm.nih.gov/pubmed/33266744 http://dx.doi.org/10.3390/e21010028 |
_version_ | 1783586517472509952 |
---|---|
author | Mossel, Elchanan Ohannessian, Mesrob I. |
author_facet | Mossel, Elchanan Ohannessian, Mesrob I. |
author_sort | Mossel, Elchanan |
collection | PubMed |
description | This paper shows that one cannot learn the probability of rare events without imposing further structural assumptions. The event of interest is that of obtaining an outcome outside the coverage of an i.i.d. sample from a discrete distribution. The probability of this event is referred to as the “missing mass”. The impossibility result can then be stated as: the missing mass is not distribution-free learnable in relative error. The proof is semi-constructive and relies on a coupling argument using a dithered geometric distribution. Via a reduction, this impossibility also extends to both discrete and continuous tail estimation. These results formalize the folklore that in order to predict rare events without restrictive modeling, one necessarily needs distributions with “heavy tails”. |
format | Online Article Text |
id | pubmed-7514132 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-75141322020-11-09 On the Impossibility of Learning the Missing Mass Mossel, Elchanan Ohannessian, Mesrob I. Entropy (Basel) Article This paper shows that one cannot learn the probability of rare events without imposing further structural assumptions. The event of interest is that of obtaining an outcome outside the coverage of an i.i.d. sample from a discrete distribution. The probability of this event is referred to as the “missing mass”. The impossibility result can then be stated as: the missing mass is not distribution-free learnable in relative error. The proof is semi-constructive and relies on a coupling argument using a dithered geometric distribution. Via a reduction, this impossibility also extends to both discrete and continuous tail estimation. These results formalize the folklore that in order to predict rare events without restrictive modeling, one necessarily needs distributions with “heavy tails”. MDPI 2019-01-02 /pmc/articles/PMC7514132/ /pubmed/33266744 http://dx.doi.org/10.3390/e21010028 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Mossel, Elchanan Ohannessian, Mesrob I. On the Impossibility of Learning the Missing Mass |
title | On the Impossibility of Learning the Missing Mass |
title_full | On the Impossibility of Learning the Missing Mass |
title_fullStr | On the Impossibility of Learning the Missing Mass |
title_full_unstemmed | On the Impossibility of Learning the Missing Mass |
title_short | On the Impossibility of Learning the Missing Mass |
title_sort | on the impossibility of learning the missing mass |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7514132/ https://www.ncbi.nlm.nih.gov/pubmed/33266744 http://dx.doi.org/10.3390/e21010028 |
work_keys_str_mv | AT mosselelchanan ontheimpossibilityoflearningthemissingmass AT ohannessianmesrobi ontheimpossibilityoflearningthemissingmass |