Cargando…

Universal coding and order identification by model selection methods

The purpose of these notes is to highlight the far-reaching connections between Information Theory and Statistics. Universal coding and adaptive compression are indeed closely related to statistical inference concerning processes and using maximum likelihood or Bayesian methods. The book is divided...

Descripción completa

Detalles Bibliográficos
Autor principal: Gassiat, Élisabeth
Lenguaje:eng
Publicado: Springer 2018
Materias:
Acceso en línea:https://dx.doi.org/10.1007/978-3-319-96262-7
http://cds.cern.ch/record/2633927
_version_ 1780959663701884928
author Gassiat, Élisabeth
author_facet Gassiat, Élisabeth
author_sort Gassiat, Élisabeth
collection CERN
description The purpose of these notes is to highlight the far-reaching connections between Information Theory and Statistics. Universal coding and adaptive compression are indeed closely related to statistical inference concerning processes and using maximum likelihood or Bayesian methods. The book is divided into four chapters, the first of which introduces readers to lossless coding, provides an intrinsic lower bound on the codeword length in terms of Shannon’s entropy, and presents some coding methods that can achieve this lower bound, provided the source distribution is known. In turn, Chapter 2 addresses universal coding on finite alphabets, and seeks to find coding procedures that can achieve the optimal compression rate, regardless of the source distribution. It also quantifies the speed of convergence of the compression rate to the source entropy rate. These powerful results do not extend to infinite alphabets. In Chapter 3, it is shown that there are no universal codes over the class of stationary ergodic sources over a countable alphabet. This negative result prompts at least two different approaches: the introduction of smaller sub-classes of sources known as envelope classes, over which adaptive coding may be feasible, and the redefinition of the performance criterion by focusing on compressing the message pattern. Finally, Chapter 4 deals with the question of order identification in statistics. This question belongs to the class of model selection problems and arises in various practical situations in which the goal is to identify an integer characterizing the model: the length of dependency for a Markov chain, number of hidden states for a hidden Markov chain, and number of populations for a population mixture. The coding ideas and techniques developed in previous chapters allow us to obtain new results in this area. This book is accessible to anyone with a graduate level in Mathematics, and will appeal to information theoreticians and mathematical statisticians alike. Except for Chapter 4, all proofs are detailed and all tools needed to understand the text are reviewed.
id cern-2633927
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2018
publisher Springer
record_format invenio
spelling cern-26339272021-04-21T18:44:56Zdoi:10.1007/978-3-319-96262-7http://cds.cern.ch/record/2633927engGassiat, ÉlisabethUniversal coding and order identification by model selection methodsMathematical Physics and MathematicsThe purpose of these notes is to highlight the far-reaching connections between Information Theory and Statistics. Universal coding and adaptive compression are indeed closely related to statistical inference concerning processes and using maximum likelihood or Bayesian methods. The book is divided into four chapters, the first of which introduces readers to lossless coding, provides an intrinsic lower bound on the codeword length in terms of Shannon’s entropy, and presents some coding methods that can achieve this lower bound, provided the source distribution is known. In turn, Chapter 2 addresses universal coding on finite alphabets, and seeks to find coding procedures that can achieve the optimal compression rate, regardless of the source distribution. It also quantifies the speed of convergence of the compression rate to the source entropy rate. These powerful results do not extend to infinite alphabets. In Chapter 3, it is shown that there are no universal codes over the class of stationary ergodic sources over a countable alphabet. This negative result prompts at least two different approaches: the introduction of smaller sub-classes of sources known as envelope classes, over which adaptive coding may be feasible, and the redefinition of the performance criterion by focusing on compressing the message pattern. Finally, Chapter 4 deals with the question of order identification in statistics. This question belongs to the class of model selection problems and arises in various practical situations in which the goal is to identify an integer characterizing the model: the length of dependency for a Markov chain, number of hidden states for a hidden Markov chain, and number of populations for a population mixture. The coding ideas and techniques developed in previous chapters allow us to obtain new results in this area. This book is accessible to anyone with a graduate level in Mathematics, and will appeal to information theoreticians and mathematical statisticians alike. Except for Chapter 4, all proofs are detailed and all tools needed to understand the text are reviewed.Springeroai:cds.cern.ch:26339272018
spellingShingle Mathematical Physics and Mathematics
Gassiat, Élisabeth
Universal coding and order identification by model selection methods
title Universal coding and order identification by model selection methods
title_full Universal coding and order identification by model selection methods
title_fullStr Universal coding and order identification by model selection methods
title_full_unstemmed Universal coding and order identification by model selection methods
title_short Universal coding and order identification by model selection methods
title_sort universal coding and order identification by model selection methods
topic Mathematical Physics and Mathematics
url https://dx.doi.org/10.1007/978-3-319-96262-7
http://cds.cern.ch/record/2633927
work_keys_str_mv AT gassiatelisabeth universalcodingandorderidentificationbymodelselectionmethods