Cargando…
A Refutation of Finite-State Language Models through Zipf’s Law for Factual Knowledge
We present a hypothetical argument against finite-state processes in statistical language modeling that is based on semantics rather than syntax. In this theoretical model, we suppose that the semantic properties of texts in a natural language could be approximately captured by a recently introduced...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8465033/ https://www.ncbi.nlm.nih.gov/pubmed/34573773 http://dx.doi.org/10.3390/e23091148 |
_version_ | 1784572767813763072 |
---|---|
author | Dębowski, Łukasz |
author_facet | Dębowski, Łukasz |
author_sort | Dębowski, Łukasz |
collection | PubMed |
description | We present a hypothetical argument against finite-state processes in statistical language modeling that is based on semantics rather than syntax. In this theoretical model, we suppose that the semantic properties of texts in a natural language could be approximately captured by a recently introduced concept of a perigraphic process. Perigraphic processes are a class of stochastic processes that satisfy a Zipf-law accumulation of a subset of factual knowledge, which is time-independent, compressed, and effectively inferrable from the process. We show that the classes of finite-state processes and of perigraphic processes are disjoint, and we present a new simple example of perigraphic processes over a finite alphabet called Oracle processes. The disjointness result makes use of the Hilberg condition, i.e., the almost sure power-law growth of algorithmic mutual information. Using a strongly consistent estimator of the number of hidden states, we show that finite-state processes do not satisfy the Hilberg condition whereas Oracle processes satisfy the Hilberg condition via the data-processing inequality. We discuss the relevance of these mathematical results for theoretical and computational linguistics. |
format | Online Article Text |
id | pubmed-8465033 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-84650332021-09-27 A Refutation of Finite-State Language Models through Zipf’s Law for Factual Knowledge Dębowski, Łukasz Entropy (Basel) Article We present a hypothetical argument against finite-state processes in statistical language modeling that is based on semantics rather than syntax. In this theoretical model, we suppose that the semantic properties of texts in a natural language could be approximately captured by a recently introduced concept of a perigraphic process. Perigraphic processes are a class of stochastic processes that satisfy a Zipf-law accumulation of a subset of factual knowledge, which is time-independent, compressed, and effectively inferrable from the process. We show that the classes of finite-state processes and of perigraphic processes are disjoint, and we present a new simple example of perigraphic processes over a finite alphabet called Oracle processes. The disjointness result makes use of the Hilberg condition, i.e., the almost sure power-law growth of algorithmic mutual information. Using a strongly consistent estimator of the number of hidden states, we show that finite-state processes do not satisfy the Hilberg condition whereas Oracle processes satisfy the Hilberg condition via the data-processing inequality. We discuss the relevance of these mathematical results for theoretical and computational linguistics. MDPI 2021-09-01 /pmc/articles/PMC8465033/ /pubmed/34573773 http://dx.doi.org/10.3390/e23091148 Text en © 2021 by the author. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Dębowski, Łukasz A Refutation of Finite-State Language Models through Zipf’s Law for Factual Knowledge |
title | A Refutation of Finite-State Language Models through Zipf’s Law for Factual Knowledge |
title_full | A Refutation of Finite-State Language Models through Zipf’s Law for Factual Knowledge |
title_fullStr | A Refutation of Finite-State Language Models through Zipf’s Law for Factual Knowledge |
title_full_unstemmed | A Refutation of Finite-State Language Models through Zipf’s Law for Factual Knowledge |
title_short | A Refutation of Finite-State Language Models through Zipf’s Law for Factual Knowledge |
title_sort | refutation of finite-state language models through zipf’s law for factual knowledge |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8465033/ https://www.ncbi.nlm.nih.gov/pubmed/34573773 http://dx.doi.org/10.3390/e23091148 |
work_keys_str_mv | AT debowskiłukasz arefutationoffinitestatelanguagemodelsthroughzipfslawforfactualknowledge AT debowskiłukasz refutationoffinitestatelanguagemodelsthroughzipfslawforfactualknowledge |