Cargando…

A Refutation of Finite-State Language Models through Zipf’s Law for Factual Knowledge

We present a hypothetical argument against finite-state processes in statistical language modeling that is based on semantics rather than syntax. In this theoretical model, we suppose that the semantic properties of texts in a natural language could be approximately captured by a recently introduced...

Descripción completa

Detalles Bibliográficos
Autor principal: Dębowski, Łukasz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8465033/
https://www.ncbi.nlm.nih.gov/pubmed/34573773
http://dx.doi.org/10.3390/e23091148
_version_ 1784572767813763072
author Dębowski, Łukasz
author_facet Dębowski, Łukasz
author_sort Dębowski, Łukasz
collection PubMed
description We present a hypothetical argument against finite-state processes in statistical language modeling that is based on semantics rather than syntax. In this theoretical model, we suppose that the semantic properties of texts in a natural language could be approximately captured by a recently introduced concept of a perigraphic process. Perigraphic processes are a class of stochastic processes that satisfy a Zipf-law accumulation of a subset of factual knowledge, which is time-independent, compressed, and effectively inferrable from the process. We show that the classes of finite-state processes and of perigraphic processes are disjoint, and we present a new simple example of perigraphic processes over a finite alphabet called Oracle processes. The disjointness result makes use of the Hilberg condition, i.e., the almost sure power-law growth of algorithmic mutual information. Using a strongly consistent estimator of the number of hidden states, we show that finite-state processes do not satisfy the Hilberg condition whereas Oracle processes satisfy the Hilberg condition via the data-processing inequality. We discuss the relevance of these mathematical results for theoretical and computational linguistics.
format Online
Article
Text
id pubmed-8465033
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-84650332021-09-27 A Refutation of Finite-State Language Models through Zipf’s Law for Factual Knowledge Dębowski, Łukasz Entropy (Basel) Article We present a hypothetical argument against finite-state processes in statistical language modeling that is based on semantics rather than syntax. In this theoretical model, we suppose that the semantic properties of texts in a natural language could be approximately captured by a recently introduced concept of a perigraphic process. Perigraphic processes are a class of stochastic processes that satisfy a Zipf-law accumulation of a subset of factual knowledge, which is time-independent, compressed, and effectively inferrable from the process. We show that the classes of finite-state processes and of perigraphic processes are disjoint, and we present a new simple example of perigraphic processes over a finite alphabet called Oracle processes. The disjointness result makes use of the Hilberg condition, i.e., the almost sure power-law growth of algorithmic mutual information. Using a strongly consistent estimator of the number of hidden states, we show that finite-state processes do not satisfy the Hilberg condition whereas Oracle processes satisfy the Hilberg condition via the data-processing inequality. We discuss the relevance of these mathematical results for theoretical and computational linguistics. MDPI 2021-09-01 /pmc/articles/PMC8465033/ /pubmed/34573773 http://dx.doi.org/10.3390/e23091148 Text en © 2021 by the author. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Dębowski, Łukasz
A Refutation of Finite-State Language Models through Zipf’s Law for Factual Knowledge
title A Refutation of Finite-State Language Models through Zipf’s Law for Factual Knowledge
title_full A Refutation of Finite-State Language Models through Zipf’s Law for Factual Knowledge
title_fullStr A Refutation of Finite-State Language Models through Zipf’s Law for Factual Knowledge
title_full_unstemmed A Refutation of Finite-State Language Models through Zipf’s Law for Factual Knowledge
title_short A Refutation of Finite-State Language Models through Zipf’s Law for Factual Knowledge
title_sort refutation of finite-state language models through zipf’s law for factual knowledge
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8465033/
https://www.ncbi.nlm.nih.gov/pubmed/34573773
http://dx.doi.org/10.3390/e23091148
work_keys_str_mv AT debowskiłukasz arefutationoffinitestatelanguagemodelsthroughzipfslawforfactualknowledge
AT debowskiłukasz refutationoffinitestatelanguagemodelsthroughzipfslawforfactualknowledge