Cargando…

A Standardized Project Gutenberg Corpus for Statistical Analysis of Natural Language and Quantitative Linguistics

The use of Project Gutenberg (PG) as a text corpus has been extremely popular in statistical analysis of language for more than 25 years. However, in contrast to other major linguistic datasets of similar importance, no consensual full version of PG exists to date. In fact, most PG studies so far ei...

Descripción completa

Detalles Bibliográficos
Autores principales: Gerlach, Martin, Font-Clos, Francesc
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7516435/
https://www.ncbi.nlm.nih.gov/pubmed/33285901
http://dx.doi.org/10.3390/e22010126

Ejemplares similares