Cargando…

A non-parametric significance test to compare corpora

Classical null hypothesis significance tests are not appropriate in corpus linguistics, because the randomness assumption underlying these testing procedures is not fulfilled. Nevertheless, there are numerous scenarios where it would be beneficial to have some kind of test in order to judge the rele...

Descripción completa

Detalles Bibliográficos
Autor principal: Koplenig, Alexander
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6752893/
https://www.ncbi.nlm.nih.gov/pubmed/31536558
http://dx.doi.org/10.1371/journal.pone.0222703
_version_ 1783452806852640768
author Koplenig, Alexander
author_facet Koplenig, Alexander
author_sort Koplenig, Alexander
collection PubMed
description Classical null hypothesis significance tests are not appropriate in corpus linguistics, because the randomness assumption underlying these testing procedures is not fulfilled. Nevertheless, there are numerous scenarios where it would be beneficial to have some kind of test in order to judge the relevance of a result (e.g. a difference between two corpora) by answering the question whether the attribute of interest is pronounced enough to warrant the conclusion that it is substantial and not due to chance. In this paper, I outline such a test.
format Online
Article
Text
id pubmed-6752893
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-67528932019-09-27 A non-parametric significance test to compare corpora Koplenig, Alexander PLoS One Research Article Classical null hypothesis significance tests are not appropriate in corpus linguistics, because the randomness assumption underlying these testing procedures is not fulfilled. Nevertheless, there are numerous scenarios where it would be beneficial to have some kind of test in order to judge the relevance of a result (e.g. a difference between two corpora) by answering the question whether the attribute of interest is pronounced enough to warrant the conclusion that it is substantial and not due to chance. In this paper, I outline such a test. Public Library of Science 2019-09-19 /pmc/articles/PMC6752893/ /pubmed/31536558 http://dx.doi.org/10.1371/journal.pone.0222703 Text en © 2019 Alexander Koplenig http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Koplenig, Alexander
A non-parametric significance test to compare corpora
title A non-parametric significance test to compare corpora
title_full A non-parametric significance test to compare corpora
title_fullStr A non-parametric significance test to compare corpora
title_full_unstemmed A non-parametric significance test to compare corpora
title_short A non-parametric significance test to compare corpora
title_sort non-parametric significance test to compare corpora
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6752893/
https://www.ncbi.nlm.nih.gov/pubmed/31536558
http://dx.doi.org/10.1371/journal.pone.0222703
work_keys_str_mv AT koplenigalexander anonparametricsignificancetesttocomparecorpora
AT koplenigalexander nonparametricsignificancetesttocomparecorpora