Cargando…
Evaluation of JATSdecoder as an automated text extraction tool for statistical results in scientific reports
The extraction of statistical results in scientific reports is beneficial for checking studies on plausibility and reliability. The R package JATSdecoder supports the application of text mining approaches to scientific reports. Its function get.stats() extracts all reported statistical results from...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8484375/ https://www.ncbi.nlm.nih.gov/pubmed/34593888 http://dx.doi.org/10.1038/s41598-021-98782-3 |
_version_ | 1784577306714439680 |
---|---|
author | Böschen, Ingmar |
author_facet | Böschen, Ingmar |
author_sort | Böschen, Ingmar |
collection | PubMed |
description | The extraction of statistical results in scientific reports is beneficial for checking studies on plausibility and reliability. The R package JATSdecoder supports the application of text mining approaches to scientific reports. Its function get.stats() extracts all reported statistical results from text and recomputes p values for most standard test results. The output can be reduced to results with checkable or computable p values only. In this article, get.stats()’s ability to extract, recompute and check statistical results is compared to that of statcheck, which is an already established tool. A manually coded data set, containing the number of statistically significant results in 49 articles, serves as an initial indicator for get.stats()’s and statcheck’s differing detection rates for statistical results. Further 13,531 PDF files by 10 mayor psychological journals, 18,744 XML documents by Frontiers of Psychology and 23,730 articles related to psychological research and published by PLoS One are scanned for statistical results with both algorithms. get.stats() almost replicates the manually extracted number of significant results in 49 PDF articles. get.stats() outperforms the statcheck functions in identifying statistical results in every included journal and input format. Furthermore, the raw results extracted by get.stats() increase statcheck’s detection rate. JATSdecoder’s function get.stats() is a highly general and reliable tool to extract statistical results from text. It copes with a wide range of textual representations of statistical standard results and recomputes p values for two- and one-sided tests. It facilitates manual and automated checks on consistency and completeness of the reported results within a manuscript. |
format | Online Article Text |
id | pubmed-8484375 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-84843752021-10-04 Evaluation of JATSdecoder as an automated text extraction tool for statistical results in scientific reports Böschen, Ingmar Sci Rep Article The extraction of statistical results in scientific reports is beneficial for checking studies on plausibility and reliability. The R package JATSdecoder supports the application of text mining approaches to scientific reports. Its function get.stats() extracts all reported statistical results from text and recomputes p values for most standard test results. The output can be reduced to results with checkable or computable p values only. In this article, get.stats()’s ability to extract, recompute and check statistical results is compared to that of statcheck, which is an already established tool. A manually coded data set, containing the number of statistically significant results in 49 articles, serves as an initial indicator for get.stats()’s and statcheck’s differing detection rates for statistical results. Further 13,531 PDF files by 10 mayor psychological journals, 18,744 XML documents by Frontiers of Psychology and 23,730 articles related to psychological research and published by PLoS One are scanned for statistical results with both algorithms. get.stats() almost replicates the manually extracted number of significant results in 49 PDF articles. get.stats() outperforms the statcheck functions in identifying statistical results in every included journal and input format. Furthermore, the raw results extracted by get.stats() increase statcheck’s detection rate. JATSdecoder’s function get.stats() is a highly general and reliable tool to extract statistical results from text. It copes with a wide range of textual representations of statistical standard results and recomputes p values for two- and one-sided tests. It facilitates manual and automated checks on consistency and completeness of the reported results within a manuscript. Nature Publishing Group UK 2021-09-30 /pmc/articles/PMC8484375/ /pubmed/34593888 http://dx.doi.org/10.1038/s41598-021-98782-3 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Böschen, Ingmar Evaluation of JATSdecoder as an automated text extraction tool for statistical results in scientific reports |
title | Evaluation of JATSdecoder as an automated text extraction tool for statistical results in scientific reports |
title_full | Evaluation of JATSdecoder as an automated text extraction tool for statistical results in scientific reports |
title_fullStr | Evaluation of JATSdecoder as an automated text extraction tool for statistical results in scientific reports |
title_full_unstemmed | Evaluation of JATSdecoder as an automated text extraction tool for statistical results in scientific reports |
title_short | Evaluation of JATSdecoder as an automated text extraction tool for statistical results in scientific reports |
title_sort | evaluation of jatsdecoder as an automated text extraction tool for statistical results in scientific reports |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8484375/ https://www.ncbi.nlm.nih.gov/pubmed/34593888 http://dx.doi.org/10.1038/s41598-021-98782-3 |
work_keys_str_mv | AT boscheningmar evaluationofjatsdecoderasanautomatedtextextractiontoolforstatisticalresultsinscientificreports |