Cargando…

Evaluation of the extraction of methodological study characteristics with JATSdecoder

This paper introduces and evaluates the study.character module from the JATSdecoder package which extracts several key methodological study characteristics from NISO-JATS coded scientific articles. study.character splits the text into sections and applies its heuristic-driven extraction procedures t...

Descripción completa

Detalles Bibliográficos
Autor principal: Böschen, Ingmar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9813005/
https://www.ncbi.nlm.nih.gov/pubmed/36599903
http://dx.doi.org/10.1038/s41598-022-27085-y
_version_ 1784863838994169856
author Böschen, Ingmar
author_facet Böschen, Ingmar
author_sort Böschen, Ingmar
collection PubMed
description This paper introduces and evaluates the study.character module from the JATSdecoder package which extracts several key methodological study characteristics from NISO-JATS coded scientific articles. study.character splits the text into sections and applies its heuristic-driven extraction procedures to the text of the method and result section/s. When used individually, study.character’s functions can also be applied to any textual input. An externally coded data set of 288 PDF articles serves as an indicator of study.character’s capabilities in extracting the number of sub-studies reported per article, the statistical methods applied and software solutions used. Its precision of extraction of the reported [Formula: see text] -level, power, correction procedures for multiple testing, use of interactions, definition of outlier, and mentions of statistical assumptions are evaluated by a comparison to a manually curated data set of the same collection of articles. Sensitivity, specificity, and accuracy measures are reported for each of the evaluated functions. study.character reliably extracts the methodological study characteristics targeted here from psychological research articles. Most extractions have very low false positive rates and high accuracy ([Formula: see text] ). Most non-detections are due to PDF-specific conversion errors and complex text structures, that are not yet manageable. study.character can be applied to large text resources in order to examine methodological trends over time, by journal and/or by topic. It also enables a new way of identifying study sets for meta-analyzes and systematic reviews.
format Online
Article
Text
id pubmed-9813005
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-98130052023-01-06 Evaluation of the extraction of methodological study characteristics with JATSdecoder Böschen, Ingmar Sci Rep Article This paper introduces and evaluates the study.character module from the JATSdecoder package which extracts several key methodological study characteristics from NISO-JATS coded scientific articles. study.character splits the text into sections and applies its heuristic-driven extraction procedures to the text of the method and result section/s. When used individually, study.character’s functions can also be applied to any textual input. An externally coded data set of 288 PDF articles serves as an indicator of study.character’s capabilities in extracting the number of sub-studies reported per article, the statistical methods applied and software solutions used. Its precision of extraction of the reported [Formula: see text] -level, power, correction procedures for multiple testing, use of interactions, definition of outlier, and mentions of statistical assumptions are evaluated by a comparison to a manually curated data set of the same collection of articles. Sensitivity, specificity, and accuracy measures are reported for each of the evaluated functions. study.character reliably extracts the methodological study characteristics targeted here from psychological research articles. Most extractions have very low false positive rates and high accuracy ([Formula: see text] ). Most non-detections are due to PDF-specific conversion errors and complex text structures, that are not yet manageable. study.character can be applied to large text resources in order to examine methodological trends over time, by journal and/or by topic. It also enables a new way of identifying study sets for meta-analyzes and systematic reviews. Nature Publishing Group UK 2023-01-04 /pmc/articles/PMC9813005/ /pubmed/36599903 http://dx.doi.org/10.1038/s41598-022-27085-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Böschen, Ingmar
Evaluation of the extraction of methodological study characteristics with JATSdecoder
title Evaluation of the extraction of methodological study characteristics with JATSdecoder
title_full Evaluation of the extraction of methodological study characteristics with JATSdecoder
title_fullStr Evaluation of the extraction of methodological study characteristics with JATSdecoder
title_full_unstemmed Evaluation of the extraction of methodological study characteristics with JATSdecoder
title_short Evaluation of the extraction of methodological study characteristics with JATSdecoder
title_sort evaluation of the extraction of methodological study characteristics with jatsdecoder
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9813005/
https://www.ncbi.nlm.nih.gov/pubmed/36599903
http://dx.doi.org/10.1038/s41598-022-27085-y
work_keys_str_mv AT boscheningmar evaluationoftheextractionofmethodologicalstudycharacteristicswithjatsdecoder