Cargando…
Evaluation of an automatic article selection method for timelier updates of the Comet Core Outcome Set database
Curated databases of scientific literature play an important role in helping researchers find relevant literature, but populating such databases is a labour intensive and time-consuming process. One such database is the freely accessible Comet Core Outcome Set database, which was originally populate...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6836711/ https://www.ncbi.nlm.nih.gov/pubmed/31697361 http://dx.doi.org/10.1093/database/baz109 |
_version_ | 1783466957103693824 |
---|---|
author | Norman, Christopher R Gargon, Elizabeth Leeflang, Mariska M G Névéol, Aurélie Williamson, Paula R |
author_facet | Norman, Christopher R Gargon, Elizabeth Leeflang, Mariska M G Névéol, Aurélie Williamson, Paula R |
author_sort | Norman, Christopher R |
collection | PubMed |
description | Curated databases of scientific literature play an important role in helping researchers find relevant literature, but populating such databases is a labour intensive and time-consuming process. One such database is the freely accessible Comet Core Outcome Set database, which was originally populated using manual screening in an annually updated systematic review. In order to reduce the workload and facilitate more timely updates we are evaluating machine learning methods to reduce the number of references needed to screen. In this study we have evaluated a machine learning approach based on logistic regression to automatically rank the candidate articles. Data from the original systematic review and its four first review updates were used to train the model and evaluate performance. We estimated that using automatic screening would yield a workload reduction of at least 75% while keeping the number of missed references around 2%. We judged this to be an acceptable trade-off for this systematic review, and the method is now being used for the next round of the Comet database update. |
format | Online Article Text |
id | pubmed-6836711 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-68367112019-11-13 Evaluation of an automatic article selection method for timelier updates of the Comet Core Outcome Set database Norman, Christopher R Gargon, Elizabeth Leeflang, Mariska M G Névéol, Aurélie Williamson, Paula R Database (Oxford) Original Article Curated databases of scientific literature play an important role in helping researchers find relevant literature, but populating such databases is a labour intensive and time-consuming process. One such database is the freely accessible Comet Core Outcome Set database, which was originally populated using manual screening in an annually updated systematic review. In order to reduce the workload and facilitate more timely updates we are evaluating machine learning methods to reduce the number of references needed to screen. In this study we have evaluated a machine learning approach based on logistic regression to automatically rank the candidate articles. Data from the original systematic review and its four first review updates were used to train the model and evaluate performance. We estimated that using automatic screening would yield a workload reduction of at least 75% while keeping the number of missed references around 2%. We judged this to be an acceptable trade-off for this systematic review, and the method is now being used for the next round of the Comet database update. Oxford University Press 2019-11-07 /pmc/articles/PMC6836711/ /pubmed/31697361 http://dx.doi.org/10.1093/database/baz109 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Norman, Christopher R Gargon, Elizabeth Leeflang, Mariska M G Névéol, Aurélie Williamson, Paula R Evaluation of an automatic article selection method for timelier updates of the Comet Core Outcome Set database |
title | Evaluation of an automatic article selection method for timelier updates of the Comet Core Outcome Set database |
title_full | Evaluation of an automatic article selection method for timelier updates of the Comet Core Outcome Set database |
title_fullStr | Evaluation of an automatic article selection method for timelier updates of the Comet Core Outcome Set database |
title_full_unstemmed | Evaluation of an automatic article selection method for timelier updates of the Comet Core Outcome Set database |
title_short | Evaluation of an automatic article selection method for timelier updates of the Comet Core Outcome Set database |
title_sort | evaluation of an automatic article selection method for timelier updates of the comet core outcome set database |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6836711/ https://www.ncbi.nlm.nih.gov/pubmed/31697361 http://dx.doi.org/10.1093/database/baz109 |
work_keys_str_mv | AT normanchristopherr evaluationofanautomaticarticleselectionmethodfortimelierupdatesofthecometcoreoutcomesetdatabase AT gargonelizabeth evaluationofanautomaticarticleselectionmethodfortimelierupdatesofthecometcoreoutcomesetdatabase AT leeflangmariskamg evaluationofanautomaticarticleselectionmethodfortimelierupdatesofthecometcoreoutcomesetdatabase AT neveolaurelie evaluationofanautomaticarticleselectionmethodfortimelierupdatesofthecometcoreoutcomesetdatabase AT williamsonpaular evaluationofanautomaticarticleselectionmethodfortimelierupdatesofthecometcoreoutcomesetdatabase |