Cargando…

The Automated Systematic Search Deduplicator (ASySD): a rapid, open-source, interoperable tool to remove duplicate citations in biomedical systematic reviews

BACKGROUND: Researchers performing high-quality systematic reviews search across multiple databases to identify relevant evidence. However, the same publication is often retrieved from several databases. Identifying and removing such duplicates (“deduplication”) can be extremely time-consuming, but...

Descripción completa

Detalles Bibliográficos
Autores principales: Hair, Kaitlyn, Bahor, Zsanett, Macleod, Malcolm, Liao, Jing, Sena, Emily S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10483700/
https://www.ncbi.nlm.nih.gov/pubmed/37674179
http://dx.doi.org/10.1186/s12915-023-01686-z
_version_ 1785102437348016128
author Hair, Kaitlyn
Bahor, Zsanett
Macleod, Malcolm
Liao, Jing
Sena, Emily S.
author_facet Hair, Kaitlyn
Bahor, Zsanett
Macleod, Malcolm
Liao, Jing
Sena, Emily S.
author_sort Hair, Kaitlyn
collection PubMed
description BACKGROUND: Researchers performing high-quality systematic reviews search across multiple databases to identify relevant evidence. However, the same publication is often retrieved from several databases. Identifying and removing such duplicates (“deduplication”) can be extremely time-consuming, but failure to remove these citations can lead to the wrongful inclusion of duplicate data. Many existing tools are not sensitive enough, lack interoperability with other tools, are not freely accessible, or are difficult to use without programming knowledge. Here, we report the performance of our Automated Systematic Search Deduplicator (ASySD), a novel tool to perform automated deduplication of systematic searches for biomedical reviews. METHODS: We evaluated ASySD’s performance on 5 unseen biomedical systematic search datasets of various sizes (1845–79,880 citations). We compared the performance of ASySD with EndNote’s automated deduplication option and with the Systematic Review Assistant Deduplication Module (SRA-DM). RESULTS: ASySD identified more duplicates than either SRA-DM or EndNote, with a sensitivity in different datasets of 0.95 to 0.99. The false-positive rate was comparable to human performance, with a specificity of > 0.99. The tool took less than 1 h to identify and remove duplicates within each dataset. CONCLUSIONS: For duplicate removal in biomedical systematic reviews, ASySD is a highly sensitive, reliable, and time-saving tool. It is open source and freely available online as both an R package and a user-friendly web application. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12915-023-01686-z.
format Online
Article
Text
id pubmed-10483700
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-104837002023-09-08 The Automated Systematic Search Deduplicator (ASySD): a rapid, open-source, interoperable tool to remove duplicate citations in biomedical systematic reviews Hair, Kaitlyn Bahor, Zsanett Macleod, Malcolm Liao, Jing Sena, Emily S. BMC Biol Methodology Article BACKGROUND: Researchers performing high-quality systematic reviews search across multiple databases to identify relevant evidence. However, the same publication is often retrieved from several databases. Identifying and removing such duplicates (“deduplication”) can be extremely time-consuming, but failure to remove these citations can lead to the wrongful inclusion of duplicate data. Many existing tools are not sensitive enough, lack interoperability with other tools, are not freely accessible, or are difficult to use without programming knowledge. Here, we report the performance of our Automated Systematic Search Deduplicator (ASySD), a novel tool to perform automated deduplication of systematic searches for biomedical reviews. METHODS: We evaluated ASySD’s performance on 5 unseen biomedical systematic search datasets of various sizes (1845–79,880 citations). We compared the performance of ASySD with EndNote’s automated deduplication option and with the Systematic Review Assistant Deduplication Module (SRA-DM). RESULTS: ASySD identified more duplicates than either SRA-DM or EndNote, with a sensitivity in different datasets of 0.95 to 0.99. The false-positive rate was comparable to human performance, with a specificity of > 0.99. The tool took less than 1 h to identify and remove duplicates within each dataset. CONCLUSIONS: For duplicate removal in biomedical systematic reviews, ASySD is a highly sensitive, reliable, and time-saving tool. It is open source and freely available online as both an R package and a user-friendly web application. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12915-023-01686-z. BioMed Central 2023-09-07 /pmc/articles/PMC10483700/ /pubmed/37674179 http://dx.doi.org/10.1186/s12915-023-01686-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Hair, Kaitlyn
Bahor, Zsanett
Macleod, Malcolm
Liao, Jing
Sena, Emily S.
The Automated Systematic Search Deduplicator (ASySD): a rapid, open-source, interoperable tool to remove duplicate citations in biomedical systematic reviews
title The Automated Systematic Search Deduplicator (ASySD): a rapid, open-source, interoperable tool to remove duplicate citations in biomedical systematic reviews
title_full The Automated Systematic Search Deduplicator (ASySD): a rapid, open-source, interoperable tool to remove duplicate citations in biomedical systematic reviews
title_fullStr The Automated Systematic Search Deduplicator (ASySD): a rapid, open-source, interoperable tool to remove duplicate citations in biomedical systematic reviews
title_full_unstemmed The Automated Systematic Search Deduplicator (ASySD): a rapid, open-source, interoperable tool to remove duplicate citations in biomedical systematic reviews
title_short The Automated Systematic Search Deduplicator (ASySD): a rapid, open-source, interoperable tool to remove duplicate citations in biomedical systematic reviews
title_sort automated systematic search deduplicator (asysd): a rapid, open-source, interoperable tool to remove duplicate citations in biomedical systematic reviews
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10483700/
https://www.ncbi.nlm.nih.gov/pubmed/37674179
http://dx.doi.org/10.1186/s12915-023-01686-z
work_keys_str_mv AT hairkaitlyn theautomatedsystematicsearchdeduplicatorasysdarapidopensourceinteroperabletooltoremoveduplicatecitationsinbiomedicalsystematicreviews
AT bahorzsanett theautomatedsystematicsearchdeduplicatorasysdarapidopensourceinteroperabletooltoremoveduplicatecitationsinbiomedicalsystematicreviews
AT macleodmalcolm theautomatedsystematicsearchdeduplicatorasysdarapidopensourceinteroperabletooltoremoveduplicatecitationsinbiomedicalsystematicreviews
AT liaojing theautomatedsystematicsearchdeduplicatorasysdarapidopensourceinteroperabletooltoremoveduplicatecitationsinbiomedicalsystematicreviews
AT senaemilys theautomatedsystematicsearchdeduplicatorasysdarapidopensourceinteroperabletooltoremoveduplicatecitationsinbiomedicalsystematicreviews
AT hairkaitlyn automatedsystematicsearchdeduplicatorasysdarapidopensourceinteroperabletooltoremoveduplicatecitationsinbiomedicalsystematicreviews
AT bahorzsanett automatedsystematicsearchdeduplicatorasysdarapidopensourceinteroperabletooltoremoveduplicatecitationsinbiomedicalsystematicreviews
AT macleodmalcolm automatedsystematicsearchdeduplicatorasysdarapidopensourceinteroperabletooltoremoveduplicatecitationsinbiomedicalsystematicreviews
AT liaojing automatedsystematicsearchdeduplicatorasysdarapidopensourceinteroperabletooltoremoveduplicatecitationsinbiomedicalsystematicreviews
AT senaemilys automatedsystematicsearchdeduplicatorasysdarapidopensourceinteroperabletooltoremoveduplicatecitationsinbiomedicalsystematicreviews