Cargando…

An evaluation of two commercial deep learning-based information retrieval systems for COVID-19 literature

The COVID-19 pandemic has resulted in a tremendous need for access to the latest scientific information, leading to both corpora for COVID-19 literature and search engines to query such data. While most search engine research is performed in academia with rigorous evaluation, major commercial compan...

Descripción completa

Detalles Bibliográficos
Autores principales: Soni, Sarvesh, Roberts, Kirk
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7717324/
https://www.ncbi.nlm.nih.gov/pubmed/33197268
http://dx.doi.org/10.1093/jamia/ocaa271
_version_ 1783619304567078912
author Soni, Sarvesh
Roberts, Kirk
author_facet Soni, Sarvesh
Roberts, Kirk
author_sort Soni, Sarvesh
collection PubMed
description The COVID-19 pandemic has resulted in a tremendous need for access to the latest scientific information, leading to both corpora for COVID-19 literature and search engines to query such data. While most search engine research is performed in academia with rigorous evaluation, major commercial companies dominate the web search market. Thus, it is expected that commercial pandemic-specific search engines will gain much higher traction than academic alternatives, leading to questions about the empirical performance of these tools. This paper seeks to empirically evaluate two commercial search engines for COVID-19 (Google and Amazon) in comparison with academic prototypes evaluated in the TREC-COVID task. We performed several steps to reduce bias in the manual judgments to ensure a fair comparison of all systems. We find the commercial search engines sizably underperformed those evaluated under TREC-COVID. This has implications for trust in popular health search engines and developing biomedical search engines for future health crises.
format Online
Article
Text
id pubmed-7717324
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-77173242020-12-09 An evaluation of two commercial deep learning-based information retrieval systems for COVID-19 literature Soni, Sarvesh Roberts, Kirk J Am Med Inform Assoc Case Report The COVID-19 pandemic has resulted in a tremendous need for access to the latest scientific information, leading to both corpora for COVID-19 literature and search engines to query such data. While most search engine research is performed in academia with rigorous evaluation, major commercial companies dominate the web search market. Thus, it is expected that commercial pandemic-specific search engines will gain much higher traction than academic alternatives, leading to questions about the empirical performance of these tools. This paper seeks to empirically evaluate two commercial search engines for COVID-19 (Google and Amazon) in comparison with academic prototypes evaluated in the TREC-COVID task. We performed several steps to reduce bias in the manual judgments to ensure a fair comparison of all systems. We find the commercial search engines sizably underperformed those evaluated under TREC-COVID. This has implications for trust in popular health search engines and developing biomedical search engines for future health crises. Oxford University Press 2020-11-17 /pmc/articles/PMC7717324/ /pubmed/33197268 http://dx.doi.org/10.1093/jamia/ocaa271 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model)
spellingShingle Case Report
Soni, Sarvesh
Roberts, Kirk
An evaluation of two commercial deep learning-based information retrieval systems for COVID-19 literature
title An evaluation of two commercial deep learning-based information retrieval systems for COVID-19 literature
title_full An evaluation of two commercial deep learning-based information retrieval systems for COVID-19 literature
title_fullStr An evaluation of two commercial deep learning-based information retrieval systems for COVID-19 literature
title_full_unstemmed An evaluation of two commercial deep learning-based information retrieval systems for COVID-19 literature
title_short An evaluation of two commercial deep learning-based information retrieval systems for COVID-19 literature
title_sort evaluation of two commercial deep learning-based information retrieval systems for covid-19 literature
topic Case Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7717324/
https://www.ncbi.nlm.nih.gov/pubmed/33197268
http://dx.doi.org/10.1093/jamia/ocaa271
work_keys_str_mv AT sonisarvesh anevaluationoftwocommercialdeeplearningbasedinformationretrievalsystemsforcovid19literature
AT robertskirk anevaluationoftwocommercialdeeplearningbasedinformationretrievalsystemsforcovid19literature
AT sonisarvesh evaluationoftwocommercialdeeplearningbasedinformationretrievalsystemsforcovid19literature
AT robertskirk evaluationoftwocommercialdeeplearningbasedinformationretrievalsystemsforcovid19literature