Cargando…
Enhancing access to the Bibliome: the TREC 2004 Genomics Track
BACKGROUND: The goal of the TREC Genomics Track is to improve information retrieval in the area of genomics by creating test collections that will allow researchers to improve and better understand failures of their systems. The 2004 track included an ad hoc retrieval task, simulating use of a searc...
Autores principales: | , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1440302/ https://www.ncbi.nlm.nih.gov/pubmed/16722581 http://dx.doi.org/10.1186/1747-5333-1-3 |
_version_ | 1782127315920093184 |
---|---|
author | Hersh, William R Bhupatiraju, Ravi Teja Ross, Laura Roberts, Phoebe Cohen, Aaron M Kraemer, Dale F |
author_facet | Hersh, William R Bhupatiraju, Ravi Teja Ross, Laura Roberts, Phoebe Cohen, Aaron M Kraemer, Dale F |
author_sort | Hersh, William R |
collection | PubMed |
description | BACKGROUND: The goal of the TREC Genomics Track is to improve information retrieval in the area of genomics by creating test collections that will allow researchers to improve and better understand failures of their systems. The 2004 track included an ad hoc retrieval task, simulating use of a search engine to obtain documents about biomedical topics. This paper describes the Genomics Track of the Text Retrieval Conference (TREC) 2004, a forum for evaluation of IR research systems, where retrieval in the genomics domain has recently begun to be assessed. RESULTS: A total of 27 research groups submitted 47 different runs. The most effective runs, as measured by the primary evaluation measure of mean average precision (MAP), used a combination of domain-specific and general techniques. The best MAP obtained by any run was 0.4075. Techniques that expanded queries with gene name lists as well as words from related articles had the best efficacy. However, many runs performed more poorly than a simple baseline run, indicating that careful selection of system features is essential. CONCLUSION: Various approaches to ad hoc retrieval provide a diversity of efficacy. The TREC Genomics Track and its test collection resources provide tools that allow improvement in information retrieval systems. |
format | Text |
id | pubmed-1440302 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-14403022006-04-19 Enhancing access to the Bibliome: the TREC 2004 Genomics Track Hersh, William R Bhupatiraju, Ravi Teja Ross, Laura Roberts, Phoebe Cohen, Aaron M Kraemer, Dale F J Biomed Discov Collab Research BACKGROUND: The goal of the TREC Genomics Track is to improve information retrieval in the area of genomics by creating test collections that will allow researchers to improve and better understand failures of their systems. The 2004 track included an ad hoc retrieval task, simulating use of a search engine to obtain documents about biomedical topics. This paper describes the Genomics Track of the Text Retrieval Conference (TREC) 2004, a forum for evaluation of IR research systems, where retrieval in the genomics domain has recently begun to be assessed. RESULTS: A total of 27 research groups submitted 47 different runs. The most effective runs, as measured by the primary evaluation measure of mean average precision (MAP), used a combination of domain-specific and general techniques. The best MAP obtained by any run was 0.4075. Techniques that expanded queries with gene name lists as well as words from related articles had the best efficacy. However, many runs performed more poorly than a simple baseline run, indicating that careful selection of system features is essential. CONCLUSION: Various approaches to ad hoc retrieval provide a diversity of efficacy. The TREC Genomics Track and its test collection resources provide tools that allow improvement in information retrieval systems. BioMed Central 2006-03-13 /pmc/articles/PMC1440302/ /pubmed/16722581 http://dx.doi.org/10.1186/1747-5333-1-3 Text en Copyright © 2006 Hersh et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Hersh, William R Bhupatiraju, Ravi Teja Ross, Laura Roberts, Phoebe Cohen, Aaron M Kraemer, Dale F Enhancing access to the Bibliome: the TREC 2004 Genomics Track |
title | Enhancing access to the Bibliome: the TREC 2004 Genomics Track |
title_full | Enhancing access to the Bibliome: the TREC 2004 Genomics Track |
title_fullStr | Enhancing access to the Bibliome: the TREC 2004 Genomics Track |
title_full_unstemmed | Enhancing access to the Bibliome: the TREC 2004 Genomics Track |
title_short | Enhancing access to the Bibliome: the TREC 2004 Genomics Track |
title_sort | enhancing access to the bibliome: the trec 2004 genomics track |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1440302/ https://www.ncbi.nlm.nih.gov/pubmed/16722581 http://dx.doi.org/10.1186/1747-5333-1-3 |
work_keys_str_mv | AT hershwilliamr enhancingaccesstothebibliomethetrec2004genomicstrack AT bhupatirajuraviteja enhancingaccesstothebibliomethetrec2004genomicstrack AT rosslaura enhancingaccesstothebibliomethetrec2004genomicstrack AT robertsphoebe enhancingaccesstothebibliomethetrec2004genomicstrack AT cohenaaronm enhancingaccesstothebibliomethetrec2004genomicstrack AT kraemerdalef enhancingaccesstothebibliomethetrec2004genomicstrack |