Cargando…
A Latent Model for Ad Hoc Table Retrieval
The ad hoc table retrieval task is concerned with satisfying a query with a ranked list of tables. While there are strong baselines in the literature that exploit learning to rank and semantic matching techniques, there are still a set of hard queries that are difficult for these baseline methods to...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7148042/ http://dx.doi.org/10.1007/978-3-030-45442-5_11 |
_version_ | 1783520518317539328 |
---|---|
author | Bagheri, Ebrahim Al-Obeidat, Feras |
author_facet | Bagheri, Ebrahim Al-Obeidat, Feras |
author_sort | Bagheri, Ebrahim |
collection | PubMed |
description | The ad hoc table retrieval task is concerned with satisfying a query with a ranked list of tables. While there are strong baselines in the literature that exploit learning to rank and semantic matching techniques, there are still a set of hard queries that are difficult for these baseline methods to address. We find that such hard queries are those whose constituting tokens (i.e., terms or entities) are not fully or partially observed in the relevant tables. We focus on proposing a latent factor model to address such hard queries. Our proposed model factorizes the token-table co-occurrence matrix into two low dimensional latent factor matrices that can be used for measuring table and query similarity even if no shared tokens exist between them. We find that the variation of our proposed model that considers keywords provides statistically significant improvement over three strong baselines in terms of NDCG and ERR. |
format | Online Article Text |
id | pubmed-7148042 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-71480422020-04-13 A Latent Model for Ad Hoc Table Retrieval Bagheri, Ebrahim Al-Obeidat, Feras Advances in Information Retrieval Article The ad hoc table retrieval task is concerned with satisfying a query with a ranked list of tables. While there are strong baselines in the literature that exploit learning to rank and semantic matching techniques, there are still a set of hard queries that are difficult for these baseline methods to address. We find that such hard queries are those whose constituting tokens (i.e., terms or entities) are not fully or partially observed in the relevant tables. We focus on proposing a latent factor model to address such hard queries. Our proposed model factorizes the token-table co-occurrence matrix into two low dimensional latent factor matrices that can be used for measuring table and query similarity even if no shared tokens exist between them. We find that the variation of our proposed model that considers keywords provides statistically significant improvement over three strong baselines in terms of NDCG and ERR. 2020-03-24 /pmc/articles/PMC7148042/ http://dx.doi.org/10.1007/978-3-030-45442-5_11 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Bagheri, Ebrahim Al-Obeidat, Feras A Latent Model for Ad Hoc Table Retrieval |
title | A Latent Model for Ad Hoc Table Retrieval |
title_full | A Latent Model for Ad Hoc Table Retrieval |
title_fullStr | A Latent Model for Ad Hoc Table Retrieval |
title_full_unstemmed | A Latent Model for Ad Hoc Table Retrieval |
title_short | A Latent Model for Ad Hoc Table Retrieval |
title_sort | latent model for ad hoc table retrieval |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7148042/ http://dx.doi.org/10.1007/978-3-030-45442-5_11 |
work_keys_str_mv | AT bagheriebrahim alatentmodelforadhoctableretrieval AT alobeidatferas alatentmodelforadhoctableretrieval AT bagheriebrahim latentmodelforadhoctableretrieval AT alobeidatferas latentmodelforadhoctableretrieval |