Cargando…

Phrasal Paraphrase Based Question Reformulation for Archived Question Retrieval

Lexical gap in cQA search, resulted by the variability of languages, has been recognized as an important and widespread phenomenon. To address the problem, this paper presents a question reformulation scheme to enhance the question retrieval model by fully exploring the intelligence of paraphrase in...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yu, Zhang, Wei-Nan, Lu, Ke, Ji, Rongrong, Wang, Fanglin, Liu, Ting
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3689745/
https://www.ncbi.nlm.nih.gov/pubmed/23805178
http://dx.doi.org/10.1371/journal.pone.0064601
_version_ 1782274302339448832
author Zhang, Yu
Zhang, Wei-Nan
Lu, Ke
Ji, Rongrong
Wang, Fanglin
Liu, Ting
author_facet Zhang, Yu
Zhang, Wei-Nan
Lu, Ke
Ji, Rongrong
Wang, Fanglin
Liu, Ting
author_sort Zhang, Yu
collection PubMed
description Lexical gap in cQA search, resulted by the variability of languages, has been recognized as an important and widespread phenomenon. To address the problem, this paper presents a question reformulation scheme to enhance the question retrieval model by fully exploring the intelligence of paraphrase in phrase-level. It compensates for the existing paraphrasing research in a suitable granularity, which either falls into fine-grained lexical-level or coarse-grained sentence-level. Given a question in natural language, our scheme first detects the involved key-phrases by jointly integrating the corpus-dependent knowledge and question-aware cues. Next, it automatically extracts the paraphrases for each identified key-phrase utilizing multiple online translation engines, and then selects the most relevant reformulations from a large group of question rewrites, which is formed by full permutation and combination of the generated paraphrases. Extensive evaluations on a real world data set demonstrate that our model is able to characterize the complex questions and achieves promising performance as compared to the state-of-the-art methods.
format Online
Article
Text
id pubmed-3689745
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-36897452013-06-26 Phrasal Paraphrase Based Question Reformulation for Archived Question Retrieval Zhang, Yu Zhang, Wei-Nan Lu, Ke Ji, Rongrong Wang, Fanglin Liu, Ting PLoS One Research Article Lexical gap in cQA search, resulted by the variability of languages, has been recognized as an important and widespread phenomenon. To address the problem, this paper presents a question reformulation scheme to enhance the question retrieval model by fully exploring the intelligence of paraphrase in phrase-level. It compensates for the existing paraphrasing research in a suitable granularity, which either falls into fine-grained lexical-level or coarse-grained sentence-level. Given a question in natural language, our scheme first detects the involved key-phrases by jointly integrating the corpus-dependent knowledge and question-aware cues. Next, it automatically extracts the paraphrases for each identified key-phrase utilizing multiple online translation engines, and then selects the most relevant reformulations from a large group of question rewrites, which is formed by full permutation and combination of the generated paraphrases. Extensive evaluations on a real world data set demonstrate that our model is able to characterize the complex questions and achieves promising performance as compared to the state-of-the-art methods. Public Library of Science 2013-06-21 /pmc/articles/PMC3689745/ /pubmed/23805178 http://dx.doi.org/10.1371/journal.pone.0064601 Text en © 2013 Zhang et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Zhang, Yu
Zhang, Wei-Nan
Lu, Ke
Ji, Rongrong
Wang, Fanglin
Liu, Ting
Phrasal Paraphrase Based Question Reformulation for Archived Question Retrieval
title Phrasal Paraphrase Based Question Reformulation for Archived Question Retrieval
title_full Phrasal Paraphrase Based Question Reformulation for Archived Question Retrieval
title_fullStr Phrasal Paraphrase Based Question Reformulation for Archived Question Retrieval
title_full_unstemmed Phrasal Paraphrase Based Question Reformulation for Archived Question Retrieval
title_short Phrasal Paraphrase Based Question Reformulation for Archived Question Retrieval
title_sort phrasal paraphrase based question reformulation for archived question retrieval
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3689745/
https://www.ncbi.nlm.nih.gov/pubmed/23805178
http://dx.doi.org/10.1371/journal.pone.0064601
work_keys_str_mv AT zhangyu phrasalparaphrasebasedquestionreformulationforarchivedquestionretrieval
AT zhangweinan phrasalparaphrasebasedquestionreformulationforarchivedquestionretrieval
AT luke phrasalparaphrasebasedquestionreformulationforarchivedquestionretrieval
AT jirongrong phrasalparaphrasebasedquestionreformulationforarchivedquestionretrieval
AT wangfanglin phrasalparaphrasebasedquestionreformulationforarchivedquestionretrieval
AT liuting phrasalparaphrasebasedquestionreformulationforarchivedquestionretrieval