Cargando…

Refining Automatically Extracted Knowledge Bases Using Crowdsourcing

Machine-constructed knowledge bases often contain noisy and inaccurate facts. There exists significant work in developing automated algorithms for knowledge base refinement. Automated approaches improve the quality of knowledge bases but are far from perfect. In this paper, we leverage crowdsourcing...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Chunhua, Zhao, Pengpeng, Sheng, Victor S., Xian, Xuefeng, Wu, Jian, Cui, Zhiming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5446892/
https://www.ncbi.nlm.nih.gov/pubmed/28588611
http://dx.doi.org/10.1155/2017/4092135
_version_ 1783239191641980928
author Li, Chunhua
Zhao, Pengpeng
Sheng, Victor S.
Xian, Xuefeng
Wu, Jian
Cui, Zhiming
author_facet Li, Chunhua
Zhao, Pengpeng
Sheng, Victor S.
Xian, Xuefeng
Wu, Jian
Cui, Zhiming
author_sort Li, Chunhua
collection PubMed
description Machine-constructed knowledge bases often contain noisy and inaccurate facts. There exists significant work in developing automated algorithms for knowledge base refinement. Automated approaches improve the quality of knowledge bases but are far from perfect. In this paper, we leverage crowdsourcing to improve the quality of automatically extracted knowledge bases. As human labelling is costly, an important research challenge is how we can use limited human resources to maximize the quality improvement for a knowledge base. To address this problem, we first introduce a concept of semantic constraints that can be used to detect potential errors and do inference among candidate facts. Then, based on semantic constraints, we propose rank-based and graph-based algorithms for crowdsourced knowledge refining, which judiciously select the most beneficial candidate facts to conduct crowdsourcing and prune unnecessary questions. Our experiments show that our method improves the quality of knowledge bases significantly and outperforms state-of-the-art automatic methods under a reasonable crowdsourcing cost.
format Online
Article
Text
id pubmed-5446892
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-54468922017-06-06 Refining Automatically Extracted Knowledge Bases Using Crowdsourcing Li, Chunhua Zhao, Pengpeng Sheng, Victor S. Xian, Xuefeng Wu, Jian Cui, Zhiming Comput Intell Neurosci Research Article Machine-constructed knowledge bases often contain noisy and inaccurate facts. There exists significant work in developing automated algorithms for knowledge base refinement. Automated approaches improve the quality of knowledge bases but are far from perfect. In this paper, we leverage crowdsourcing to improve the quality of automatically extracted knowledge bases. As human labelling is costly, an important research challenge is how we can use limited human resources to maximize the quality improvement for a knowledge base. To address this problem, we first introduce a concept of semantic constraints that can be used to detect potential errors and do inference among candidate facts. Then, based on semantic constraints, we propose rank-based and graph-based algorithms for crowdsourced knowledge refining, which judiciously select the most beneficial candidate facts to conduct crowdsourcing and prune unnecessary questions. Our experiments show that our method improves the quality of knowledge bases significantly and outperforms state-of-the-art automatic methods under a reasonable crowdsourcing cost. Hindawi 2017 2017-05-14 /pmc/articles/PMC5446892/ /pubmed/28588611 http://dx.doi.org/10.1155/2017/4092135 Text en Copyright © 2017 Chunhua Li et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Li, Chunhua
Zhao, Pengpeng
Sheng, Victor S.
Xian, Xuefeng
Wu, Jian
Cui, Zhiming
Refining Automatically Extracted Knowledge Bases Using Crowdsourcing
title Refining Automatically Extracted Knowledge Bases Using Crowdsourcing
title_full Refining Automatically Extracted Knowledge Bases Using Crowdsourcing
title_fullStr Refining Automatically Extracted Knowledge Bases Using Crowdsourcing
title_full_unstemmed Refining Automatically Extracted Knowledge Bases Using Crowdsourcing
title_short Refining Automatically Extracted Knowledge Bases Using Crowdsourcing
title_sort refining automatically extracted knowledge bases using crowdsourcing
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5446892/
https://www.ncbi.nlm.nih.gov/pubmed/28588611
http://dx.doi.org/10.1155/2017/4092135
work_keys_str_mv AT lichunhua refiningautomaticallyextractedknowledgebasesusingcrowdsourcing
AT zhaopengpeng refiningautomaticallyextractedknowledgebasesusingcrowdsourcing
AT shengvictors refiningautomaticallyextractedknowledgebasesusingcrowdsourcing
AT xianxuefeng refiningautomaticallyextractedknowledgebasesusingcrowdsourcing
AT wujian refiningautomaticallyextractedknowledgebasesusingcrowdsourcing
AT cuizhiming refiningautomaticallyextractedknowledgebasesusingcrowdsourcing