Cargando…

Refining Automatically Extracted Knowledge Bases Using Crowdsourcing

Machine-constructed knowledge bases often contain noisy and inaccurate facts. There exists significant work in developing automated algorithms for knowledge base refinement. Automated approaches improve the quality of knowledge bases but are far from perfect. In this paper, we leverage crowdsourcing...

Descripción completa

Detalles Bibliográficos
Autores principales:	Li, Chunhua, Zhao, Pengpeng, Sheng, Victor S., Xian, Xuefeng, Wu, Jian, Cui, Zhiming
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Hindawi 2017
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5446892/ https://www.ncbi.nlm.nih.gov/pubmed/28588611 http://dx.doi.org/10.1155/2017/4092135

_version_	1783239191641980928
author	Li, Chunhua Zhao, Pengpeng Sheng, Victor S. Xian, Xuefeng Wu, Jian Cui, Zhiming
author_facet	Li, Chunhua Zhao, Pengpeng Sheng, Victor S. Xian, Xuefeng Wu, Jian Cui, Zhiming
author_sort	Li, Chunhua
collection	PubMed
description	Machine-constructed knowledge bases often contain noisy and inaccurate facts. There exists significant work in developing automated algorithms for knowledge base refinement. Automated approaches improve the quality of knowledge bases but are far from perfect. In this paper, we leverage crowdsourcing to improve the quality of automatically extracted knowledge bases. As human labelling is costly, an important research challenge is how we can use limited human resources to maximize the quality improvement for a knowledge base. To address this problem, we first introduce a concept of semantic constraints that can be used to detect potential errors and do inference among candidate facts. Then, based on semantic constraints, we propose rank-based and graph-based algorithms for crowdsourced knowledge refining, which judiciously select the most beneficial candidate facts to conduct crowdsourcing and prune unnecessary questions. Our experiments show that our method improves the quality of knowledge bases significantly and outperforms state-of-the-art automatic methods under a reasonable crowdsourcing cost.
format	Online Article Text
id	pubmed-5446892
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	Hindawi
record_format	MEDLINE/PubMed
spelling	pubmed-54468922017-06-06 Refining Automatically Extracted Knowledge Bases Using Crowdsourcing Li, Chunhua Zhao, Pengpeng Sheng, Victor S. Xian, Xuefeng Wu, Jian Cui, Zhiming Comput Intell Neurosci Research Article Machine-constructed knowledge bases often contain noisy and inaccurate facts. There exists significant work in developing automated algorithms for knowledge base refinement. Automated approaches improve the quality of knowledge bases but are far from perfect. In this paper, we leverage crowdsourcing to improve the quality of automatically extracted knowledge bases. As human labelling is costly, an important research challenge is how we can use limited human resources to maximize the quality improvement for a knowledge base. To address this problem, we first introduce a concept of semantic constraints that can be used to detect potential errors and do inference among candidate facts. Then, based on semantic constraints, we propose rank-based and graph-based algorithms for crowdsourced knowledge refining, which judiciously select the most beneficial candidate facts to conduct crowdsourcing and prune unnecessary questions. Our experiments show that our method improves the quality of knowledge bases significantly and outperforms state-of-the-art automatic methods under a reasonable crowdsourcing cost. Hindawi 2017 2017-05-14 /pmc/articles/PMC5446892/ /pubmed/28588611 http://dx.doi.org/10.1155/2017/4092135 Text en Copyright © 2017 Chunhua Li et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Li, Chunhua Zhao, Pengpeng Sheng, Victor S. Xian, Xuefeng Wu, Jian Cui, Zhiming Refining Automatically Extracted Knowledge Bases Using Crowdsourcing
title	Refining Automatically Extracted Knowledge Bases Using Crowdsourcing
title_full	Refining Automatically Extracted Knowledge Bases Using Crowdsourcing
title_fullStr	Refining Automatically Extracted Knowledge Bases Using Crowdsourcing
title_full_unstemmed	Refining Automatically Extracted Knowledge Bases Using Crowdsourcing
title_short	Refining Automatically Extracted Knowledge Bases Using Crowdsourcing
title_sort	refining automatically extracted knowledge bases using crowdsourcing
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5446892/ https://www.ncbi.nlm.nih.gov/pubmed/28588611 http://dx.doi.org/10.1155/2017/4092135
work_keys_str_mv	AT lichunhua refiningautomaticallyextractedknowledgebasesusingcrowdsourcing AT zhaopengpeng refiningautomaticallyextractedknowledgebasesusingcrowdsourcing AT shengvictors refiningautomaticallyextractedknowledgebasesusingcrowdsourcing AT xianxuefeng refiningautomaticallyextractedknowledgebasesusingcrowdsourcing AT wujian refiningautomaticallyextractedknowledgebasesusingcrowdsourcing AT cuizhiming refiningautomaticallyextractedknowledgebasesusingcrowdsourcing

Refining Automatically Extracted Knowledge Bases Using Crowdsourcing

Ejemplares similares