Cargando…

Towards automated check-worthy sentence detection using Gated Recurrent Unit

People are exposed to a lot of information daily, which is a mix of facts, opinions, and false claims. The rate at which information is created and spread has necessitated an automated fact-checking mechanism. In this work, we focus on the first step of the fact-checking system, which is to identify...

Descripción completa

Detalles Bibliográficos
Autores principales:	Jha, Ria, Motwani, Ena, Singhal, Nivedita, Kaushal, Rishabh
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer London 2023
Materias:	Original Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9916500/ https://www.ncbi.nlm.nih.gov/pubmed/36816595 http://dx.doi.org/10.1007/s00521-023-08300-x

_version_	1784886141175988224
author	Jha, Ria Motwani, Ena Singhal, Nivedita Kaushal, Rishabh
author_facet	Jha, Ria Motwani, Ena Singhal, Nivedita Kaushal, Rishabh
author_sort	Jha, Ria
collection	PubMed
description	People are exposed to a lot of information daily, which is a mix of facts, opinions, and false claims. The rate at which information is created and spread has necessitated an automated fact-checking mechanism. In this work, we focus on the first step of the fact-checking system, which is to identify whether a given sentence is factual. We propose a glove embedding-based gated recurrent unit pipeline for check-worthy sentence detection, referred to as G2CW framework. It detects whether a given sentence has check-worthy content in it or not; furthermore, if it has check-worthy content, whether it is important or not, from a fact-checking perspective. We evaluate our proposed framework on two datasets: a standard ClaimBuster dataset commonly used by the research community for this problem and a self-curated IndianClaim dataset. Our G2CW framework outperforms prior work with 0.92 as F1-score. Furthermore, our G2CW framework, when trained on the ClaimBuster dataset, performs the best on the IndianClaims dataset.
format	Online Article Text
id	pubmed-9916500
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Springer London
record_format	MEDLINE/PubMed
spelling	pubmed-99165002023-02-13 Towards automated check-worthy sentence detection using Gated Recurrent Unit Jha, Ria Motwani, Ena Singhal, Nivedita Kaushal, Rishabh Neural Comput Appl Original Article People are exposed to a lot of information daily, which is a mix of facts, opinions, and false claims. The rate at which information is created and spread has necessitated an automated fact-checking mechanism. In this work, we focus on the first step of the fact-checking system, which is to identify whether a given sentence is factual. We propose a glove embedding-based gated recurrent unit pipeline for check-worthy sentence detection, referred to as G2CW framework. It detects whether a given sentence has check-worthy content in it or not; furthermore, if it has check-worthy content, whether it is important or not, from a fact-checking perspective. We evaluate our proposed framework on two datasets: a standard ClaimBuster dataset commonly used by the research community for this problem and a self-curated IndianClaim dataset. Our G2CW framework outperforms prior work with 0.92 as F1-score. Furthermore, our G2CW framework, when trained on the ClaimBuster dataset, performs the best on the IndianClaims dataset. Springer London 2023-02-10 2023 /pmc/articles/PMC9916500/ /pubmed/36816595 http://dx.doi.org/10.1007/s00521-023-08300-x Text en © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2023, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle	Original Article Jha, Ria Motwani, Ena Singhal, Nivedita Kaushal, Rishabh Towards automated check-worthy sentence detection using Gated Recurrent Unit
title	Towards automated check-worthy sentence detection using Gated Recurrent Unit
title_full	Towards automated check-worthy sentence detection using Gated Recurrent Unit
title_fullStr	Towards automated check-worthy sentence detection using Gated Recurrent Unit
title_full_unstemmed	Towards automated check-worthy sentence detection using Gated Recurrent Unit
title_short	Towards automated check-worthy sentence detection using Gated Recurrent Unit
title_sort	towards automated check-worthy sentence detection using gated recurrent unit
topic	Original Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9916500/ https://www.ncbi.nlm.nih.gov/pubmed/36816595 http://dx.doi.org/10.1007/s00521-023-08300-x
work_keys_str_mv	AT jharia towardsautomatedcheckworthysentencedetectionusinggatedrecurrentunit AT motwaniena towardsautomatedcheckworthysentencedetectionusinggatedrecurrentunit AT singhalnivedita towardsautomatedcheckworthysentencedetectionusinggatedrecurrentunit AT kaushalrishabh towardsautomatedcheckworthysentencedetectionusinggatedrecurrentunit

Towards automated check-worthy sentence detection using Gated Recurrent Unit

Ejemplares similares