Cargando…
Food safety news events classification via a hierarchical transformer model
In light of the significance of regulatory authorities and the rising demand for information disclosure, a vast amount of information on food safety news reports is readily accessible on the Internet. The extraction of such information for precise classification and provision of appropriate safety a...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10345358/ https://www.ncbi.nlm.nih.gov/pubmed/37456013 http://dx.doi.org/10.1016/j.heliyon.2023.e17806 |
_version_ | 1785073070706262016 |
---|---|
author | Xiong, Shufeng Tian, Wenjie Batra, Vishwash Fan, Xiaobo Xi, Lei Liu, Hebing Liu, Liangliang |
author_facet | Xiong, Shufeng Tian, Wenjie Batra, Vishwash Fan, Xiaobo Xi, Lei Liu, Hebing Liu, Liangliang |
author_sort | Xiong, Shufeng |
collection | PubMed |
description | In light of the significance of regulatory authorities and the rising demand for information disclosure, a vast amount of information on food safety news reports is readily accessible on the Internet. The extraction of such information for precise classification and provision of appropriate safety alerts based on their respective categories has emerged as a challenging problem for academic research. Given that most food safety-related events in news reports comprise lengthy text, the pre-trained language models currently employed for text analysis are generally limited in their capability to handle long documents. This paper proposes a long-text classification model utilising hierarchical Transformers. We categorise information in long documents into two distinct types: (1) multiple text chunks meeting the length constraint and (2) essential sentences within long documents, such as headings, paragraph start and end sentences, etc. Initially, our proposed model utilises the text chunks as input to the BERT model. Then, it concatenates the output of the BERT model with the important sentences from the document and use them as input to the Transformer model for feature transformation. Finally, we utilise a classifier for food safety news classification. We conducted several comparative experiments with various commonly used text classification models on a dataset constructed from publicly available information on food regulatory websites. Our proposed method outperforms existing methods, establishing itself as the leading approach in terms of performance. |
format | Online Article Text |
id | pubmed-10345358 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-103453582023-07-15 Food safety news events classification via a hierarchical transformer model Xiong, Shufeng Tian, Wenjie Batra, Vishwash Fan, Xiaobo Xi, Lei Liu, Hebing Liu, Liangliang Heliyon Research Article In light of the significance of regulatory authorities and the rising demand for information disclosure, a vast amount of information on food safety news reports is readily accessible on the Internet. The extraction of such information for precise classification and provision of appropriate safety alerts based on their respective categories has emerged as a challenging problem for academic research. Given that most food safety-related events in news reports comprise lengthy text, the pre-trained language models currently employed for text analysis are generally limited in their capability to handle long documents. This paper proposes a long-text classification model utilising hierarchical Transformers. We categorise information in long documents into two distinct types: (1) multiple text chunks meeting the length constraint and (2) essential sentences within long documents, such as headings, paragraph start and end sentences, etc. Initially, our proposed model utilises the text chunks as input to the BERT model. Then, it concatenates the output of the BERT model with the important sentences from the document and use them as input to the Transformer model for feature transformation. Finally, we utilise a classifier for food safety news classification. We conducted several comparative experiments with various commonly used text classification models on a dataset constructed from publicly available information on food regulatory websites. Our proposed method outperforms existing methods, establishing itself as the leading approach in terms of performance. Elsevier 2023-06-30 /pmc/articles/PMC10345358/ /pubmed/37456013 http://dx.doi.org/10.1016/j.heliyon.2023.e17806 Text en © 2023 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Research Article Xiong, Shufeng Tian, Wenjie Batra, Vishwash Fan, Xiaobo Xi, Lei Liu, Hebing Liu, Liangliang Food safety news events classification via a hierarchical transformer model |
title | Food safety news events classification via a hierarchical transformer model |
title_full | Food safety news events classification via a hierarchical transformer model |
title_fullStr | Food safety news events classification via a hierarchical transformer model |
title_full_unstemmed | Food safety news events classification via a hierarchical transformer model |
title_short | Food safety news events classification via a hierarchical transformer model |
title_sort | food safety news events classification via a hierarchical transformer model |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10345358/ https://www.ncbi.nlm.nih.gov/pubmed/37456013 http://dx.doi.org/10.1016/j.heliyon.2023.e17806 |
work_keys_str_mv | AT xiongshufeng foodsafetynewseventsclassificationviaahierarchicaltransformermodel AT tianwenjie foodsafetynewseventsclassificationviaahierarchicaltransformermodel AT batravishwash foodsafetynewseventsclassificationviaahierarchicaltransformermodel AT fanxiaobo foodsafetynewseventsclassificationviaahierarchicaltransformermodel AT xilei foodsafetynewseventsclassificationviaahierarchicaltransformermodel AT liuhebing foodsafetynewseventsclassificationviaahierarchicaltransformermodel AT liuliangliang foodsafetynewseventsclassificationviaahierarchicaltransformermodel |