Cargando…

A fine-grained recognition technique for identifying Chinese food images

As a crucial area of research in the field of computer vision, food recognition technology has become a core technology in many food-related fields, such as unmanned restaurants and food nutrition analysis, which are closely related to our healthy lives. Obtaining accurate classification results is...

Descripción completa

Detalles Bibliográficos
Autores principales: Feng, Shuo, Wang, Yangang, Gong, Jianhong, Li, Xiang, Li, Shangxuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10661202/
https://www.ncbi.nlm.nih.gov/pubmed/38027727
http://dx.doi.org/10.1016/j.heliyon.2023.e21565
_version_ 1785137921461846016
author Feng, Shuo
Wang, Yangang
Gong, Jianhong
Li, Xiang
Li, Shangxuan
author_facet Feng, Shuo
Wang, Yangang
Gong, Jianhong
Li, Xiang
Li, Shangxuan
author_sort Feng, Shuo
collection PubMed
description As a crucial area of research in the field of computer vision, food recognition technology has become a core technology in many food-related fields, such as unmanned restaurants and food nutrition analysis, which are closely related to our healthy lives. Obtaining accurate classification results is the most important task in food recognition. Food classification is a fine-grained recognition process, which involves extracting features from a group of objects with similar appearances and accurately classifying them into different categories. In a such usage environment, the network is required to not only overview the overall image, but also capture the subtle details within it. In addition, since Chinese food images have unique texture features, the model needs to extract texture information from the image. However, existing CNN methods have not focused on and processed this information. To classify food as accurately as possible, this paper introduces the Laplace pyramid into the convolution layer and proposes a bilinear network that can perceive image texture features and multi-scale features (LMB-Net). The proposed model was evaluated on a public dataset, and the results demonstrate that LMB-Net achieves state-of-the-art classification performance.
format Online
Article
Text
id pubmed-10661202
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-106612022023-10-31 A fine-grained recognition technique for identifying Chinese food images Feng, Shuo Wang, Yangang Gong, Jianhong Li, Xiang Li, Shangxuan Heliyon Research Article As a crucial area of research in the field of computer vision, food recognition technology has become a core technology in many food-related fields, such as unmanned restaurants and food nutrition analysis, which are closely related to our healthy lives. Obtaining accurate classification results is the most important task in food recognition. Food classification is a fine-grained recognition process, which involves extracting features from a group of objects with similar appearances and accurately classifying them into different categories. In a such usage environment, the network is required to not only overview the overall image, but also capture the subtle details within it. In addition, since Chinese food images have unique texture features, the model needs to extract texture information from the image. However, existing CNN methods have not focused on and processed this information. To classify food as accurately as possible, this paper introduces the Laplace pyramid into the convolution layer and proposes a bilinear network that can perceive image texture features and multi-scale features (LMB-Net). The proposed model was evaluated on a public dataset, and the results demonstrate that LMB-Net achieves state-of-the-art classification performance. Elsevier 2023-10-31 /pmc/articles/PMC10661202/ /pubmed/38027727 http://dx.doi.org/10.1016/j.heliyon.2023.e21565 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
Feng, Shuo
Wang, Yangang
Gong, Jianhong
Li, Xiang
Li, Shangxuan
A fine-grained recognition technique for identifying Chinese food images
title A fine-grained recognition technique for identifying Chinese food images
title_full A fine-grained recognition technique for identifying Chinese food images
title_fullStr A fine-grained recognition technique for identifying Chinese food images
title_full_unstemmed A fine-grained recognition technique for identifying Chinese food images
title_short A fine-grained recognition technique for identifying Chinese food images
title_sort fine-grained recognition technique for identifying chinese food images
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10661202/
https://www.ncbi.nlm.nih.gov/pubmed/38027727
http://dx.doi.org/10.1016/j.heliyon.2023.e21565
work_keys_str_mv AT fengshuo afinegrainedrecognitiontechniqueforidentifyingchinesefoodimages
AT wangyangang afinegrainedrecognitiontechniqueforidentifyingchinesefoodimages
AT gongjianhong afinegrainedrecognitiontechniqueforidentifyingchinesefoodimages
AT lixiang afinegrainedrecognitiontechniqueforidentifyingchinesefoodimages
AT lishangxuan afinegrainedrecognitiontechniqueforidentifyingchinesefoodimages
AT fengshuo finegrainedrecognitiontechniqueforidentifyingchinesefoodimages
AT wangyangang finegrainedrecognitiontechniqueforidentifyingchinesefoodimages
AT gongjianhong finegrainedrecognitiontechniqueforidentifyingchinesefoodimages
AT lixiang finegrainedrecognitiontechniqueforidentifyingchinesefoodimages
AT lishangxuan finegrainedrecognitiontechniqueforidentifyingchinesefoodimages