Cargando…
A fine-grained recognition technique for identifying Chinese food images
As a crucial area of research in the field of computer vision, food recognition technology has become a core technology in many food-related fields, such as unmanned restaurants and food nutrition analysis, which are closely related to our healthy lives. Obtaining accurate classification results is...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10661202/ https://www.ncbi.nlm.nih.gov/pubmed/38027727 http://dx.doi.org/10.1016/j.heliyon.2023.e21565 |
_version_ | 1785137921461846016 |
---|---|
author | Feng, Shuo Wang, Yangang Gong, Jianhong Li, Xiang Li, Shangxuan |
author_facet | Feng, Shuo Wang, Yangang Gong, Jianhong Li, Xiang Li, Shangxuan |
author_sort | Feng, Shuo |
collection | PubMed |
description | As a crucial area of research in the field of computer vision, food recognition technology has become a core technology in many food-related fields, such as unmanned restaurants and food nutrition analysis, which are closely related to our healthy lives. Obtaining accurate classification results is the most important task in food recognition. Food classification is a fine-grained recognition process, which involves extracting features from a group of objects with similar appearances and accurately classifying them into different categories. In a such usage environment, the network is required to not only overview the overall image, but also capture the subtle details within it. In addition, since Chinese food images have unique texture features, the model needs to extract texture information from the image. However, existing CNN methods have not focused on and processed this information. To classify food as accurately as possible, this paper introduces the Laplace pyramid into the convolution layer and proposes a bilinear network that can perceive image texture features and multi-scale features (LMB-Net). The proposed model was evaluated on a public dataset, and the results demonstrate that LMB-Net achieves state-of-the-art classification performance. |
format | Online Article Text |
id | pubmed-10661202 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-106612022023-10-31 A fine-grained recognition technique for identifying Chinese food images Feng, Shuo Wang, Yangang Gong, Jianhong Li, Xiang Li, Shangxuan Heliyon Research Article As a crucial area of research in the field of computer vision, food recognition technology has become a core technology in many food-related fields, such as unmanned restaurants and food nutrition analysis, which are closely related to our healthy lives. Obtaining accurate classification results is the most important task in food recognition. Food classification is a fine-grained recognition process, which involves extracting features from a group of objects with similar appearances and accurately classifying them into different categories. In a such usage environment, the network is required to not only overview the overall image, but also capture the subtle details within it. In addition, since Chinese food images have unique texture features, the model needs to extract texture information from the image. However, existing CNN methods have not focused on and processed this information. To classify food as accurately as possible, this paper introduces the Laplace pyramid into the convolution layer and proposes a bilinear network that can perceive image texture features and multi-scale features (LMB-Net). The proposed model was evaluated on a public dataset, and the results demonstrate that LMB-Net achieves state-of-the-art classification performance. Elsevier 2023-10-31 /pmc/articles/PMC10661202/ /pubmed/38027727 http://dx.doi.org/10.1016/j.heliyon.2023.e21565 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Research Article Feng, Shuo Wang, Yangang Gong, Jianhong Li, Xiang Li, Shangxuan A fine-grained recognition technique for identifying Chinese food images |
title | A fine-grained recognition technique for identifying Chinese food images |
title_full | A fine-grained recognition technique for identifying Chinese food images |
title_fullStr | A fine-grained recognition technique for identifying Chinese food images |
title_full_unstemmed | A fine-grained recognition technique for identifying Chinese food images |
title_short | A fine-grained recognition technique for identifying Chinese food images |
title_sort | fine-grained recognition technique for identifying chinese food images |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10661202/ https://www.ncbi.nlm.nih.gov/pubmed/38027727 http://dx.doi.org/10.1016/j.heliyon.2023.e21565 |
work_keys_str_mv | AT fengshuo afinegrainedrecognitiontechniqueforidentifyingchinesefoodimages AT wangyangang afinegrainedrecognitiontechniqueforidentifyingchinesefoodimages AT gongjianhong afinegrainedrecognitiontechniqueforidentifyingchinesefoodimages AT lixiang afinegrainedrecognitiontechniqueforidentifyingchinesefoodimages AT lishangxuan afinegrainedrecognitiontechniqueforidentifyingchinesefoodimages AT fengshuo finegrainedrecognitiontechniqueforidentifyingchinesefoodimages AT wangyangang finegrainedrecognitiontechniqueforidentifyingchinesefoodimages AT gongjianhong finegrainedrecognitiontechniqueforidentifyingchinesefoodimages AT lixiang finegrainedrecognitiontechniqueforidentifyingchinesefoodimages AT lishangxuan finegrainedrecognitiontechniqueforidentifyingchinesefoodimages |