Cargando…

Food Image Segmentation Using Multi-Modal Imaging Sensors with Color and Thermal Data

Sensor-based food intake monitoring has become one of the fastest-growing fields in dietary assessment. Researchers are exploring imaging-sensor-based food detection, food recognition, and food portion size estimation. A major problem that is still being tackled in this field is the segmentation of...

Descripción completa

Detalles Bibliográficos
Autores principales: Raju, Viprav B., Imtiaz, Masudul H., Sazonov, Edward
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9860575/
https://www.ncbi.nlm.nih.gov/pubmed/36679357
http://dx.doi.org/10.3390/s23020560
_version_ 1784874617404391424
author Raju, Viprav B.
Imtiaz, Masudul H.
Sazonov, Edward
author_facet Raju, Viprav B.
Imtiaz, Masudul H.
Sazonov, Edward
author_sort Raju, Viprav B.
collection PubMed
description Sensor-based food intake monitoring has become one of the fastest-growing fields in dietary assessment. Researchers are exploring imaging-sensor-based food detection, food recognition, and food portion size estimation. A major problem that is still being tackled in this field is the segmentation of regions of food when multiple food items are present, mainly when similar-looking foods (similar in color and/or texture) are present. Food image segmentation is a relatively under-explored area compared with other fields. This paper proposes a novel approach to food imaging consisting of two imaging sensors: color (Red–Green–Blue) and thermal. Furthermore, we propose a multi-modal four-Dimensional (RGB-T) image segmentation using a k-means clustering algorithm to segment regions of similar-looking food items in multiple combinations of hot, cold, and warm (at room temperature) foods. Six food combinations of two food items each were used to capture RGB and thermal image data. RGB and thermal data were superimposed to form a combined RGB-T image and three sets of data (RGB, thermal, and RGB-T) were tested. A bootstrapped optimization of within-cluster sum of squares (WSS) was employed to determine the optimal number of clusters for each case. The combined RGB-T data achieved better results compared with RGB and thermal data, used individually. The mean ± standard deviation (std. dev.) of the F1 score for RGB-T data was 0.87 ± 0.1 compared with 0.66 ± 0.13 and 0.64 ± 0.39, for RGB and Thermal data, respectively.
format Online
Article
Text
id pubmed-9860575
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-98605752023-01-22 Food Image Segmentation Using Multi-Modal Imaging Sensors with Color and Thermal Data Raju, Viprav B. Imtiaz, Masudul H. Sazonov, Edward Sensors (Basel) Article Sensor-based food intake monitoring has become one of the fastest-growing fields in dietary assessment. Researchers are exploring imaging-sensor-based food detection, food recognition, and food portion size estimation. A major problem that is still being tackled in this field is the segmentation of regions of food when multiple food items are present, mainly when similar-looking foods (similar in color and/or texture) are present. Food image segmentation is a relatively under-explored area compared with other fields. This paper proposes a novel approach to food imaging consisting of two imaging sensors: color (Red–Green–Blue) and thermal. Furthermore, we propose a multi-modal four-Dimensional (RGB-T) image segmentation using a k-means clustering algorithm to segment regions of similar-looking food items in multiple combinations of hot, cold, and warm (at room temperature) foods. Six food combinations of two food items each were used to capture RGB and thermal image data. RGB and thermal data were superimposed to form a combined RGB-T image and three sets of data (RGB, thermal, and RGB-T) were tested. A bootstrapped optimization of within-cluster sum of squares (WSS) was employed to determine the optimal number of clusters for each case. The combined RGB-T data achieved better results compared with RGB and thermal data, used individually. The mean ± standard deviation (std. dev.) of the F1 score for RGB-T data was 0.87 ± 0.1 compared with 0.66 ± 0.13 and 0.64 ± 0.39, for RGB and Thermal data, respectively. MDPI 2023-01-04 /pmc/articles/PMC9860575/ /pubmed/36679357 http://dx.doi.org/10.3390/s23020560 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Raju, Viprav B.
Imtiaz, Masudul H.
Sazonov, Edward
Food Image Segmentation Using Multi-Modal Imaging Sensors with Color and Thermal Data
title Food Image Segmentation Using Multi-Modal Imaging Sensors with Color and Thermal Data
title_full Food Image Segmentation Using Multi-Modal Imaging Sensors with Color and Thermal Data
title_fullStr Food Image Segmentation Using Multi-Modal Imaging Sensors with Color and Thermal Data
title_full_unstemmed Food Image Segmentation Using Multi-Modal Imaging Sensors with Color and Thermal Data
title_short Food Image Segmentation Using Multi-Modal Imaging Sensors with Color and Thermal Data
title_sort food image segmentation using multi-modal imaging sensors with color and thermal data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9860575/
https://www.ncbi.nlm.nih.gov/pubmed/36679357
http://dx.doi.org/10.3390/s23020560
work_keys_str_mv AT rajuvipravb foodimagesegmentationusingmultimodalimagingsensorswithcolorandthermaldata
AT imtiazmasudulh foodimagesegmentationusingmultimodalimagingsensorswithcolorandthermaldata
AT sazonovedward foodimagesegmentationusingmultimodalimagingsensorswithcolorandthermaldata