Cargando…
Spectroscopy Approaches for Food Safety Applications: Improving Data Efficiency Using Active Learning and Semi-supervised Learning
The past decade witnessed rapid development in the measurement and monitoring technologies for food science. Among these technologies, spectroscopy has been widely used for the analysis of food quality, safety, and nutritional properties. Due to the complexity of food systems and the lack of compreh...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9257238/ https://www.ncbi.nlm.nih.gov/pubmed/35814488 http://dx.doi.org/10.3389/frai.2022.863261 |
_version_ | 1784741300459798528 |
---|---|
author | Zhang, Huanle Wisuthiphaet, Nicharee Cui, Hemiao Nitin, Nitin Liu, Xin Zhao, Qing |
author_facet | Zhang, Huanle Wisuthiphaet, Nicharee Cui, Hemiao Nitin, Nitin Liu, Xin Zhao, Qing |
author_sort | Zhang, Huanle |
collection | PubMed |
description | The past decade witnessed rapid development in the measurement and monitoring technologies for food science. Among these technologies, spectroscopy has been widely used for the analysis of food quality, safety, and nutritional properties. Due to the complexity of food systems and the lack of comprehensive predictive models, rapid and simple measurements to predict complex properties in food systems are largely missing. Machine Learning (ML) has shown great potential to improve the classification and prediction of these properties. However, the barriers to collecting large datasets for ML applications still persists. In this paper, we explore different approaches of data annotation and model training to improve data efficiency for ML applications. Specifically, we leverage Active Learning (AL) and Semi-Supervised Learning (SSL) and investigate four approaches: baseline passive learning, AL, SSL, and a hybrid of AL and SSL. To evaluate these approaches, we collect two spectroscopy datasets: predicting plasma dosage and detecting foodborne pathogen. Our experimental results show that, compared to the de facto passive learning approach, advanced approaches (AL, SSL, and the hybrid) can greatly reduce the number of labeled samples, with some cases decreasing the number of labeled samples by more than half. |
format | Online Article Text |
id | pubmed-9257238 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-92572382022-07-07 Spectroscopy Approaches for Food Safety Applications: Improving Data Efficiency Using Active Learning and Semi-supervised Learning Zhang, Huanle Wisuthiphaet, Nicharee Cui, Hemiao Nitin, Nitin Liu, Xin Zhao, Qing Front Artif Intell Artificial Intelligence The past decade witnessed rapid development in the measurement and monitoring technologies for food science. Among these technologies, spectroscopy has been widely used for the analysis of food quality, safety, and nutritional properties. Due to the complexity of food systems and the lack of comprehensive predictive models, rapid and simple measurements to predict complex properties in food systems are largely missing. Machine Learning (ML) has shown great potential to improve the classification and prediction of these properties. However, the barriers to collecting large datasets for ML applications still persists. In this paper, we explore different approaches of data annotation and model training to improve data efficiency for ML applications. Specifically, we leverage Active Learning (AL) and Semi-Supervised Learning (SSL) and investigate four approaches: baseline passive learning, AL, SSL, and a hybrid of AL and SSL. To evaluate these approaches, we collect two spectroscopy datasets: predicting plasma dosage and detecting foodborne pathogen. Our experimental results show that, compared to the de facto passive learning approach, advanced approaches (AL, SSL, and the hybrid) can greatly reduce the number of labeled samples, with some cases decreasing the number of labeled samples by more than half. Frontiers Media S.A. 2022-06-22 /pmc/articles/PMC9257238/ /pubmed/35814488 http://dx.doi.org/10.3389/frai.2022.863261 Text en Copyright © 2022 Zhang, Wisuthiphaet, Cui, Nitin, Liu and Zhao. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Artificial Intelligence Zhang, Huanle Wisuthiphaet, Nicharee Cui, Hemiao Nitin, Nitin Liu, Xin Zhao, Qing Spectroscopy Approaches for Food Safety Applications: Improving Data Efficiency Using Active Learning and Semi-supervised Learning |
title | Spectroscopy Approaches for Food Safety Applications: Improving Data Efficiency Using Active Learning and Semi-supervised Learning |
title_full | Spectroscopy Approaches for Food Safety Applications: Improving Data Efficiency Using Active Learning and Semi-supervised Learning |
title_fullStr | Spectroscopy Approaches for Food Safety Applications: Improving Data Efficiency Using Active Learning and Semi-supervised Learning |
title_full_unstemmed | Spectroscopy Approaches for Food Safety Applications: Improving Data Efficiency Using Active Learning and Semi-supervised Learning |
title_short | Spectroscopy Approaches for Food Safety Applications: Improving Data Efficiency Using Active Learning and Semi-supervised Learning |
title_sort | spectroscopy approaches for food safety applications: improving data efficiency using active learning and semi-supervised learning |
topic | Artificial Intelligence |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9257238/ https://www.ncbi.nlm.nih.gov/pubmed/35814488 http://dx.doi.org/10.3389/frai.2022.863261 |
work_keys_str_mv | AT zhanghuanle spectroscopyapproachesforfoodsafetyapplicationsimprovingdataefficiencyusingactivelearningandsemisupervisedlearning AT wisuthiphaetnicharee spectroscopyapproachesforfoodsafetyapplicationsimprovingdataefficiencyusingactivelearningandsemisupervisedlearning AT cuihemiao spectroscopyapproachesforfoodsafetyapplicationsimprovingdataefficiencyusingactivelearningandsemisupervisedlearning AT nitinnitin spectroscopyapproachesforfoodsafetyapplicationsimprovingdataefficiencyusingactivelearningandsemisupervisedlearning AT liuxin spectroscopyapproachesforfoodsafetyapplicationsimprovingdataefficiencyusingactivelearningandsemisupervisedlearning AT zhaoqing spectroscopyapproachesforfoodsafetyapplicationsimprovingdataefficiencyusingactivelearningandsemisupervisedlearning |