Cargando…
To Estimate Performance of Artificial Neural Network Model Based on Terahertz Spectrum: Gelatin Identification as an Example
It is a necessity to determine significant food or traditional Chinese medicine (TCM) with low cost, which is more likely to achieve high accurate identification by THz-TDS. In this study, feedforward neural networks based on terahertz spectra are employed to predict the animal origin of gelatins, w...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9330513/ https://www.ncbi.nlm.nih.gov/pubmed/35911115 http://dx.doi.org/10.3389/fnut.2022.925717 |
_version_ | 1784758179953901568 |
---|---|
author | Li, Yizhang Liu, Lingyu Wang, Zhongmin Chang, Tianying Li, Ke Xu, Wenqing Wu, Yong Yang, Hua Jiang, Daoli |
author_facet | Li, Yizhang Liu, Lingyu Wang, Zhongmin Chang, Tianying Li, Ke Xu, Wenqing Wu, Yong Yang, Hua Jiang, Daoli |
author_sort | Li, Yizhang |
collection | PubMed |
description | It is a necessity to determine significant food or traditional Chinese medicine (TCM) with low cost, which is more likely to achieve high accurate identification by THz-TDS. In this study, feedforward neural networks based on terahertz spectra are employed to predict the animal origin of gelatins, whose adaption to the mission is examined by parallel models built by random sample partition and initialization. It is found that the generalization performance of feedforward ANNs in original data is not satisfactory although prediction on trained samples can be accurate. A multivariate scattering correction is conducted to enhance prediction accuracy, and 20 additional models verify the effectiveness of such dispose. A special partition of total dataset is conducted based on statistics of parallel models, whose influence on ANN performance is investigated with another 20 models. The performance of the models is unsatisfactory because of notable differences in training and test sets according to principal component analysis. By comparing the distribution of the first two principal components before and after multivariate scattering correction, we found that the reciprocal of the minimum number of line segments required for error-free classification in 2-D feature space can be viewed as an index to describe linear separability of data. The rise of proposed linear separability would have a lower requirement for harsh parameter tuning of ANN models and tolerate random initialization. The difference in principal components of samples between a training set and a data set determines whether partition is acceptable or whether a model would have generality. A rapid way to estimate the performance of an ANN before sufficient tuning on a classification mission is to compare differences between groups and differences within groups. Given that a representative peak missing curve is discussed in this article, an analysis based on gelatin THz spectra may be helpful for studies on some other feature-less species. |
format | Online Article Text |
id | pubmed-9330513 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-93305132022-07-29 To Estimate Performance of Artificial Neural Network Model Based on Terahertz Spectrum: Gelatin Identification as an Example Li, Yizhang Liu, Lingyu Wang, Zhongmin Chang, Tianying Li, Ke Xu, Wenqing Wu, Yong Yang, Hua Jiang, Daoli Front Nutr Nutrition It is a necessity to determine significant food or traditional Chinese medicine (TCM) with low cost, which is more likely to achieve high accurate identification by THz-TDS. In this study, feedforward neural networks based on terahertz spectra are employed to predict the animal origin of gelatins, whose adaption to the mission is examined by parallel models built by random sample partition and initialization. It is found that the generalization performance of feedforward ANNs in original data is not satisfactory although prediction on trained samples can be accurate. A multivariate scattering correction is conducted to enhance prediction accuracy, and 20 additional models verify the effectiveness of such dispose. A special partition of total dataset is conducted based on statistics of parallel models, whose influence on ANN performance is investigated with another 20 models. The performance of the models is unsatisfactory because of notable differences in training and test sets according to principal component analysis. By comparing the distribution of the first two principal components before and after multivariate scattering correction, we found that the reciprocal of the minimum number of line segments required for error-free classification in 2-D feature space can be viewed as an index to describe linear separability of data. The rise of proposed linear separability would have a lower requirement for harsh parameter tuning of ANN models and tolerate random initialization. The difference in principal components of samples between a training set and a data set determines whether partition is acceptable or whether a model would have generality. A rapid way to estimate the performance of an ANN before sufficient tuning on a classification mission is to compare differences between groups and differences within groups. Given that a representative peak missing curve is discussed in this article, an analysis based on gelatin THz spectra may be helpful for studies on some other feature-less species. Frontiers Media S.A. 2022-07-14 /pmc/articles/PMC9330513/ /pubmed/35911115 http://dx.doi.org/10.3389/fnut.2022.925717 Text en Copyright © 2022 Li, Liu, Wang, Chang, Li, Xu, Wu, Yang and Jiang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Nutrition Li, Yizhang Liu, Lingyu Wang, Zhongmin Chang, Tianying Li, Ke Xu, Wenqing Wu, Yong Yang, Hua Jiang, Daoli To Estimate Performance of Artificial Neural Network Model Based on Terahertz Spectrum: Gelatin Identification as an Example |
title | To Estimate Performance of Artificial Neural Network Model Based on Terahertz Spectrum: Gelatin Identification as an Example |
title_full | To Estimate Performance of Artificial Neural Network Model Based on Terahertz Spectrum: Gelatin Identification as an Example |
title_fullStr | To Estimate Performance of Artificial Neural Network Model Based on Terahertz Spectrum: Gelatin Identification as an Example |
title_full_unstemmed | To Estimate Performance of Artificial Neural Network Model Based on Terahertz Spectrum: Gelatin Identification as an Example |
title_short | To Estimate Performance of Artificial Neural Network Model Based on Terahertz Spectrum: Gelatin Identification as an Example |
title_sort | to estimate performance of artificial neural network model based on terahertz spectrum: gelatin identification as an example |
topic | Nutrition |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9330513/ https://www.ncbi.nlm.nih.gov/pubmed/35911115 http://dx.doi.org/10.3389/fnut.2022.925717 |
work_keys_str_mv | AT liyizhang toestimateperformanceofartificialneuralnetworkmodelbasedonterahertzspectrumgelatinidentificationasanexample AT liulingyu toestimateperformanceofartificialneuralnetworkmodelbasedonterahertzspectrumgelatinidentificationasanexample AT wangzhongmin toestimateperformanceofartificialneuralnetworkmodelbasedonterahertzspectrumgelatinidentificationasanexample AT changtianying toestimateperformanceofartificialneuralnetworkmodelbasedonterahertzspectrumgelatinidentificationasanexample AT like toestimateperformanceofartificialneuralnetworkmodelbasedonterahertzspectrumgelatinidentificationasanexample AT xuwenqing toestimateperformanceofartificialneuralnetworkmodelbasedonterahertzspectrumgelatinidentificationasanexample AT wuyong toestimateperformanceofartificialneuralnetworkmodelbasedonterahertzspectrumgelatinidentificationasanexample AT yanghua toestimateperformanceofartificialneuralnetworkmodelbasedonterahertzspectrumgelatinidentificationasanexample AT jiangdaoli toestimateperformanceofartificialneuralnetworkmodelbasedonterahertzspectrumgelatinidentificationasanexample |