Cargando…

Experimental data manipulations to assess performance of hyperspectral classification models of crop seeds and other objects

BACKGROUND: Optical sensing solutions are being developed and adopted to classify a wide range of biological objects, including crop seeds. Performance assessment of optical classification models remains both a priority and a challenge. METHODS: As training data, we acquired hyperspectral imaging da...

Descripción completa

Detalles Bibliográficos
Autores principales: Nansen, Christian, Imtiaz, Mohammad S., Mesgaran, Mohsen B., Lee, Hyoseok
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9164469/
https://www.ncbi.nlm.nih.gov/pubmed/35658997
http://dx.doi.org/10.1186/s13007-022-00912-z
_version_ 1784720141290831872
author Nansen, Christian
Imtiaz, Mohammad S.
Mesgaran, Mohsen B.
Lee, Hyoseok
author_facet Nansen, Christian
Imtiaz, Mohammad S.
Mesgaran, Mohsen B.
Lee, Hyoseok
author_sort Nansen, Christian
collection PubMed
description BACKGROUND: Optical sensing solutions are being developed and adopted to classify a wide range of biological objects, including crop seeds. Performance assessment of optical classification models remains both a priority and a challenge. METHODS: As training data, we acquired hyperspectral imaging data from 3646 individual tomato seeds (germination yes/no) from two tomato varieties. We performed three experimental data manipulations: (1) Object assignment error: effect of individual object in the training data being assigned to the wrong class. (2) Spectral repeatability: effect of introducing known ranges (0–10%) of stochastic noise to individual reflectance values. (3) Size of training data set: effect of reducing numbers of observations in training data. Effects of each of these experimental data manipulations were characterized and quantified based on classifications with two functions [linear discriminant analysis (LDA) and support vector machine (SVM)]. RESULTS: For both classification functions, accuracy decreased linearly in response to introduction of object assignment error and to experimental reduction of spectral repeatability. We also demonstrated that experimental reduction of training data by 20% had negligible effect on classification accuracy. LDA and SVM classification algorithms were applied to independent validation seed samples. LDA-based classifications predicted seed germination with RMSE = 10.56 (variety 1) and 26.15 (variety 2), and SVM-based classifications predicted seed germination with RMSE = 10.44 (variety 1) and 12.58 (variety 2). CONCLUSION: We believe this study represents the first, in which optical seed classification included both a thorough performance evaluation of two separate classification functions based on experimental data manipulations, and application of classification models to validation seed samples not included in training data. Proposed experimental data manipulations are discussed in broader contexts and general relevance, and they are suggested as methods for in-depth performance assessments of optical classification models.
format Online
Article
Text
id pubmed-9164469
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-91644692022-06-05 Experimental data manipulations to assess performance of hyperspectral classification models of crop seeds and other objects Nansen, Christian Imtiaz, Mohammad S. Mesgaran, Mohsen B. Lee, Hyoseok Plant Methods Research BACKGROUND: Optical sensing solutions are being developed and adopted to classify a wide range of biological objects, including crop seeds. Performance assessment of optical classification models remains both a priority and a challenge. METHODS: As training data, we acquired hyperspectral imaging data from 3646 individual tomato seeds (germination yes/no) from two tomato varieties. We performed three experimental data manipulations: (1) Object assignment error: effect of individual object in the training data being assigned to the wrong class. (2) Spectral repeatability: effect of introducing known ranges (0–10%) of stochastic noise to individual reflectance values. (3) Size of training data set: effect of reducing numbers of observations in training data. Effects of each of these experimental data manipulations were characterized and quantified based on classifications with two functions [linear discriminant analysis (LDA) and support vector machine (SVM)]. RESULTS: For both classification functions, accuracy decreased linearly in response to introduction of object assignment error and to experimental reduction of spectral repeatability. We also demonstrated that experimental reduction of training data by 20% had negligible effect on classification accuracy. LDA and SVM classification algorithms were applied to independent validation seed samples. LDA-based classifications predicted seed germination with RMSE = 10.56 (variety 1) and 26.15 (variety 2), and SVM-based classifications predicted seed germination with RMSE = 10.44 (variety 1) and 12.58 (variety 2). CONCLUSION: We believe this study represents the first, in which optical seed classification included both a thorough performance evaluation of two separate classification functions based on experimental data manipulations, and application of classification models to validation seed samples not included in training data. Proposed experimental data manipulations are discussed in broader contexts and general relevance, and they are suggested as methods for in-depth performance assessments of optical classification models. BioMed Central 2022-06-03 /pmc/articles/PMC9164469/ /pubmed/35658997 http://dx.doi.org/10.1186/s13007-022-00912-z Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Nansen, Christian
Imtiaz, Mohammad S.
Mesgaran, Mohsen B.
Lee, Hyoseok
Experimental data manipulations to assess performance of hyperspectral classification models of crop seeds and other objects
title Experimental data manipulations to assess performance of hyperspectral classification models of crop seeds and other objects
title_full Experimental data manipulations to assess performance of hyperspectral classification models of crop seeds and other objects
title_fullStr Experimental data manipulations to assess performance of hyperspectral classification models of crop seeds and other objects
title_full_unstemmed Experimental data manipulations to assess performance of hyperspectral classification models of crop seeds and other objects
title_short Experimental data manipulations to assess performance of hyperspectral classification models of crop seeds and other objects
title_sort experimental data manipulations to assess performance of hyperspectral classification models of crop seeds and other objects
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9164469/
https://www.ncbi.nlm.nih.gov/pubmed/35658997
http://dx.doi.org/10.1186/s13007-022-00912-z
work_keys_str_mv AT nansenchristian experimentaldatamanipulationstoassessperformanceofhyperspectralclassificationmodelsofcropseedsandotherobjects
AT imtiazmohammads experimentaldatamanipulationstoassessperformanceofhyperspectralclassificationmodelsofcropseedsandotherobjects
AT mesgaranmohsenb experimentaldatamanipulationstoassessperformanceofhyperspectralclassificationmodelsofcropseedsandotherobjects
AT leehyoseok experimentaldatamanipulationstoassessperformanceofhyperspectralclassificationmodelsofcropseedsandotherobjects