Cargando…
Experimental data manipulations to assess performance of hyperspectral classification models of crop seeds and other objects
BACKGROUND: Optical sensing solutions are being developed and adopted to classify a wide range of biological objects, including crop seeds. Performance assessment of optical classification models remains both a priority and a challenge. METHODS: As training data, we acquired hyperspectral imaging da...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9164469/ https://www.ncbi.nlm.nih.gov/pubmed/35658997 http://dx.doi.org/10.1186/s13007-022-00912-z |
_version_ | 1784720141290831872 |
---|---|
author | Nansen, Christian Imtiaz, Mohammad S. Mesgaran, Mohsen B. Lee, Hyoseok |
author_facet | Nansen, Christian Imtiaz, Mohammad S. Mesgaran, Mohsen B. Lee, Hyoseok |
author_sort | Nansen, Christian |
collection | PubMed |
description | BACKGROUND: Optical sensing solutions are being developed and adopted to classify a wide range of biological objects, including crop seeds. Performance assessment of optical classification models remains both a priority and a challenge. METHODS: As training data, we acquired hyperspectral imaging data from 3646 individual tomato seeds (germination yes/no) from two tomato varieties. We performed three experimental data manipulations: (1) Object assignment error: effect of individual object in the training data being assigned to the wrong class. (2) Spectral repeatability: effect of introducing known ranges (0–10%) of stochastic noise to individual reflectance values. (3) Size of training data set: effect of reducing numbers of observations in training data. Effects of each of these experimental data manipulations were characterized and quantified based on classifications with two functions [linear discriminant analysis (LDA) and support vector machine (SVM)]. RESULTS: For both classification functions, accuracy decreased linearly in response to introduction of object assignment error and to experimental reduction of spectral repeatability. We also demonstrated that experimental reduction of training data by 20% had negligible effect on classification accuracy. LDA and SVM classification algorithms were applied to independent validation seed samples. LDA-based classifications predicted seed germination with RMSE = 10.56 (variety 1) and 26.15 (variety 2), and SVM-based classifications predicted seed germination with RMSE = 10.44 (variety 1) and 12.58 (variety 2). CONCLUSION: We believe this study represents the first, in which optical seed classification included both a thorough performance evaluation of two separate classification functions based on experimental data manipulations, and application of classification models to validation seed samples not included in training data. Proposed experimental data manipulations are discussed in broader contexts and general relevance, and they are suggested as methods for in-depth performance assessments of optical classification models. |
format | Online Article Text |
id | pubmed-9164469 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-91644692022-06-05 Experimental data manipulations to assess performance of hyperspectral classification models of crop seeds and other objects Nansen, Christian Imtiaz, Mohammad S. Mesgaran, Mohsen B. Lee, Hyoseok Plant Methods Research BACKGROUND: Optical sensing solutions are being developed and adopted to classify a wide range of biological objects, including crop seeds. Performance assessment of optical classification models remains both a priority and a challenge. METHODS: As training data, we acquired hyperspectral imaging data from 3646 individual tomato seeds (germination yes/no) from two tomato varieties. We performed three experimental data manipulations: (1) Object assignment error: effect of individual object in the training data being assigned to the wrong class. (2) Spectral repeatability: effect of introducing known ranges (0–10%) of stochastic noise to individual reflectance values. (3) Size of training data set: effect of reducing numbers of observations in training data. Effects of each of these experimental data manipulations were characterized and quantified based on classifications with two functions [linear discriminant analysis (LDA) and support vector machine (SVM)]. RESULTS: For both classification functions, accuracy decreased linearly in response to introduction of object assignment error and to experimental reduction of spectral repeatability. We also demonstrated that experimental reduction of training data by 20% had negligible effect on classification accuracy. LDA and SVM classification algorithms were applied to independent validation seed samples. LDA-based classifications predicted seed germination with RMSE = 10.56 (variety 1) and 26.15 (variety 2), and SVM-based classifications predicted seed germination with RMSE = 10.44 (variety 1) and 12.58 (variety 2). CONCLUSION: We believe this study represents the first, in which optical seed classification included both a thorough performance evaluation of two separate classification functions based on experimental data manipulations, and application of classification models to validation seed samples not included in training data. Proposed experimental data manipulations are discussed in broader contexts and general relevance, and they are suggested as methods for in-depth performance assessments of optical classification models. BioMed Central 2022-06-03 /pmc/articles/PMC9164469/ /pubmed/35658997 http://dx.doi.org/10.1186/s13007-022-00912-z Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Nansen, Christian Imtiaz, Mohammad S. Mesgaran, Mohsen B. Lee, Hyoseok Experimental data manipulations to assess performance of hyperspectral classification models of crop seeds and other objects |
title | Experimental data manipulations to assess performance of hyperspectral classification models of crop seeds and other objects |
title_full | Experimental data manipulations to assess performance of hyperspectral classification models of crop seeds and other objects |
title_fullStr | Experimental data manipulations to assess performance of hyperspectral classification models of crop seeds and other objects |
title_full_unstemmed | Experimental data manipulations to assess performance of hyperspectral classification models of crop seeds and other objects |
title_short | Experimental data manipulations to assess performance of hyperspectral classification models of crop seeds and other objects |
title_sort | experimental data manipulations to assess performance of hyperspectral classification models of crop seeds and other objects |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9164469/ https://www.ncbi.nlm.nih.gov/pubmed/35658997 http://dx.doi.org/10.1186/s13007-022-00912-z |
work_keys_str_mv | AT nansenchristian experimentaldatamanipulationstoassessperformanceofhyperspectralclassificationmodelsofcropseedsandotherobjects AT imtiazmohammads experimentaldatamanipulationstoassessperformanceofhyperspectralclassificationmodelsofcropseedsandotherobjects AT mesgaranmohsenb experimentaldatamanipulationstoassessperformanceofhyperspectralclassificationmodelsofcropseedsandotherobjects AT leehyoseok experimentaldatamanipulationstoassessperformanceofhyperspectralclassificationmodelsofcropseedsandotherobjects |