Cargando…
Assessment of clustering techniques to support the analyses of soybean seed vigor
Soy is the main product of Brazilian agriculture and the fourth most cultivated bean globally. Since soy cultivation tends to increase and due to this large market, the guarantee of product quality is an indispensable factor for enterprises to stay competitive. Industries perform vigor tests to acqu...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10456222/ https://www.ncbi.nlm.nih.gov/pubmed/37624819 http://dx.doi.org/10.1371/journal.pone.0285566 |
_version_ | 1785096644316889088 |
---|---|
author | de Oliveira, Eduardo R. Bugatti, Pedro H. Saito, Priscila T. M. |
author_facet | de Oliveira, Eduardo R. Bugatti, Pedro H. Saito, Priscila T. M. |
author_sort | de Oliveira, Eduardo R. |
collection | PubMed |
description | Soy is the main product of Brazilian agriculture and the fourth most cultivated bean globally. Since soy cultivation tends to increase and due to this large market, the guarantee of product quality is an indispensable factor for enterprises to stay competitive. Industries perform vigor tests to acquire information and evaluate the quality of soy planting. The tetrazolium test, for example, provides information about moisture damage, bedbugs, or mechanical damage. However, the verification of the damage reason and its severity are done by an analyst, one by one. Since this is massive and exhausting work, it is susceptible to mistakes. Proposals involving different supervised learning approaches, including active learning strategies, have already been used, and have brought significant results. Therefore, this paper analyzes the performance of non-supervised techniques for classifying soybeans. An extensive experimental evaluation was performed, considering (9) different clustering algorithms (partitional, hierarchical, and density-based) applied to 5 image datasets of soybean seeds submitted to the tetrazolium test, including different damages and/or their levels. To describe those images, we considered 18 extractors of traditional features. We also considered four metrics (accuracy, FOWLKES, DAVIES, and CALINSKI) and two-dimensionality reduction techniques (principal component analysis and t-distributed stochastic neighbor embedding) for validation. Results show that this paper presents essential contributions since it makes it possible to identify descriptors and clustering algorithms that shall be used as preprocessing in other learning processes, accelerating and improving the classification process of key agricultural problems. |
format | Online Article Text |
id | pubmed-10456222 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-104562222023-08-26 Assessment of clustering techniques to support the analyses of soybean seed vigor de Oliveira, Eduardo R. Bugatti, Pedro H. Saito, Priscila T. M. PLoS One Research Article Soy is the main product of Brazilian agriculture and the fourth most cultivated bean globally. Since soy cultivation tends to increase and due to this large market, the guarantee of product quality is an indispensable factor for enterprises to stay competitive. Industries perform vigor tests to acquire information and evaluate the quality of soy planting. The tetrazolium test, for example, provides information about moisture damage, bedbugs, or mechanical damage. However, the verification of the damage reason and its severity are done by an analyst, one by one. Since this is massive and exhausting work, it is susceptible to mistakes. Proposals involving different supervised learning approaches, including active learning strategies, have already been used, and have brought significant results. Therefore, this paper analyzes the performance of non-supervised techniques for classifying soybeans. An extensive experimental evaluation was performed, considering (9) different clustering algorithms (partitional, hierarchical, and density-based) applied to 5 image datasets of soybean seeds submitted to the tetrazolium test, including different damages and/or their levels. To describe those images, we considered 18 extractors of traditional features. We also considered four metrics (accuracy, FOWLKES, DAVIES, and CALINSKI) and two-dimensionality reduction techniques (principal component analysis and t-distributed stochastic neighbor embedding) for validation. Results show that this paper presents essential contributions since it makes it possible to identify descriptors and clustering algorithms that shall be used as preprocessing in other learning processes, accelerating and improving the classification process of key agricultural problems. Public Library of Science 2023-08-25 /pmc/articles/PMC10456222/ /pubmed/37624819 http://dx.doi.org/10.1371/journal.pone.0285566 Text en © 2023 de Oliveira et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article de Oliveira, Eduardo R. Bugatti, Pedro H. Saito, Priscila T. M. Assessment of clustering techniques to support the analyses of soybean seed vigor |
title | Assessment of clustering techniques to support the analyses of soybean seed vigor |
title_full | Assessment of clustering techniques to support the analyses of soybean seed vigor |
title_fullStr | Assessment of clustering techniques to support the analyses of soybean seed vigor |
title_full_unstemmed | Assessment of clustering techniques to support the analyses of soybean seed vigor |
title_short | Assessment of clustering techniques to support the analyses of soybean seed vigor |
title_sort | assessment of clustering techniques to support the analyses of soybean seed vigor |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10456222/ https://www.ncbi.nlm.nih.gov/pubmed/37624819 http://dx.doi.org/10.1371/journal.pone.0285566 |
work_keys_str_mv | AT deoliveiraeduardor assessmentofclusteringtechniquestosupporttheanalysesofsoybeanseedvigor AT bugattipedroh assessmentofclusteringtechniquestosupporttheanalysesofsoybeanseedvigor AT saitopriscilatm assessmentofclusteringtechniquestosupporttheanalysesofsoybeanseedvigor |