Cargando…

Assessment of clustering techniques to support the analyses of soybean seed vigor

Soy is the main product of Brazilian agriculture and the fourth most cultivated bean globally. Since soy cultivation tends to increase and due to this large market, the guarantee of product quality is an indispensable factor for enterprises to stay competitive. Industries perform vigor tests to acqu...

Descripción completa

Detalles Bibliográficos
Autores principales: de Oliveira, Eduardo R., Bugatti, Pedro H., Saito, Priscila T. M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10456222/
https://www.ncbi.nlm.nih.gov/pubmed/37624819
http://dx.doi.org/10.1371/journal.pone.0285566
_version_ 1785096644316889088
author de Oliveira, Eduardo R.
Bugatti, Pedro H.
Saito, Priscila T. M.
author_facet de Oliveira, Eduardo R.
Bugatti, Pedro H.
Saito, Priscila T. M.
author_sort de Oliveira, Eduardo R.
collection PubMed
description Soy is the main product of Brazilian agriculture and the fourth most cultivated bean globally. Since soy cultivation tends to increase and due to this large market, the guarantee of product quality is an indispensable factor for enterprises to stay competitive. Industries perform vigor tests to acquire information and evaluate the quality of soy planting. The tetrazolium test, for example, provides information about moisture damage, bedbugs, or mechanical damage. However, the verification of the damage reason and its severity are done by an analyst, one by one. Since this is massive and exhausting work, it is susceptible to mistakes. Proposals involving different supervised learning approaches, including active learning strategies, have already been used, and have brought significant results. Therefore, this paper analyzes the performance of non-supervised techniques for classifying soybeans. An extensive experimental evaluation was performed, considering (9) different clustering algorithms (partitional, hierarchical, and density-based) applied to 5 image datasets of soybean seeds submitted to the tetrazolium test, including different damages and/or their levels. To describe those images, we considered 18 extractors of traditional features. We also considered four metrics (accuracy, FOWLKES, DAVIES, and CALINSKI) and two-dimensionality reduction techniques (principal component analysis and t-distributed stochastic neighbor embedding) for validation. Results show that this paper presents essential contributions since it makes it possible to identify descriptors and clustering algorithms that shall be used as preprocessing in other learning processes, accelerating and improving the classification process of key agricultural problems.
format Online
Article
Text
id pubmed-10456222
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-104562222023-08-26 Assessment of clustering techniques to support the analyses of soybean seed vigor de Oliveira, Eduardo R. Bugatti, Pedro H. Saito, Priscila T. M. PLoS One Research Article Soy is the main product of Brazilian agriculture and the fourth most cultivated bean globally. Since soy cultivation tends to increase and due to this large market, the guarantee of product quality is an indispensable factor for enterprises to stay competitive. Industries perform vigor tests to acquire information and evaluate the quality of soy planting. The tetrazolium test, for example, provides information about moisture damage, bedbugs, or mechanical damage. However, the verification of the damage reason and its severity are done by an analyst, one by one. Since this is massive and exhausting work, it is susceptible to mistakes. Proposals involving different supervised learning approaches, including active learning strategies, have already been used, and have brought significant results. Therefore, this paper analyzes the performance of non-supervised techniques for classifying soybeans. An extensive experimental evaluation was performed, considering (9) different clustering algorithms (partitional, hierarchical, and density-based) applied to 5 image datasets of soybean seeds submitted to the tetrazolium test, including different damages and/or their levels. To describe those images, we considered 18 extractors of traditional features. We also considered four metrics (accuracy, FOWLKES, DAVIES, and CALINSKI) and two-dimensionality reduction techniques (principal component analysis and t-distributed stochastic neighbor embedding) for validation. Results show that this paper presents essential contributions since it makes it possible to identify descriptors and clustering algorithms that shall be used as preprocessing in other learning processes, accelerating and improving the classification process of key agricultural problems. Public Library of Science 2023-08-25 /pmc/articles/PMC10456222/ /pubmed/37624819 http://dx.doi.org/10.1371/journal.pone.0285566 Text en © 2023 de Oliveira et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
de Oliveira, Eduardo R.
Bugatti, Pedro H.
Saito, Priscila T. M.
Assessment of clustering techniques to support the analyses of soybean seed vigor
title Assessment of clustering techniques to support the analyses of soybean seed vigor
title_full Assessment of clustering techniques to support the analyses of soybean seed vigor
title_fullStr Assessment of clustering techniques to support the analyses of soybean seed vigor
title_full_unstemmed Assessment of clustering techniques to support the analyses of soybean seed vigor
title_short Assessment of clustering techniques to support the analyses of soybean seed vigor
title_sort assessment of clustering techniques to support the analyses of soybean seed vigor
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10456222/
https://www.ncbi.nlm.nih.gov/pubmed/37624819
http://dx.doi.org/10.1371/journal.pone.0285566
work_keys_str_mv AT deoliveiraeduardor assessmentofclusteringtechniquestosupporttheanalysesofsoybeanseedvigor
AT bugattipedroh assessmentofclusteringtechniquestosupporttheanalysesofsoybeanseedvigor
AT saitopriscilatm assessmentofclusteringtechniquestosupporttheanalysesofsoybeanseedvigor