Cargando…

Multiclass classification of environmental chemical stimuli from unbalanced plant electrophysiological data

Plant electrophysiological response contains useful signature of its environment and health which can be utilized using suitable statistical analysis for developing an inverse model to classify the stimulus applied to the plant. In this paper, we have presented a statistical analysis pipeline to tac...

Descripción completa

Detalles Bibliográficos
Autores principales: Bhadra, Nivedita, Chatterjee, Shre Kumar, Das, Saptarshi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10159166/
https://www.ncbi.nlm.nih.gov/pubmed/37141215
http://dx.doi.org/10.1371/journal.pone.0285321
_version_ 1785037076680409088
author Bhadra, Nivedita
Chatterjee, Shre Kumar
Das, Saptarshi
author_facet Bhadra, Nivedita
Chatterjee, Shre Kumar
Das, Saptarshi
author_sort Bhadra, Nivedita
collection PubMed
description Plant electrophysiological response contains useful signature of its environment and health which can be utilized using suitable statistical analysis for developing an inverse model to classify the stimulus applied to the plant. In this paper, we have presented a statistical analysis pipeline to tackle a multiclass environmental stimuli classification problem with unbalanced plant electrophysiological data. The objective here is to classify three different environmental chemical stimuli, using fifteen statistical features, extracted from the plant electrical signals and compare the performance of eight different classification algorithms. A comparison using reduced dimensional projection of the high dimensional features via principal component analysis (PCA) has also been presented. Since the experimental data is highly unbalanced due to varying length of the experiments, we employ a random under-sampling approach for the two majority classes to create an ensemble of confusion matrices to compare the classification performances. Along with this, three other multi-classification performance metrics commonly used for unbalanced data viz. balanced accuracy, F(1)-score and Matthews correlation coefficient have also been analyzed. From the stacked confusion matrices and the derived performance metrics, we choose the best feature-classifier setting in terms of the classification performances carried out in the original high dimensional vs. the reduced feature space, for this highly unbalanced multiclass problem of plant signal classification due to different chemical stress. Difference in the classification performances in the high vs. reduced dimensions are also quantified using the multivariate analysis of variance (MANOVA) hypothesis testing. Our findings have potential real-world applications in precision agriculture for exploring multiclass classification problems with highly unbalanced datasets, employing a combination of existing machine learning algorithms. This work also advances existing studies on environmental pollution level monitoring using plant electrophysiological data.
format Online
Article
Text
id pubmed-10159166
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-101591662023-05-05 Multiclass classification of environmental chemical stimuli from unbalanced plant electrophysiological data Bhadra, Nivedita Chatterjee, Shre Kumar Das, Saptarshi PLoS One Research Article Plant electrophysiological response contains useful signature of its environment and health which can be utilized using suitable statistical analysis for developing an inverse model to classify the stimulus applied to the plant. In this paper, we have presented a statistical analysis pipeline to tackle a multiclass environmental stimuli classification problem with unbalanced plant electrophysiological data. The objective here is to classify three different environmental chemical stimuli, using fifteen statistical features, extracted from the plant electrical signals and compare the performance of eight different classification algorithms. A comparison using reduced dimensional projection of the high dimensional features via principal component analysis (PCA) has also been presented. Since the experimental data is highly unbalanced due to varying length of the experiments, we employ a random under-sampling approach for the two majority classes to create an ensemble of confusion matrices to compare the classification performances. Along with this, three other multi-classification performance metrics commonly used for unbalanced data viz. balanced accuracy, F(1)-score and Matthews correlation coefficient have also been analyzed. From the stacked confusion matrices and the derived performance metrics, we choose the best feature-classifier setting in terms of the classification performances carried out in the original high dimensional vs. the reduced feature space, for this highly unbalanced multiclass problem of plant signal classification due to different chemical stress. Difference in the classification performances in the high vs. reduced dimensions are also quantified using the multivariate analysis of variance (MANOVA) hypothesis testing. Our findings have potential real-world applications in precision agriculture for exploring multiclass classification problems with highly unbalanced datasets, employing a combination of existing machine learning algorithms. This work also advances existing studies on environmental pollution level monitoring using plant electrophysiological data. Public Library of Science 2023-05-04 /pmc/articles/PMC10159166/ /pubmed/37141215 http://dx.doi.org/10.1371/journal.pone.0285321 Text en © 2023 Bhadra et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Bhadra, Nivedita
Chatterjee, Shre Kumar
Das, Saptarshi
Multiclass classification of environmental chemical stimuli from unbalanced plant electrophysiological data
title Multiclass classification of environmental chemical stimuli from unbalanced plant electrophysiological data
title_full Multiclass classification of environmental chemical stimuli from unbalanced plant electrophysiological data
title_fullStr Multiclass classification of environmental chemical stimuli from unbalanced plant electrophysiological data
title_full_unstemmed Multiclass classification of environmental chemical stimuli from unbalanced plant electrophysiological data
title_short Multiclass classification of environmental chemical stimuli from unbalanced plant electrophysiological data
title_sort multiclass classification of environmental chemical stimuli from unbalanced plant electrophysiological data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10159166/
https://www.ncbi.nlm.nih.gov/pubmed/37141215
http://dx.doi.org/10.1371/journal.pone.0285321
work_keys_str_mv AT bhadranivedita multiclassclassificationofenvironmentalchemicalstimulifromunbalancedplantelectrophysiologicaldata
AT chatterjeeshrekumar multiclassclassificationofenvironmentalchemicalstimulifromunbalancedplantelectrophysiologicaldata
AT dassaptarshi multiclassclassificationofenvironmentalchemicalstimulifromunbalancedplantelectrophysiologicaldata