Cargando…

Identification of Relevant Phytochemical Constituents for Characterization and Authentication of Tomatoes by General Linear Model Linked to Automatic Interaction Detection (GLM-AID) and Artificial Neural Network Models (ANNs)

There are a large number of tomato cultivars with a wide range of morphological, chemical, nutritional and sensorial characteristics. Many factors are known to affect the nutrient content of tomato cultivars. A complete understanding of the effect of these factors would require an exhaustive experim...

Descripción completa

Detalles Bibliográficos
Autores principales: Hernández Suárez, Marcos, Astray Dopazo, Gonzalo, Larios López, Dina, Espinosa, Francisco
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4467870/
https://www.ncbi.nlm.nih.gov/pubmed/26075889
http://dx.doi.org/10.1371/journal.pone.0128566
_version_ 1782376422066618368
author Hernández Suárez, Marcos
Astray Dopazo, Gonzalo
Larios López, Dina
Espinosa, Francisco
author_facet Hernández Suárez, Marcos
Astray Dopazo, Gonzalo
Larios López, Dina
Espinosa, Francisco
author_sort Hernández Suárez, Marcos
collection PubMed
description There are a large number of tomato cultivars with a wide range of morphological, chemical, nutritional and sensorial characteristics. Many factors are known to affect the nutrient content of tomato cultivars. A complete understanding of the effect of these factors would require an exhaustive experimental design, multidisciplinary scientific approach and a suitable statistical method. Some multivariate analytical techniques such as Principal Component Analysis (PCA) or Factor Analysis (FA) have been widely applied in order to search for patterns in the behaviour and reduce the dimensionality of a data set by a new set of uncorrelated latent variables. However, in some cases it is not useful to replace the original variables with these latent variables. In this study, Automatic Interaction Detection (AID) algorithm and Artificial Neural Network (ANN) models were applied as alternative to the PCA, AF and other multivariate analytical techniques in order to identify the relevant phytochemical constituents for characterization and authentication of tomatoes. To prove the feasibility of AID algorithm and ANN models to achieve the purpose of this study, both methods were applied on a data set with twenty five chemical parameters analysed on 167 tomato samples from Tenerife (Spain). Each tomato sample was defined by three factors: cultivar, agricultural practice and harvest date. General Linear Model linked to AID (GLM-AID) tree-structured was organized into 3 levels according to the number of factors. p-Coumaric acid was the compound the allowed to distinguish the tomato samples according to the day of harvest. More than one chemical parameter was necessary to distinguish among different agricultural practices and among the tomato cultivars. Several ANN models, with 25 and 10 input variables, for the prediction of cultivar, agricultural practice and harvest date, were developed. Finally, the models with 10 input variables were chosen with fit’s goodness between 44 and 100%. The lowest fits were for the cultivar classification, this low percentage suggests that other kind of chemical parameter should be used to identify tomato cultivars.
format Online
Article
Text
id pubmed-4467870
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-44678702015-06-25 Identification of Relevant Phytochemical Constituents for Characterization and Authentication of Tomatoes by General Linear Model Linked to Automatic Interaction Detection (GLM-AID) and Artificial Neural Network Models (ANNs) Hernández Suárez, Marcos Astray Dopazo, Gonzalo Larios López, Dina Espinosa, Francisco PLoS One Research Article There are a large number of tomato cultivars with a wide range of morphological, chemical, nutritional and sensorial characteristics. Many factors are known to affect the nutrient content of tomato cultivars. A complete understanding of the effect of these factors would require an exhaustive experimental design, multidisciplinary scientific approach and a suitable statistical method. Some multivariate analytical techniques such as Principal Component Analysis (PCA) or Factor Analysis (FA) have been widely applied in order to search for patterns in the behaviour and reduce the dimensionality of a data set by a new set of uncorrelated latent variables. However, in some cases it is not useful to replace the original variables with these latent variables. In this study, Automatic Interaction Detection (AID) algorithm and Artificial Neural Network (ANN) models were applied as alternative to the PCA, AF and other multivariate analytical techniques in order to identify the relevant phytochemical constituents for characterization and authentication of tomatoes. To prove the feasibility of AID algorithm and ANN models to achieve the purpose of this study, both methods were applied on a data set with twenty five chemical parameters analysed on 167 tomato samples from Tenerife (Spain). Each tomato sample was defined by three factors: cultivar, agricultural practice and harvest date. General Linear Model linked to AID (GLM-AID) tree-structured was organized into 3 levels according to the number of factors. p-Coumaric acid was the compound the allowed to distinguish the tomato samples according to the day of harvest. More than one chemical parameter was necessary to distinguish among different agricultural practices and among the tomato cultivars. Several ANN models, with 25 and 10 input variables, for the prediction of cultivar, agricultural practice and harvest date, were developed. Finally, the models with 10 input variables were chosen with fit’s goodness between 44 and 100%. The lowest fits were for the cultivar classification, this low percentage suggests that other kind of chemical parameter should be used to identify tomato cultivars. Public Library of Science 2015-06-15 /pmc/articles/PMC4467870/ /pubmed/26075889 http://dx.doi.org/10.1371/journal.pone.0128566 Text en © 2015 Hernández Suárez et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Hernández Suárez, Marcos
Astray Dopazo, Gonzalo
Larios López, Dina
Espinosa, Francisco
Identification of Relevant Phytochemical Constituents for Characterization and Authentication of Tomatoes by General Linear Model Linked to Automatic Interaction Detection (GLM-AID) and Artificial Neural Network Models (ANNs)
title Identification of Relevant Phytochemical Constituents for Characterization and Authentication of Tomatoes by General Linear Model Linked to Automatic Interaction Detection (GLM-AID) and Artificial Neural Network Models (ANNs)
title_full Identification of Relevant Phytochemical Constituents for Characterization and Authentication of Tomatoes by General Linear Model Linked to Automatic Interaction Detection (GLM-AID) and Artificial Neural Network Models (ANNs)
title_fullStr Identification of Relevant Phytochemical Constituents for Characterization and Authentication of Tomatoes by General Linear Model Linked to Automatic Interaction Detection (GLM-AID) and Artificial Neural Network Models (ANNs)
title_full_unstemmed Identification of Relevant Phytochemical Constituents for Characterization and Authentication of Tomatoes by General Linear Model Linked to Automatic Interaction Detection (GLM-AID) and Artificial Neural Network Models (ANNs)
title_short Identification of Relevant Phytochemical Constituents for Characterization and Authentication of Tomatoes by General Linear Model Linked to Automatic Interaction Detection (GLM-AID) and Artificial Neural Network Models (ANNs)
title_sort identification of relevant phytochemical constituents for characterization and authentication of tomatoes by general linear model linked to automatic interaction detection (glm-aid) and artificial neural network models (anns)
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4467870/
https://www.ncbi.nlm.nih.gov/pubmed/26075889
http://dx.doi.org/10.1371/journal.pone.0128566
work_keys_str_mv AT hernandezsuarezmarcos identificationofrelevantphytochemicalconstituentsforcharacterizationandauthenticationoftomatoesbygenerallinearmodellinkedtoautomaticinteractiondetectionglmaidandartificialneuralnetworkmodelsanns
AT astraydopazogonzalo identificationofrelevantphytochemicalconstituentsforcharacterizationandauthenticationoftomatoesbygenerallinearmodellinkedtoautomaticinteractiondetectionglmaidandartificialneuralnetworkmodelsanns
AT larioslopezdina identificationofrelevantphytochemicalconstituentsforcharacterizationandauthenticationoftomatoesbygenerallinearmodellinkedtoautomaticinteractiondetectionglmaidandartificialneuralnetworkmodelsanns
AT espinosafrancisco identificationofrelevantphytochemicalconstituentsforcharacterizationandauthenticationoftomatoesbygenerallinearmodellinkedtoautomaticinteractiondetectionglmaidandartificialneuralnetworkmodelsanns