Cargando…

Hyperspectral Leaf Reflectance as Proxy for Photosynthetic Capacities: An Ensemble Approach Based on Multiple Machine Learning Algorithms

Global agriculture production is challenged by increasing demands from rising population and a changing climate, which may be alleviated through development of genetically improved crop cultivars. Research into increasing photosynthetic energy conversion efficiency has proposed many strategies to im...

Descripción completa

Detalles Bibliográficos
Autores principales: Fu, Peng, Meacham-Hensold, Katherine, Guan, Kaiyu, Bernacchi, Carl J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6556518/
https://www.ncbi.nlm.nih.gov/pubmed/31214235
http://dx.doi.org/10.3389/fpls.2019.00730
_version_ 1783425341921951744
author Fu, Peng
Meacham-Hensold, Katherine
Guan, Kaiyu
Bernacchi, Carl J.
author_facet Fu, Peng
Meacham-Hensold, Katherine
Guan, Kaiyu
Bernacchi, Carl J.
author_sort Fu, Peng
collection PubMed
description Global agriculture production is challenged by increasing demands from rising population and a changing climate, which may be alleviated through development of genetically improved crop cultivars. Research into increasing photosynthetic energy conversion efficiency has proposed many strategies to improve production but have yet to yield real-world solutions, largely because of a phenotyping bottleneck. Partial least squares regression (PLSR) is a statistical technique that is increasingly used to relate hyperspectral reflectance to key photosynthetic capacities associated with carbon uptake (maximum carboxylation rate of Rubisco, V(c,max)) and conversion of light energy (maximum electron transport rate supporting RuBP regeneration, J(max)) to alleviate this bottleneck. However, its performance varies significantly across different plant species, regions, and growth environments. Thus, to cope with the heterogeneous performances of PLSR, this study aims to develop a new approach to estimate photosynthetic capacities. A framework was developed that combines six machine learning algorithms, including artificial neural network (ANN), support vector machine (SVM), least absolute shrinkage and selection operator (LASSO), random forest (RF), Gaussian process (GP), and PLSR to optimize high-throughput analysis of the two photosynthetic variables. Six tobacco genotypes, including both transgenic and wild-type lines, with a range of photosynthetic capacities were used to test the framework. Leaf reflectance spectra were measured from 400 to 2500 nm using a high-spectral-resolution spectroradiometer. Corresponding photosynthesis vs. intercellular CO(2) concentration response curves were measured for each leaf using a leaf gas-exchange system. Results suggested that the mean R(2) value of the six regression techniques for predicting V(c,max) (J(max)) ranged from 0.60 (0.45) to 0.65 (0.56) with the mean RMSE value varying from 47.1 (40.1) to 54.0 (44.7) μmol m(-2) s(-1). Regression stacking for V(c,max) (J(max)) performed better than the individual regression techniques with increases in R(2) of 0.1 (0.08) and decreases in RMSE by 4.1 (6.6) μmol m(-2) s(-1), equal to 8% (15%) reduction in RMSE. Better predictive performance of the regression stacking is likely attributed to the varying coefficients (or weights) in the level-2 model (the LASSO model) and the diverse ability of each individual regression technique to utilize spectral information for the best modeling performance. Further refinements can be made to apply this stacked regression technique to other plant phenotypic traits.
format Online
Article
Text
id pubmed-6556518
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-65565182019-06-18 Hyperspectral Leaf Reflectance as Proxy for Photosynthetic Capacities: An Ensemble Approach Based on Multiple Machine Learning Algorithms Fu, Peng Meacham-Hensold, Katherine Guan, Kaiyu Bernacchi, Carl J. Front Plant Sci Plant Science Global agriculture production is challenged by increasing demands from rising population and a changing climate, which may be alleviated through development of genetically improved crop cultivars. Research into increasing photosynthetic energy conversion efficiency has proposed many strategies to improve production but have yet to yield real-world solutions, largely because of a phenotyping bottleneck. Partial least squares regression (PLSR) is a statistical technique that is increasingly used to relate hyperspectral reflectance to key photosynthetic capacities associated with carbon uptake (maximum carboxylation rate of Rubisco, V(c,max)) and conversion of light energy (maximum electron transport rate supporting RuBP regeneration, J(max)) to alleviate this bottleneck. However, its performance varies significantly across different plant species, regions, and growth environments. Thus, to cope with the heterogeneous performances of PLSR, this study aims to develop a new approach to estimate photosynthetic capacities. A framework was developed that combines six machine learning algorithms, including artificial neural network (ANN), support vector machine (SVM), least absolute shrinkage and selection operator (LASSO), random forest (RF), Gaussian process (GP), and PLSR to optimize high-throughput analysis of the two photosynthetic variables. Six tobacco genotypes, including both transgenic and wild-type lines, with a range of photosynthetic capacities were used to test the framework. Leaf reflectance spectra were measured from 400 to 2500 nm using a high-spectral-resolution spectroradiometer. Corresponding photosynthesis vs. intercellular CO(2) concentration response curves were measured for each leaf using a leaf gas-exchange system. Results suggested that the mean R(2) value of the six regression techniques for predicting V(c,max) (J(max)) ranged from 0.60 (0.45) to 0.65 (0.56) with the mean RMSE value varying from 47.1 (40.1) to 54.0 (44.7) μmol m(-2) s(-1). Regression stacking for V(c,max) (J(max)) performed better than the individual regression techniques with increases in R(2) of 0.1 (0.08) and decreases in RMSE by 4.1 (6.6) μmol m(-2) s(-1), equal to 8% (15%) reduction in RMSE. Better predictive performance of the regression stacking is likely attributed to the varying coefficients (or weights) in the level-2 model (the LASSO model) and the diverse ability of each individual regression technique to utilize spectral information for the best modeling performance. Further refinements can be made to apply this stacked regression technique to other plant phenotypic traits. Frontiers Media S.A. 2019-06-03 /pmc/articles/PMC6556518/ /pubmed/31214235 http://dx.doi.org/10.3389/fpls.2019.00730 Text en Copyright © 2019 Fu, Meacham-Hensold, Guan and Bernacchi. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Fu, Peng
Meacham-Hensold, Katherine
Guan, Kaiyu
Bernacchi, Carl J.
Hyperspectral Leaf Reflectance as Proxy for Photosynthetic Capacities: An Ensemble Approach Based on Multiple Machine Learning Algorithms
title Hyperspectral Leaf Reflectance as Proxy for Photosynthetic Capacities: An Ensemble Approach Based on Multiple Machine Learning Algorithms
title_full Hyperspectral Leaf Reflectance as Proxy for Photosynthetic Capacities: An Ensemble Approach Based on Multiple Machine Learning Algorithms
title_fullStr Hyperspectral Leaf Reflectance as Proxy for Photosynthetic Capacities: An Ensemble Approach Based on Multiple Machine Learning Algorithms
title_full_unstemmed Hyperspectral Leaf Reflectance as Proxy for Photosynthetic Capacities: An Ensemble Approach Based on Multiple Machine Learning Algorithms
title_short Hyperspectral Leaf Reflectance as Proxy for Photosynthetic Capacities: An Ensemble Approach Based on Multiple Machine Learning Algorithms
title_sort hyperspectral leaf reflectance as proxy for photosynthetic capacities: an ensemble approach based on multiple machine learning algorithms
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6556518/
https://www.ncbi.nlm.nih.gov/pubmed/31214235
http://dx.doi.org/10.3389/fpls.2019.00730
work_keys_str_mv AT fupeng hyperspectralleafreflectanceasproxyforphotosyntheticcapacitiesanensembleapproachbasedonmultiplemachinelearningalgorithms
AT meachamhensoldkatherine hyperspectralleafreflectanceasproxyforphotosyntheticcapacitiesanensembleapproachbasedonmultiplemachinelearningalgorithms
AT guankaiyu hyperspectralleafreflectanceasproxyforphotosyntheticcapacitiesanensembleapproachbasedonmultiplemachinelearningalgorithms
AT bernacchicarlj hyperspectralleafreflectanceasproxyforphotosyntheticcapacitiesanensembleapproachbasedonmultiplemachinelearningalgorithms