Cargando…

Forest-Scale Phenotyping: Productivity Characterisation Through Machine Learning

Advances in remote sensing combined with the emergence of sophisticated methods for large-scale data analytics from the field of data science provide new methods to model complex interactions in biological systems. Using a data-driven philosophy, insights from experts are used to corroborate the res...

Descripción completa

Detalles Bibliográficos
Autores principales: Bombrun, Maxime, Dash, Jonathan P., Pont, David, Watt, Michael S., Pearse, Grant D., Dungey, Heidi S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7068454/
https://www.ncbi.nlm.nih.gov/pubmed/32210980
http://dx.doi.org/10.3389/fpls.2020.00099
_version_ 1783505583641460736
author Bombrun, Maxime
Dash, Jonathan P.
Pont, David
Watt, Michael S.
Pearse, Grant D.
Dungey, Heidi S.
author_facet Bombrun, Maxime
Dash, Jonathan P.
Pont, David
Watt, Michael S.
Pearse, Grant D.
Dungey, Heidi S.
author_sort Bombrun, Maxime
collection PubMed
description Advances in remote sensing combined with the emergence of sophisticated methods for large-scale data analytics from the field of data science provide new methods to model complex interactions in biological systems. Using a data-driven philosophy, insights from experts are used to corroborate the results generated through analytical models instead of leading the model design. Following such an approach, this study outlines the development and implementation of a whole-of-forest phenotyping system that incorporates spatial estimates of productivity across a large plantation forest. In large-scale plantation forestry, improving the productivity and consistency of future forests is an important but challenging goal due to the multiple interactions between biotic and abiotic factors, the long breeding cycle, and the high variability of growing conditions. Forest phenotypic expression is highly affected by the interaction of environmental conditions and forest management but the understanding of this complex dynamics is incomplete. In this study, we collected an extensive set of 2.7 million observations composed of 62 variables describing climate, forest management, tree genetics, and fine-scale terrain information extracted from environmental surfaces, management records, and remotely sensed data. Using three machine learning methods, we compared models of forest productivity and evaluate the gain and Shapley values for interpreting the influence of categorical variables on the power of these methods to predict forest productivity at a landscape level. The most accurate model identified that the most important drivers of productivity were, in order of importance, genetics, environmental conditions, leaf area index, topology, and soil properties, thus describing the complex interactions of the forest. This approach demonstrates that new methods in remote sensing and data science enable powerful, landscape-level understanding of forest productivity. The phenotyping method developed here can be used to identify superior and inferior genotypes and estimate a productivity index for individual site. This approach can improve tree breeding and deployment of the right genetics to the right site in order to increase the overall productivity across planted forests.
format Online
Article
Text
id pubmed-7068454
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-70684542020-03-24 Forest-Scale Phenotyping: Productivity Characterisation Through Machine Learning Bombrun, Maxime Dash, Jonathan P. Pont, David Watt, Michael S. Pearse, Grant D. Dungey, Heidi S. Front Plant Sci Plant Science Advances in remote sensing combined with the emergence of sophisticated methods for large-scale data analytics from the field of data science provide new methods to model complex interactions in biological systems. Using a data-driven philosophy, insights from experts are used to corroborate the results generated through analytical models instead of leading the model design. Following such an approach, this study outlines the development and implementation of a whole-of-forest phenotyping system that incorporates spatial estimates of productivity across a large plantation forest. In large-scale plantation forestry, improving the productivity and consistency of future forests is an important but challenging goal due to the multiple interactions between biotic and abiotic factors, the long breeding cycle, and the high variability of growing conditions. Forest phenotypic expression is highly affected by the interaction of environmental conditions and forest management but the understanding of this complex dynamics is incomplete. In this study, we collected an extensive set of 2.7 million observations composed of 62 variables describing climate, forest management, tree genetics, and fine-scale terrain information extracted from environmental surfaces, management records, and remotely sensed data. Using three machine learning methods, we compared models of forest productivity and evaluate the gain and Shapley values for interpreting the influence of categorical variables on the power of these methods to predict forest productivity at a landscape level. The most accurate model identified that the most important drivers of productivity were, in order of importance, genetics, environmental conditions, leaf area index, topology, and soil properties, thus describing the complex interactions of the forest. This approach demonstrates that new methods in remote sensing and data science enable powerful, landscape-level understanding of forest productivity. The phenotyping method developed here can be used to identify superior and inferior genotypes and estimate a productivity index for individual site. This approach can improve tree breeding and deployment of the right genetics to the right site in order to increase the overall productivity across planted forests. Frontiers Media S.A. 2020-03-06 /pmc/articles/PMC7068454/ /pubmed/32210980 http://dx.doi.org/10.3389/fpls.2020.00099 Text en Copyright © 2020 Bombrun, Dash, Pont, Watt, Pearse and Dungey http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Bombrun, Maxime
Dash, Jonathan P.
Pont, David
Watt, Michael S.
Pearse, Grant D.
Dungey, Heidi S.
Forest-Scale Phenotyping: Productivity Characterisation Through Machine Learning
title Forest-Scale Phenotyping: Productivity Characterisation Through Machine Learning
title_full Forest-Scale Phenotyping: Productivity Characterisation Through Machine Learning
title_fullStr Forest-Scale Phenotyping: Productivity Characterisation Through Machine Learning
title_full_unstemmed Forest-Scale Phenotyping: Productivity Characterisation Through Machine Learning
title_short Forest-Scale Phenotyping: Productivity Characterisation Through Machine Learning
title_sort forest-scale phenotyping: productivity characterisation through machine learning
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7068454/
https://www.ncbi.nlm.nih.gov/pubmed/32210980
http://dx.doi.org/10.3389/fpls.2020.00099
work_keys_str_mv AT bombrunmaxime forestscalephenotypingproductivitycharacterisationthroughmachinelearning
AT dashjonathanp forestscalephenotypingproductivitycharacterisationthroughmachinelearning
AT pontdavid forestscalephenotypingproductivitycharacterisationthroughmachinelearning
AT wattmichaels forestscalephenotypingproductivitycharacterisationthroughmachinelearning
AT pearsegrantd forestscalephenotypingproductivitycharacterisationthroughmachinelearning
AT dungeyheidis forestscalephenotypingproductivitycharacterisationthroughmachinelearning