Cargando…

A novel transfer learning framework for sorghum biomass prediction using UAV-based remote sensing data and genetic markers

Yield for biofuel crops is measured in terms of biomass, so measurements throughout the growing season are crucial in breeding programs, yet traditionally time- and labor-consuming since they involve destructive sampling. Modern remote sensing platforms, such as unmanned aerial vehicles (UAVs), can...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Taojun, Crawford, Melba M., Tuinstra, Mitchell R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10126475/
https://www.ncbi.nlm.nih.gov/pubmed/37113602
http://dx.doi.org/10.3389/fpls.2023.1138479
_version_ 1785030255453405184
author Wang, Taojun
Crawford, Melba M.
Tuinstra, Mitchell R.
author_facet Wang, Taojun
Crawford, Melba M.
Tuinstra, Mitchell R.
author_sort Wang, Taojun
collection PubMed
description Yield for biofuel crops is measured in terms of biomass, so measurements throughout the growing season are crucial in breeding programs, yet traditionally time- and labor-consuming since they involve destructive sampling. Modern remote sensing platforms, such as unmanned aerial vehicles (UAVs), can carry multiple sensors and collect numerous phenotypic traits with efficient, non-invasive field surveys. However, modeling the complex relationships between the observed phenotypic traits and biomass remains a challenging task, as the ground reference data are very limited for each genotype in the breeding experiment. In this study, a Long Short-Term Memory (LSTM) based Recurrent Neural Network (RNN) model is proposed for sorghum biomass prediction. The architecture is designed to exploit the time series remote sensing and weather data, as well as static genotypic information. As a large number of features have been derived from the remote sensing data, feature importance analysis is conducted to identify and remove redundant features. A strategy to extract representative information from high-dimensional genetic markers is proposed. To enhance generalization and minimize the need for ground reference data, transfer learning strategies are proposed for selecting the most informative training samples from the target domain. Consequently, a pre-trained model can be refined with limited training samples. Field experiments were conducted over a sorghum breeding trial planted in multiple years with more than 600 testcross hybrids. The results show that the proposed LSTM-based RNN model can achieve high accuracies for single year prediction. Further, with the proposed transfer learning strategies, a pre-trained model can be refined with limited training samples from the target domain and predict biomass with an accuracy comparable to that from a trained-from-scratch model for both multiple experiments within a given year and across multiple years.
format Online
Article
Text
id pubmed-10126475
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-101264752023-04-26 A novel transfer learning framework for sorghum biomass prediction using UAV-based remote sensing data and genetic markers Wang, Taojun Crawford, Melba M. Tuinstra, Mitchell R. Front Plant Sci Plant Science Yield for biofuel crops is measured in terms of biomass, so measurements throughout the growing season are crucial in breeding programs, yet traditionally time- and labor-consuming since they involve destructive sampling. Modern remote sensing platforms, such as unmanned aerial vehicles (UAVs), can carry multiple sensors and collect numerous phenotypic traits with efficient, non-invasive field surveys. However, modeling the complex relationships between the observed phenotypic traits and biomass remains a challenging task, as the ground reference data are very limited for each genotype in the breeding experiment. In this study, a Long Short-Term Memory (LSTM) based Recurrent Neural Network (RNN) model is proposed for sorghum biomass prediction. The architecture is designed to exploit the time series remote sensing and weather data, as well as static genotypic information. As a large number of features have been derived from the remote sensing data, feature importance analysis is conducted to identify and remove redundant features. A strategy to extract representative information from high-dimensional genetic markers is proposed. To enhance generalization and minimize the need for ground reference data, transfer learning strategies are proposed for selecting the most informative training samples from the target domain. Consequently, a pre-trained model can be refined with limited training samples. Field experiments were conducted over a sorghum breeding trial planted in multiple years with more than 600 testcross hybrids. The results show that the proposed LSTM-based RNN model can achieve high accuracies for single year prediction. Further, with the proposed transfer learning strategies, a pre-trained model can be refined with limited training samples from the target domain and predict biomass with an accuracy comparable to that from a trained-from-scratch model for both multiple experiments within a given year and across multiple years. Frontiers Media S.A. 2023-04-11 /pmc/articles/PMC10126475/ /pubmed/37113602 http://dx.doi.org/10.3389/fpls.2023.1138479 Text en Copyright © 2023 Wang, Crawford and Tuinstra https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Wang, Taojun
Crawford, Melba M.
Tuinstra, Mitchell R.
A novel transfer learning framework for sorghum biomass prediction using UAV-based remote sensing data and genetic markers
title A novel transfer learning framework for sorghum biomass prediction using UAV-based remote sensing data and genetic markers
title_full A novel transfer learning framework for sorghum biomass prediction using UAV-based remote sensing data and genetic markers
title_fullStr A novel transfer learning framework for sorghum biomass prediction using UAV-based remote sensing data and genetic markers
title_full_unstemmed A novel transfer learning framework for sorghum biomass prediction using UAV-based remote sensing data and genetic markers
title_short A novel transfer learning framework for sorghum biomass prediction using UAV-based remote sensing data and genetic markers
title_sort novel transfer learning framework for sorghum biomass prediction using uav-based remote sensing data and genetic markers
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10126475/
https://www.ncbi.nlm.nih.gov/pubmed/37113602
http://dx.doi.org/10.3389/fpls.2023.1138479
work_keys_str_mv AT wangtaojun anoveltransferlearningframeworkforsorghumbiomasspredictionusinguavbasedremotesensingdataandgeneticmarkers
AT crawfordmelbam anoveltransferlearningframeworkforsorghumbiomasspredictionusinguavbasedremotesensingdataandgeneticmarkers
AT tuinstramitchellr anoveltransferlearningframeworkforsorghumbiomasspredictionusinguavbasedremotesensingdataandgeneticmarkers
AT wangtaojun noveltransferlearningframeworkforsorghumbiomasspredictionusinguavbasedremotesensingdataandgeneticmarkers
AT crawfordmelbam noveltransferlearningframeworkforsorghumbiomasspredictionusinguavbasedremotesensingdataandgeneticmarkers
AT tuinstramitchellr noveltransferlearningframeworkforsorghumbiomasspredictionusinguavbasedremotesensingdataandgeneticmarkers