Cargando…

Standardizing and Centralizing Datasets for Efficient Training of Agricultural Deep Learning Models

In recent years, deep learning models have become the standard for agricultural computer vision. Such models are typically fine-tuned to agricultural tasks using model weights that were originally fit to more general, non-agricultural datasets. This lack of agriculture-specific fine-tuning potential...

Descripción completa

Detalles Bibliográficos
Autores principales:	Joshi, Amogh, Guevara, Dario, Earles, Mason
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	AAAS 2023
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10482164/ https://www.ncbi.nlm.nih.gov/pubmed/37680999 http://dx.doi.org/10.34133/plantphenomics.0084

_version_	1785102124219105280
author	Joshi, Amogh Guevara, Dario Earles, Mason
author_facet	Joshi, Amogh Guevara, Dario Earles, Mason
author_sort	Joshi, Amogh
collection	PubMed
description	In recent years, deep learning models have become the standard for agricultural computer vision. Such models are typically fine-tuned to agricultural tasks using model weights that were originally fit to more general, non-agricultural datasets. This lack of agriculture-specific fine-tuning potentially increases training time and resource use, and decreases model performance, leading to an overall decrease in data efficiency. To overcome this limitation, we collect a wide range of existing public datasets for 3 distinct tasks, standardize them, and construct standard training and evaluation pipelines, providing us with a set of benchmarks and pretrained models. We then conduct a number of experiments using methods that are commonly used in deep learning tasks but unexplored in their domain-specific applications for agriculture. Our experiments guide us in developing a number of approaches to improve data efficiency when training agricultural deep learning models, without large-scale modifications to existing pipelines. Our results demonstrate that even slight training modifications, such as using agricultural pretrained model weights, or adopting specific spatial augmentations into data processing pipelines, can considerably boost model performance and result in shorter convergence time, saving training resources. Furthermore, we find that even models trained on low-quality annotations can produce comparable levels of performance to their high-quality equivalents, suggesting that datasets with poor annotations can still be used for training, expanding the pool of currently available datasets. Our methods are broadly applicable throughout agricultural deep learning and present high potential for substantial data efficiency improvements.
format	Online Article Text
id	pubmed-10482164
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	AAAS
record_format	MEDLINE/PubMed
spelling	pubmed-104821642023-09-07 Standardizing and Centralizing Datasets for Efficient Training of Agricultural Deep Learning Models Joshi, Amogh Guevara, Dario Earles, Mason Plant Phenomics Research Article In recent years, deep learning models have become the standard for agricultural computer vision. Such models are typically fine-tuned to agricultural tasks using model weights that were originally fit to more general, non-agricultural datasets. This lack of agriculture-specific fine-tuning potentially increases training time and resource use, and decreases model performance, leading to an overall decrease in data efficiency. To overcome this limitation, we collect a wide range of existing public datasets for 3 distinct tasks, standardize them, and construct standard training and evaluation pipelines, providing us with a set of benchmarks and pretrained models. We then conduct a number of experiments using methods that are commonly used in deep learning tasks but unexplored in their domain-specific applications for agriculture. Our experiments guide us in developing a number of approaches to improve data efficiency when training agricultural deep learning models, without large-scale modifications to existing pipelines. Our results demonstrate that even slight training modifications, such as using agricultural pretrained model weights, or adopting specific spatial augmentations into data processing pipelines, can considerably boost model performance and result in shorter convergence time, saving training resources. Furthermore, we find that even models trained on low-quality annotations can produce comparable levels of performance to their high-quality equivalents, suggesting that datasets with poor annotations can still be used for training, expanding the pool of currently available datasets. Our methods are broadly applicable throughout agricultural deep learning and present high potential for substantial data efficiency improvements. AAAS 2023-09-06 /pmc/articles/PMC10482164/ /pubmed/37680999 http://dx.doi.org/10.34133/plantphenomics.0084 Text en Copyright © 2023 Amogh Joshi et al. https://creativecommons.org/licenses/by/4.0/Exclusive licensee Nanjing Agricultural University. No claim to original U.S. Government Works. Distributed under a Creative Commons Attribution License 4.0 (CC BY 4.0) (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Research Article Joshi, Amogh Guevara, Dario Earles, Mason Standardizing and Centralizing Datasets for Efficient Training of Agricultural Deep Learning Models
title	Standardizing and Centralizing Datasets for Efficient Training of Agricultural Deep Learning Models
title_full	Standardizing and Centralizing Datasets for Efficient Training of Agricultural Deep Learning Models
title_fullStr	Standardizing and Centralizing Datasets for Efficient Training of Agricultural Deep Learning Models
title_full_unstemmed	Standardizing and Centralizing Datasets for Efficient Training of Agricultural Deep Learning Models
title_short	Standardizing and Centralizing Datasets for Efficient Training of Agricultural Deep Learning Models
title_sort	standardizing and centralizing datasets for efficient training of agricultural deep learning models
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10482164/ https://www.ncbi.nlm.nih.gov/pubmed/37680999 http://dx.doi.org/10.34133/plantphenomics.0084
work_keys_str_mv	AT joshiamogh standardizingandcentralizingdatasetsforefficienttrainingofagriculturaldeeplearningmodels AT guevaradario standardizingandcentralizingdatasetsforefficienttrainingofagriculturaldeeplearningmodels AT earlesmason standardizingandcentralizingdatasetsforefficienttrainingofagriculturaldeeplearningmodels

Standardizing and Centralizing Datasets for Efficient Training of Agricultural Deep Learning Models

Ejemplares similares