Cargando…

Single-Shot Convolution Neural Networks for Real-Time Fruit Detection Within the Tree

Image/video processing for fruit detection in the tree using hard-coded feature extraction algorithms has shown high accuracy on fruit detection during recent years. While accurate, these approaches even with high-end hardware are still computationally intensive and too slow for real-time systems. T...

Descripción completa

Detalles Bibliográficos
Autores principales: Bresilla, Kushtrim, Perulli, Giulio Demetrio, Boini, Alexandra, Morandi, Brunella, Corelli Grappadelli, Luca, Manfrini, Luigi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6537632/
https://www.ncbi.nlm.nih.gov/pubmed/31178875
http://dx.doi.org/10.3389/fpls.2019.00611
_version_ 1783422051565961216
author Bresilla, Kushtrim
Perulli, Giulio Demetrio
Boini, Alexandra
Morandi, Brunella
Corelli Grappadelli, Luca
Manfrini, Luigi
author_facet Bresilla, Kushtrim
Perulli, Giulio Demetrio
Boini, Alexandra
Morandi, Brunella
Corelli Grappadelli, Luca
Manfrini, Luigi
author_sort Bresilla, Kushtrim
collection PubMed
description Image/video processing for fruit detection in the tree using hard-coded feature extraction algorithms has shown high accuracy on fruit detection during recent years. While accurate, these approaches even with high-end hardware are still computationally intensive and too slow for real-time systems. This paper details the use of deep convolution neural networks architecture based on single-stage detectors. Using deep-learning techniques eliminates the need for hard-code specific features for specific fruit shapes, color and/or other attributes. This architecture takes the input image and divides into AxA grid, where A is a configurable hyper-parameter that defines the fineness of the grid. To each grid cell an image detection and localization algorithm is applied. Each of those cells is responsible to predict bounding boxes and confidence score for fruit (apple and pear in the case of this study) detected in that cell. We want this confidence score to be high if a fruit exists in a cell, otherwise to be zero, if no fruit is in the cell. More than 100 images of apple and pear trees were taken. Each tree image with approximately 50 fruits, that at the end resulted on more than 5000 images of apple and pear fruits each. Labeling images for training consisted on manually specifying the bounding boxes for fruits, where (x, y) are the center coordinates of the box and (w, h) are width and height. This architecture showed an accuracy of more than 90% fruit detection. Based on correlation between number of visible fruits, detected fruits on one frame and the real number of fruits on one tree, a model was created to accommodate this error rate. Processing speed is higher than 20 FPS which is fast enough for any grasping/harvesting robotic arm or other real-time applications. HIGHLIGHTS: Using new convolutional deep learning techniques based on single-shot detectors to detect and count fruits (apple and pear) within the tree canopy.
format Online
Article
Text
id pubmed-6537632
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-65376322019-06-07 Single-Shot Convolution Neural Networks for Real-Time Fruit Detection Within the Tree Bresilla, Kushtrim Perulli, Giulio Demetrio Boini, Alexandra Morandi, Brunella Corelli Grappadelli, Luca Manfrini, Luigi Front Plant Sci Plant Science Image/video processing for fruit detection in the tree using hard-coded feature extraction algorithms has shown high accuracy on fruit detection during recent years. While accurate, these approaches even with high-end hardware are still computationally intensive and too slow for real-time systems. This paper details the use of deep convolution neural networks architecture based on single-stage detectors. Using deep-learning techniques eliminates the need for hard-code specific features for specific fruit shapes, color and/or other attributes. This architecture takes the input image and divides into AxA grid, where A is a configurable hyper-parameter that defines the fineness of the grid. To each grid cell an image detection and localization algorithm is applied. Each of those cells is responsible to predict bounding boxes and confidence score for fruit (apple and pear in the case of this study) detected in that cell. We want this confidence score to be high if a fruit exists in a cell, otherwise to be zero, if no fruit is in the cell. More than 100 images of apple and pear trees were taken. Each tree image with approximately 50 fruits, that at the end resulted on more than 5000 images of apple and pear fruits each. Labeling images for training consisted on manually specifying the bounding boxes for fruits, where (x, y) are the center coordinates of the box and (w, h) are width and height. This architecture showed an accuracy of more than 90% fruit detection. Based on correlation between number of visible fruits, detected fruits on one frame and the real number of fruits on one tree, a model was created to accommodate this error rate. Processing speed is higher than 20 FPS which is fast enough for any grasping/harvesting robotic arm or other real-time applications. HIGHLIGHTS: Using new convolutional deep learning techniques based on single-shot detectors to detect and count fruits (apple and pear) within the tree canopy. Frontiers Media S.A. 2019-05-21 /pmc/articles/PMC6537632/ /pubmed/31178875 http://dx.doi.org/10.3389/fpls.2019.00611 Text en Copyright © 2019 Bresilla, Perulli, Boini, Morandi, Corelli Grappadelli and Manfrini. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Bresilla, Kushtrim
Perulli, Giulio Demetrio
Boini, Alexandra
Morandi, Brunella
Corelli Grappadelli, Luca
Manfrini, Luigi
Single-Shot Convolution Neural Networks for Real-Time Fruit Detection Within the Tree
title Single-Shot Convolution Neural Networks for Real-Time Fruit Detection Within the Tree
title_full Single-Shot Convolution Neural Networks for Real-Time Fruit Detection Within the Tree
title_fullStr Single-Shot Convolution Neural Networks for Real-Time Fruit Detection Within the Tree
title_full_unstemmed Single-Shot Convolution Neural Networks for Real-Time Fruit Detection Within the Tree
title_short Single-Shot Convolution Neural Networks for Real-Time Fruit Detection Within the Tree
title_sort single-shot convolution neural networks for real-time fruit detection within the tree
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6537632/
https://www.ncbi.nlm.nih.gov/pubmed/31178875
http://dx.doi.org/10.3389/fpls.2019.00611
work_keys_str_mv AT bresillakushtrim singleshotconvolutionneuralnetworksforrealtimefruitdetectionwithinthetree
AT perulligiuliodemetrio singleshotconvolutionneuralnetworksforrealtimefruitdetectionwithinthetree
AT boinialexandra singleshotconvolutionneuralnetworksforrealtimefruitdetectionwithinthetree
AT morandibrunella singleshotconvolutionneuralnetworksforrealtimefruitdetectionwithinthetree
AT corelligrappadelliluca singleshotconvolutionneuralnetworksforrealtimefruitdetectionwithinthetree
AT manfriniluigi singleshotconvolutionneuralnetworksforrealtimefruitdetectionwithinthetree