Cargando…

WE3DS: An RGB-D Image Dataset for Semantic Segmentation in Agriculture

Smart farming (SF) applications rely on robust and accurate computer vision systems. An important computer vision task in agriculture is semantic segmentation, which aims to classify each pixel of an image and can be used for selective weed removal. State-of-the-art implementations use convolutional...

Descripción completa

Detalles Bibliográficos
Autores principales: Kitzler, Florian, Barta, Norbert, Neugschwandtner, Reinhard W., Gronauer, Andreas, Motsch, Viktoria
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10007111/
https://www.ncbi.nlm.nih.gov/pubmed/36904917
http://dx.doi.org/10.3390/s23052713
_version_ 1784905437665034240
author Kitzler, Florian
Barta, Norbert
Neugschwandtner, Reinhard W.
Gronauer, Andreas
Motsch, Viktoria
author_facet Kitzler, Florian
Barta, Norbert
Neugschwandtner, Reinhard W.
Gronauer, Andreas
Motsch, Viktoria
author_sort Kitzler, Florian
collection PubMed
description Smart farming (SF) applications rely on robust and accurate computer vision systems. An important computer vision task in agriculture is semantic segmentation, which aims to classify each pixel of an image and can be used for selective weed removal. State-of-the-art implementations use convolutional neural networks (CNN) that are trained on large image datasets. In agriculture, publicly available RGB image datasets are scarce and often lack detailed ground-truth information. In contrast to agriculture, other research areas feature RGB-D datasets that combine color (RGB) with additional distance (D) information. Such results show that including distance as an additional modality can improve model performance further. Therefore, we introduce WE3DS as the first RGB-D image dataset for multi-class plant species semantic segmentation in crop farming. It contains 2568 RGB-D images (color image and distance map) and corresponding hand-annotated ground-truth masks. Images were taken under natural light conditions using an RGB-D sensor consisting of two RGB cameras in a stereo setup. Further, we provide a benchmark for RGB-D semantic segmentation on the WE3DS dataset and compare it with a solely RGB-based model. Our trained models achieve up to 70.7% mean Intersection over Union (mIoU) for discriminating between soil, seven crop species, and ten weed species. Finally, our work confirms the finding that additional distance information improves segmentation quality.
format Online
Article
Text
id pubmed-10007111
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100071112023-03-12 WE3DS: An RGB-D Image Dataset for Semantic Segmentation in Agriculture Kitzler, Florian Barta, Norbert Neugschwandtner, Reinhard W. Gronauer, Andreas Motsch, Viktoria Sensors (Basel) Article Smart farming (SF) applications rely on robust and accurate computer vision systems. An important computer vision task in agriculture is semantic segmentation, which aims to classify each pixel of an image and can be used for selective weed removal. State-of-the-art implementations use convolutional neural networks (CNN) that are trained on large image datasets. In agriculture, publicly available RGB image datasets are scarce and often lack detailed ground-truth information. In contrast to agriculture, other research areas feature RGB-D datasets that combine color (RGB) with additional distance (D) information. Such results show that including distance as an additional modality can improve model performance further. Therefore, we introduce WE3DS as the first RGB-D image dataset for multi-class plant species semantic segmentation in crop farming. It contains 2568 RGB-D images (color image and distance map) and corresponding hand-annotated ground-truth masks. Images were taken under natural light conditions using an RGB-D sensor consisting of two RGB cameras in a stereo setup. Further, we provide a benchmark for RGB-D semantic segmentation on the WE3DS dataset and compare it with a solely RGB-based model. Our trained models achieve up to 70.7% mean Intersection over Union (mIoU) for discriminating between soil, seven crop species, and ten weed species. Finally, our work confirms the finding that additional distance information improves segmentation quality. MDPI 2023-03-01 /pmc/articles/PMC10007111/ /pubmed/36904917 http://dx.doi.org/10.3390/s23052713 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Kitzler, Florian
Barta, Norbert
Neugschwandtner, Reinhard W.
Gronauer, Andreas
Motsch, Viktoria
WE3DS: An RGB-D Image Dataset for Semantic Segmentation in Agriculture
title WE3DS: An RGB-D Image Dataset for Semantic Segmentation in Agriculture
title_full WE3DS: An RGB-D Image Dataset for Semantic Segmentation in Agriculture
title_fullStr WE3DS: An RGB-D Image Dataset for Semantic Segmentation in Agriculture
title_full_unstemmed WE3DS: An RGB-D Image Dataset for Semantic Segmentation in Agriculture
title_short WE3DS: An RGB-D Image Dataset for Semantic Segmentation in Agriculture
title_sort we3ds: an rgb-d image dataset for semantic segmentation in agriculture
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10007111/
https://www.ncbi.nlm.nih.gov/pubmed/36904917
http://dx.doi.org/10.3390/s23052713
work_keys_str_mv AT kitzlerflorian we3dsanrgbdimagedatasetforsemanticsegmentationinagriculture
AT bartanorbert we3dsanrgbdimagedatasetforsemanticsegmentationinagriculture
AT neugschwandtnerreinhardw we3dsanrgbdimagedatasetforsemanticsegmentationinagriculture
AT gronauerandreas we3dsanrgbdimagedatasetforsemanticsegmentationinagriculture
AT motschviktoria we3dsanrgbdimagedatasetforsemanticsegmentationinagriculture