Cargando…
WE3DS: An RGB-D Image Dataset for Semantic Segmentation in Agriculture
Smart farming (SF) applications rely on robust and accurate computer vision systems. An important computer vision task in agriculture is semantic segmentation, which aims to classify each pixel of an image and can be used for selective weed removal. State-of-the-art implementations use convolutional...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10007111/ https://www.ncbi.nlm.nih.gov/pubmed/36904917 http://dx.doi.org/10.3390/s23052713 |
_version_ | 1784905437665034240 |
---|---|
author | Kitzler, Florian Barta, Norbert Neugschwandtner, Reinhard W. Gronauer, Andreas Motsch, Viktoria |
author_facet | Kitzler, Florian Barta, Norbert Neugschwandtner, Reinhard W. Gronauer, Andreas Motsch, Viktoria |
author_sort | Kitzler, Florian |
collection | PubMed |
description | Smart farming (SF) applications rely on robust and accurate computer vision systems. An important computer vision task in agriculture is semantic segmentation, which aims to classify each pixel of an image and can be used for selective weed removal. State-of-the-art implementations use convolutional neural networks (CNN) that are trained on large image datasets. In agriculture, publicly available RGB image datasets are scarce and often lack detailed ground-truth information. In contrast to agriculture, other research areas feature RGB-D datasets that combine color (RGB) with additional distance (D) information. Such results show that including distance as an additional modality can improve model performance further. Therefore, we introduce WE3DS as the first RGB-D image dataset for multi-class plant species semantic segmentation in crop farming. It contains 2568 RGB-D images (color image and distance map) and corresponding hand-annotated ground-truth masks. Images were taken under natural light conditions using an RGB-D sensor consisting of two RGB cameras in a stereo setup. Further, we provide a benchmark for RGB-D semantic segmentation on the WE3DS dataset and compare it with a solely RGB-based model. Our trained models achieve up to 70.7% mean Intersection over Union (mIoU) for discriminating between soil, seven crop species, and ten weed species. Finally, our work confirms the finding that additional distance information improves segmentation quality. |
format | Online Article Text |
id | pubmed-10007111 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-100071112023-03-12 WE3DS: An RGB-D Image Dataset for Semantic Segmentation in Agriculture Kitzler, Florian Barta, Norbert Neugschwandtner, Reinhard W. Gronauer, Andreas Motsch, Viktoria Sensors (Basel) Article Smart farming (SF) applications rely on robust and accurate computer vision systems. An important computer vision task in agriculture is semantic segmentation, which aims to classify each pixel of an image and can be used for selective weed removal. State-of-the-art implementations use convolutional neural networks (CNN) that are trained on large image datasets. In agriculture, publicly available RGB image datasets are scarce and often lack detailed ground-truth information. In contrast to agriculture, other research areas feature RGB-D datasets that combine color (RGB) with additional distance (D) information. Such results show that including distance as an additional modality can improve model performance further. Therefore, we introduce WE3DS as the first RGB-D image dataset for multi-class plant species semantic segmentation in crop farming. It contains 2568 RGB-D images (color image and distance map) and corresponding hand-annotated ground-truth masks. Images were taken under natural light conditions using an RGB-D sensor consisting of two RGB cameras in a stereo setup. Further, we provide a benchmark for RGB-D semantic segmentation on the WE3DS dataset and compare it with a solely RGB-based model. Our trained models achieve up to 70.7% mean Intersection over Union (mIoU) for discriminating between soil, seven crop species, and ten weed species. Finally, our work confirms the finding that additional distance information improves segmentation quality. MDPI 2023-03-01 /pmc/articles/PMC10007111/ /pubmed/36904917 http://dx.doi.org/10.3390/s23052713 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Kitzler, Florian Barta, Norbert Neugschwandtner, Reinhard W. Gronauer, Andreas Motsch, Viktoria WE3DS: An RGB-D Image Dataset for Semantic Segmentation in Agriculture |
title | WE3DS: An RGB-D Image Dataset for Semantic Segmentation in Agriculture |
title_full | WE3DS: An RGB-D Image Dataset for Semantic Segmentation in Agriculture |
title_fullStr | WE3DS: An RGB-D Image Dataset for Semantic Segmentation in Agriculture |
title_full_unstemmed | WE3DS: An RGB-D Image Dataset for Semantic Segmentation in Agriculture |
title_short | WE3DS: An RGB-D Image Dataset for Semantic Segmentation in Agriculture |
title_sort | we3ds: an rgb-d image dataset for semantic segmentation in agriculture |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10007111/ https://www.ncbi.nlm.nih.gov/pubmed/36904917 http://dx.doi.org/10.3390/s23052713 |
work_keys_str_mv | AT kitzlerflorian we3dsanrgbdimagedatasetforsemanticsegmentationinagriculture AT bartanorbert we3dsanrgbdimagedatasetforsemanticsegmentationinagriculture AT neugschwandtnerreinhardw we3dsanrgbdimagedatasetforsemanticsegmentationinagriculture AT gronauerandreas we3dsanrgbdimagedatasetforsemanticsegmentationinagriculture AT motschviktoria we3dsanrgbdimagedatasetforsemanticsegmentationinagriculture |