Cargando…

Fully Automated DCNN-Based Thermal Images Annotation Using Neural Network Pretrained on RGB Data

One of the biggest challenges of training deep neural network is the need for massive data annotation. To train the neural network for object detection, millions of annotated training images are required. However, currently, there are no large-scale thermal image datasets that could be used to train...

Descripción completa

Detalles Bibliográficos
Autores principales: Ligocki, Adam, Jelinek, Ales, Zalud, Ludek, Rahtu, Esa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7926581/
https://www.ncbi.nlm.nih.gov/pubmed/33672344
http://dx.doi.org/10.3390/s21041552
_version_ 1783659499789221888
author Ligocki, Adam
Jelinek, Ales
Zalud, Ludek
Rahtu, Esa
author_facet Ligocki, Adam
Jelinek, Ales
Zalud, Ludek
Rahtu, Esa
author_sort Ligocki, Adam
collection PubMed
description One of the biggest challenges of training deep neural network is the need for massive data annotation. To train the neural network for object detection, millions of annotated training images are required. However, currently, there are no large-scale thermal image datasets that could be used to train the state of the art neural networks, while voluminous RGB image datasets are available. This paper presents a method that allows to create hundreds of thousands of annotated thermal images using the RGB pre-trained object detector. A dataset created in this way can be used to train object detectors with improved performance. The main gain of this work is the novel method for fully automatic thermal image labeling. The proposed system uses the RGB camera, thermal camera, 3D LiDAR, and the pre-trained neural network that detects objects in the RGB domain. Using this setup, it is possible to run the fully automated process that annotates the thermal images and creates the automatically annotated thermal training dataset. As the result, we created a dataset containing hundreds of thousands of annotated objects. This approach allows to train deep learning models with similar performance as the common human-annotation-based methods do. This paper also proposes several improvements to fine-tune the results with minimal human intervention. Finally, the evaluation of the proposed solution shows that the method gives significantly better results than training the neural network with standard small-scale hand-annotated thermal image datasets.
format Online
Article
Text
id pubmed-7926581
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-79265812021-03-04 Fully Automated DCNN-Based Thermal Images Annotation Using Neural Network Pretrained on RGB Data Ligocki, Adam Jelinek, Ales Zalud, Ludek Rahtu, Esa Sensors (Basel) Article One of the biggest challenges of training deep neural network is the need for massive data annotation. To train the neural network for object detection, millions of annotated training images are required. However, currently, there are no large-scale thermal image datasets that could be used to train the state of the art neural networks, while voluminous RGB image datasets are available. This paper presents a method that allows to create hundreds of thousands of annotated thermal images using the RGB pre-trained object detector. A dataset created in this way can be used to train object detectors with improved performance. The main gain of this work is the novel method for fully automatic thermal image labeling. The proposed system uses the RGB camera, thermal camera, 3D LiDAR, and the pre-trained neural network that detects objects in the RGB domain. Using this setup, it is possible to run the fully automated process that annotates the thermal images and creates the automatically annotated thermal training dataset. As the result, we created a dataset containing hundreds of thousands of annotated objects. This approach allows to train deep learning models with similar performance as the common human-annotation-based methods do. This paper also proposes several improvements to fine-tune the results with minimal human intervention. Finally, the evaluation of the proposed solution shows that the method gives significantly better results than training the neural network with standard small-scale hand-annotated thermal image datasets. MDPI 2021-02-23 /pmc/articles/PMC7926581/ /pubmed/33672344 http://dx.doi.org/10.3390/s21041552 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Ligocki, Adam
Jelinek, Ales
Zalud, Ludek
Rahtu, Esa
Fully Automated DCNN-Based Thermal Images Annotation Using Neural Network Pretrained on RGB Data
title Fully Automated DCNN-Based Thermal Images Annotation Using Neural Network Pretrained on RGB Data
title_full Fully Automated DCNN-Based Thermal Images Annotation Using Neural Network Pretrained on RGB Data
title_fullStr Fully Automated DCNN-Based Thermal Images Annotation Using Neural Network Pretrained on RGB Data
title_full_unstemmed Fully Automated DCNN-Based Thermal Images Annotation Using Neural Network Pretrained on RGB Data
title_short Fully Automated DCNN-Based Thermal Images Annotation Using Neural Network Pretrained on RGB Data
title_sort fully automated dcnn-based thermal images annotation using neural network pretrained on rgb data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7926581/
https://www.ncbi.nlm.nih.gov/pubmed/33672344
http://dx.doi.org/10.3390/s21041552
work_keys_str_mv AT ligockiadam fullyautomateddcnnbasedthermalimagesannotationusingneuralnetworkpretrainedonrgbdata
AT jelinekales fullyautomateddcnnbasedthermalimagesannotationusingneuralnetworkpretrainedonrgbdata
AT zaludludek fullyautomateddcnnbasedthermalimagesannotationusingneuralnetworkpretrainedonrgbdata
AT rahtuesa fullyautomateddcnnbasedthermalimagesannotationusingneuralnetworkpretrainedonrgbdata