Cargando…
Multilabel Video Classification Model of Navigation Mark's Lights Based on Deep Learning
At night, buoys and other navigation marks disappear to be replaced by fixed or flashing lights. Navigation marks are seen as a set of lights in various colors rather than their familiar outline. Deciphering that the meaning of the lights is a burden to navigators, it is also a new challenging resea...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8601801/ https://www.ncbi.nlm.nih.gov/pubmed/34804148 http://dx.doi.org/10.1155/2021/6794202 |
_version_ | 1784601427397574656 |
---|---|
author | Han, Xu Pan, Mingyang Ge, Haipeng Li, Shaoxi Hu, Jingfeng Zhao, Lining Li, Yu |
author_facet | Han, Xu Pan, Mingyang Ge, Haipeng Li, Shaoxi Hu, Jingfeng Zhao, Lining Li, Yu |
author_sort | Han, Xu |
collection | PubMed |
description | At night, buoys and other navigation marks disappear to be replaced by fixed or flashing lights. Navigation marks are seen as a set of lights in various colors rather than their familiar outline. Deciphering that the meaning of the lights is a burden to navigators, it is also a new challenging research direction of intelligent sensing of navigation environment. The study studied initiatively the intelligent recognition of lights on navigation marks at night based on multilabel video classification methods. To capture effectively the characteristics of navigation mark's lights, including both color and flashing phase, three different multilabel classification models based on binary relevance, label power set, and adapted algorithm were investigated and compared. According to the experiment's results performed on a data set with 8000 minutes video, the model based on binary relevance, named NMLNet, has highest accuracy about 99.23% to classify 9 types of navigation mark's lights. It also has the fastest computation speed with least network parameters. In the NMLNet, there are two branches for the classifications of color and flashing, respectively, and for the flashing classification, an improved MobileNet-v2 was used to capture the brightness characteristic of lights in each video frame, and an LSTM is used to capture the temporal dynamics of lights. Aiming to run on mobile devices on vessel, the MobileNet-v2 was used as backbone, and with the improvement of spatial attention mechanism, it achieved the accuracy near Resnet-50 while keeping its high speed. |
format | Online Article Text |
id | pubmed-8601801 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-86018012021-11-19 Multilabel Video Classification Model of Navigation Mark's Lights Based on Deep Learning Han, Xu Pan, Mingyang Ge, Haipeng Li, Shaoxi Hu, Jingfeng Zhao, Lining Li, Yu Comput Intell Neurosci Research Article At night, buoys and other navigation marks disappear to be replaced by fixed or flashing lights. Navigation marks are seen as a set of lights in various colors rather than their familiar outline. Deciphering that the meaning of the lights is a burden to navigators, it is also a new challenging research direction of intelligent sensing of navigation environment. The study studied initiatively the intelligent recognition of lights on navigation marks at night based on multilabel video classification methods. To capture effectively the characteristics of navigation mark's lights, including both color and flashing phase, three different multilabel classification models based on binary relevance, label power set, and adapted algorithm were investigated and compared. According to the experiment's results performed on a data set with 8000 minutes video, the model based on binary relevance, named NMLNet, has highest accuracy about 99.23% to classify 9 types of navigation mark's lights. It also has the fastest computation speed with least network parameters. In the NMLNet, there are two branches for the classifications of color and flashing, respectively, and for the flashing classification, an improved MobileNet-v2 was used to capture the brightness characteristic of lights in each video frame, and an LSTM is used to capture the temporal dynamics of lights. Aiming to run on mobile devices on vessel, the MobileNet-v2 was used as backbone, and with the improvement of spatial attention mechanism, it achieved the accuracy near Resnet-50 while keeping its high speed. Hindawi 2021-11-11 /pmc/articles/PMC8601801/ /pubmed/34804148 http://dx.doi.org/10.1155/2021/6794202 Text en Copyright © 2021 Xu Han et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Han, Xu Pan, Mingyang Ge, Haipeng Li, Shaoxi Hu, Jingfeng Zhao, Lining Li, Yu Multilabel Video Classification Model of Navigation Mark's Lights Based on Deep Learning |
title | Multilabel Video Classification Model of Navigation Mark's Lights Based on Deep Learning |
title_full | Multilabel Video Classification Model of Navigation Mark's Lights Based on Deep Learning |
title_fullStr | Multilabel Video Classification Model of Navigation Mark's Lights Based on Deep Learning |
title_full_unstemmed | Multilabel Video Classification Model of Navigation Mark's Lights Based on Deep Learning |
title_short | Multilabel Video Classification Model of Navigation Mark's Lights Based on Deep Learning |
title_sort | multilabel video classification model of navigation mark's lights based on deep learning |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8601801/ https://www.ncbi.nlm.nih.gov/pubmed/34804148 http://dx.doi.org/10.1155/2021/6794202 |
work_keys_str_mv | AT hanxu multilabelvideoclassificationmodelofnavigationmarkslightsbasedondeeplearning AT panmingyang multilabelvideoclassificationmodelofnavigationmarkslightsbasedondeeplearning AT gehaipeng multilabelvideoclassificationmodelofnavigationmarkslightsbasedondeeplearning AT lishaoxi multilabelvideoclassificationmodelofnavigationmarkslightsbasedondeeplearning AT hujingfeng multilabelvideoclassificationmodelofnavigationmarkslightsbasedondeeplearning AT zhaolining multilabelvideoclassificationmodelofnavigationmarkslightsbasedondeeplearning AT liyu multilabelvideoclassificationmodelofnavigationmarkslightsbasedondeeplearning |