Cargando…
Novel Method of Semantic Segmentation Applicable to Augmented Reality
This paper proposes a novel method of semantic segmentation, consisting of modified dilated residual network, atrous pyramid pooling module, and backpropagation, that is applicable to augmented reality (AR). In the proposed method, the modified dilated residual network extracts a feature map from th...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7146136/ https://www.ncbi.nlm.nih.gov/pubmed/32245002 http://dx.doi.org/10.3390/s20061737 |
_version_ | 1783520130309816320 |
---|---|
author | Ko, Tae-young Lee, Seung-ho |
author_facet | Ko, Tae-young Lee, Seung-ho |
author_sort | Ko, Tae-young |
collection | PubMed |
description | This paper proposes a novel method of semantic segmentation, consisting of modified dilated residual network, atrous pyramid pooling module, and backpropagation, that is applicable to augmented reality (AR). In the proposed method, the modified dilated residual network extracts a feature map from the original images and maintains spatial information. The atrous pyramid pooling module places convolutions in parallel and layers feature maps in a pyramid shape to extract objects occupying small areas in the image; these are converted into one channel using a 1 × 1 convolution. Backpropagation compares the semantic segmentation obtained through convolution from the final feature map with the ground truth provided by a database. Losses can be reduced by applying backpropagation to the modified dilated residual network to change the weighting. The proposed method was compared with other methods on the Cityscapes and PASCAL VOC 2012 databases. The proposed method achieved accuracies of 82.8 and 89.8 mean intersection over union (mIOU) and frame rates of 61 and 64.3 frames per second (fps) for the Cityscapes and PASCAL VOC 2012 databases, respectively. These results prove the applicability of the proposed method for implementing natural AR applications at actual speeds because the frame rate is greater than 60 fps. |
format | Online Article Text |
id | pubmed-7146136 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-71461362020-04-15 Novel Method of Semantic Segmentation Applicable to Augmented Reality Ko, Tae-young Lee, Seung-ho Sensors (Basel) Article This paper proposes a novel method of semantic segmentation, consisting of modified dilated residual network, atrous pyramid pooling module, and backpropagation, that is applicable to augmented reality (AR). In the proposed method, the modified dilated residual network extracts a feature map from the original images and maintains spatial information. The atrous pyramid pooling module places convolutions in parallel and layers feature maps in a pyramid shape to extract objects occupying small areas in the image; these are converted into one channel using a 1 × 1 convolution. Backpropagation compares the semantic segmentation obtained through convolution from the final feature map with the ground truth provided by a database. Losses can be reduced by applying backpropagation to the modified dilated residual network to change the weighting. The proposed method was compared with other methods on the Cityscapes and PASCAL VOC 2012 databases. The proposed method achieved accuracies of 82.8 and 89.8 mean intersection over union (mIOU) and frame rates of 61 and 64.3 frames per second (fps) for the Cityscapes and PASCAL VOC 2012 databases, respectively. These results prove the applicability of the proposed method for implementing natural AR applications at actual speeds because the frame rate is greater than 60 fps. MDPI 2020-03-20 /pmc/articles/PMC7146136/ /pubmed/32245002 http://dx.doi.org/10.3390/s20061737 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Ko, Tae-young Lee, Seung-ho Novel Method of Semantic Segmentation Applicable to Augmented Reality |
title | Novel Method of Semantic Segmentation Applicable to Augmented Reality |
title_full | Novel Method of Semantic Segmentation Applicable to Augmented Reality |
title_fullStr | Novel Method of Semantic Segmentation Applicable to Augmented Reality |
title_full_unstemmed | Novel Method of Semantic Segmentation Applicable to Augmented Reality |
title_short | Novel Method of Semantic Segmentation Applicable to Augmented Reality |
title_sort | novel method of semantic segmentation applicable to augmented reality |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7146136/ https://www.ncbi.nlm.nih.gov/pubmed/32245002 http://dx.doi.org/10.3390/s20061737 |
work_keys_str_mv | AT kotaeyoung novelmethodofsemanticsegmentationapplicabletoaugmentedreality AT leeseungho novelmethodofsemanticsegmentationapplicabletoaugmentedreality |