Cargando…

Fast Panoptic Segmentation with Soft Attention Embeddings

Panoptic segmentation provides a rich 2D environment representation by unifying semantic and instance segmentation. Most current state-of-the-art panoptic segmentation methods are built upon two-stage detectors and are not suitable for real-time applications, such as automated driving, due to their...

Descripción completa

Detalles Bibliográficos
Autores principales:	Petrovai, Andra, Nedevschi, Sergiu
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8837929/ https://www.ncbi.nlm.nih.gov/pubmed/35161529 http://dx.doi.org/10.3390/s22030783

_version_	1784649999896805376
author	Petrovai, Andra Nedevschi, Sergiu
author_facet	Petrovai, Andra Nedevschi, Sergiu
author_sort	Petrovai, Andra
collection	PubMed
description	Panoptic segmentation provides a rich 2D environment representation by unifying semantic and instance segmentation. Most current state-of-the-art panoptic segmentation methods are built upon two-stage detectors and are not suitable for real-time applications, such as automated driving, due to their high computational complexity. In this work, we introduce a novel, fast and accurate single-stage panoptic segmentation network that employs a shared feature extraction backbone and three network heads for object detection, semantic segmentation, instance-level attention masks. Guided by object detections, our new panoptic segmentation head learns instance specific soft attention masks based on spatial embeddings. The semantic masks for stuff classes and soft instance masks for things classes are pixel-wise coherent and can be easily integrated in a panoptic output. The training and inference pipelines are simplified and no post-processing of the panoptic output is necessary. Benefiting from fast inference speed, the network can be deployed in automated vehicles or robotic applications. We perform extensive experiments on COCO and Cityscapes datasets and obtain competitive results in both accuracy and time. On the Cityscapes dataset we achieve 59.7 panoptic quality with an inference speed of more than 10 FPS on high resolution 1024 × 2048 images.
format	Online Article Text
id	pubmed-8837929
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-88379292022-02-13 Fast Panoptic Segmentation with Soft Attention Embeddings Petrovai, Andra Nedevschi, Sergiu Sensors (Basel) Article Panoptic segmentation provides a rich 2D environment representation by unifying semantic and instance segmentation. Most current state-of-the-art panoptic segmentation methods are built upon two-stage detectors and are not suitable for real-time applications, such as automated driving, due to their high computational complexity. In this work, we introduce a novel, fast and accurate single-stage panoptic segmentation network that employs a shared feature extraction backbone and three network heads for object detection, semantic segmentation, instance-level attention masks. Guided by object detections, our new panoptic segmentation head learns instance specific soft attention masks based on spatial embeddings. The semantic masks for stuff classes and soft instance masks for things classes are pixel-wise coherent and can be easily integrated in a panoptic output. The training and inference pipelines are simplified and no post-processing of the panoptic output is necessary. Benefiting from fast inference speed, the network can be deployed in automated vehicles or robotic applications. We perform extensive experiments on COCO and Cityscapes datasets and obtain competitive results in both accuracy and time. On the Cityscapes dataset we achieve 59.7 panoptic quality with an inference speed of more than 10 FPS on high resolution 1024 × 2048 images. MDPI 2022-01-20 /pmc/articles/PMC8837929/ /pubmed/35161529 http://dx.doi.org/10.3390/s22030783 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Petrovai, Andra Nedevschi, Sergiu Fast Panoptic Segmentation with Soft Attention Embeddings
title	Fast Panoptic Segmentation with Soft Attention Embeddings
title_full	Fast Panoptic Segmentation with Soft Attention Embeddings
title_fullStr	Fast Panoptic Segmentation with Soft Attention Embeddings
title_full_unstemmed	Fast Panoptic Segmentation with Soft Attention Embeddings
title_short	Fast Panoptic Segmentation with Soft Attention Embeddings
title_sort	fast panoptic segmentation with soft attention embeddings
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8837929/ https://www.ncbi.nlm.nih.gov/pubmed/35161529 http://dx.doi.org/10.3390/s22030783
work_keys_str_mv	AT petrovaiandra fastpanopticsegmentationwithsoftattentionembeddings AT nedevschisergiu fastpanopticsegmentationwithsoftattentionembeddings

Fast Panoptic Segmentation with Soft Attention Embeddings

Ejemplares similares