Cargando…

Fast Panoptic Segmentation with Soft Attention Embeddings

Panoptic segmentation provides a rich 2D environment representation by unifying semantic and instance segmentation. Most current state-of-the-art panoptic segmentation methods are built upon two-stage detectors and are not suitable for real-time applications, such as automated driving, due to their...

Descripción completa

Detalles Bibliográficos
Autores principales: Petrovai, Andra, Nedevschi, Sergiu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8837929/
https://www.ncbi.nlm.nih.gov/pubmed/35161529
http://dx.doi.org/10.3390/s22030783
_version_ 1784649999896805376
author Petrovai, Andra
Nedevschi, Sergiu
author_facet Petrovai, Andra
Nedevschi, Sergiu
author_sort Petrovai, Andra
collection PubMed
description Panoptic segmentation provides a rich 2D environment representation by unifying semantic and instance segmentation. Most current state-of-the-art panoptic segmentation methods are built upon two-stage detectors and are not suitable for real-time applications, such as automated driving, due to their high computational complexity. In this work, we introduce a novel, fast and accurate single-stage panoptic segmentation network that employs a shared feature extraction backbone and three network heads for object detection, semantic segmentation, instance-level attention masks. Guided by object detections, our new panoptic segmentation head learns instance specific soft attention masks based on spatial embeddings. The semantic masks for stuff classes and soft instance masks for things classes are pixel-wise coherent and can be easily integrated in a panoptic output. The training and inference pipelines are simplified and no post-processing of the panoptic output is necessary. Benefiting from fast inference speed, the network can be deployed in automated vehicles or robotic applications. We perform extensive experiments on COCO and Cityscapes datasets and obtain competitive results in both accuracy and time. On the Cityscapes dataset we achieve 59.7 panoptic quality with an inference speed of more than 10 FPS on high resolution 1024 × 2048 images.
format Online
Article
Text
id pubmed-8837929
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-88379292022-02-13 Fast Panoptic Segmentation with Soft Attention Embeddings Petrovai, Andra Nedevschi, Sergiu Sensors (Basel) Article Panoptic segmentation provides a rich 2D environment representation by unifying semantic and instance segmentation. Most current state-of-the-art panoptic segmentation methods are built upon two-stage detectors and are not suitable for real-time applications, such as automated driving, due to their high computational complexity. In this work, we introduce a novel, fast and accurate single-stage panoptic segmentation network that employs a shared feature extraction backbone and three network heads for object detection, semantic segmentation, instance-level attention masks. Guided by object detections, our new panoptic segmentation head learns instance specific soft attention masks based on spatial embeddings. The semantic masks for stuff classes and soft instance masks for things classes are pixel-wise coherent and can be easily integrated in a panoptic output. The training and inference pipelines are simplified and no post-processing of the panoptic output is necessary. Benefiting from fast inference speed, the network can be deployed in automated vehicles or robotic applications. We perform extensive experiments on COCO and Cityscapes datasets and obtain competitive results in both accuracy and time. On the Cityscapes dataset we achieve 59.7 panoptic quality with an inference speed of more than 10 FPS on high resolution 1024 × 2048 images. MDPI 2022-01-20 /pmc/articles/PMC8837929/ /pubmed/35161529 http://dx.doi.org/10.3390/s22030783 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Petrovai, Andra
Nedevschi, Sergiu
Fast Panoptic Segmentation with Soft Attention Embeddings
title Fast Panoptic Segmentation with Soft Attention Embeddings
title_full Fast Panoptic Segmentation with Soft Attention Embeddings
title_fullStr Fast Panoptic Segmentation with Soft Attention Embeddings
title_full_unstemmed Fast Panoptic Segmentation with Soft Attention Embeddings
title_short Fast Panoptic Segmentation with Soft Attention Embeddings
title_sort fast panoptic segmentation with soft attention embeddings
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8837929/
https://www.ncbi.nlm.nih.gov/pubmed/35161529
http://dx.doi.org/10.3390/s22030783
work_keys_str_mv AT petrovaiandra fastpanopticsegmentationwithsoftattentionembeddings
AT nedevschisergiu fastpanopticsegmentationwithsoftattentionembeddings