Cargando…

Joint Multi-object Detection and Segmentation from an Untrimmed Video

In this paper, we present a novel method for jointly detecting and segmenting multiple objects from an untrimmed video. Unlike most existing video object segmentation methods that can only handle a trimmed video in which all video frames contain the target objects, we address a more practical and di...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Xinling, Wang, Le, Zhang, Qilin, Zheng, Nanning, Hua, Gang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7256415/
http://dx.doi.org/10.1007/978-3-030-49161-1_27
_version_ 1783539903641944064
author Liu, Xinling
Wang, Le
Zhang, Qilin
Zheng, Nanning
Hua, Gang
author_facet Liu, Xinling
Wang, Le
Zhang, Qilin
Zheng, Nanning
Hua, Gang
author_sort Liu, Xinling
collection PubMed
description In this paper, we present a novel method for jointly detecting and segmenting multiple objects from an untrimmed video. Unlike most existing video object segmentation methods that can only handle a trimmed video in which all video frames contain the target objects, we address a more practical and difficult problem, i.e., joint multi-object detection and segmentation from an untrimmed video where the target objects do not always appear per frame. In particular, our method consists of two modules, i.e., object decision module and object segmentation module. The object decision module is used to detect the objects and decide which target objects need to be separated out from video. As there are usually two or more target objects and they do not always appear in the whole video, we introduce the data association into object decision module to identify their correspondences among frames. The object segmentation module aims to separate the target objects identified by object decision module. In order to extensively evaluate the proposed method, we introduce a new dataset named UNVOSeg dataset, in which [Formula: see text] of the video frames do not contain objects. Experimental results on four datasets demonstrate that our method outperforms most of the state-of-the-art approaches.
format Online
Article
Text
id pubmed-7256415
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-72564152020-05-29 Joint Multi-object Detection and Segmentation from an Untrimmed Video Liu, Xinling Wang, Le Zhang, Qilin Zheng, Nanning Hua, Gang Artificial Intelligence Applications and Innovations Article In this paper, we present a novel method for jointly detecting and segmenting multiple objects from an untrimmed video. Unlike most existing video object segmentation methods that can only handle a trimmed video in which all video frames contain the target objects, we address a more practical and difficult problem, i.e., joint multi-object detection and segmentation from an untrimmed video where the target objects do not always appear per frame. In particular, our method consists of two modules, i.e., object decision module and object segmentation module. The object decision module is used to detect the objects and decide which target objects need to be separated out from video. As there are usually two or more target objects and they do not always appear in the whole video, we introduce the data association into object decision module to identify their correspondences among frames. The object segmentation module aims to separate the target objects identified by object decision module. In order to extensively evaluate the proposed method, we introduce a new dataset named UNVOSeg dataset, in which [Formula: see text] of the video frames do not contain objects. Experimental results on four datasets demonstrate that our method outperforms most of the state-of-the-art approaches. 2020-05-06 /pmc/articles/PMC7256415/ http://dx.doi.org/10.1007/978-3-030-49161-1_27 Text en © IFIP International Federation for Information Processing 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Liu, Xinling
Wang, Le
Zhang, Qilin
Zheng, Nanning
Hua, Gang
Joint Multi-object Detection and Segmentation from an Untrimmed Video
title Joint Multi-object Detection and Segmentation from an Untrimmed Video
title_full Joint Multi-object Detection and Segmentation from an Untrimmed Video
title_fullStr Joint Multi-object Detection and Segmentation from an Untrimmed Video
title_full_unstemmed Joint Multi-object Detection and Segmentation from an Untrimmed Video
title_short Joint Multi-object Detection and Segmentation from an Untrimmed Video
title_sort joint multi-object detection and segmentation from an untrimmed video
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7256415/
http://dx.doi.org/10.1007/978-3-030-49161-1_27
work_keys_str_mv AT liuxinling jointmultiobjectdetectionandsegmentationfromanuntrimmedvideo
AT wangle jointmultiobjectdetectionandsegmentationfromanuntrimmedvideo
AT zhangqilin jointmultiobjectdetectionandsegmentationfromanuntrimmedvideo
AT zhengnanning jointmultiobjectdetectionandsegmentationfromanuntrimmedvideo
AT huagang jointmultiobjectdetectionandsegmentationfromanuntrimmedvideo