Cargando…

Deep learning-based video quality enhancement for the new versatile video coding

Multimedia IoT (M-IoT) is an emerging type of Internet of things (IoT) relaying multimedia data (images, videos, audio and speech, etc.). The rapid growth of M-IoT devices enables the creation of a massive volume of multimedia data with different characteristics and requirements. With the developmen...

Descripción completa

Detalles Bibliográficos
Autores principales: Bouaafia, Soulef, Khemiri, Randa, Messaoud, Seifeddine, Ben Ahmed, Olfa, Sayadi, Fatma Ezahra
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer London 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8425022/
https://www.ncbi.nlm.nih.gov/pubmed/34511732
http://dx.doi.org/10.1007/s00521-021-06491-9
_version_ 1783749773840351232
author Bouaafia, Soulef
Khemiri, Randa
Messaoud, Seifeddine
Ben Ahmed, Olfa
Sayadi, Fatma Ezahra
author_facet Bouaafia, Soulef
Khemiri, Randa
Messaoud, Seifeddine
Ben Ahmed, Olfa
Sayadi, Fatma Ezahra
author_sort Bouaafia, Soulef
collection PubMed
description Multimedia IoT (M-IoT) is an emerging type of Internet of things (IoT) relaying multimedia data (images, videos, audio and speech, etc.). The rapid growth of M-IoT devices enables the creation of a massive volume of multimedia data with different characteristics and requirements. With the development of artificial intelligence (AI), AI-based multimedia IoT systems have been recently designed and deployed for various video-based services for contemporary daily life, like video surveillance with high definition (HD) and ultra-high definition (UHD) and mobile multimedia streaming. These new services need higher video quality in order to meet the quality of experience (QoE) required by the users. Versatile video coding (VVC) is the new video coding standard that achieves significant coding efficiency over its predecessor high-efficiency video coding (HEVC). Moreover, VVC can achieve up to 30% BD rate savings compared to HEVC. Inspired by the rapid advancements in deep learning, we propose in this paper a wide-activated squeeze-and-excitation deep convolutional neural network (WSE-DCNN) technique-based video quality enhancement for VVC. Therefore, we replace the conventional in-loop filtering in VVC by the proposed WSE-DCNN model that eliminates the compression artifacts in order to improve visual quality and hence increase the end user QoE. The obtained results prove that the proposed in-loop filtering technique achieves [Formula: see text]%, [Formula: see text]%, and [Formula: see text]% BD rate reduction for luma and both chroma components under random access configuration. Compared to the traditional CNN-based filtering approaches, the proposed WSE-DCNN-based in-loop filtering framework achieves efficient performance in terms of RD cost.
format Online
Article
Text
id pubmed-8425022
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer London
record_format MEDLINE/PubMed
spelling pubmed-84250222021-09-08 Deep learning-based video quality enhancement for the new versatile video coding Bouaafia, Soulef Khemiri, Randa Messaoud, Seifeddine Ben Ahmed, Olfa Sayadi, Fatma Ezahra Neural Comput Appl S.I. : Emerging trends in AI & ML Multimedia IoT (M-IoT) is an emerging type of Internet of things (IoT) relaying multimedia data (images, videos, audio and speech, etc.). The rapid growth of M-IoT devices enables the creation of a massive volume of multimedia data with different characteristics and requirements. With the development of artificial intelligence (AI), AI-based multimedia IoT systems have been recently designed and deployed for various video-based services for contemporary daily life, like video surveillance with high definition (HD) and ultra-high definition (UHD) and mobile multimedia streaming. These new services need higher video quality in order to meet the quality of experience (QoE) required by the users. Versatile video coding (VVC) is the new video coding standard that achieves significant coding efficiency over its predecessor high-efficiency video coding (HEVC). Moreover, VVC can achieve up to 30% BD rate savings compared to HEVC. Inspired by the rapid advancements in deep learning, we propose in this paper a wide-activated squeeze-and-excitation deep convolutional neural network (WSE-DCNN) technique-based video quality enhancement for VVC. Therefore, we replace the conventional in-loop filtering in VVC by the proposed WSE-DCNN model that eliminates the compression artifacts in order to improve visual quality and hence increase the end user QoE. The obtained results prove that the proposed in-loop filtering technique achieves [Formula: see text]%, [Formula: see text]%, and [Formula: see text]% BD rate reduction for luma and both chroma components under random access configuration. Compared to the traditional CNN-based filtering approaches, the proposed WSE-DCNN-based in-loop filtering framework achieves efficient performance in terms of RD cost. Springer London 2021-09-08 2022 /pmc/articles/PMC8425022/ /pubmed/34511732 http://dx.doi.org/10.1007/s00521-021-06491-9 Text en © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle S.I. : Emerging trends in AI & ML
Bouaafia, Soulef
Khemiri, Randa
Messaoud, Seifeddine
Ben Ahmed, Olfa
Sayadi, Fatma Ezahra
Deep learning-based video quality enhancement for the new versatile video coding
title Deep learning-based video quality enhancement for the new versatile video coding
title_full Deep learning-based video quality enhancement for the new versatile video coding
title_fullStr Deep learning-based video quality enhancement for the new versatile video coding
title_full_unstemmed Deep learning-based video quality enhancement for the new versatile video coding
title_short Deep learning-based video quality enhancement for the new versatile video coding
title_sort deep learning-based video quality enhancement for the new versatile video coding
topic S.I. : Emerging trends in AI & ML
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8425022/
https://www.ncbi.nlm.nih.gov/pubmed/34511732
http://dx.doi.org/10.1007/s00521-021-06491-9
work_keys_str_mv AT bouaafiasoulef deeplearningbasedvideoqualityenhancementforthenewversatilevideocoding
AT khemiriranda deeplearningbasedvideoqualityenhancementforthenewversatilevideocoding
AT messaoudseifeddine deeplearningbasedvideoqualityenhancementforthenewversatilevideocoding
AT benahmedolfa deeplearningbasedvideoqualityenhancementforthenewversatilevideocoding
AT sayadifatmaezahra deeplearningbasedvideoqualityenhancementforthenewversatilevideocoding