Cargando…

A Deep CNN-LSTM Framework for Fast Video Coding

High Efficiency Video Coding (HEVC) doubles the compression rates over the previous H.264 standard for the same video quality. To improve the coding efficiency, HEVC adopts the hierarchical quadtree structured Coding Unit (CU). However, the computational complexity significantly increases due to the...

Descripción completa

Detalles Bibliográficos
Autores principales: Bouaafia, Soulef, Khemiri, Randa, Sayadi, Fatma Ezahra, Atri, Mohamed, Liouane, Noureddine
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7340939/
http://dx.doi.org/10.1007/978-3-030-51935-3_22
_version_ 1783555127469146112
author Bouaafia, Soulef
Khemiri, Randa
Sayadi, Fatma Ezahra
Atri, Mohamed
Liouane, Noureddine
author_facet Bouaafia, Soulef
Khemiri, Randa
Sayadi, Fatma Ezahra
Atri, Mohamed
Liouane, Noureddine
author_sort Bouaafia, Soulef
collection PubMed
description High Efficiency Video Coding (HEVC) doubles the compression rates over the previous H.264 standard for the same video quality. To improve the coding efficiency, HEVC adopts the hierarchical quadtree structured Coding Unit (CU). However, the computational complexity significantly increases due to the full search for Rate-Distortion Optimization (RDO) to find the optimal Coding Tree Unit (CTU) partition. Here, this paper proposes a deep learning model to predict the HEVC CU partition at inter-mode, instead of brute-force RDO search. To learn the learning model, a large-scale database for HEVC inter-mode is first built. Second, to predict the CU partition of HEVC, we propose as a model a combination of a Convolutional Neural Network (CNN) and a Long Short-Term Memory (LSTM) network. The simulation results prove that the proposed scheme can achieve a best compromise between complexity reduction and RD performance, compared to existing approaches.
format Online
Article
Text
id pubmed-7340939
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-73409392020-07-08 A Deep CNN-LSTM Framework for Fast Video Coding Bouaafia, Soulef Khemiri, Randa Sayadi, Fatma Ezahra Atri, Mohamed Liouane, Noureddine Image and Signal Processing Article High Efficiency Video Coding (HEVC) doubles the compression rates over the previous H.264 standard for the same video quality. To improve the coding efficiency, HEVC adopts the hierarchical quadtree structured Coding Unit (CU). However, the computational complexity significantly increases due to the full search for Rate-Distortion Optimization (RDO) to find the optimal Coding Tree Unit (CTU) partition. Here, this paper proposes a deep learning model to predict the HEVC CU partition at inter-mode, instead of brute-force RDO search. To learn the learning model, a large-scale database for HEVC inter-mode is first built. Second, to predict the CU partition of HEVC, we propose as a model a combination of a Convolutional Neural Network (CNN) and a Long Short-Term Memory (LSTM) network. The simulation results prove that the proposed scheme can achieve a best compromise between complexity reduction and RD performance, compared to existing approaches. 2020-06-05 /pmc/articles/PMC7340939/ http://dx.doi.org/10.1007/978-3-030-51935-3_22 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Bouaafia, Soulef
Khemiri, Randa
Sayadi, Fatma Ezahra
Atri, Mohamed
Liouane, Noureddine
A Deep CNN-LSTM Framework for Fast Video Coding
title A Deep CNN-LSTM Framework for Fast Video Coding
title_full A Deep CNN-LSTM Framework for Fast Video Coding
title_fullStr A Deep CNN-LSTM Framework for Fast Video Coding
title_full_unstemmed A Deep CNN-LSTM Framework for Fast Video Coding
title_short A Deep CNN-LSTM Framework for Fast Video Coding
title_sort deep cnn-lstm framework for fast video coding
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7340939/
http://dx.doi.org/10.1007/978-3-030-51935-3_22
work_keys_str_mv AT bouaafiasoulef adeepcnnlstmframeworkforfastvideocoding
AT khemiriranda adeepcnnlstmframeworkforfastvideocoding
AT sayadifatmaezahra adeepcnnlstmframeworkforfastvideocoding
AT atrimohamed adeepcnnlstmframeworkforfastvideocoding
AT liouanenoureddine adeepcnnlstmframeworkforfastvideocoding
AT bouaafiasoulef deepcnnlstmframeworkforfastvideocoding
AT khemiriranda deepcnnlstmframeworkforfastvideocoding
AT sayadifatmaezahra deepcnnlstmframeworkforfastvideocoding
AT atrimohamed deepcnnlstmframeworkforfastvideocoding
AT liouanenoureddine deepcnnlstmframeworkforfastvideocoding