Cargando…
A Deep CNN-LSTM Framework for Fast Video Coding
High Efficiency Video Coding (HEVC) doubles the compression rates over the previous H.264 standard for the same video quality. To improve the coding efficiency, HEVC adopts the hierarchical quadtree structured Coding Unit (CU). However, the computational complexity significantly increases due to the...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7340939/ http://dx.doi.org/10.1007/978-3-030-51935-3_22 |
_version_ | 1783555127469146112 |
---|---|
author | Bouaafia, Soulef Khemiri, Randa Sayadi, Fatma Ezahra Atri, Mohamed Liouane, Noureddine |
author_facet | Bouaafia, Soulef Khemiri, Randa Sayadi, Fatma Ezahra Atri, Mohamed Liouane, Noureddine |
author_sort | Bouaafia, Soulef |
collection | PubMed |
description | High Efficiency Video Coding (HEVC) doubles the compression rates over the previous H.264 standard for the same video quality. To improve the coding efficiency, HEVC adopts the hierarchical quadtree structured Coding Unit (CU). However, the computational complexity significantly increases due to the full search for Rate-Distortion Optimization (RDO) to find the optimal Coding Tree Unit (CTU) partition. Here, this paper proposes a deep learning model to predict the HEVC CU partition at inter-mode, instead of brute-force RDO search. To learn the learning model, a large-scale database for HEVC inter-mode is first built. Second, to predict the CU partition of HEVC, we propose as a model a combination of a Convolutional Neural Network (CNN) and a Long Short-Term Memory (LSTM) network. The simulation results prove that the proposed scheme can achieve a best compromise between complexity reduction and RD performance, compared to existing approaches. |
format | Online Article Text |
id | pubmed-7340939 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-73409392020-07-08 A Deep CNN-LSTM Framework for Fast Video Coding Bouaafia, Soulef Khemiri, Randa Sayadi, Fatma Ezahra Atri, Mohamed Liouane, Noureddine Image and Signal Processing Article High Efficiency Video Coding (HEVC) doubles the compression rates over the previous H.264 standard for the same video quality. To improve the coding efficiency, HEVC adopts the hierarchical quadtree structured Coding Unit (CU). However, the computational complexity significantly increases due to the full search for Rate-Distortion Optimization (RDO) to find the optimal Coding Tree Unit (CTU) partition. Here, this paper proposes a deep learning model to predict the HEVC CU partition at inter-mode, instead of brute-force RDO search. To learn the learning model, a large-scale database for HEVC inter-mode is first built. Second, to predict the CU partition of HEVC, we propose as a model a combination of a Convolutional Neural Network (CNN) and a Long Short-Term Memory (LSTM) network. The simulation results prove that the proposed scheme can achieve a best compromise between complexity reduction and RD performance, compared to existing approaches. 2020-06-05 /pmc/articles/PMC7340939/ http://dx.doi.org/10.1007/978-3-030-51935-3_22 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Bouaafia, Soulef Khemiri, Randa Sayadi, Fatma Ezahra Atri, Mohamed Liouane, Noureddine A Deep CNN-LSTM Framework for Fast Video Coding |
title | A Deep CNN-LSTM Framework for Fast Video Coding |
title_full | A Deep CNN-LSTM Framework for Fast Video Coding |
title_fullStr | A Deep CNN-LSTM Framework for Fast Video Coding |
title_full_unstemmed | A Deep CNN-LSTM Framework for Fast Video Coding |
title_short | A Deep CNN-LSTM Framework for Fast Video Coding |
title_sort | deep cnn-lstm framework for fast video coding |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7340939/ http://dx.doi.org/10.1007/978-3-030-51935-3_22 |
work_keys_str_mv | AT bouaafiasoulef adeepcnnlstmframeworkforfastvideocoding AT khemiriranda adeepcnnlstmframeworkforfastvideocoding AT sayadifatmaezahra adeepcnnlstmframeworkforfastvideocoding AT atrimohamed adeepcnnlstmframeworkforfastvideocoding AT liouanenoureddine adeepcnnlstmframeworkforfastvideocoding AT bouaafiasoulef deepcnnlstmframeworkforfastvideocoding AT khemiriranda deepcnnlstmframeworkforfastvideocoding AT sayadifatmaezahra deepcnnlstmframeworkforfastvideocoding AT atrimohamed deepcnnlstmframeworkforfastvideocoding AT liouanenoureddine deepcnnlstmframeworkforfastvideocoding |