Cargando…
State-of-the-art image and video quality assessment with a metric based on an intrinsically non-linear neural summation model
The development of automatic methods for image and video quality assessment that correlate well with the perception of human observers is a very challenging open problem in vision science, with numerous practical applications in disciplines such as image processing and computer vision, as well as in...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10408451/ https://www.ncbi.nlm.nih.gov/pubmed/37559700 http://dx.doi.org/10.3389/fnins.2023.1222815 |
_version_ | 1785086185069084672 |
---|---|
author | Luna, Raúl Zabaleta, Itziar Bertalmío, Marcelo |
author_facet | Luna, Raúl Zabaleta, Itziar Bertalmío, Marcelo |
author_sort | Luna, Raúl |
collection | PubMed |
description | The development of automatic methods for image and video quality assessment that correlate well with the perception of human observers is a very challenging open problem in vision science, with numerous practical applications in disciplines such as image processing and computer vision, as well as in the media industry. In the past two decades, the goal of image quality research has been to improve upon classical metrics by developing models that emulate some aspects of the visual system, and while the progress has been considerable, state-of-the-art quality assessment methods still share a number of shortcomings, like their performance dropping considerably when they are tested on a database that is quite different from the one used to train them, or their significant limitations in predicting observer scores for high framerate videos. In this work we propose a novel objective method for image and video quality assessment that is based on the recently introduced Intrinsically Non-linear Receptive Field (INRF) formulation, a neural summation model that has been shown to be better at predicting neural activity and visual perception phenomena than the classical linear receptive field. Here we start by optimizing, on a classic image quality database, the four parameters of a very simple INRF-based metric, and proceed to test this metric on three other databases, showing that its performance equals or surpasses that of the state-of-the-art methods, some of them having millions of parameters. Next, we extend to the temporal domain this INRF image quality metric, and test it on several popular video quality datasets; again, the results of our proposed INRF-based video quality metric are shown to be very competitive. |
format | Online Article Text |
id | pubmed-10408451 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-104084512023-08-09 State-of-the-art image and video quality assessment with a metric based on an intrinsically non-linear neural summation model Luna, Raúl Zabaleta, Itziar Bertalmío, Marcelo Front Neurosci Neuroscience The development of automatic methods for image and video quality assessment that correlate well with the perception of human observers is a very challenging open problem in vision science, with numerous practical applications in disciplines such as image processing and computer vision, as well as in the media industry. In the past two decades, the goal of image quality research has been to improve upon classical metrics by developing models that emulate some aspects of the visual system, and while the progress has been considerable, state-of-the-art quality assessment methods still share a number of shortcomings, like their performance dropping considerably when they are tested on a database that is quite different from the one used to train them, or their significant limitations in predicting observer scores for high framerate videos. In this work we propose a novel objective method for image and video quality assessment that is based on the recently introduced Intrinsically Non-linear Receptive Field (INRF) formulation, a neural summation model that has been shown to be better at predicting neural activity and visual perception phenomena than the classical linear receptive field. Here we start by optimizing, on a classic image quality database, the four parameters of a very simple INRF-based metric, and proceed to test this metric on three other databases, showing that its performance equals or surpasses that of the state-of-the-art methods, some of them having millions of parameters. Next, we extend to the temporal domain this INRF image quality metric, and test it on several popular video quality datasets; again, the results of our proposed INRF-based video quality metric are shown to be very competitive. Frontiers Media S.A. 2023-07-25 /pmc/articles/PMC10408451/ /pubmed/37559700 http://dx.doi.org/10.3389/fnins.2023.1222815 Text en Copyright © 2023 Luna, Zabaleta and Bertalmío. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Neuroscience Luna, Raúl Zabaleta, Itziar Bertalmío, Marcelo State-of-the-art image and video quality assessment with a metric based on an intrinsically non-linear neural summation model |
title | State-of-the-art image and video quality assessment with a metric based on an intrinsically non-linear neural summation model |
title_full | State-of-the-art image and video quality assessment with a metric based on an intrinsically non-linear neural summation model |
title_fullStr | State-of-the-art image and video quality assessment with a metric based on an intrinsically non-linear neural summation model |
title_full_unstemmed | State-of-the-art image and video quality assessment with a metric based on an intrinsically non-linear neural summation model |
title_short | State-of-the-art image and video quality assessment with a metric based on an intrinsically non-linear neural summation model |
title_sort | state-of-the-art image and video quality assessment with a metric based on an intrinsically non-linear neural summation model |
topic | Neuroscience |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10408451/ https://www.ncbi.nlm.nih.gov/pubmed/37559700 http://dx.doi.org/10.3389/fnins.2023.1222815 |
work_keys_str_mv | AT lunaraul stateoftheartimageandvideoqualityassessmentwithametricbasedonanintrinsicallynonlinearneuralsummationmodel AT zabaletaitziar stateoftheartimageandvideoqualityassessmentwithametricbasedonanintrinsicallynonlinearneuralsummationmodel AT bertalmiomarcelo stateoftheartimageandvideoqualityassessmentwithametricbasedonanintrinsicallynonlinearneuralsummationmodel |