Cargando…

State-of-the-art image and video quality assessment with a metric based on an intrinsically non-linear neural summation model

The development of automatic methods for image and video quality assessment that correlate well with the perception of human observers is a very challenging open problem in vision science, with numerous practical applications in disciplines such as image processing and computer vision, as well as in...

Descripción completa

Detalles Bibliográficos
Autores principales: Luna, Raúl, Zabaleta, Itziar, Bertalmío, Marcelo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10408451/
https://www.ncbi.nlm.nih.gov/pubmed/37559700
http://dx.doi.org/10.3389/fnins.2023.1222815
_version_ 1785086185069084672
author Luna, Raúl
Zabaleta, Itziar
Bertalmío, Marcelo
author_facet Luna, Raúl
Zabaleta, Itziar
Bertalmío, Marcelo
author_sort Luna, Raúl
collection PubMed
description The development of automatic methods for image and video quality assessment that correlate well with the perception of human observers is a very challenging open problem in vision science, with numerous practical applications in disciplines such as image processing and computer vision, as well as in the media industry. In the past two decades, the goal of image quality research has been to improve upon classical metrics by developing models that emulate some aspects of the visual system, and while the progress has been considerable, state-of-the-art quality assessment methods still share a number of shortcomings, like their performance dropping considerably when they are tested on a database that is quite different from the one used to train them, or their significant limitations in predicting observer scores for high framerate videos. In this work we propose a novel objective method for image and video quality assessment that is based on the recently introduced Intrinsically Non-linear Receptive Field (INRF) formulation, a neural summation model that has been shown to be better at predicting neural activity and visual perception phenomena than the classical linear receptive field. Here we start by optimizing, on a classic image quality database, the four parameters of a very simple INRF-based metric, and proceed to test this metric on three other databases, showing that its performance equals or surpasses that of the state-of-the-art methods, some of them having millions of parameters. Next, we extend to the temporal domain this INRF image quality metric, and test it on several popular video quality datasets; again, the results of our proposed INRF-based video quality metric are shown to be very competitive.
format Online
Article
Text
id pubmed-10408451
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-104084512023-08-09 State-of-the-art image and video quality assessment with a metric based on an intrinsically non-linear neural summation model Luna, Raúl Zabaleta, Itziar Bertalmío, Marcelo Front Neurosci Neuroscience The development of automatic methods for image and video quality assessment that correlate well with the perception of human observers is a very challenging open problem in vision science, with numerous practical applications in disciplines such as image processing and computer vision, as well as in the media industry. In the past two decades, the goal of image quality research has been to improve upon classical metrics by developing models that emulate some aspects of the visual system, and while the progress has been considerable, state-of-the-art quality assessment methods still share a number of shortcomings, like their performance dropping considerably when they are tested on a database that is quite different from the one used to train them, or their significant limitations in predicting observer scores for high framerate videos. In this work we propose a novel objective method for image and video quality assessment that is based on the recently introduced Intrinsically Non-linear Receptive Field (INRF) formulation, a neural summation model that has been shown to be better at predicting neural activity and visual perception phenomena than the classical linear receptive field. Here we start by optimizing, on a classic image quality database, the four parameters of a very simple INRF-based metric, and proceed to test this metric on three other databases, showing that its performance equals or surpasses that of the state-of-the-art methods, some of them having millions of parameters. Next, we extend to the temporal domain this INRF image quality metric, and test it on several popular video quality datasets; again, the results of our proposed INRF-based video quality metric are shown to be very competitive. Frontiers Media S.A. 2023-07-25 /pmc/articles/PMC10408451/ /pubmed/37559700 http://dx.doi.org/10.3389/fnins.2023.1222815 Text en Copyright © 2023 Luna, Zabaleta and Bertalmío. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Luna, Raúl
Zabaleta, Itziar
Bertalmío, Marcelo
State-of-the-art image and video quality assessment with a metric based on an intrinsically non-linear neural summation model
title State-of-the-art image and video quality assessment with a metric based on an intrinsically non-linear neural summation model
title_full State-of-the-art image and video quality assessment with a metric based on an intrinsically non-linear neural summation model
title_fullStr State-of-the-art image and video quality assessment with a metric based on an intrinsically non-linear neural summation model
title_full_unstemmed State-of-the-art image and video quality assessment with a metric based on an intrinsically non-linear neural summation model
title_short State-of-the-art image and video quality assessment with a metric based on an intrinsically non-linear neural summation model
title_sort state-of-the-art image and video quality assessment with a metric based on an intrinsically non-linear neural summation model
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10408451/
https://www.ncbi.nlm.nih.gov/pubmed/37559700
http://dx.doi.org/10.3389/fnins.2023.1222815
work_keys_str_mv AT lunaraul stateoftheartimageandvideoqualityassessmentwithametricbasedonanintrinsicallynonlinearneuralsummationmodel
AT zabaletaitziar stateoftheartimageandvideoqualityassessmentwithametricbasedonanintrinsicallynonlinearneuralsummationmodel
AT bertalmiomarcelo stateoftheartimageandvideoqualityassessmentwithametricbasedonanintrinsicallynonlinearneuralsummationmodel