Cargando…

ResMem-Net: memory based deep CNN for image memorability estimation

Image memorability is a very hard problem in image processing due to its subjective nature. But due to the introduction of Deep Learning and the large availability of data and GPUs, great strides have been made in predicting the memorability of an image. In this paper, we propose a novel deep learni...

Descripción completa

Detalles Bibliográficos
Autores principales:	Praveen, Arockia, Noorwali, Abdulfattah, Samiayya, Duraimurugan, Zubair Khan, Mohammad, Vincent P M, Durai Raj, Bashir, Ali Kashif, Alagupandi, Vinoth
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	PeerJ Inc. 2021
Materias:	Artificial Intelligence
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8594589/ https://www.ncbi.nlm.nih.gov/pubmed/34825056 http://dx.doi.org/10.7717/peerj-cs.767

_version_	1784600020932100096
author	Praveen, Arockia Noorwali, Abdulfattah Samiayya, Duraimurugan Zubair Khan, Mohammad Vincent P M, Durai Raj Bashir, Ali Kashif Alagupandi, Vinoth
author_facet	Praveen, Arockia Noorwali, Abdulfattah Samiayya, Duraimurugan Zubair Khan, Mohammad Vincent P M, Durai Raj Bashir, Ali Kashif Alagupandi, Vinoth
author_sort	Praveen, Arockia
collection	PubMed
description	Image memorability is a very hard problem in image processing due to its subjective nature. But due to the introduction of Deep Learning and the large availability of data and GPUs, great strides have been made in predicting the memorability of an image. In this paper, we propose a novel deep learning architecture called ResMem-Net that is a hybrid of LSTM and CNN that uses information from the hidden layers of the CNN to compute the memorability score of an image. The intermediate layers are important for predicting the output because they contain information about the intrinsic properties of the image. The proposed architecture automatically learns visual emotions and saliency, shown by the heatmaps generated using the GradRAM technique. We have also used the heatmaps and results to analyze and answer one of the most important questions in image memorability: “What makes an image memorable?”. The model is trained and evaluated using the publicly available Large-scale Image Memorability dataset (LaMem) from MIT. The results show that the model achieves a rank correlation of 0.679 and a mean squared error of 0.011, which is better than the current state-of-the-art models and is close to human consistency (p = 0.68). The proposed architecture also has a significantly low number of parameters compared to the state-of-the-art architecture, making it memory efficient and suitable for production.
format	Online Article Text
id	pubmed-8594589
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	PeerJ Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-85945892021-11-24 ResMem-Net: memory based deep CNN for image memorability estimation Praveen, Arockia Noorwali, Abdulfattah Samiayya, Duraimurugan Zubair Khan, Mohammad Vincent P M, Durai Raj Bashir, Ali Kashif Alagupandi, Vinoth PeerJ Comput Sci Artificial Intelligence Image memorability is a very hard problem in image processing due to its subjective nature. But due to the introduction of Deep Learning and the large availability of data and GPUs, great strides have been made in predicting the memorability of an image. In this paper, we propose a novel deep learning architecture called ResMem-Net that is a hybrid of LSTM and CNN that uses information from the hidden layers of the CNN to compute the memorability score of an image. The intermediate layers are important for predicting the output because they contain information about the intrinsic properties of the image. The proposed architecture automatically learns visual emotions and saliency, shown by the heatmaps generated using the GradRAM technique. We have also used the heatmaps and results to analyze and answer one of the most important questions in image memorability: “What makes an image memorable?”. The model is trained and evaluated using the publicly available Large-scale Image Memorability dataset (LaMem) from MIT. The results show that the model achieves a rank correlation of 0.679 and a mean squared error of 0.011, which is better than the current state-of-the-art models and is close to human consistency (p = 0.68). The proposed architecture also has a significantly low number of parameters compared to the state-of-the-art architecture, making it memory efficient and suitable for production. PeerJ Inc. 2021-11-05 /pmc/articles/PMC8594589/ /pubmed/34825056 http://dx.doi.org/10.7717/peerj-cs.767 Text en ©2021 Praveen et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle	Artificial Intelligence Praveen, Arockia Noorwali, Abdulfattah Samiayya, Duraimurugan Zubair Khan, Mohammad Vincent P M, Durai Raj Bashir, Ali Kashif Alagupandi, Vinoth ResMem-Net: memory based deep CNN for image memorability estimation
title	ResMem-Net: memory based deep CNN for image memorability estimation
title_full	ResMem-Net: memory based deep CNN for image memorability estimation
title_fullStr	ResMem-Net: memory based deep CNN for image memorability estimation
title_full_unstemmed	ResMem-Net: memory based deep CNN for image memorability estimation
title_short	ResMem-Net: memory based deep CNN for image memorability estimation
title_sort	resmem-net: memory based deep cnn for image memorability estimation
topic	Artificial Intelligence
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8594589/ https://www.ncbi.nlm.nih.gov/pubmed/34825056 http://dx.doi.org/10.7717/peerj-cs.767
work_keys_str_mv	AT praveenarockia resmemnetmemorybaseddeepcnnforimagememorabilityestimation AT noorwaliabdulfattah resmemnetmemorybaseddeepcnnforimagememorabilityestimation AT samiayyaduraimurugan resmemnetmemorybaseddeepcnnforimagememorabilityestimation AT zubairkhanmohammad resmemnetmemorybaseddeepcnnforimagememorabilityestimation AT vincentpmdurairaj resmemnetmemorybaseddeepcnnforimagememorabilityestimation AT bashiralikashif resmemnetmemorybaseddeepcnnforimagememorabilityestimation AT alagupandivinoth resmemnetmemorybaseddeepcnnforimagememorabilityestimation

ResMem-Net: memory based deep CNN for image memorability estimation

Ejemplares similares