Cargando…

Efficient facial representations for age, gender and identity recognition in organizing photo albums using multi-output ConvNet

This paper is focused on the automatic extraction of persons and their attributes (gender, year of born) from album of photos and videos. A two-stage approach is proposed in which, firstly, the convolutional neural network simultaneously predicts age/gender from all photos and additionally extracts...

Descripción completa

Detalles Bibliográficos
Autor principal: Savchenko, Andrey V.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7924510/
https://www.ncbi.nlm.nih.gov/pubmed/33816850
http://dx.doi.org/10.7717/peerj-cs.197
_version_ 1783659106160082944
author Savchenko, Andrey V.
author_facet Savchenko, Andrey V.
author_sort Savchenko, Andrey V.
collection PubMed
description This paper is focused on the automatic extraction of persons and their attributes (gender, year of born) from album of photos and videos. A two-stage approach is proposed in which, firstly, the convolutional neural network simultaneously predicts age/gender from all photos and additionally extracts facial representations suitable for face identification. Here the MobileNet is modified and is preliminarily trained to perform face recognition in order to additionally recognize age and gender. The age is estimated as the expected value of top predictions in the neural network. In the second stage of the proposed approach, extracted faces are grouped using hierarchical agglomerative clustering techniques. The birth year and gender of a person in each cluster are estimated using aggregation of predictions for individual photos. The proposed approach is implemented in an Android mobile application. It is experimentally demonstrated that the quality of facial clustering for the developed network is competitive with the state-of-the-art results achieved by deep neural networks, though implementation of the proposed approach is much computationally cheaper. Moreover, this approach is characterized by more accurate age/gender recognition when compared to the publicly available models.
format Online
Article
Text
id pubmed-7924510
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-79245102021-04-02 Efficient facial representations for age, gender and identity recognition in organizing photo albums using multi-output ConvNet Savchenko, Andrey V. PeerJ Comput Sci Artificial Intelligence This paper is focused on the automatic extraction of persons and their attributes (gender, year of born) from album of photos and videos. A two-stage approach is proposed in which, firstly, the convolutional neural network simultaneously predicts age/gender from all photos and additionally extracts facial representations suitable for face identification. Here the MobileNet is modified and is preliminarily trained to perform face recognition in order to additionally recognize age and gender. The age is estimated as the expected value of top predictions in the neural network. In the second stage of the proposed approach, extracted faces are grouped using hierarchical agglomerative clustering techniques. The birth year and gender of a person in each cluster are estimated using aggregation of predictions for individual photos. The proposed approach is implemented in an Android mobile application. It is experimentally demonstrated that the quality of facial clustering for the developed network is competitive with the state-of-the-art results achieved by deep neural networks, though implementation of the proposed approach is much computationally cheaper. Moreover, this approach is characterized by more accurate age/gender recognition when compared to the publicly available models. PeerJ Inc. 2019-06-10 /pmc/articles/PMC7924510/ /pubmed/33816850 http://dx.doi.org/10.7717/peerj-cs.197 Text en © 2019 Savchenko http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Artificial Intelligence
Savchenko, Andrey V.
Efficient facial representations for age, gender and identity recognition in organizing photo albums using multi-output ConvNet
title Efficient facial representations for age, gender and identity recognition in organizing photo albums using multi-output ConvNet
title_full Efficient facial representations for age, gender and identity recognition in organizing photo albums using multi-output ConvNet
title_fullStr Efficient facial representations for age, gender and identity recognition in organizing photo albums using multi-output ConvNet
title_full_unstemmed Efficient facial representations for age, gender and identity recognition in organizing photo albums using multi-output ConvNet
title_short Efficient facial representations for age, gender and identity recognition in organizing photo albums using multi-output ConvNet
title_sort efficient facial representations for age, gender and identity recognition in organizing photo albums using multi-output convnet
topic Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7924510/
https://www.ncbi.nlm.nih.gov/pubmed/33816850
http://dx.doi.org/10.7717/peerj-cs.197
work_keys_str_mv AT savchenkoandreyv efficientfacialrepresentationsforagegenderandidentityrecognitioninorganizingphotoalbumsusingmultioutputconvnet