Cargando…
Efficient facial representations for age, gender and identity recognition in organizing photo albums using multi-output ConvNet
This paper is focused on the automatic extraction of persons and their attributes (gender, year of born) from album of photos and videos. A two-stage approach is proposed in which, firstly, the convolutional neural network simultaneously predicts age/gender from all photos and additionally extracts...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7924510/ https://www.ncbi.nlm.nih.gov/pubmed/33816850 http://dx.doi.org/10.7717/peerj-cs.197 |
_version_ | 1783659106160082944 |
---|---|
author | Savchenko, Andrey V. |
author_facet | Savchenko, Andrey V. |
author_sort | Savchenko, Andrey V. |
collection | PubMed |
description | This paper is focused on the automatic extraction of persons and their attributes (gender, year of born) from album of photos and videos. A two-stage approach is proposed in which, firstly, the convolutional neural network simultaneously predicts age/gender from all photos and additionally extracts facial representations suitable for face identification. Here the MobileNet is modified and is preliminarily trained to perform face recognition in order to additionally recognize age and gender. The age is estimated as the expected value of top predictions in the neural network. In the second stage of the proposed approach, extracted faces are grouped using hierarchical agglomerative clustering techniques. The birth year and gender of a person in each cluster are estimated using aggregation of predictions for individual photos. The proposed approach is implemented in an Android mobile application. It is experimentally demonstrated that the quality of facial clustering for the developed network is competitive with the state-of-the-art results achieved by deep neural networks, though implementation of the proposed approach is much computationally cheaper. Moreover, this approach is characterized by more accurate age/gender recognition when compared to the publicly available models. |
format | Online Article Text |
id | pubmed-7924510 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-79245102021-04-02 Efficient facial representations for age, gender and identity recognition in organizing photo albums using multi-output ConvNet Savchenko, Andrey V. PeerJ Comput Sci Artificial Intelligence This paper is focused on the automatic extraction of persons and their attributes (gender, year of born) from album of photos and videos. A two-stage approach is proposed in which, firstly, the convolutional neural network simultaneously predicts age/gender from all photos and additionally extracts facial representations suitable for face identification. Here the MobileNet is modified and is preliminarily trained to perform face recognition in order to additionally recognize age and gender. The age is estimated as the expected value of top predictions in the neural network. In the second stage of the proposed approach, extracted faces are grouped using hierarchical agglomerative clustering techniques. The birth year and gender of a person in each cluster are estimated using aggregation of predictions for individual photos. The proposed approach is implemented in an Android mobile application. It is experimentally demonstrated that the quality of facial clustering for the developed network is competitive with the state-of-the-art results achieved by deep neural networks, though implementation of the proposed approach is much computationally cheaper. Moreover, this approach is characterized by more accurate age/gender recognition when compared to the publicly available models. PeerJ Inc. 2019-06-10 /pmc/articles/PMC7924510/ /pubmed/33816850 http://dx.doi.org/10.7717/peerj-cs.197 Text en © 2019 Savchenko http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Artificial Intelligence Savchenko, Andrey V. Efficient facial representations for age, gender and identity recognition in organizing photo albums using multi-output ConvNet |
title | Efficient facial representations for age, gender and identity recognition in organizing photo albums using multi-output ConvNet |
title_full | Efficient facial representations for age, gender and identity recognition in organizing photo albums using multi-output ConvNet |
title_fullStr | Efficient facial representations for age, gender and identity recognition in organizing photo albums using multi-output ConvNet |
title_full_unstemmed | Efficient facial representations for age, gender and identity recognition in organizing photo albums using multi-output ConvNet |
title_short | Efficient facial representations for age, gender and identity recognition in organizing photo albums using multi-output ConvNet |
title_sort | efficient facial representations for age, gender and identity recognition in organizing photo albums using multi-output convnet |
topic | Artificial Intelligence |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7924510/ https://www.ncbi.nlm.nih.gov/pubmed/33816850 http://dx.doi.org/10.7717/peerj-cs.197 |
work_keys_str_mv | AT savchenkoandreyv efficientfacialrepresentationsforagegenderandidentityrecognitioninorganizingphotoalbumsusingmultioutputconvnet |