Cargando…

FER-PCVT: Facial Expression Recognition with Patch-Convolutional Vision Transformer for Stroke Patients

Early rehabilitation with the right intensity contributes to the physical recovery of stroke survivors. In clinical practice, physicians determine whether the training intensity is suitable for rehabilitation based on patients’ narratives, training scores, and evaluation scales, which puts tremendou...

Descripción completa

Detalles Bibliográficos
Autores principales: Fan, Yiming, Wang, Hewei, Zhu, Xiaoyu, Cao, Xiangming, Yi, Chuanjian, Chen, Yao, Jia, Jie, Lu, Xiaofeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9776282/
https://www.ncbi.nlm.nih.gov/pubmed/36552086
http://dx.doi.org/10.3390/brainsci12121626
_version_ 1784855829245067264
author Fan, Yiming
Wang, Hewei
Zhu, Xiaoyu
Cao, Xiangming
Yi, Chuanjian
Chen, Yao
Jia, Jie
Lu, Xiaofeng
author_facet Fan, Yiming
Wang, Hewei
Zhu, Xiaoyu
Cao, Xiangming
Yi, Chuanjian
Chen, Yao
Jia, Jie
Lu, Xiaofeng
author_sort Fan, Yiming
collection PubMed
description Early rehabilitation with the right intensity contributes to the physical recovery of stroke survivors. In clinical practice, physicians determine whether the training intensity is suitable for rehabilitation based on patients’ narratives, training scores, and evaluation scales, which puts tremendous pressure on medical resources. In this study, a lightweight facial expression recognition algorithm is proposed to diagnose stroke patients’ training motivations automatically. First, the properties of convolution are introduced into the Vision Transformer’s structure, allowing the model to extract both local and global features of facial expressions. Second, the pyramid-shaped feature output mode in Convolutional Neural Networks is also introduced to reduce the model’s parameters and calculation costs significantly. Moreover, a classifier that can better classify facial expressions of stroke patients is designed to improve performance further. We verified the proposed algorithm on the Real-world Affective Faces Database (RAF-DB), the Face Expression Recognition Plus Dataset (FER+), and a private dataset for stroke patients. Experiments show that the backbone network of the proposed algorithm achieves better performance than Pyramid Vision Transformer (PvT) and Convolutional Vision Transformer (CvT) with fewer parameters and Floating-point Operations Per Second (FLOPs). In addition, the algorithm reaches an 89.44% accuracy on the RAF-DB dataset, which is higher than other recent studies. In particular, it obtains an accuracy of 99.81% on the private dataset, with only 4.10M parameters.
format Online
Article
Text
id pubmed-9776282
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-97762822022-12-23 FER-PCVT: Facial Expression Recognition with Patch-Convolutional Vision Transformer for Stroke Patients Fan, Yiming Wang, Hewei Zhu, Xiaoyu Cao, Xiangming Yi, Chuanjian Chen, Yao Jia, Jie Lu, Xiaofeng Brain Sci Article Early rehabilitation with the right intensity contributes to the physical recovery of stroke survivors. In clinical practice, physicians determine whether the training intensity is suitable for rehabilitation based on patients’ narratives, training scores, and evaluation scales, which puts tremendous pressure on medical resources. In this study, a lightweight facial expression recognition algorithm is proposed to diagnose stroke patients’ training motivations automatically. First, the properties of convolution are introduced into the Vision Transformer’s structure, allowing the model to extract both local and global features of facial expressions. Second, the pyramid-shaped feature output mode in Convolutional Neural Networks is also introduced to reduce the model’s parameters and calculation costs significantly. Moreover, a classifier that can better classify facial expressions of stroke patients is designed to improve performance further. We verified the proposed algorithm on the Real-world Affective Faces Database (RAF-DB), the Face Expression Recognition Plus Dataset (FER+), and a private dataset for stroke patients. Experiments show that the backbone network of the proposed algorithm achieves better performance than Pyramid Vision Transformer (PvT) and Convolutional Vision Transformer (CvT) with fewer parameters and Floating-point Operations Per Second (FLOPs). In addition, the algorithm reaches an 89.44% accuracy on the RAF-DB dataset, which is higher than other recent studies. In particular, it obtains an accuracy of 99.81% on the private dataset, with only 4.10M parameters. MDPI 2022-11-28 /pmc/articles/PMC9776282/ /pubmed/36552086 http://dx.doi.org/10.3390/brainsci12121626 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Fan, Yiming
Wang, Hewei
Zhu, Xiaoyu
Cao, Xiangming
Yi, Chuanjian
Chen, Yao
Jia, Jie
Lu, Xiaofeng
FER-PCVT: Facial Expression Recognition with Patch-Convolutional Vision Transformer for Stroke Patients
title FER-PCVT: Facial Expression Recognition with Patch-Convolutional Vision Transformer for Stroke Patients
title_full FER-PCVT: Facial Expression Recognition with Patch-Convolutional Vision Transformer for Stroke Patients
title_fullStr FER-PCVT: Facial Expression Recognition with Patch-Convolutional Vision Transformer for Stroke Patients
title_full_unstemmed FER-PCVT: Facial Expression Recognition with Patch-Convolutional Vision Transformer for Stroke Patients
title_short FER-PCVT: Facial Expression Recognition with Patch-Convolutional Vision Transformer for Stroke Patients
title_sort fer-pcvt: facial expression recognition with patch-convolutional vision transformer for stroke patients
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9776282/
https://www.ncbi.nlm.nih.gov/pubmed/36552086
http://dx.doi.org/10.3390/brainsci12121626
work_keys_str_mv AT fanyiming ferpcvtfacialexpressionrecognitionwithpatchconvolutionalvisiontransformerforstrokepatients
AT wanghewei ferpcvtfacialexpressionrecognitionwithpatchconvolutionalvisiontransformerforstrokepatients
AT zhuxiaoyu ferpcvtfacialexpressionrecognitionwithpatchconvolutionalvisiontransformerforstrokepatients
AT caoxiangming ferpcvtfacialexpressionrecognitionwithpatchconvolutionalvisiontransformerforstrokepatients
AT yichuanjian ferpcvtfacialexpressionrecognitionwithpatchconvolutionalvisiontransformerforstrokepatients
AT chenyao ferpcvtfacialexpressionrecognitionwithpatchconvolutionalvisiontransformerforstrokepatients
AT jiajie ferpcvtfacialexpressionrecognitionwithpatchconvolutionalvisiontransformerforstrokepatients
AT luxiaofeng ferpcvtfacialexpressionrecognitionwithpatchconvolutionalvisiontransformerforstrokepatients