Cargando…
An Empathy Evaluation System Using Spectrogram Image Features of Audio
Watching videos online has become part of a relaxed lifestyle. The music in videos has a sensitive influence on human emotions, perception, and imaginations, which can make people feel relaxed or sad, and so on. Therefore, it is particularly important for people who make advertising videos to unders...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8587789/ https://www.ncbi.nlm.nih.gov/pubmed/34770419 http://dx.doi.org/10.3390/s21217111 |
_version_ | 1784598251221024768 |
---|---|
author | Zhang, Jing Wen, Xingyu Cho, Ayoung Whang, Mincheol |
author_facet | Zhang, Jing Wen, Xingyu Cho, Ayoung Whang, Mincheol |
author_sort | Zhang, Jing |
collection | PubMed |
description | Watching videos online has become part of a relaxed lifestyle. The music in videos has a sensitive influence on human emotions, perception, and imaginations, which can make people feel relaxed or sad, and so on. Therefore, it is particularly important for people who make advertising videos to understand the relationship between the physical elements of music and empathy characteristics. The purpose of this paper is to analyze the music features in an advertising video and extract the music features that make people empathize. This paper combines both methods of the power spectrum of MFCC and image RGB analysis to find the audio feature vector. In spectral analysis, the eigenvectors obtained in the analysis process range from blue (low range) to green (medium range) to red (high range). The machine learning random forest classifier is used to classify the data obtained by machine learning, and the trained model is used to monitor the development of an advertisement empathy system in real time. The result is that the optimal model is obtained with the training accuracy result of 99.173% and a test accuracy of 86.171%, which can be deemed as correct by comparing the three models of audio feature value analysis. The contribution of this study can be summarized as follows: (1) the low-frequency and high-amplitude audio in the video is more likely to resonate than the high-frequency and high-amplitude audio; (2) it is found that frequency and audio amplitude are important attributes for describing waveforms by observing the characteristics of the machine learning classifier; (3) a new audio extraction method is proposed to induce human empathy. That is, the feature value extracted by the method of spectrogram image features of audio has the most ability to arouse human empathy. |
format | Online Article Text |
id | pubmed-8587789 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-85877892021-11-13 An Empathy Evaluation System Using Spectrogram Image Features of Audio Zhang, Jing Wen, Xingyu Cho, Ayoung Whang, Mincheol Sensors (Basel) Article Watching videos online has become part of a relaxed lifestyle. The music in videos has a sensitive influence on human emotions, perception, and imaginations, which can make people feel relaxed or sad, and so on. Therefore, it is particularly important for people who make advertising videos to understand the relationship between the physical elements of music and empathy characteristics. The purpose of this paper is to analyze the music features in an advertising video and extract the music features that make people empathize. This paper combines both methods of the power spectrum of MFCC and image RGB analysis to find the audio feature vector. In spectral analysis, the eigenvectors obtained in the analysis process range from blue (low range) to green (medium range) to red (high range). The machine learning random forest classifier is used to classify the data obtained by machine learning, and the trained model is used to monitor the development of an advertisement empathy system in real time. The result is that the optimal model is obtained with the training accuracy result of 99.173% and a test accuracy of 86.171%, which can be deemed as correct by comparing the three models of audio feature value analysis. The contribution of this study can be summarized as follows: (1) the low-frequency and high-amplitude audio in the video is more likely to resonate than the high-frequency and high-amplitude audio; (2) it is found that frequency and audio amplitude are important attributes for describing waveforms by observing the characteristics of the machine learning classifier; (3) a new audio extraction method is proposed to induce human empathy. That is, the feature value extracted by the method of spectrogram image features of audio has the most ability to arouse human empathy. MDPI 2021-10-26 /pmc/articles/PMC8587789/ /pubmed/34770419 http://dx.doi.org/10.3390/s21217111 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Zhang, Jing Wen, Xingyu Cho, Ayoung Whang, Mincheol An Empathy Evaluation System Using Spectrogram Image Features of Audio |
title | An Empathy Evaluation System Using Spectrogram Image Features of Audio |
title_full | An Empathy Evaluation System Using Spectrogram Image Features of Audio |
title_fullStr | An Empathy Evaluation System Using Spectrogram Image Features of Audio |
title_full_unstemmed | An Empathy Evaluation System Using Spectrogram Image Features of Audio |
title_short | An Empathy Evaluation System Using Spectrogram Image Features of Audio |
title_sort | empathy evaluation system using spectrogram image features of audio |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8587789/ https://www.ncbi.nlm.nih.gov/pubmed/34770419 http://dx.doi.org/10.3390/s21217111 |
work_keys_str_mv | AT zhangjing anempathyevaluationsystemusingspectrogramimagefeaturesofaudio AT wenxingyu anempathyevaluationsystemusingspectrogramimagefeaturesofaudio AT choayoung anempathyevaluationsystemusingspectrogramimagefeaturesofaudio AT whangmincheol anempathyevaluationsystemusingspectrogramimagefeaturesofaudio AT zhangjing empathyevaluationsystemusingspectrogramimagefeaturesofaudio AT wenxingyu empathyevaluationsystemusingspectrogramimagefeaturesofaudio AT choayoung empathyevaluationsystemusingspectrogramimagefeaturesofaudio AT whangmincheol empathyevaluationsystemusingspectrogramimagefeaturesofaudio |