Cargando…

Soundgen: An open-source tool for synthesizing nonverbal vocalizations

Voice synthesis is a useful method for investigating the communicative role of different acoustic features. Although many text-to-speech systems are available, researchers of human nonverbal vocalizations and bioacousticians may profit from a dedicated simple tool for synthesizing and manipulating n...

Descripción completa

Detalles Bibliográficos
Autor principal:	Anikin, Andrey
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer US 2018
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6478631/ https://www.ncbi.nlm.nih.gov/pubmed/30054898 http://dx.doi.org/10.3758/s13428-018-1095-7

_version_	1783413176910479360
author	Anikin, Andrey
author_facet	Anikin, Andrey
author_sort	Anikin, Andrey
collection	PubMed
description	Voice synthesis is a useful method for investigating the communicative role of different acoustic features. Although many text-to-speech systems are available, researchers of human nonverbal vocalizations and bioacousticians may profit from a dedicated simple tool for synthesizing and manipulating natural-sounding vocalizations. Soundgen (https://CRAN.R-project.org/package=soundgen) is an open-source R package that synthesizes nonverbal vocalizations based on meaningful acoustic parameters, which can be specified from the command line or in an interactive app. This tool was validated by comparing the perceived emotion, valence, arousal, and authenticity of 60 recorded human nonverbal vocalizations (screams, moans, laughs, and so on) and their approximate synthetic reproductions. Each synthetic sound was created by manually specifying only a small number of high-level control parameters, such as syllable length and a few anchors for the intonation contour. Nevertheless, the valence and arousal ratings of synthetic sounds were similar to those of the original recordings, and the authenticity ratings were comparable, maintaining parity with the originals for less complex vocalizations. Manipulating the precise acoustic characteristics of synthetic sounds may shed light on the salient predictors of emotion in the human voice. More generally, soundgen may prove useful for any studies that require precise control over the acoustic features of nonspeech sounds, including research on animal vocalizations and auditory perception.
format	Online Article Text
id	pubmed-6478631
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Springer US
record_format	MEDLINE/PubMed
spelling	pubmed-64786312019-05-14 Soundgen: An open-source tool for synthesizing nonverbal vocalizations Anikin, Andrey Behav Res Methods Article Voice synthesis is a useful method for investigating the communicative role of different acoustic features. Although many text-to-speech systems are available, researchers of human nonverbal vocalizations and bioacousticians may profit from a dedicated simple tool for synthesizing and manipulating natural-sounding vocalizations. Soundgen (https://CRAN.R-project.org/package=soundgen) is an open-source R package that synthesizes nonverbal vocalizations based on meaningful acoustic parameters, which can be specified from the command line or in an interactive app. This tool was validated by comparing the perceived emotion, valence, arousal, and authenticity of 60 recorded human nonverbal vocalizations (screams, moans, laughs, and so on) and their approximate synthetic reproductions. Each synthetic sound was created by manually specifying only a small number of high-level control parameters, such as syllable length and a few anchors for the intonation contour. Nevertheless, the valence and arousal ratings of synthetic sounds were similar to those of the original recordings, and the authenticity ratings were comparable, maintaining parity with the originals for less complex vocalizations. Manipulating the precise acoustic characteristics of synthetic sounds may shed light on the salient predictors of emotion in the human voice. More generally, soundgen may prove useful for any studies that require precise control over the acoustic features of nonspeech sounds, including research on animal vocalizations and auditory perception. Springer US 2018-07-27 2019 /pmc/articles/PMC6478631/ /pubmed/30054898 http://dx.doi.org/10.3758/s13428-018-1095-7 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle	Article Anikin, Andrey Soundgen: An open-source tool for synthesizing nonverbal vocalizations
title	Soundgen: An open-source tool for synthesizing nonverbal vocalizations
title_full	Soundgen: An open-source tool for synthesizing nonverbal vocalizations
title_fullStr	Soundgen: An open-source tool for synthesizing nonverbal vocalizations
title_full_unstemmed	Soundgen: An open-source tool for synthesizing nonverbal vocalizations
title_short	Soundgen: An open-source tool for synthesizing nonverbal vocalizations
title_sort	soundgen: an open-source tool for synthesizing nonverbal vocalizations
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6478631/ https://www.ncbi.nlm.nih.gov/pubmed/30054898 http://dx.doi.org/10.3758/s13428-018-1095-7
work_keys_str_mv	AT anikinandrey soundgenanopensourcetoolforsynthesizingnonverbalvocalizations

Soundgen: An open-source tool for synthesizing nonverbal vocalizations

Ejemplares similares